Default profile banner
PP

Pavan Polagani

@Pavan2505

Data Engineer at Helson Software Solutions Private Limited

Hyderabad, Telangana, India

Helson Software Solutions Private LimitedVishnu Institute of Technology

4.11 years of experience as a Data Engineer delivering cloud-native data warehousing and analytics solutions using Azure, AWS, Snowflake, Spark, and DBT, with a strong focus on building reliable, testable, and version-controlled ELT pipelines.

Experience

Data Engineer

Helson Software Solutions Private Limited

•Sep 2024 - Present•Hyderabad

Project #3: Humana Healthcare Insurance. Design and implement data ingestion, transformation, and storage pipelines using Azure Data Factory, Azure Databricks and Pyspark. Manage and optimize data storage solutions, including Azure Blob Storage, Azure Data Lake, Azure SQL Database. Implement data quality checks and monitoring to ensure data accuracy and reliability.

Data Engineer

Edgeverve Systems Limited

•Aug 2022 - Sep 2024•Bengaluru

Project #2: Enterprise Data Pipeline (AWS & Snowflake). Designed, developed, and maintained automated data pipelines using AWS Glue and PySpark, including data models, database objects, and views to support downstream analytics in Snowflake. Configured and orchestrated AWS Glue jobs for end-to-end processing from S3 ingestion to Snowflake loading. Built modular DBT models in Snowflake.

Data Engineer

Techsoft Solutions Private Limited

•Aug 2021 - Aug 2022•Bengaluru

Project #1: Claim Management System (CMS). Led end-to-end data migration using SQL, Azure SQL, Azure Data Lake, and Azure Data Factory. Built scalable Spark workloads in Azure Databricks using Spark SQL and RDDs to transform large and small datasets. Leveraged DBT to convert raw claim and policy data into star-schema models.

Data Engineer

Capgemini

•Apr 2021 - Aug 2021•Hyderabad

Education

Vishnu Institute of Technology

Bachelor of Technology

Electronics & Communications Engineering

Jun 2016 - Oct 2020

Skills

Azure Blob Storage
ADLS Gen 2
AWS S3
AWS Glue
AWS Lambda
AWS EC2
Snowflake
Azure Databricks
Spark SQL
Hadoop
Spark
PySpark
Apache Airflow
Git
Azure DevOps
AWS CodePipeline
MySQL
Azure SQL
Snowflake SQL
Python
SQL
DBT
Delta Lake