Default profile banner
BV

Bhargavi Vempuluru

@Bhargavi

Data Engineer at Automatic Data Processing (ADP)

Hyderabad, Telangana, India

Automatic Data Processing (ADP)KLU

Data Engineer with 2+ years of experience building and operating scalable data pipelines using SQL, Python, PySpark, and Azure Databricks, specializing in payroll and compliance analytics with strong hands-on expertise in Azure Data Factory, ADLS Gen2, Synapse Analytics, and cost-optimized ETL workflows.

Experience

Data Engineer

Automatic Data Processing (ADP)

Nov 2023 - PresentHyderabad

Designed and implemented end-to-end ETL pipelines for payroll and compliance analytics using Python, SQL, PySpark, and Apache Spark on Azure Databricks, processing large-scale employee and financial datasets. Built and maintained analytics-ready datasets in Azure Data Lake Storage Gen2 following lakehouse principles, enabling accurate salary processing, statutory reporting, and downstream business analytics. Optimized PySpark and SQL transformations in Azure Databricks, reducing query execution and processing time by 25–45% on high-volume payroll workloads. Managed data access, governance, and schema consistency across several payroll domains by utilizing Databricks Unity Catalog. Built rerunnable SQL procedures to support schema changes and deployment-safe releases with minimal impact on payroll processing. Created Databricks dashboards for internal data quality monitoring, reconciliation, and mismatch detection between source and target payroll systems. Collaborated closely with business leaders, payroll SMEs, and cross-functional Agile teams to translate business and compliance requirements into reliable, production-ready data solutions.

Education

KLU

B. Tech

Electronics and Communication Engineering

Jun 2019 - Apr 2023

Licenses & Certifications

BEC (Business English Certificate)

Cambridge Assessment English

• No expiration

Introduction to python Programming

• No expiration

Skills

Python
SQL
PySpark
ETL / ELT Pipelines
Lakehouse Architecture
Data Modeling
Incremental Loads
CDC
Data Quality & Validation
Schema Evolution
Cost-Optimized Pipelines
Apache Spark
Delta Lake
Spark SQL
Azure Databricks
Databricks Unity Catalog
Azure Data Factory
Azure Data Lake Storage Gen2 (ADLS)
Azure Synapse Analytics
Databricks Notebooks & Jobs
Jira
CI/CD Pipelines
Production Deployments
Rerunnable Pipelines
GIT