Experience

Data Engineer

Automatic Data Processing (ADP)

•Nov 2023 - Present•Hyderabad

Designed and implemented end-to-end ETL pipelines for payroll and compliance analytics using Python, SQL, PySpark, and Apache Spark on Azure Databricks, processing large-scale employee and financial datasets. Built and maintained analytics-ready datasets in Azure Data Lake Storage Gen2 following lakehouse principles, enabling accurate salary processing, statutory reporting, and downstream business analytics. Optimized PySpark and SQL transformations in Azure Databricks, reducing query execution and processing time by 25–45% on high-volume payroll workloads. Managed data access, governance, and schema consistency across several payroll domains by utilizing Databricks Unity Catalog. Built rerunnable SQL procedures to support schema changes and deployment-safe releases with minimal impact on payroll processing. Created Databricks dashboards for internal data quality monitoring, reconciliation, and mismatch detection between source and target payroll systems. Collaborated closely with business leaders, payroll SMEs, and cross-functional Agile teams to translate business and compliance requirements into reliable, production-ready data solutions.

Education

KLU

B. Tech

Electronics and Communication Engineering

Jun 2019 - Apr 2023

Licenses & Certifications

BEC (Business English Certificate)

Cambridge Assessment English

• No expiration

Introduction to python Programming

• No expiration

Skills

Python

SQL

PySpark

ETL / ELT Pipelines

Lakehouse Architecture

Data Modeling

Incremental Loads

CDC

Data Quality & Validation

Schema Evolution

Cost-Optimized Pipelines

Apache Spark

Delta Lake

Spark SQL

Azure Databricks

Databricks Unity Catalog

Azure Data Factory

Azure Data Lake Storage Gen2 (ADLS)

Azure Synapse Analytics

Databricks Notebooks & Jobs

Jira

CI/CD Pipelines

Production Deployments

Rerunnable Pipelines

GIT

Bhargavi Vempuluru

Experience

Education

KLU

Licenses & Certifications

Skills