Default profile banner
VP

Vaishnavi Parkhi

@vaishnaviparkhi7378

Senior System Associate at Infosys Limited

Maharashtra, India

Infosys LimitedSuryodaya College of Engineering and Technology

Data Engineer with 4+ years of experience in building and optimizing data pipelines, ETL/ELT workflows, and cloud-based big data solutions. Hands-on expertise in Azure Data Factory, Azure Databricks, PySpark, SQL, Delta Lake, and Azure Synapse. Proven ability to design scalable architectures, optimize performance, and manage multi-terabyte datasets. Strong focus on Lakehouse architecture, CI/CD, data quality, data governance

Experience

Senior System Associate

Infosys Limited

Dec 2025 - PresentNagpur, India

Designed and optimized large-scale Spark data pipelines, improving workflow execution time by 30% using broadcast joins, caching, and repartition strategies. Implemented Medallion (Bronze–Silver–Gold) Lakehouse architecture in Delta Lake on ADLS, improving scalability and analytical efficiency. Built and maintained batch + streaming pipelines using ADF and Databricks, reducing data turnaround time by 25%. Processed 20M+ records per batch, ensuring high reliability, data quality, and governance compliance. Led Azure DevOps CI/CD deployment pipelines for versioning, testing, and automated releases.

System Associate

Infosys Limited

Jan 2022 - Dec 2024Pune, India

Developed PySpark workflows and ETL integrations in Azure Synapse Notebooks and Databricks. Converted complex business logic into Spark SQL, RDD, and DataFrame transformations, improving pipeline scalability. Built ingestion pipelines handling CSV, JSON, Parquet, Delta, ensuring efficient schema inference and validation. Monitored and resolved ETL failures using AutoSys and Airflow, improving uptime and reducing job failures. Developed SQL-based transformations and supported SSIS, SSRS workloads. Analyzed 400+ execution logs via Databricks and Unix to troubleshoot bugs and ensure smooth sprint cycles.

Education

Suryodaya College of Engineering and Technology

Masters of Computer Application

Computer Application

Jan 2025Grade: 70%

Kamla Nehru Mahavidyalaya

Bachelor of Computer Application

Computer Application

Jan 2021Grade: 73.3%

Licenses & Certifications

Databricks Certified Data Engineer Associate

Databricks

• No expiration

Microsoft – AZ-900: Microsoft Azure Fundamentals

Microsoft

• No expiration

Microsoft – DP-700: Fabric Data Engineer Associate

Microsoft

• No expiration

Skills

SQL
PySpark
Spark SQL
RDD
DataFrame API
Azure Data Factory (ADF)
Azure Databricks
Azure Synapse Analytics
Azure Data Lake Storage (ADLS)
Azure Logic Apps
Lakehouse Architecture
Delta Lake
Medallion Architecture
Power BI
SSMS
Azure SQL Database
ETL / Data Engineering
ETL/ELT Development
Data Modeling
Data Quality
Data Governance
Batch Processing
Streaming Pipelines
Performance Optimization