Default profile banner
IS

Ipsita Sarkar

@Ipsita17

Data Engineer (Associate Software Engineer) at Accenture

Gurgaon, Haryana, India

AccentureBharati Vidyapeeth’s Institute of Computer Applications and Management (GGSIPU), Delhi

Innovative Data Engineer specializing in building scalable data pipelines and lakehouse architectures leveraging PySpark, Databricks, Azure Cloud, and GCP. Proven expertise in designing enterprise-grade ETL/ELT work-flows, optimizing big data processing, and implementing robust data quality frameworks. Databricks Certified with hands-on experience in Apache Spark, Delta Lake, and real-time streaming.

Experience

Data Engineer (Associate Software Engineer)

Accenture

Oct 2024 - Present

Architected production-grade ETL/ELT pipelines using PySpark and Spark SQL, processing 100M+ records daily with 30-40% performance improvement through partition optimization, broadcast joins, and adaptive query execution; Designed multi-layer lakehouse architecture (Landing → Bronze → Silver → Gold) ensuring data governance, lineage, and quality across enterprise platform serving 500+ business users; Developed reusable PySpark transformation modules with complex joins, window functions, and aggregations, reducing code duplication by 40% and accelerating feature delivery by 2-3 weeks; Orchestrated end-to-end workflows using Azure Data Factory and GCP Cloud Composer, enabling reliable cross-platform data movement between ADLS, BigQuery, and Cloud Storage with 99.8% pipeline success rate; Built optimized SQL models with materialized views, partitioned tables, and clustered indexes, reducing dash-board query latency from 45s to 18s (60% improvement) and supporting real-time analytics; Implemented comprehensive automated data quality framework including schema validation, duplicate detec-tion, null checks, and threshold-based alerts, reducing production defects by 30% and improving data

Education

Bharati Vidyapeeth’s Institute of Computer Applications and Management (GGSIPU), Delhi

Master of Computer Applications

Computer Applications

Sep 2022 - Sep 2024Grade: 8.6/10

Asansol Engineering College, West Bengal

Bachelor of Computer Applications

Computer Applications

Jan 2019 - Jan 2022Grade: 9.3/10

Licenses & Certifications

Databricks Certified Data Engineer Associate

Databricks

Issued: Oct 2025• No expiration

Credential ID: 163966870

Databricks Certified Generative AI Engineer

Databricks

Issued: Jan 2026• No expiration

Credential ID: 163966870

Skills

Python
SQL
PySpark
Spark SQL
Java
ML / GenAI
Apache Spark
Databricks
Delta Lake
Delta Live Tables
Apache Kafka
Data Lakehouse Architecture
Azure (ADF, ADLS Gen2, Synapse Analytics, Databricks)
GCP (BigQuery, Cloud Storage, Cloud Composer, Pub/Sub)
ETL/ELT Pipelines
Data Modeling
Medallion Architecture (Bronze-Silver-Gold)
SCD Type 2
CDC
Incremental Loads
PostgreSQL
Snowflake
BigQuery
Azure Synapse
MySQL
Git
GitHub
VS Code
pytest
CI/CD
Azure DevOps
Unity Catalog
Jira