Default profile banner
SK

Sankalp Kumar

@sankalpk4u

Lead Associate - Data Engineering at WNS Global Services

Noida, Uttar Pradesh, India

WNS Global ServicesSharda University

Sankalp Kumar is a Data Engineer and Azure & Databricks Certified professional currently serving as a Lead Associate - Data Engineering at WNS Global Services. He has extensive experience in developing scalable ETL/ELT pipelines using PySpark, Spark SQL, and Delta Lake within Azure and Databricks environments. He holds a Bachelor of Technology in Information Technology from Sharda University and is a Databricks Certified Data Engineer Professional.

Experience

Lead Associate - Data Engineering

WNS Global Services

Dec 2025 - PresentGurugram, Haryana

Project: Business Systems Integration Platform. Client: 84.51° (Kroger). Developed Azure Databricks (PySpark, Spark SQL) pipelines integrating enterprise data from Salesforce, Dynamics 365, Workday, and Concur into centralized analytics datasets. Built scalable Delta Lake pipelines on ADLS for workforce, and financial data for downstream analytics and reporting. Implemented data validation, schema alignment, and transformation logic across multiple enterprise source systems. Collaborated with client analytics teams to deliver curated datasets supporting marketing analytics, campaign insights, and operational reporting. Managed Git-based development and Databricks Jobs orchestration for production ETL pipelines.

Data Engineer – (Junior Consultant)

Digivate Labs Pvt. Ltd

May 2024 - Aug 2025Gurugram, Haryana

Developed scalable batch and streaming pipelines using PySpark, Spark SQL and Delta Live Tables (DLT), reducing manual processing time by 60%. Managed fine-grained data access and governance using Unity Catalog. Automated workflows with Databricks Jobs API and GitHub Actions, reducing scheduling overhead by 70%. Collaborated with analytics teams to deliver validated, production-grade data assets. Key Projects: Snapdeal – E-commerce Data Platform Migration (Vertica → Databricks). Migrated 35+ TB historical data, 150+ pipelines, 10K+ tables using metadata-driven PySpark frameworks. Improved query performance 3x and reduced infra cost by 40% via Delta optimizations. Implemented automated audit logging & validation for trust and traceability.

Data Engineer Intern

Digivate Labs Pvt. Ltd

Nov 2023 - May 2024Gurugram, Haryana

Assisted in ongoing Databricks migration and pipeline development projects under senior engineers. Supported teams with data validation, schema alignment, and ETL enhancements in PySpark and SQL. Built foundational skills in Spark optimization, Medallion architecture, and workflow orchestration.

Education

Sharda University

Bachelor of Technology

Information Technology

Aug 2020 - Jul 2024

Licenses & Certifications

Certified Data Engineer Professional

Databricks

Issued: Feb 2025• No expiration

Certified Data Engineer Associate

Databricks

Issued: Jul 2024• No expiration

Data Analytics with Python

IIT Roorkee (NPTEL)

Issued: May 2023• No expiration

Skills

Python
PySpark
Pandas
NumPy
SQL
T-SQL
Spark SQL
Shell scripting
Databricks
Delta Live Tables (DLT)
Delta Lake
Fivetran
HighTouch
Apache Spark
Kafka
Azure
Data Lake (ADLS)
Blob Storage
Data Factory (ADF)
Synapse Analytics
Event Hub
AWS
S3
IAM
Snowflake
Azure Synapse
NoSQL
MongoDB
Databricks Unity Catalog
ETL/ELT pipelines
Medallion architecture
Data Modelling
Git
GitHub Actions
Databricks Repos
Azure DevOps
DataDog
Power BI
Databricks Dashboards