Default profile banner
SS

Shubhankar Singhal

@Shubhankar

Data Engineer at Vaco Binary Semantics

Delhi, India

Vaco Binary SemanticsKrishna Engineering College

Azure Data Engineer with 2.5 years of experience in building, optimizing, and maintaining cloud-based data pipelines using Azure Data Factory, Azure Data Lake Storage, Azure Blob Storage, Azure Synapse Analytics, and Databricks. Strong expertise in ETL/ELT development, SQL-based transformations, data modeling, and performance optimization. Experienced in handling large-scale batch and near-real-time data ingestion for analytics and reporting use cases.

Experience

Data Engineer

Vaco Binary Semantics

•Jun 2024 - Present•Gurgaon, India

Designed and maintained scalable ETL pipelines using Azure Data Factory to ingest and transform large volumes of 2-wheeler and 4-wheeler data from international markets (US, Vietnam). Implemented data ingestion using Azure Blob Storage and Azure Data Lake Storage Gen2 as the central data lake for analytics workloads. Optimized existing data pipelines, achieving ~80% reduction in data processing time and improved data freshness. Performed data transformations and aggregations using SQL and PySpark (Databricks) for downstream analytics. Built SQL-based analytical datasets and views for reporting, reducing manual reporting effort by 60% and saving 200+ hours. Integrated curated datasets with Azure Synapse Analytics for high-performance querying and reporting. Integrated batch and near real-time data ingestion strategies to ensure high data availability. Conducted data profiling, anomaly detection, and data quality validation to improve the reliability of analytics data. Collaborated with analytics and product teams to design data models optimized for business intelligence use cases.

Associate Software Engineer

Nagarro

•Mar 2023 - Dec 2023•Gurgaon, India

Designed and implemented Azure Data Factory (ADF) batch pipelines to ingest structured data from Azure Blob Storage into Azure Data Lake Storage Gen2 (ADLS). Configured Linked Services, Datasets, and Pipelines in ADF for scheduled batch data ingestion. Implemented basic data transformations using Mapping Data Flows and SQL-based transformations. Loaded curated data into Azure SQL Database / Synapse SQL (serverless) for reporting and analysis. Implemented pipeline scheduling, monitoring, and failure handling using ADF triggers and logs.

Education

Krishna Engineering College

Bachelor of Engineering

Computer Science

Aug 2023•Grade: 7.56/10.0

Skills

SQL
Python
C++
C#
PySpark
MySQL
SQLite
Knowledge Graph(Graph DB)
Azure SQL Database
Azure Data Factory
Azure Blob Storage
Azure Data Lake Storage
Azure Synapse Analytics
Databricks
Google Cloud Platform (GCP)
ETL Development
Data Warehousing
Data Quality Check
Schema Validation