Default profile banner
AG

Abhishek Giri

@user.2518283

Data Engineer | Azure Databricks | ETL Pipelines | Microsoft Certified

Kolkata, India

https://www.linkedin.com/in/abhishekgiri99

Polestar Solutions and ServicesJadavpur University

Abhishek Kumar Giri is a Data Engineer at Polestar Solutions and Services with around 4 years of experience in BI and Data Engineering. He is proficient in Databricks, Azure Data Factory, PySpark, and cloud data warehousing, and has automated credit-back processes improving system efficiency by 53%, and built scalable data pipelines reducing load time by 80%. He holds dual Microsoft certifications as Azure Data Engineer Associate (DP-203) and Fabric Analytics Engineer Associate (DP-600), and earned a BE in Mechanical Engineering from Jadavpur University with a GPA of 8.28.

Experience

Data Engineer

Polestar Solutions and Services

Full-time•Jul 2021 - Present

Project 3 (Electric appliances/Consumer electronics): Automated credit-back process improving efficiency by 53%. Built scalable data pipelines integrating structured and unstructured data from multiple API sources using Azure Blob Storage, reducing load time by 80%. Developed business logic transformations using PySpark and SQL with orchestration pipelines in Synapse. Project 2 (Footwear/Accessories): Developed pipeline to process semi-structured data in real-time using Multiprocessing, analyzing ~1 Lakh XML POS files. Project 1 (Construction/Mining): Migrated data from various databases and developed complex DAX logic to support Power BI.

Education

Jadavpur University

BE

Mechanical Engineering

Jan 2017 - Jan 2021•Grade: 8.28 GPA

Kolkata

Licenses & Certifications

Microsoft Certified: Azure Data Engineer Associate (DP-203)

Microsoft

Microsoft Certified: Fabric Analytics Engineer Associate (DP-600)

Microsoft

Azure Data Fundamentals (DP-900)

Microsoft

Azure Fundamentals (AZ-900)

Microsoft

Databricks Lakehouse Fundamentals

Databricks

Generative AI Fundamentals

Databricks

Skills

Azure Databricks
Microsoft Fabric
Azure Data Factory
Azure Synapse Analytics
Delta Lake
Azure Key Vault
Snowflake
Google Cloud Platform
Apache Airflow
PySpark
Apache Spark
Python
Pandas
NumPy
T-SQL
Spark SQL
PostgreSQL
MySQL
MS SQL Server
Data Modelling
ETL/ELT
GitHub
Jira
Agile
Power BI
DAX