Sachin Waditake

@sachinwaditake

Hadoop-Spark Developer at Capgemini India

Pune

Capgemini IndiaPune University

Sachin is an experienced Big Data/Hadoop and Spark Developer with over 3 years of experience. He has a strong background in file distribution systems and data processing using technologies like Spark, SQL, Hive, and HDFS. His expertise includes developing end-to-end data transformation pipelines across both Hadoop and Azure environments, utilizing tools like Azure Data Factory and Azure Databricks.

Experience

Hadoop-Spark Developer

Capgemini India

•May 2022 - Present•Pune

Developed and implemented data pipelines using Spark, SQL, and Hive. Proficient in PySpark programming, Spark Architecture (Core, SQL, RDD, Data Frames), and handling structured/semi-structured data. Experienced with Azure services including ADF, ADLS Gen2, Azure Databricks, Synapse, and Azure SQL. Managed end-to-end data transformation pipelines, including delta load functionality, using technologies like Apache Spark, Hadoop, and Azure Data Lake Storage.

Education

Pune University

Bachelor of Engineering

Jan 2021

Skills

Big Data

Hadoop

Spark

PySpark

SQL

Hive

HDFS

Python

Azure Data Factory (ADF)

ADLS Gen2

Azure Databricks

Synapse Analytics

Azure SQL

MySQL

Sqoop

YARN

Data Transformation

ETL

Delta Lake