Default profile banner
SW

Sachin Waditake

@sachinwaditake

Hadoop-Spark Developer at Capgemini India

Pune

linkedin.com/in/sachin-722a2523b

Capgemini IndiaPune University

Sachin is an experienced Big Data/Hadoop and Spark Developer with over 3 years of experience. He has a strong background in file distribution systems and data processing using technologies like Spark, SQL, Hive, and HDFS. His expertise includes developing end-to-end data transformation pipelines across both Hadoop and Azure environments, utilizing tools like Azure Data Factory and Azure Databricks.

Experience

Hadoop-Spark Developer

Capgemini India

•May 2022 - Present•Pune

Developed and implemented data pipelines using Spark, SQL, and Hive. Proficient in PySpark programming, Spark Architecture (Core, SQL, RDD, Data Frames), and handling structured/semi-structured data. Experienced with Azure services including ADF, ADLS Gen2, Azure Databricks, Synapse, and Azure SQL. Managed end-to-end data transformation pipelines, including delta load functionality, using technologies like Apache Spark, Hadoop, and Azure Data Lake Storage.

Education

Pune University

Bachelor of Engineering

Jan 2021

Skills

Big Data
Hadoop
Spark
PySpark
SQL
Hive
HDFS
Python
Azure Data Factory (ADF)
ADLS Gen2
Azure Databricks
Synapse Analytics
Azure SQL
MySQL
Sqoop
YARN
Data Transformation
ETL
Delta Lake