Saurabh Dhawale
@saurabhdhawale
Data Engineer at NeoSoft Systems and Cloud Services
Pune, India
Saurabh is a Data Engineer with over 3 years of experience in designing, implementing, and supporting big data applications using Apache Spark, Hadoop, and AWS. He possesses strong expertise in Spark-core, Spark-SQL, and streaming, along with proficiency in Python and complex SQL queries. His experience spans the full project lifecycle, including data modeling, ETL processes, and optimizing performance across various cloud environments like AWS S3 and Redshift.
Experience
Data Engineer
NeoSoft Systems and Cloud Services
Developed solutions based on customer requirements. Key projects included: 1) Data Warehouse Analysis (Banking Domain), involving Pyspark data extraction, Hive table creation, Spark/Hive optimization, Airflow automation, and handling CDC/SCD. 2) Enterprise Data Hub (Telecommunication Domain), utilizing Medallion architecture (Bronze, Silver, Gold) with Pyspark and AWS Glue ETL. Managed job scheduling with AWS Glue and delivered reports using AWS S3. Tools used include Hadoop, Spark, Hive, AWS RDS, EMR, S3, Python, AWS Glue, and Athena.
Education
University of Pune
Master of Business Administration
MBA