Default profile banner
DS

DHRUVIL SHAH

@dhruvilshah

Senior Data Engineer at LTIMindtree Ltd.

Mumbai, India

https://linkedin.com/in/dhruvil-shah-gada

LTIMindtree Ltd.Vidyalankar School of Information Technology | University of Mumbai

Dhruvil Shah is an experienced Data Engineer with 3.7 years of expertise in ETL Development and Cloud Technologies. He specializes in designing and implementing robust ETL pipelines using tools like PySpark, Talend, and AWS Glue. His skills include managing data lakes, optimizing complex SQL queries, and ensuring data integrity across various enterprise systems.

Experience

Senior Data Engineer

LTIMindtree Ltd.

Jun 2021 - Present

Part of the Enterprise Data Solutions Team, working for a well-known American banking client: Citizens Bank, where I play a pivotal role in supporting the Datalake as well as various ETL applications delivering high-quality solutions tailored to the specific requirements for the enterprise. Designed, developed, and maintained ETL processes for extracting data from various sources, like Databases, Fixed Width/Delimited Files, Mainframe Files, APIs, etc. transforming it, and loading it into the Data Lake and Data Marts. Worked on several enhancements and job optimizations addressing bottlenecks and achieving significant improvement in job performance. Identified and rectified data quality defects, ensuring accuracy, consistency, and integrity across datasets. Migrated several Data ingestion batches from Talend to Pyspark achieving atleast a 50% reduction in execution time. Tuned SQL queries to handle larger datasets, with millions of records, efficiently using resources and reducing overall load on the infrastructure. Resolved complex and critical production issues, performed root cause analysis, proactively identified system inefficiencies, and applied creative solutions and bug fixes to ensure system stability and data accuracy. Implemented an automation script to add partitions to tables instead of using crawlers, reducing service costs by over 90% and significantly speeding up partitioning processes. Conducted thorough code reviews and facilitated handovers from other teams, ensuring adherence to best practices and maintaining high code quality. Mentored several team members in ETL best practices and advanced use of AWS services, creating documentations as well as training materials.

Education

Vidyalankar School of Information Technology | University of Mumbai

M.Sc. in Information Technology

Jan 2021 - Jan 2023Grade: CGPA – 8.3

S. K. Somaiya College of Science and Commerce | University of Mumbai

B.Sc. in Information Technology

Jan 2018 - Jan 2021Grade: CGPA – 8.4

Licenses & Certifications

AWS Certified Cloud Practitioner (Badge)

AWS

AWS Partner: Technical Accredited (Badge)

AWS

AWS Certified Data Engineer – Associate

AWS

• No expiration

Skills

ETL Development
Cloud Computing
Python
PySpark
Shell Scripting
SQL
AWS S3
AWS Redshift
AWS Glue
AWS EMR
AWS Athena
AWS Lambda
AWS LakeFormation
Talend
Autosys
Apache Spark
Hadoop
Hive Airflow
Git
Pandas
Numpy
MySQL
Oracle
SQL Server
PostgreSQL
MongoDB
Snowflake
Java
JSON