Default profile banner
RC

RIYA CHAUHAN

@riyachauhan

Associate Data Engineer at Celebal Technologies

Gurgaon

Ardent MillsManav Rachna International Institute of Research and Studies

Riya Chauhan is an Associate Data Engineer with experience leading data projects using Databricks, Azure, and ADF. She specializes in migrating large datasets and optimizing ETL pipelines, significantly improving data flow and system performance. Proficient in Python, SQL, and various cloud services, she is skilled in ensuring high data accuracy and quality across complex data warehousing environments.

Experience

Senior Consultant

Ardent Mills

Jul 2023 - Present

Led the migration of 10+ TB data from Microsoft Dynamics 365 and Azure SQL to OneLake, improving data flow by 30% and system performance. Designed 15+ pipelines in Azure Data Factory and Databricks, ensuring 99.9% data accuracy, and automated workflows in Azure DevOps, saving 20+ hours weekly. Built a Databricks alert system with Azure Logic Apps for 100% real-time monitoring and developed validation logic using hashing techniques for consistent data. Implemented Lakehouse federation, integrating Databricks with SQL Server and Azure DevOps for seamless cross-system access for 5+ teams.

Associate Data Engineer

Celebal Technologies

Feb 2023 - PresentIndia

Led data projects using Databricks, Azure, ADF, and DevOps, delivering 95% efficient solutions. Designed Databricks models, boosting insights by 25% and cutting processing time by 40%. Decreased default rates by 15% and maintained a 90% investor retention rate. Developed data cleaning processes, improving data cleanliness by 96%. Conducted Spark queries to process the dataset by 95%. Enhanced ETL speed by 80%, reducing latency to under 5 minutes.

Associate DE

Manulife

Jan 2023 - Apr 2024

Led the migration of a 5,000+ line Data Integration Hub (DIH) codebase to Azure, optimizing performance by 20%. Conducted 50+ unit and system integration tests, identifying and resolving 95% of codebase issues pre-deployment. Improved code quality by 30% through detailed pre/post-migration comparisons. Created 10+ concise Excel documents for project documentation, aiding in future maintenance. Collaborated with 5+ cross-functional teams to troubleshoot issues, ensuring 100% project success rate.

Education

Manav Rachna International Institute of Research and Studies

B. Tech

Faridabad, Haryana

Jul 2019 - Jul 2023Grade: CGPA: 7.82

Licenses & Certifications

Databricks Certified: Data Engineer Professional

Databricks

Issued: Jan 2024

Databricks Certified: Data Engineer Associate

Databricks

Issued: Sep 2023

Microsoft Certified: Azure Data Fundamentals

Microsoft

Issued: Apr 2023

Skills

SQL
Python
MS SQL Server
MySql
Apache Spark
Pyspark
Spark SQL
Microsoft Azure
ADLS Gen 2
Azure DataBricks
Azure Data Factory
Azure DevOps
Apache HBase
Spark Structured Streaming
Autoloader
ETL Pipeline
Data Engineering
Big Data
Data Quality