Madhurika Jain
@madhurikajain
Data Engineer at Synoriq R&D Private Limited
Jaipur
Madhurika Jain is a Data Engineer with over a year of hands-on experience utilizing technologies such as PySpark, AWS Data Lake, and AWS Redshift. She has a proven track record of designing robust data pipelines, implementing data quality checks, and optimizing cloud services. She is skilled in making NoSQL data available in tabular formats and optimizing AWS costs.
Experience
Senior Software Engineer
Synoriq R&D Private Limited
Designed data pipeline to ingest data from various databases along with CDC operations. Designed framework for Data Quality Checks of different data assets. Built data ingestion pipelines to ingest data from different external data sources to data lake for consumption of end users. Maintained the data quality at UDP side by pointing out the issues in data at source vendor side. Implemented zero fault tolerance in the pipelines. Dealt with OOM issues in AWS Glue. Analyzed spark UI for optimization of Glue pipelines. Made the mongoDB (NoSQL) data available in tabular format for end users by using cloud services like redshift & query federation lambda. Implemented materialized views & schema level changes to maintain the data security. Implemented database & table level access control using TBAC in AWS Lake Formation. AWS cost optimization - saved around 1,00,400 USD/month on the AWS cloud services costing (s3, glue, redshift, lambda, DMS, quicksight etc.). Found out many bugs in AWS services like Lake Formation, redshift, lambda etc.
Assistant Manager
RITES Limited
Teaching Assistant
GCEW
Electrical Engineer
Hindalco Industries Limited
Education
M.B.M. Engineering College, Jodhpur
B.E. in Electrical Engineering
Electrical Engineering
B.C.S.C. Girls School, Jodhpur
Senior Secondary (class XII)
B.C.S.C. Girls School, Jodhpur
Secondary (class X)
Licenses & Certifications
Foundations in Data Science
PadhAI, One Fourth Labs (IIT-M Professionals)