A data engineer with 3 years of experience in designing and implementing scalable data pipelines. The candidate is proficient in programming languages like Python and SQL, and ETL tools such as Apache Airflow and Databricks. They are adept at optimizing data architecture and performance tuning of databases.
Experience
Data Engineer
ZS Associates
Migrated ETL from Boomi to AWS Lambda functions, Step Functions, and Glue, resulting in improved data processing efficiency and cost savings. Spearheaded development of data ingestion pipelines using AWS Databricks, PySpark, and generic ingestion, logging, and DQM framework, resulting in a more efficient, scalable, and cost-effective data ingestion process. Developed and maintained scalable and efficient data pipelines using the latest technologies. Automated 3600 hours of manual work using Airflow and Python, resulting in a 75% reduction in manual workload. Mentored and coached junior team members, providing technical guidance and support.
Machine Learning Engineer
Xebia IT Architects
Developed and deployed machine learning models using frameworks such as scikit-learn. Worked with DevOps teams to deploy models in production environments. Collected and cleaned data from various sources and prepared it for modeling. Developed scripts and pipelines to transform and preprocess data.
Education
Vellore Institute of Technology
Master of Computer Applications
Computer Applications
Licenses & Certifications
AWS Solutions Architect- Associate
AWS