Default profile banner
AY

ANMOL YADAV

@anmolyadav

Data Scientist/Data Engineer

Greater Noida, India

linkedin.com/in/anmolk-yadav

GigforceIndian Institute of Technology (BHU)

Anmol Yadav is a Data Scientist and Data Engineer with over 2 years of experience in Data Analytics and ETL domains. He is skilled in machine learning, programming, and processing data to generate insights across sectors like Staffing, OTT, E-commerce, and Pharma. He is experienced in managing enterprise data infrastructures, utilizing cloud computing (AWS, Azure), and big data technologies.

Experience

Data Engineer

Gigforce

•Invalid Date - Present•Gurugaon

Managing team of 3 Analysts and 3 interns. Scrapped client's platform data to AWS S3 using Selenium and automated the process on AWS lambda,Glue Catalog, Athena. Integrated and automates APIs shared by client to store the attendance and payment data in s3. Performing cohort analysis month on month basis, also per client basis, to get the metrics on churn and retention.

Data Engineer

I2E Consulting Pvt Ltd

•Invalid Date - Invalid Date

Planned, engineered, configured, deployed Glue scripts to reduce license cost,improve efficiency. Assisted solution providers with the definition and implementation of technical and business strategies using MongoDb, Lambda, and Glue. Developed scripts to generate details inside scientific publications with DOI number.

Data Scientist

ThinkBumblebee Analytics

•Invalid Date - Invalid Date•Pune, India

Developed the ML pipeline for churn prediction for OTT clients using various services in AWS. Led the team of 4 members including Data Engineers and Web developers to build an analytics dashboard for UAE-based gaming clients.

Education

Indian Institute of Technology (BHU)

Bachelor of Technology

Invalid Date - Invalid Date•Grade: 8.38 Cpi

Licenses & Certifications

AWS Certified

• No expiration

Azure Certified

• No expiration

Skills

AWS Glue
AWS S3
AWS Lambda
API Gateway
EC2
Machine Learning
ETL
Data Preparation
Data Visualization
Exploratory Data Analysis
Azure
Python
SQL
PySpark
R-Programming
Oracle
Redshift
No-SQL
MongoDB
OpenSearch
Selenium