Default profile banner
AP

Abhishek Pundir

@abhishekpundir

Data Scientist at Absolute Data Labs

B-100, Brij Vihar, Ghaziabad- 201011, Uttar Pradesh

Absolute Data LabsDIT University

Performance-driven Data Scientist with over 4 years of experience specializing in Machine Learning, AI, and Natural Language Processing. Proven ability to develop and deploy scalable data models and ML solutions using Python and cloud platforms like AWS. Expertise includes end-to-end data pipelines (ETL/ELT), statistical analysis, and translating complex data insights into actionable business strategies.

Experience

Data Scientist

Absolute Data Labs

•Jan 2001 - Present•Gurugram, Haryana

Spearheading the development of ELT pipelines, effectively acquiring SQL Server data and optimizing storage in Snowflake, leveraging expertise in data warehouse design. Collaborating with multi-disciplinary teams to understand business requirements. Analyzing statistical data, cleaning & preprocessing, and visualizing data. Identifying and implementing statistical models including linear models, multivariate analysis, stochastic models, sampling, optimization, and time series analysis. Fostering collaboration to develop and deploy impactful machine learning solutions using Python, covering the entire end-to-end ML cycle. Streamlining the end-to-end ML solution workflow through automation using Apache Airflow. Conducting statistical analysis and data mining to gain insights into market behavior and consumer trends. Achievements include improving claim outcome prediction by 82%, deploying and maintaining real-time Machine Learning models on AWS Cloud, and successfully implementing a Python-based time-series model (fbprophet) to forecast non-alcoholic beverage consumption.

Data Scientist

Infogain India Pvt. Ltd.

•Sep 2001 - Dec 2001•Noida, Uttar Pradesh

Spearheading the development and execution of Python-based ETL pipelines to extract data from diverse sources (emails, S3 bucket, Google Analytics, social media). Streamlining data extraction through a CRON job. Conducting comprehensive data preprocessing and analysis. Actively partnering with cross-functional teams to strategize and implement A/B tests, resulting in a 20% enhancement in conversion rates. Developed and implemented Python-based ETL pipelines loading data into SQL Server. Achievements include deploying and maintaining a real-time machine learning model (XGBoost) on AWS Cloud, and developing, optimizing, and assessing ML models (XGBoost, LightGBM, and Random Forests) to enhance claim cost accuracy by 75%.

Education

DIT University

Bachelors in Information Technology

Information Technology

Jan 2019

Licenses & Certifications

Machine Learning by Andrew NG

Coursera

• No expiration

AWS Cloud Practitioner

AWS

• No expiration

Statistics

Udemy

• No expiration

Python

RCPL

• No expiration

Skills

Python
SQL
Java
HTML/CSS
C/C++
Scikit-learn
NumPy
Pandas
Matplotlib
XGBoost
TensorFlow
Keras
PyTorch
Deep Learning
Natural Language Processing
Data Engineering
Snowflake
Apache Airflow
Docker
AWS
Git
Statistical Analysis
Data Visualization
Machine Learning
Time Series
A/B Testing
Data Mining
Business Intelligence