Default profile banner
SS

Shashikant Singh

@shashikantsingh

Senior Data Scientist

Gurgaon

TheMathCompanyGreat Lakes Institute of Management

Seasoned Data Scientist with 8+ years of experience pioneering business transformation through innovative Machine Learning solutions. Proficient in modern AI technologies, including GenAI, Transformers, LangChain, and GPTs. Specialized in Deep Learning using PyTorch, Keras, and TensorFlow, with a strong background in developing and deploying complex predictive models.

Experience

Senior Data Scientist (Senior Associate)

TheMathCompany

Aug 2022 - Present

Worked on next Gen Artificial Intelligence technologies, including but not limited to Transformers, GPTs, OpenAI, LangChain, HuggingFace, LLMs, Fine Tuning of LLMs. Developed and deployed a Customer Sentiment prediction machine learning model on Azure Databricks utilizing MLOps including model experiments, model logging, model registry, achieved 90% accuracy which resulted in enhanced product ratings by 10 % utilized SiBERT model from HuggingFace. Worked on multiple Natural Language Processing (NLP) models both using traditional machine learning and latest LLMs using PyTorch and Python. Fine-tuned LLM models. Actively participated in multiple competitions on Kaggle on machine learning, deep learning, image classification, NLP etc. Collaborated closely with teams, business stakeholders and engineers to ensure smooth integration of models on production environments. Well versed with Large Language Models (LLMs) and their fine tuning methods. Led a team of 2 Data Scientists and 1 Data Engineer in developing Multi-Touch Attribution Model that led to 21% reduction in cost per lead. Achieved €2.8 million in operational cost savings for a Fortune 100 company by developing and optimizing multi-variate hierarchical time series forecasting models utilizing feature engineering and hyperparameter tuning with 14% reduction in forecast error. Led a team of 7 for development of comprehensive end-to-end stability study and CPK analysis product for a Fortune 500 pharmaceutical client based on regression model. Identified trends in Control charts and Stability analysis for Mebeverine and saved client $10 million by reducing scrap batches. Performed K-means clustering on time series data , reduced forecasting time by 40%. Acted as an SME and thought leader, including the creation of project timelines, interviewing data scientists and bringing the best talent to MathCo. Mentored Data Science freshers on Demand Forecasting Capstone project.

Data Science and Engineering Analyst (Data Scientist)

QL2 Software

Nov 2021 - Jul 2022Gurgaon

Increased client retention by 33% by building a classification machine learning model to predict retention with an accuracy of 75%. Developed a deep learning image matching model using CNN, to match products across different competitors for a client with an accuracy of 72%. Collaborated with stakeholders to understand problem statement, refined the scope of analysis, and used the results to drive informed decisions. Automated data cleaning processes with Python scripts, improving data quality by 25% and reducing preparation time for analytical tasks. Refactored legacy code to improve reliability, scalability and maintainability. Mentored Data Science students of Cornell University on their Capstone Project in QL2 Software.

Assistant Manager (Data Scientist)

Bank of Baroda

Jan 2018 - Nov 2021New Delhi

Reduced credit card default rate by 12% by building propensity classification model to predict default with an accuracy of 82%. Developed a regression machine learning model to predict credit card limit for different individuals based on different parameters. Analyzed large data sets and assisted in the development of custom models/algorithms to uncover trends, patterns and insights. Performed time series analyses for different products of the bank. Analyzed business data to produce reports on PowerBI and polished presentation, highlighted findings and recommended changes. Developed clean and organized machine learning codes using standard data science methodologies. Involved in parsing and aggregating messy, incomplete, and unstructured data sources to produce datasets that can be used in analytics/predictive modeling. Developed an ETL pipeline on AWS using python scripts to automate tasks.

Software Developer (Data Analyst)

CloudSoft Technology

Oct 2015 - Dec 2017Noida

Developed a customer churn classification model for a client which predicted churn with an accuracy of 67%. Worked with clients to understand their business objectives and leveraged profound understanding of mathematical and statistical principles to deliver data driven solutions. Implemented business intelligence solutions with Tableau to improve data-drilling capabilities for non-technical users. Reduced report generation time by 35% by automating SQL scripts.

Education

Great Lakes Institute of Management

Post Graduate

Data Science and Business Analytics

Jan 2022

Ajay Kumar Garg Engineering College

Bachelor of Technology

Jan 2015

Licenses & Certifications

GenAI Fundamentals

Databricks

• No expiration

Introduction to GenAI

Google

• No expiration

GenAI and Prompt Engineering

IBM

• No expiration

Microsoft Azure Fundamentals

• No expiration

Project Management

• No expiration

Skills

Transformers
GPTs
OpenAI models
HuggingFace
LangChain
LLMs
Fine Tuning of LLMs
GenAI
Keras
TensorFlow
PyTorch
ANN
Image Classification
CNN
Sequential DL models
RNN
LightGBM
LSTM
Prophet
SARIMA
Regression
Classification
NLP
TextBlob
Clustering
PCA
Statistics
Feature Engineering
Ensemble
Bagging
Boosting
Databricks
PySpark
Azure
AWS
Hive
Python
SQL
MLOps
Git
PowerBI
Tableau
Seaborn
Plotly
MLFlow