Default profile banner
AY

Abhishek Yadav

@user.2501465

Data Scientist at Quara Holding

Gurugram, India

https://github.com/abhishek127

Quara HoldingIndian Institute of Technology, Delhi

Abhishek Yadav is a data scientist with experience at OLX Group, HSBC Technologies, and Quara Holding, holding a B.Tech from IIT Delhi. He specializes in machine learning, deep learning, LLMs, NLP, and data engineering, with hands-on experience in Python, scikit-learn, PyTorch, LangChain, and cloud platforms including GCP and AWS. He has also published research papers and competed in national analytics competitions.

Experience

Data Scientist

Quara Holding

•Jul 2023 - Present•Remote

Implemented a Logistic Regression classification model using past campaign data to optimize luxury real estate conversion through recall prioritization. Employed OpenAI GPT-3.5 API for few-shot listing description generation. Experimented with OpenAI GPT-3.5, OpenAI Embeddings, LangChain, and LanceDB/FAISS vector Database to prototype a RAG-based AI chatbot.

Data Scientist

OLX Group

•Oct 2022 - Jun 2023•Gurugram, India

Developed a highly accurate CalibratedClassifier ML model with Light Gradient Boosting as base estimator to enable retail agents in India to prioritize leads and boost the important metric of I2P from 14% to an impressive 18%. Implemented Price fraud detection utilizing XGBRegressor and employed fraud detection using Text classification with LDA(Latent Dirichlet Allocation) topic modeling for the two-wheelers category within Trust and Safety.

ML Engineer

Bezant Technologies

•Feb 2022 - Oct 2022•Remote

Developed a scalable feature engineering pipeline that generates 60+ technical indicators for any trading asset following the OHLCV pattern. Used Adaboost and RandomForestClassifier models from scikit-learn with the triple barrier method to predict price movements and achieved positive Sharpe ratio for strategies by backtesting models using TimeSeriesSplit.

Data Scientist

HSBC Technologies

•Sep 2020 - Feb 2022•Pune, India

Developed an automated Django web application for AMG IAP, streamlining operations by 95% through process automation and report generation. Created an analytics platform utilizing streamlit, plotly, seaborn, and wordcloud. Mobile app; expense tracking, investment advice, and query management.

Data Science Intern

Innoplexus Consulting Services Private Limited

Internship•May 2019 - Jul 2019•Pune, India

Created a deduplication model via fuzzy-wuzzy matching and k-nearest neighbors, alongside a sentiment analysis model. Utilized BeautifulSoup, lxml, tika, and Selenium within NiFi for diverse pharma domain data mining. Awarded Certificate of Excellence for outstanding performance as Intern.

Education

Indian Institute of Technology, Delhi

B.Tech.

Textile Technology

May 2020•Grade: 7.19/10

Kendriya Vidyalaya Mathura Cantt.

AISSCE (Class XII)

Jan 2016•Grade: 94.8 %

Kendriya Vidyalaya Mathura Cantt.

AISSE (Class X)

Jan 2014•Grade: CGPA: 10/10

Licenses & Certifications

Architecting with GCP; Google Compute Engine- Coursera Specialisation

Coursera

Issued: Dec 2020•Expires: Jan 2021

The Complete Neural Networks Bootcamp: Theory, Applications

Issued: Apr 2023• No expiration

Skills

Machine Learning
Deep Learning
Supervised Models
Unsupervised Models
Predictive Modeling
Time Series Analysis
NLP
Neural Networks
Backpropagation
Gradient Descent
Regularization
RNN
LSTM
CNN
YOLO
Transformers
BERT
GPTs
LLMs
Langchain
Prompt Engineering
Vector Databases
R
Excel
Python
Pandas
NumPy
Seaborn
Matplotlib
Plotly
Feature Engineering
Data Cleaning
Data Wrangling
EDA
Data Transformations
MySQL
Postgres
MongoDB
BigQuery
RedShift
GCP
AWS Sagemaker
Docker
Django
Flask
FastAPI
Streamlit
PyTorch
Scikit-learn
Pycaret