Default profile banner
KY

Kajal Yadav

@user.2590553

Data Scientist at Hexo.ai

Delhi, India

https://kajalyadav.com

Hexo.aiCentral University of Rajasthan

Completed M.Sc. in Data Science; holds 2 years of diversified working experience in NLP, Computer vision, Generative AI and writing data science & AI-related articles. Core skills: Generative AI, Prompt Engineering, Image Generation, NLP, extrapolating data & providing actionable business insights aligned with math, modeling, statistics, & domain knowledge.

Experience

Data Scientist

Hexo.ai

Full-timeFeb 2023 - PresentBangalore, Remote

Reduced human involvement by 70%, retaining clients through loophole investigation and feature launches. Customized Diffusion models for clients by fine-tuning on specific data. Enhanced existing generative AI models using deep learning frameworks like Pytorch, Transformers, Xformers, Segment_Anything. Added features like Segmentation, Background Removal, Inpainting, Out-painting, Photorealism, Color Control for an end-to-end pipeline.

Data Science Consultant

Own Startup

Feb 2022 - Feb 2023Remote

Built NLP based news aggregator app prototype with 60% less human involvement

Data Science Consultant

DagsHub

Feb 2022 - Feb 2023Remote

Developed a content pipeline with Dagshub’s data version control services to improve Data Scientist’s efficiency by 30%.

Data Science Consultant

Octoparse

Feb 2022 - Feb 2023Remote

Authored 20+ data-rich articles with Octoparse with AI & Data Science expertise, increasing customer retention by 15%.

Data Science Consultant

Medium

Feb 2022 - Feb 2023Remote

Authored open-source data science and AI articles, benefiting a global audience of 2,000 followers.

Data Scientist

Omdena (NGO)

Aug 2021 - Jan 2022California, US (Remote)

Worked in a team of 50 and assessed them 70% of time. Detected anomalies on Mars Surface (86% confidence). Preprocessed Image: annotated 200 images with anomalies using the VGG image annotator & invented models for the project. Diagnosed all anomalies using YOLOv4; achieved 86% confidence. Led 100% of the task & deployed model successfully on Stream-lit.

Data Scientist intern

Blue ThinQ (UK)

InternshipSep 2020 - Jul 2021Remote

Built NLP project to innovate the business metrics over how the competitor sites are performing & increased the revenue by 20%. Trained a team of 10; guided on Scraping, ML, DL, Data insights. Mastered 20 skills including Market research, Data Scraping, EDA, Data Cleaning, Data Analysis, Data Visualization, Data Modeling, Data Mining, Topic Modeling, Data Labeling, Sentiment analysis, N-Grams, Clustering, Classification, Text summarization, Keywords extraction, Collocations, Grammar patterns, and Business consulting.

Education

Central University of Rajasthan

Masters

Big Data Analytics (C.S.)

Jan 2019 - Jan 2021Grade: 7.61/10

Delhi University

Bachelors

C.S.

Jan 2016 - Jan 2019Grade: 7.5/10

Licenses & Certifications

Build a Data Science Web App with Streamlit & Python

Coursera

Natural Language Processing

NPTEL

Deep Learning

NPTEL

Programming for everybody

Coursera

Skills

Python
SQL
Linux
Git
Anaconda
Excel
MySQL
Jupyter
Google Collab
VS Code
Regex
Sklearn
Pandas
NumPy
Prompt Engineering
PySpark
Matplotlib
Seaborn
Plotly
SciPy
Bar-Charts
Histograms
Pie-Charts
Clustering
Supervised learning
Un-supervised Learning
Classification
Regression
Linear Regression
K-Means
KNN
Logistic Regression
SVM
Naïve Bayes
Decision trees
Random Forest
XgBoost
ML
Pytorch
Torch
Torchvision
DL
Streamlit
Gradio
NLTK
TextBlob
Tweepy
Genism
SpaCy
Image Classification
YOLO
Object Detection
OpenCv
Cv2
Neural Networks
AWS services
EC2
S3
Hugging Face Hub
VAE
LLM
GPT
SAM
Diffusers
Transformers
Fastai
Xformers
Dreambooth
Stable Diffusion
BERT
RNN
LSTM
RLHF
UNet
Statistical Modeling
Linear Algebra
Probability
Statistical Inference