Default profile banner
DC

Dhruv Choudhary

@user.2599046

Data Scientist at proism

Jaipur, India

https://github.com/docstard

proismManipal University Jaipur

Dhruv Choudhary is a Data Science Engineering student at Manipal University Jaipur with hands-on experience as a Data Scientist at proism and a Data Science Intern at Bharti Airtel. He has expertise in NLP, machine learning, computer vision, and web scraping, including engineered custom NER models achieving 40% improvement in skills extraction and deploying homography-based forgery detection with 80% reduction in false positive rates.

Experience

Data Scientist

proism

•Sep 2023 - Apr 2024•Remote

Applied advanced data preprocessing techniques to cleanse and enrich scraped job data, ensuring high quality and reliability for subsequent analysis and modeling endeavors. Utilized N-grams methodology to extract nuanced patterns and insights from textual data, enhancing the depth and accuracy of information derived from job descriptions and resumes. Developed and deployed machine learning models utilizing sklearn’s TfidfVectorizer and NearestNeighbors algorithms to construct feature matrices and calculate distances between words from job descriptions and resume skills. Developed a robust database curation process for resume skills using Beautiful Soup. Engineered custom NER models using Spacy to proficiently detect technical skills, libraries, and frameworks, leveraging a comprehensive dataset of over 4,000 entries, leading to a 40% improvement in skills extraction. Developed a machine learning model utilizing Naive Bayes algorithm to accurately classify over 5,000 job descriptions into 10 different job categories with an average accuracy rate of 85%.

Data Science Intern

Bharti Airtel Ltd.

•Jun 2023 - Sep 2023•Remote

Developed and deployed a homography-based forgery detection system using SURF on Aadhaar card images, identifying tampered regions by matching key points and computing descriptors, resulting in an 80% reduction in false positive rates compared to previous methods. Implemented error level analysis, blockiness analysis, and JPEG artefact analysis techniques, identifying and flagging tampered pixels with over 70% efficiency. Applied thresholding and contour detection to identify forged regions in Aadhaar card images based on the calculated difference between genuine and warped images. Conducted extensive research and experimentation, analyzing over 10 research papers and implementing findings into practical forgery detection solutions. Collaborated effectively with cross-functional teams to integrate forgery detection features into Airtel’s document verification systems, ensuring robust security measures.

Education

Manipal University Jaipur

B.Tech.

Data Science Engineering

Oct 2020 - Present•Grade: 8.22 CGPA

Skills

Python
JavaScript
MYSQL
MongoDB
Apache Hive
Pandas
Numpy
scikit-learn
Data Mining
Matplotlib
Keras
NLP
ML
DL
Power BI
Excel
Naive Bayes
N-grams
vectorization
classification models
Gaussian model
regression
vector machines
spacy
Text-embedding
tokenization
Data Analysis
OpenCV
SURF
Homography Transformations
RANSAC Algo
HTML
CSS
Bootstrap
MUI
ReactJS
Django
Github
Jupyter Notebook
Google Colab
JQuery
TF-IDF Vectorizer
Nearest Neighbors Algorithm
Cosine Similarity
Selenium
SpaCy Matcher
Beautiful Soup
Nextjs
Tailwind CSS
Computer Vision Algorithms
RANSAC Algorithm
Git
Data Preprocessing