Default profile banner
AS

Ankit Srivastava

@ankitsrivastava

Senior Data Scientist at Velocitai Digital Pvt Ltd

Gurugram, India

https://linkedin.com/in/ankit-srivas

Velocitai Digital Pvt LtdUniversity of Hamburg (UHH)

Ankit Srivastava is a Senior Data Scientist with over 9 years of experience in Machine Learning, Deep Learning, and software development. He specializes in the complete software development lifecycle, from requirement analysis to the deployment of robust analytical models. Ankit has a proven track record of leading data science teams and implementing advanced AI solutions across various industries.

Experience

Senior Data Scientist

Velocitai Digital Pvt Ltd

•Apr 2023 - Present•Gurugram, India

Lead DS team, providing mentorship, & overseeing project lifecycles from problem definition to deployment. Lead weekly team meetings to align objectives, discuss ongoing projects, & strategize improvements. Scraped, cleaned, analyzed, & fine-tuned LLMs utilizing PEFT (LoRA) for various applications of our product including Sentiment Analysis, Aspect Analysis, Spam Detection, Topic Modelling, & Text Summarizing. Developed natural language query to SQL using langchain function calling & utilized vector DB to search context. Utilized RAG map-reduce technique for summarizing topics and providing an overall summary for queried texts.

Research Data Scientist

WeClapp SE

•Jul 2022 - Jan 2023•Kitzingen, Germany

Led DS team & spearheaded ML-driven solutions to automate manual processes, and enhance software efficiency. Developed model for the Catalogue Automation (182 categories), utilizing XGBClassifier (fit on tabular data with sentence embeddings & Word2Vec for description) to categorize products with 92% accuracy, and 82.3% F-Score. Analyzed data for Item Return Shipment, employing predictive modeling techniques to anticipate returned items.

Research Assistant (AI)

University of Hamburg

•Jul 2020 - Apr 2022•Hamburg, Germany

Collaborated with the NDR news channel team to conduct multi-class (9 classes) classification on their news website, achieving accuracy 93% & F-Score 82% using a Neural Network-based approach. Led the development of a multimodal application for second language learners, integrating complex word identification models by fine-tuning RoBERTa, resulting in a model accuracy of 95% and an F-Score of 87%. Played a key role in data collection, feature engineering, and model development, demonstrating expertise in data science techniques and advanced neural network architectures.

Software Engineer - Data Science

Infor (India) Pvt Ltd

•Nov 2016 - Sep 2018•Bengaluru, India

Developed and maintained Hospitality Property Management System from requirements analysis to development. Analyzed data & developed a machine learning model for Customer Market Segmentation using clustering algorithms (K-means, DBSCAN, HAC), achieving a Silhouette Score of 0.57.

Senior Software Engineer

JK Technosoft Pvt Ltd

•Mar 2013 - Nov 2016•Noida, India

Managed client interactions (from USA/UK) and conducted comprehensive requirement analysis. Worked as full-stack developer using Progress 4GL, delivering new features & optimizing legacy code. Successfully resolved a decade-old issue stemming from legacy code, showcasing problem-solving skills. Mentored junior developers and provided onboarding training for experienced team members.

Software Developer

DSI Software Pvt Ltd

•Sep 2012 - Mar 2013•Noida, India

Developed an E-Commerce web application prototype for client utilizing Java and Oracle Database.

Education

University of Hamburg (UHH)

Master of Science (MSc.)

Intelligent Adaptive Systems (AI/ML)

Oct 2018 - Apr 2020•Grade: 1.8

Dr. MGR Educational & Research Institute

Bachelor of Technology (BTech.)

Computer Science & Engineering

Aug 2008 - Aug 2012•Grade: 8.2

Licenses & Certifications

Fundamentals of Machine Learning on AWS

Pluralsight, India

Issued: Jan 2023

Agile Methodology (Scrum)

JK Technosoft, India

Issued: Jan 2015

Oracle Certified Professional Java Programmer

Oracle, India

Issued: Jan 2011

Skills

Python
SQL
PostgreSQL
MySQL
Oracle
Vector Search
Pinecone DB
Progress 4GL
Java
Natural Language Processing (NLP)
Supervised Learning
Unsupervised Learning
Deep Learning
Generative AI (GenAI)
RAG
PEFT (LoRA)
Boosting
Bagging
RNN
PCA
Reinforcement Learning
Git
Docker
AWS (ECR, ECS, Lambda, Eventbridge, S3 bucket, CloudWatch)
VS Code
Notebook
LangSmith
Postman
PyCharm
Tableau
Jira
Confluence
Pandas
GeoPandas
NumPy
Transformers
LangChain
Scikit-Learn
XGBoost
Spacy
Gensim
NLTK
FastAPI
PyTorch
Keras
Tensorflow
Matplotlib
Seaborn
Optuna