Default profile banner
AR

Ashish Rawat

@ashishr

Data Scientist | Researcher

Bangalore, India

ashish-rawat

KOANTEKJAMIA MILIA ISLAMIA

Ashish Rawat is a Data Scientist and Researcher with over three years of experience in advanced analytics. He possesses hands-on expertise in machine learning, NLP, and predictive modeling, utilizing tools like Python, PySpark, Azure, and AWS. His experience includes developing LLM applications, time series forecasting, and implementing complex anomaly detection systems.

Experience

Data Scientist

KOANTEK

Full-timeSep 2021 - PresentBangalore, India

LLM | SOLIS – Fine-tune, quantize, CPU inference. Refine “Wizard Vicuna Model” on Dolly Dataset, apply PEFT quantization for smaller size, and perform CPU inference. Implementation of Langchain to the prompt engineering. Employed the vector database (“Chroma DB and FAISS”) to store the vector embeddings for faster retrieval. Forecasting| Solis – Employed, Deployed and served Time series Model. Forecast daily maximum concurrently active devices for next fifteen days to aid better planning in Japan market for a portable wi-fi solution provider. Forecast daily internet data usage for next fifteen days to aid better planning in Japan market for a portable wi-fi solution provider. Heimdall Co-relation | Fluid – Build and optimize Databricks pipeline to find similarity and word correlation between Google searches in different states in the US for Marketing Start-up. Calculate the correlation on the digital marketing data. Performed EDA Hypothetical testing to understand the traffic behavior w.r.t time and other variables (daily/hourly). Propensity and Sentiment Modelling – Employed re-trainable sentiment and Propensity model for client. Build a lookalike audience based on propensity, on a per-client basis, and automatically. Also, employed the sentiment model with state of the art techinques such as xlnet, albert, robert, distillbert models to capture the sentiment of the auidience. Incremental Return on AdSpend | Swiftly Systems – Contributed in the delivery of a full-fledged pipeline for analyzing campaign lift for improving the incremental returns. Conducted 2 tests (two-tailed and right-tailed) for incremental lift analysis. Built K-means clustering model for 500K+ users, documented code, and generalized for 250+ campaigns. Anomaly Detection on orders data | Casella Waste Systems – Tested with LSTM Autoencoder with 50 K parameters and Isolation Forest for Unsupervised Anomaly Detection. Implemented SHAPLEY values over Isolation Forest model to infer the model and unde

Researcher

NLP RESEARCH GROUP

ResearchApr 2021 - PresentVirtual, Online

Drug Adverse Event Detection Using Text-Based Convolutional Neural Networks (TextCNN) Technique.

DATA SCIENTIST

TATA CONSULTANCY SERVICES

Full-timeJul 2019 - Aug 2021Noida, India

Safety Literature Review (SLR) - Tool for helping life sciences organisations meet their regulatory requirements. Competitive Intelligence Platfrom (CIP) - Pharma competitive intelligence tool: Gather, analyze, leverage market data. Medical Literature Review (MLR) - Multi-market compliance tool: Validate logos, grammar, claims, pharma content in assets.

DATA SCIENCE INTERN

TATA CONSULTANCY SERVICES

InternshipJan 2019 - Mar 2019Noida, India

ETL process analyzes EMR, insurance data for incidence/prevalence insights. Optimize sales revenue with promotional modeling across channels. Drug sentiment analysis on tweets using Twitter API and NLP tools.

Education

JAMIA MILIA ISLAMIA

MSC, MATHEMATICS & CS

Mathematics and Computer Science

Jan 2017 - Jan 2019

Minor Project: Object Detection through Deep learning implemented in Python

DELHI UNIVERSITY

BSC, MATHEMATICS (HONORS)

Mathematics

Jan 2014 - Jan 2017

Skills

Python
iPython Notebook
Pyspark SQL
Azure
Aws
Databricks
Google cloud platform
Propensity
Regression
Customer Churn
Natural Language Processing
Support Vector Machines
Clustering
Lift Analysis
Topic Modelling
LSTM
CNN
RNN
Image Analytics
OCR
Large Language Model
OPEN AI
Chat-gpt
Hugging Face
Transformer
Quantization models
PEFT
PostgreSQL
MongoDb
Jira
Git
Github
Airflow
Delta table