Sarsiz Chauhan
@sarsizchauhan
Assistant Manager (Data Science) at BANK OF AMERICA
Gurugram, India
Sarsiz Chauhan is an experienced Data Science professional with 4 years of experience developing and implementing advanced ML solutions. His expertise includes automating complex processes, building financial cost models using Random Forest, and developing NLP applications for legal document analysis. He has a strong background in data pipeline creation, dashboard reporting, and utilizing deep learning techniques, contributing to data-driven decision-making across various industries.
Experience
Assistant Manager (Data Science)
BANK OF AMERICA
Automating Loan Documents processing for different types and formats; Reducing time to check thousands of loan documents; Using Machine Learning (Random Forest) to create Financial Cost model to detect at risk Financial Centers.
Sr. Modeling Analyst (Data Scientist)
Moody’s Analytics
Created a library to convert any kind of money related term($1.5M, One Million Dollar, etc) to numeric ($1,000,000). Published on Python Software Foundation. Data Scraping, saved thousands of man hours by creating and automating the data pipeline processing and storing clean information. Developed a Program to identify Money Related Entities pertaining to Legal Documents with the application of NLP. Trained a binary classification model to classify data acquired from various sources. Creating APIs (in FastAPI, Flask and CherryPy) for the model trained. Establish a data-driven culture with Dashboard reporting, visualization and analytics. Used Deep Learning to make an image captioning model to label the images if they are related to natural calamity or not. Multi-processing was implemented to speed up the task and to improve the utilization of system resources. Socialize different machine learning and data science models with Earthquake, Flood and Remote Sensing Teams. Promoting best practices in Data Management, Analytics and Machine learning.
Machine Learning Engineer
Think Future Technologies
Developed a POC for Chatbot for HR. Used Tensorflow and Google NMT. Explored RASA NLU for conversational bot. Responsibilities included roadmap, development and implementation of real-time risk and pricing models, maintenance and development of analytical calculation framework.
Data Science and Big Data
Eventful-India(now SiteSutra)
Written several Scripts in Python to scrap Secondary Data from hundred of websites. Data gathered was cleaned using Pandas, excel and visualized on specific parameters using matplotlib and visualization tools. Used mongodb in backend.
Education
GGSIPU
B.Tech
Computer Science and Engineering
Little Angels School
12th
Physics - Chem - Maths - English
Little Angels School
10th
Science
Licenses & Certifications
NLP with Python for Machine Learning Essential Training
Linkedin Learning
Introduction to Natural Language Processing in Python
Datacamp
Neural Networks and Deep Learning
Deep Learning.AI, Coursera
Machine Learning: A case study approach
Coursera
Introduction to Data Science in Python
Coursera
Big Data Specialization
Coursera
Apache Spark I, Hadoop Foundations, Hadoop Programming
IBM