Default profile banner
SC

Sarsiz Chauhan

@sarsizchauhan

Assistant Manager (Data Science) at BANK OF AMERICA

Gurugram, India

https://linkedin.com/in/sarsiz-chauhan

BANK OF AMERICAGGSIPU

Sarsiz Chauhan is an experienced Data Science professional with 4 years of experience developing and implementing advanced ML solutions. His expertise includes automating complex processes, building financial cost models using Random Forest, and developing NLP applications for legal document analysis. He has a strong background in data pipeline creation, dashboard reporting, and utilizing deep learning techniques, contributing to data-driven decision-making across various industries.

Experience

Assistant Manager (Data Science)

BANK OF AMERICA

Jan 2021 - Present

Automating Loan Documents processing for different types and formats; Reducing time to check thousands of loan documents; Using Machine Learning (Random Forest) to create Financial Cost model to detect at risk Financial Centers.

Sr. Modeling Analyst (Data Scientist)

Moody’s Analytics

Jan 2018 - Jan 2021

Created a library to convert any kind of money related term($1.5M, One Million Dollar, etc) to numeric ($1,000,000). Published on Python Software Foundation. Data Scraping, saved thousands of man hours by creating and automating the data pipeline processing and storing clean information. Developed a Program to identify Money Related Entities pertaining to Legal Documents with the application of NLP. Trained a binary classification model to classify data acquired from various sources. Creating APIs (in FastAPI, Flask and CherryPy) for the model trained. Establish a data-driven culture with Dashboard reporting, visualization and analytics. Used Deep Learning to make an image captioning model to label the images if they are related to natural calamity or not. Multi-processing was implemented to speed up the task and to improve the utilization of system resources. Socialize different machine learning and data science models with Earthquake, Flood and Remote Sensing Teams. Promoting best practices in Data Management, Analytics and Machine learning.

Machine Learning Engineer

Think Future Technologies

Jan 2018 - Jan 2018

Developed a POC for Chatbot for HR. Used Tensorflow and Google NMT. Explored RASA NLU for conversational bot. Responsibilities included roadmap, development and implementation of real-time risk and pricing models, maintenance and development of analytical calculation framework.

Data Science and Big Data

Eventful-India(now SiteSutra)

Jan 2017 - Jan 2017

Written several Scripts in Python to scrap Secondary Data from hundred of websites. Data gathered was cleaned using Pandas, excel and visualized on specific parameters using matplotlib and visualization tools. Used mongodb in backend.

Education

GGSIPU

B.Tech

Computer Science and Engineering

Jan 2014 - Jan 2018Grade: 80.1%

Little Angels School

12th

Physics - Chem - Maths - English

Jan 2013 - Jan 2014Grade: 93.8%

Little Angels School

10th

Science

Jan 2011 - Jan 2012Grade: 10 CGPA

Licenses & Certifications

NLP with Python for Machine Learning Essential Training

Linkedin Learning

Issued: Jan 2020• No expiration

Introduction to Natural Language Processing in Python

Datacamp

Issued: Dec 2001• No expiration

Neural Networks and Deep Learning

Deep Learning.AI, Coursera

Issued: Aug 2001• No expiration

Machine Learning: A case study approach

Coursera

Issued: Mar 2001• No expiration

Introduction to Data Science in Python

Coursera

Issued: Jul 2001• No expiration

Big Data Specialization

Coursera

Issued: Invalid Date• No expiration

Apache Spark I, Hadoop Foundations, Hadoop Programming

IBM

Issued: Invalid Date• No expiration

Skills

Python
R
Java
C
C++
SQL Server
MongoDB
MySQL
Pandas
Matplotlib
Plotly
Dash
SpaCy
OpenCV
Flask
CherryPy
TensorFlow
Keras
Scikit-learn
Random Forest
Deep Learning
NLP
Spark
Splunk
HTML
JS
JSP