Chidananda AV

@chidanandaav

ML Software Engineer at ULTRIA

Bangalore, KA

www.linkedin.com/in/chidanandav55

ULTRIAREVA UNIVERSITY

Chidananda is an experienced Machine Learning Engineer with 3 years of experience building successful ML/DL algorithms and predictive models. He is highly adept at core NLP tasks, including Classification, Clustering, and NER, utilizing transformer-based language models and embeddings. His background includes end-to-end problem solving in NLP product startups, from design and data collection to production deployment.

Experience

SOFTWARE ENGINEER - ML

ULTRIA

•Oct 2019 - Nov 2021

Collected, cleaned and built a huge corpus of legal data. Using this trained In-house word2vec, fasttext embeddings and fine tuned ELMO, BERT. Using these instead of original embeddings resulted in huge improvements in accuracy across all models. Extensively worked on contextual sentence similarity. Experimented with SIF, Siamese networks, USE, Sent2Vec, SentBERT. Built and deployed >10 NER models to extract multiple entities(parties, values, timelines) from various parts of documents. Built and deployed a classifier to identify and link master agreements and their child agreements(addendum, amendment). Experimented with ELMO, BERT, RoBERTa, DistillBERT and achieved a robust solution with >90% accuracy. Built an asynchronous service to automatically train an NER model. The Intention was to allow anyone to build an NER model instantly. Only annotated data was to be provided by the user in an excel and the service would train, tune and save the model.

ASSOCIATE SOFTWARE ENGINEER - ML

ULTRIA

•Jun 2019 - Sep 2019

Used Attention mechanism to improve >300 class paragraph classifier. The approach resulted in >20% increase in accuracy while making the model more robust to OCR errors. Built and deployed a model to identify Intent, liabilities and risks present in the contract. Used unsupervised approach to solve the problem with recall value of ~90% with almost no annotated data available. Extensively worked on document clustering building many models for multiple clients using K-means, KNN, GMM etc., further converted this into a service with sufficient flexibility to decrease redundancy.

DATA SCIENCE INTERN

ULTRIA

•Jan 2019 - May 2019

Built a POC RNN based language model from scratch to predict the next probable phrase based on a user prompt to help draft contracts easily, the language model was trained using > 100k contracts consisting of ~500M tokens. Built and deployed many multiclass document classifiers for Ad Hoc customer requests for easy document processing. Used algorithms such as SGD, SVM, Naive bayes, LR, multinomial depending on effectiveness.

Education

REVA UNIVERSITY

BTech

Computer Science

Jan 2015 - Jan 2019•Grade: GPA: 8.5 / 10

Licenses & Certifications

Machine learning from Stanford

Coursera

• No expiration

Deep learning Specialization

Coursera

• No expiration

IBM DS professional certification

IBM

• No expiration

Skills

Python

Linux

Gitlab

Regex

Matlab

Tableau

SQL

Git

DVC

Dataturks

SkLearn

NumPy

Pandas

Flask

SciPy

NLTK

Matplotlib

PyTesseract

Tensorflow

Keras

PyTorch

Flair

Spacy

Gensim

FastText

HuggingFace

Linear/Logistic Regression

Classification

Clustering

NER

LSTM

CRF

Language Models

CNN

Object detection

Recommendation system

SVM

Naive Bayes

Decision Tree

Random Forest