Default profile banner
VM

VIVEK MANOHARAN

@vivekmanoharan

Machine Learning Engineer at Tata Consultancy Services

Chennai, TN

Tata Consultancy Services – Google TVCSASTRA University

Machine learning Engineer with 6 years of experience in developing computer vision solutions and web applications. Proficient in Python, Image classification, Objection detection, Image processing and Web application development.

Experience

Machine Learning Engineer

Tata Consultancy Services – Google TVC

Dec 2021 - Present

DocumentAI Workbench [Python, GCP, DocumentAI, Flask, Apache Beam, Protobuf] - Developed parsers for identity documents, lending and tax forms using DocumentAI’s custom document extractor. - Added feature improvements to fake document synthesis pipeline and template creation application - flask based application hosted on Google App Engine which creates templates for creating synthetic documents. - Ran experiments to demonstrate the effect of models trained using synthetic documents vs real labeled documents, achieved a F1 score of 0.86 using noise added synthetic documents.

Machine Learning Engineer

Tata Consultancy Service – Bank of America TVC

Dec 2018 - Nov 2021

Document Extraction Solution [Python, Tensorflow, CNN, Faster R-CNN, SSD, SpaCy, OpenCV, Tesseract OCR, Flask, Docker] - Led design and development of entity extraction solution for tabular structured data using objection detection, image processing, NLP and template based approaches. The solution extracted entities from 70+ document variations with an accuracy of 0.85 which reduced the manual effort by 35%. - Developed multiple ner models for extracting non-tabular structured data from 20+ documents. The models were exposed as REST APIs using TF Serving. - Developed CNN based image classifier to identify different formats of documents, cheques and summary pages. - Wrote image pre-processing scripts to clean documents and improve data capture with OCR. - Set up an on-prem application pipeline by containerizing the modules and exposing them as REST APIs. - Developed a labeling tool for ner models leveraging Tesseract OCR’s hOCR values.

DotNet Developer

Tata Consultancy Services – United Airlines TVC

Sep 2016 - Nov 2018

Ticketing and User Management Applications [C#, ASP.NET MVC, SQL Server] - Migrated application features from windows-based application to web application. - Involved in end-to-end SDLC, integrated new application features and implemented bug fixes. - Involved in production support and incident management, L3 support.

Education

SASTRA University

Bachelor of Technology

Electrical and Electronics Engineering

Jan 2012 - Jan 2016

Skills

Python
C#
C++
Javascript
SQL
Flask
Tensorflow
Scikit-learn
Keras
Numpy
Pandas
Apache Beam
SpaCy
GCP
KubeFlow
ASP.NET
Docker
Git
Linear Regression
Logistic Regression
Decision Trees
Random Forest
SVM
KNN
Neural Networks
CNN
Image Classification
Object Detection
Named Entity Recognition