Default profile banner
UD

UTSAV DATTA

@utsavdatta

AI Engineer at Edgeverve Systems (An Infosys Company)

Howrah, India

https://www.linkedin.com/in/utsav-datta/

Edgeverve Systems (An Infosys Company)The University of Sydney

Utsav Datta is an AI Engineer with 14 years of experience in the IT industry. He possesses expertise in software product design and development using Python and Deep Learning. His skills include Azure ML, PySpark, and advanced tools like Azure Cognitive Services and Docker. He has robust experience in end-to-end machine learning, NLP, and business intelligence solutions.

Experience

AI Engineer

Edgeverve Systems (An Infosys Company)

May 2020 - Present

Achieved 85% accuracy in ML based spend classification for a consumer goods company’s procurement workflow to quantify and improve their savings opportunities. Achieved over 40% reduction in redundant/duplicate data using Locality Sensitive Hashing and Jaccard distance techniques among others to reduce training data labelling effort. Designed and productionized a supplier news analytics module that included news categorization, sentiment analysis, topic modelling and entity recognition using an ensemble of Deep Learning techniques for a proprietary procurement management product. Designed / developed / productionized a proprietary product which extracts & formalizes unstructured information into knowledge representation using Python and Deep Learning models. Key role in improving product efficiency by transforming concepts from Deep Learning research papers into usable features.

ML Engineer

Cognizant Technology Solutions

Jan 2014 - Jan 2019

Responsible for designing & development of an end-to-end machine learning model to predict outcome of new contract proposals for a US based print and digital document seller with baseline accuracy of 80%. Integral role in improving baseline accuracy of the above-mentioned model by 4% using Box Cox transformation, Gradient Boosting algorithm and Voting classifiers from Scikit-Learn. Engaged in creation of an end-to-end natural language processing system to identify and extract relevant portions from a large corpora of customer grievance emails for a Switzerland based insurance provider. Successfully achieved 70% F1-score (5% more than client expectation) using spaCy, NLTK, fastText, LSTM and Python regular expressions. Accomplished 50% faster comprehension of the above-mentioned emails thereby improving the overall turnaround time of the system. Instrumental in implementing a continuous model improvement pipeline in production to track system efficiency; gathered training data over time using qualitative & quantitative methods. Achieved optimum balance between performance, scalability and cost of production deployments of training / inferencing pipelines using both closed source (Azure Machine Learning compute clusters, Azure Kubernetes services) and open source (Docker, fastAPI) technologies. Established trackable and sustainable pipelines of above-mentioned ML models utilising Azure ML, Databricks and ML flow technologies.

BI Developer

Cognizant Technology Solutions

Dec 2010 - Dec 2013

Successfully created a business intelligence semantic models using SQL Server Analysis Services (SSAS) Tabular to facilitate advanced analytics on retail data. Involved in migration of 450+ reports to Power BI from legacy systems. Served as a Team leader (comprising of 5 team members), onsite coordinator and SPOC for US and APAC clients.

ETL Developer

Tech Mahindra

Jun 2006 - Nov 2010

Developed Oracle PL/SQL packages procedures for ETL of telecom inventory data from legacy source systems to a modern data warehouse to support better service assurance.

Education

The University of Sydney

Master Of Data Science

Jan 2020

Maulana Abul Kalam Azad University of Technology

Bachelor of Technology

Computer Science & Engineering

Jan 2006

Licenses & Certifications

Microsoft Certified: Azure Data Scientist Associate

Microsoft

• No expiration

Skills

SQL
Python
TensorFlow
Keras
Azure ML SDK
Scikit-Learn
PySpark
Pandas
Numpy
Scipy
OpenCV
Spacy
NLTK
Hugging Face
Azure Cognitive Services
Azure Machine Learning
Azure Databricks
Docker
Power BI
Git
Artificial Intelligence
Natural Language Processing
Business Intelligence