Default profile banner
VM

Vishvajit Mohite

@mohitevishvajit

ML & GenAI Engineer / Data Engineer at Tata Consultancy Services

Mumbai

Tata Consultancy ServicesG V. ACHARYA INSTITUTE OF ENGINEERING, MUMBAI

AI Engineer with 2.6 years of experience building and deploying production-grade Machine Learning and Generative AI solutions in enterprise environments. Proven expertise in end-to-end AI system development, including feature engineering, model training, evaluation, deployment, and monitoring. Delivered a production GenAI classification system using Azure OpenAI with Retrieval-Augmented Generation (RAG), achieving ~95% accuracy across 27 regulatory categories. Built and productionized ML forecasting models (Random Forest, XGBoost) achieving R² = 0.82 with MAPE < 18%, enabling improved inventory planning and data-driven decision-making. Strong AWS background with hands-on experience in scalable cloud-native deployments using Lambda, RDS, S3, and Spark-based data pipelines.

Experience

ML & GenAI Engineer / Data Engineer

Tata Consultancy Services

Aug 2023 - PresentMumbai

Led end-to-end development and production deployment of a Generative AI solution using Azure OpenAI with Retrieval-Augmented Generation (RAG) for automotive complaint classification (Client: Stellantis), achieving ~95% accuracy across 27 regulatory categories and reducing manual review effort by ~40%. Built and productionized ML forecasting models (Random Forest, XGBoost) to predict recall completion rates and parts demand, achieving R² = 0.82 with MAPE < 18%, enabling improved inventory planning and data-driven budget allocation. Designed schema-aware retrieval pipelines and prompt orchestration workflows to ensure reliable LLM-based reasoning and structured database query execution in enterprise production systems. Architected and maintained a highly available AWS RDS platform with encryption at rest/in transit, optimized indexing, and automated backups, reducing query latency by 30% under production load. Engineered scalable AWS Lambda-based data workflows integrated with RDS, S3, and Secrets Manager, achieving 99.9% SLA uptime for real-time accident and recall data processing. Developed Spark and PySpark-based ETL pipelines for large-scale data transformation and feature engine

Data Engineering Intern

Truecopy

Sep 2022 - Apr 2023Pune

Built automated data ingestion and web scraping pipelines using Python (BeautifulSoup, Scrapy), increasing data availability by 70%. Developed supervised ML models using scikit-learn, reducing manual analysis effort by 60%. Automated data preprocessing, cleaning, and validation workflows to support analytics and ML use cases.

Data Analyst Intern

Caterninja

Jun 2022 - Sep 2022Mumbai

Conducted exploratory data analysis (EDA) using FADS and GADS frameworks to identify trends and operational bottlenecks. Delivered KPI reports and dashboards that improved operational efficiency and management decision-making by 20%.

Education

G V. ACHARYA INSTITUTE OF ENGINEERING, MUMBAI

Bachelor of Engineering

Computer Science

May 2023

Licenses & Certifications

AWS Certified Data Engineer – Associate

AWS

• No expiration

GitHub Copilot Fundamentals Certification

GitHub

• No expiration

Skills

Python
SQL
Java
Apache Spark
PySpark
AWS Glue
ETL Pipelines
Data Modeling
NLP
Generative AI
Azure OpenAI
Retrieval-Augmented Generation (RAG)
Agentic Workflows
Random Forest
XGBoost
scikit-learn
AWS (RDS, Lambda, S3)
PostgreSQL
MySQL
DynamoDB
Git
CI/CD
JMeter
Agile/Scrum
Data Encryption
Secure Data Design
Application Security