Vishvajit Mohite
@mohitevishvajit
ML & GenAI Engineer / Data Engineer at Tata Consultancy Services
Mumbai
AI Engineer with 2.6 years of experience building and deploying production-grade Machine Learning and Generative AI solutions in enterprise environments. Proven expertise in end-to-end AI system development, including feature engineering, model training, evaluation, deployment, and monitoring. Delivered a production GenAI classification system using Azure OpenAI with Retrieval-Augmented Generation (RAG), achieving ~95% accuracy across 27 regulatory categories. Built and productionized ML forecasting models (Random Forest, XGBoost) achieving R² = 0.82 with MAPE < 18%, enabling improved inventory planning and data-driven decision-making. Strong AWS background with hands-on experience in scalable cloud-native deployments using Lambda, RDS, S3, and Spark-based data pipelines.
Experience
ML & GenAI Engineer / Data Engineer
Tata Consultancy Services
Led end-to-end development and production deployment of a Generative AI solution using Azure OpenAI with Retrieval-Augmented Generation (RAG) for automotive complaint classification (Client: Stellantis), achieving ~95% accuracy across 27 regulatory categories and reducing manual review effort by ~40%. Built and productionized ML forecasting models (Random Forest, XGBoost) to predict recall completion rates and parts demand, achieving R² = 0.82 with MAPE < 18%, enabling improved inventory planning and data-driven budget allocation. Designed schema-aware retrieval pipelines and prompt orchestration workflows to ensure reliable LLM-based reasoning and structured database query execution in enterprise production systems. Architected and maintained a highly available AWS RDS platform with encryption at rest/in transit, optimized indexing, and automated backups, reducing query latency by 30% under production load. Engineered scalable AWS Lambda-based data workflows integrated with RDS, S3, and Secrets Manager, achieving 99.9% SLA uptime for real-time accident and recall data processing. Developed Spark and PySpark-based ETL pipelines for large-scale data transformation and feature engine
Data Engineering Intern
Truecopy
Built automated data ingestion and web scraping pipelines using Python (BeautifulSoup, Scrapy), increasing data availability by 70%. Developed supervised ML models using scikit-learn, reducing manual analysis effort by 60%. Automated data preprocessing, cleaning, and validation workflows to support analytics and ML use cases.
Data Analyst Intern
Caterninja
Conducted exploratory data analysis (EDA) using FADS and GADS frameworks to identify trends and operational bottlenecks. Delivered KPI reports and dashboards that improved operational efficiency and management decision-making by 20%.
Education
G V. ACHARYA INSTITUTE OF ENGINEERING, MUMBAI
Bachelor of Engineering
Computer Science
Licenses & Certifications
AWS Certified Data Engineer – Associate
AWS
GitHub Copilot Fundamentals Certification
GitHub