Default profile banner
JK

Jyotirmay Khavasi

@jyotirmaykhavasi

Data Science Intern at Wolters Kluwer

Pune, India

Wolters KluwerVishwakarma Institute of Information Technology

Jyotirmay Khavasi is a data science professional with experience in optimizing deep learning models and developing advanced AI solutions. His expertise includes working with LLM agents, RAG techniques, and various deep learning frameworks like PyTorch. He has practical experience in NLP, image processing, and building full-stack web applications using Django and FastAPI.

Experience

Data Science Intern

Wolters Kluwer

Jan 2024 - PresentPune, India

Optimized production-deployed Vision Encoder-Decoder and Layout Transformer models through batch classification and data augmentation techniques resulting in a 4% improvement across all metrics. Converted models to ONNX format for a 40% reduction in evaluation time for clients. Involved in Fine-Tuning Layout Transformer models for classification and entity recognition on custom data, enabling accurate multi-class classification and precise extraction of information across diverse documents. Developed an OpenCV based Table Detection pipeline utilizing object detection, image segmentation, morphological operations, contour extraction and coordinate geometry to identify, segment and reconstruct tables from PDFs, enabling accurate extraction of tabular data. Building domain specific LLM agents using Advanced RAG techniques with Qdrant VectorDB through Llama-Index framework to provide accurate and contextually relevant responses on custom data.

Open Source Contributor

Google Summer of Code @PyTorch-Ignite

May 2023 - Sep 2023Remote

Created a template for Reinforcement Learning using Advantage Actor Critic algorithm, configuring code to efficiently utilize parallel processing for spawning multiple environments and handling various Reinforcement Learning tasks, using interchangeable and flexible components of TorchRL. Researched methods to efficiently isolate and segment relevant video sections frame by frame which enabled to extract high-quality input data for the RL model. Enhanced CI/CD pipelines with GitHub workflows and enabled Docker containerization. Improved the configuration by centralizing YAML attributes, allowing command-line overrides, and helped in integrating with Google Fire and Hydra. Implemented code refactoring with Vue.js tags, enabling template inheritance and resulting in more than 1000-line reduction across templates.

Research Intern

HCL Technologies

Sep 2022 - Mar 2023Pune, India

Worked on a NLP Model which extracts Rules from given Text. Software converts and Processes Given Text to output the Mathematical Expressions of the Rules. Worked on custom BERT architecture which is used for Named Entity Recognition with a Recall of 88%. Employed Random Forests for Classification of a given sentence into ’Rule’ or ’not Rule’ with 92% Accuracy.

Education

Vishwakarma Institute of Information Technology

B.Tech

Artificial Intelligence and Data Science

Jan 2020 - Jan 2024Grade: CGPA: 9.34

Relevant Coursework: Discrete Mathematics, Probability and Statistics, Data Structures, Artificial Intelligence, Operating Systems, Database Management, Software Engineering, Cloud Computing.

Skills

C++
Python
SQL
Git
Docker
Vim
Shell
LATEX
VS Code
MySQL
PyTorch
Machine Learning
Image Processing
Deep Learning
Natural Language Processing
MongoDB
Neural Networks
Torchvision
Dockerfile
FastAPI
REST API
LLaMA-Index
Langchain
RAG
VectorDB
Streamlit
Leadership
Mentorship
Communication
Analytical Thinking
Teamwork
Problem Solving