PGParag Gawai
@parag.gawai
Machine Learning Intern at Vaultedge Software
Amravati, Maharashtra, India
Parag Gawai is a Machine Learning professional with an M.Tech in Earth System Science and Technology from IIT Kharagpur (9.15/10) and experience spanning ML internships at Vaultedge Software and ISRO's Space Applications Centre. At Vaultedge, he fine-tuned a Donut Multimodal Transformer achieving 93% accuracy using GPT-4o for document classification. He is skilled in Python, TensorFlow, PyTorch, NLP, generative AI, and cloud platforms including AWS and Docker, and has scored in the 95th percentile in GATE (CE) 2022.
Experience
Machine Learning Intern
Vaultedge Software
Fine-tuning, Training, and Evaluating Donut Multimodal Transformer Model. Prepared the image dataset from pdfs and tiffs using Hugging Face Build and Load in Apache Arrow format with Multiprocessing. Fine-tuned and Trained Donut Transformer for classification task with different tokenization technique for better performance. Achieved 85% accuracy and 0.41 F1 score, analyzed errors, compared with other text models. Text Extraction from documents using Different OCRs. Leveraged different types of OCRs, including Google Cloud Vision OCR and AWS Textract, to extract text from the documents. Utilized GPT-4o model to check whether the document is OSV or not. Achieved accuracy of 93% with exact match using GPT-4o.
Teaching Assistant
IIT Kharagpur
Serving as Teaching Assistant for Data Analytics Lab of CORAL Department, IIT Kharagpur by teaching around 25 students.
Data Science Intern
Space Applications Centre, ISRO
Developed a model for Satellite and Model oceanic current Data Validation with in-situ drifter data from eight large datasets. Worked on Linux Server associated with High Performance Computer to calculate Correlation Coefficient and RMSE. Performed the yearly and monthly analysis for the year 2020 for Indian Ocean, Arabian Sea, Bay of Bengal and Equatorial IO.
Education
IIT Kharagpur
M.Tech
Earth System Science and Technology
Government College of Engineering, Amravati
B.Tech
Civil Engineering
Maharashtra State Board
HSC
Computer Science
Maharashtra State Board
SSC
Licenses & Certifications
Machine Learning: Probability and Statistics
Data Analysis: Jupyter, Numpy, Pandas, Plotting, Visualization, Dimensionality Reduction, Time Series Analysis
Python: VSCode, Data Structures, Computational Complexity, OOP, Error Handling, Debugging