Default profile banner
SS

Shaik Shaheer

@shaikshaheer

Data Scientist Trainee

Hyderabad

https://github.com/shaikshaheer123

Innomatics Research labsCrescent University

Shaik Shaheer is a Data Scientist Trainee with practical experience in developing advanced machine learning and Generative AI solutions. His expertise includes sentiment analysis, building RAG systems using LangChain and Gemini 1.5 Pro, and deploying models on AWS. He is proficient in Python, TensorFlow, and various statistical methods, applying skills across data visualization, predictive modeling, and NLP.

Experience

Data Science Intern

Innomatics Research labs

InternshipJan 2024 - May 2024Hyderabad

• Sentiment Analysis on Product Reviews with MLflow Integration and Prefect Workflow • Implemented advanced data preprocessing techniques for training machine learning models, utilizing F1-Score for sentiment classification evaluation, alongside MLflow for model registration and management, and integrated Prefect Workflow for automated scheduling and execution of sentiment analysis tasks. • Deployed the trained sentiment classification model integrated with a Flask web application on an AWS EC2 instance, enabling real-time inference and user interaction. • AI Code Reviewer App • Led the development of Generative AI App - AI Code Reviewer, a Python application offering comprehensive feedback on code submissions, featuring a user-friendly interface built with Streamlit for seamless interaction. Engineered an efficient system utilizing the OpenAI API for advanced code review, enabling automated analysis of code for bugs and errors. • System on “Leave No Context Behind” Paper using LangChain • Created a RAG (Retrieval-Augmented Generation) system leveraging the LangChain framework, integrating Gemini 1.5 Pro LLM capabilities with external data for enhanced information retrieval and generation, and connecting LLMs with company-specific data from PDFs and text files for improved applications and user experiences.

Education

Crescent University

B.Tech

Jan 2019 - Jan 2023

Licenses & Certifications

Uber Fare Prediction and Analytics

Innomatics Research Lab

Issued: Invalid Date• No expiration

Data Analysis Hackthon on IPL Dataset

Innomatics Research Lab

Issued: Invalid Date• No expiration

6-Hours Hackathon on Building a Movie Recommendation System

Innomatics Research Lab

Issued: Invalid Date• No expiration

Skills

Python
Matplotlib
Seaborn
MySQL
Scikit-learn
Tensorflow
Open-CV
ANN
CNN
RNN
LSTM
GRU
Pandas
Numpy
Power-BI
KNN
Naive Bayes
Decision Tree
Linear Regression
Logistic Regression
SVM
Ensemble
OpenAI API
LangChain
RAG
NLP
Vector Database