Default profile banner
MS

Mukut Sharma

@mukutsharma

Data Scientist at Syngene International Limited

Bengaluru, India

Syngene International Limited

Mukut Sharma is an experienced Data Scientist skilled in developing and implementing advanced machine learning and deep learning models. He possesses expertise in Natural Language Processing, graph data analysis, and handling complex multi-regression problems. Proficient in Python, AWS services, and various ML frameworks, he has successfully applied these skills in projects ranging from mass spectra prediction to building Question Answering systems.

Experience

Data Scientist

Syngene International Limited

•Aug 2023 - Present•Bengaluru, IN

Implementation of customized neural network architectures to optimize model performance. Model building for multi-regression problems, specifically adept at handling scenarios with approximately 1000 input variables and 500 output variables. Implementation of machine learning and deep learning models derived from research papers and evaluating the results. Graph data analysis and proficient in evaluating neural network models using diverse graph embedding techniques for enhanced insights and performance optimization.

Data Scientist

MinionLabs

•Mar 2022 - Jul 2023•Bengaluru, IN

Worked on challenging fundamental Artificial Intelligence problems in areas of Machine Learning, Deep Learning, Natural Language Processing. Used machine learning tools to select features, create and optimize classifier. Processed, cleaned, and validated the integrity of data to be used for analysis. Written automated scripts to extract data from different sources. Used statistical tools to identify, analyse, and interpret patterns and trends in large data sets that could aid in diagnosis and prediction.

Data Analyst and Scientist

RedTechno

•Jan 2021 - Feb 2022•Jaipur, IN

Performed exploratory data analysis to uncover trends, patterns, and insights. Worked on extracting meaningful features from diverse datasets to improve model accuracy. Utilized tools like Matplotlib, Seaborn, or Tableau to present findings in a clear and understandable manner. Documented the entire data science process, including data collection, preprocessing steps, model development, and evaluation metrics. Engaged in mentorship opportunities and seek guidance to enhance skills and knowledge in the field of data science.

Skills

Python
scikit-learn
numpy
pandas
matplotlib
tensorflow
keras
Linear Regression
Logistic Regression
K-Nearest Neighbors
Ensemble Methods (Boosting, Bagging, Stacking)
Random Forest
Gradient Boosting
SVM
K-Means
DBSCAN
PCA
T-SNE
UMAP
Word Embedding
Transformer
BERT
Neural Network
Recurrent Neural Network
LSTM
CNN
Langchain
Large Language Model
Data science pipeline (cleansing, wrangling, visualization, modeling, interpreta
Statistics
Time series
OOP
APIs
Github
Linux
AWS
IoT (Raspberry-pi)
Excel
SQL
Apache Cassandra (vector Db)
Faiss
Llama 2
Hugging Face Transformers
Gemini
OpenAI API
Claude
S3 Bucket
Lambda
Bedrock
API Gateway