Default profile banner
KS

Kamran Shaikh

@kamranshaikh

Data Scientist

Pune, MH

Infosys Ltd.International Institute of Information Technology, Bangalore

Kamran Shaikh is a data professional with approximately 2 years of experience as a Data Scientist. He is skilled in Data Analysis, Statistics, and Data Visualization, utilizing tools like Python and SQL. His expertise covers Machine Learning, Deep Learning, and various statistical modeling techniques, including improving lead conversion rates by 10% for an EdTech company.

Experience

Technology Analyst

Infosys Ltd.

Performed EDA using Univariate, Multivariate data analysis using various Python libraries like Pandas, Numpy. Data Visualization using tools like matplotlib and Plotly. Worked on building various Machine Learning models for predicting the potential leads of an EdTech Company. Investigated the data using various EDA techniques for generation of initial insights. Handled data for missing values and outliers. Performed pre-processing of data to check for multicollinearity. Built various machine learning models including Logistic Regression, Decision Tree, Random Forest, Adaptive Boosting. Selected best model based on evaluation metrics. Improved the lead conversion rate by almost 10%. Involved in Software Development work utilizing XML scripts and SQL queries.

Systems Engineer

Infosys Ltd.

Education

International Institute of Information Technology, Bangalore

PGD in Data Science and Machine Learning

Data Science and Machine Learning

Walchand Institute of Technology, Solapur

Bachelor of Engineering

Licenses & Certifications

Infosys Certified Data Science using Python Professional

Infosys

• No expiration

Data Toolkit for Data Science

Infosys

• No expiration

Infosys Global Agile Developer

Infosys

• No expiration

Skills

Python
Pandas
NumPy
Scikit-Learn
Matplotlib
Seaborn
Statistical Modelling
Exploratory Data Analysis
Data Preprocessing
Feature Engineering
PCA
Predictive Modeling
Machine Learning
Linear Regression
Logistic Regression
Ridge & Lasso Regression
Decision Trees
Random Forests
Adaptive Boosting
Gradient Boosting
Time Series Forecasting
ARIMA
K-Means Clustering
Deep Learning
Jupyter Notebook
SQL Developer
Google Colab
JIRA
MS Excel
Power BI
SQL