Default profile banner
KN

Kashifa Naseem

@kashifanaseem

Associate Data Scientist at Optum (UHG)

New Delhi

Linkdin

Optum (UHG)IBM

Kashifa Naseem is an experienced Data Scientist skilled in Data Science and Analytics using Python, NumPy, Pandas, Scipy, SKlearn, Keras, and Transformers. She has expertise in developing machine learning models for classification, regression, and clustering, utilizing frameworks like TensorFlow. Her experience includes building complex pipelines using Airflow and working with databases such as MongoDB and Hive. She holds an M.Tech from AMU, Aligarh, specializing in scientific computing and data-driven modeling.

Experience

Associate Data Scientist

Optum (UHG)

•Feb 2022 - Present•New Delhi

Developed behavioral and risk models using Kmeans and PCA. Built machine learning models using Logistic regression, CatBoost, and LGBM for predicting member lapse and optimizing call rates. Managed data processing using Hive and performed feature engineering and data analysis on highly imbalanced datasets.

Jr. Data Scientist

Qualitics-Philips Healthcare

•Jul 2021 - Feb 2022

Developed an NLP project to classify product reviews into multiple categories. Built an ETL process to ingest data from APIs and store it in MongoDB. Improved text classification accuracy using ensemble techniques and added topic modeling for deeper insights.

Trainee

BitsActive System

•Jan 2018 - Apr 2021•Bangalore

Worked on a deep learning project using TensorFlow and Keras to predict building energy consumption. Created visualizations using Matplotlib and saved the model for integration into Android applications.

Education

AMU, Aligarh

Master’s degree

Chemical Engineering (Process Modeling and Simulation)

Jan 2015 - Jan 2018

M.J.P Rohilkhand University

Bachelor's degree

Chemical Engineering

Jan 2010 - Jan 2014

IBM

Data science certificate

Data science

Jan 2019

Licenses & Certifications

Data science certificate

IBM

Issued: Jan 2019• No expiration

Skills

Python
SQL
MATLAB
PySpark
Airflow
Pandas
NumPy
Scipy
Scikit-learn
Keras
TensorFlow
Transformers
SHAP
MongoDB
Hive
Selenium
Matplotlib
Seaborn
ggplot
Classification
Regression
Clustering
NLP
ETL
CatBoost
LightGBM