Kashifa Naseem is an experienced Data Scientist skilled in Data Science and Analytics using Python, NumPy, Pandas, Scipy, SKlearn, Keras, and Transformers. She has expertise in developing machine learning models for classification, regression, and clustering, utilizing frameworks like TensorFlow. Her experience includes building complex pipelines using Airflow and working with databases such as MongoDB and Hive. She holds an M.Tech from AMU, Aligarh, specializing in scientific computing and data-driven modeling.
Experience
Associate Data Scientist
Optum (UHG)
Developed behavioral and risk models using Kmeans and PCA. Built machine learning models using Logistic regression, CatBoost, and LGBM for predicting member lapse and optimizing call rates. Managed data processing using Hive and performed feature engineering and data analysis on highly imbalanced datasets.
Jr. Data Scientist
Qualitics-Philips Healthcare
Developed an NLP project to classify product reviews into multiple categories. Built an ETL process to ingest data from APIs and store it in MongoDB. Improved text classification accuracy using ensemble techniques and added topic modeling for deeper insights.
Trainee
BitsActive System
Worked on a deep learning project using TensorFlow and Keras to predict building energy consumption. Created visualizations using Matplotlib and saved the model for integration into Android applications.
Education
AMU, Aligarh
Master’s degree
Chemical Engineering (Process Modeling and Simulation)
M.J.P Rohilkhand University
Bachelor's degree
Chemical Engineering
IBM
Data science certificate
Data science
Licenses & Certifications
Data science certificate
IBM