Default profile banner
DD

Dipika Dhara

@dipikadhara

Data Scientist

Hyderabad, India

www.linkedin.com/in/dipika-dhara-70926a7b

Kasmo Technologies Pvt. Ltd.Maulana Abul Kalam Azad University of Technology

Dipika Dhara is a Data Scientist with 2+ years of experience specializing in Machine Learning and Data Science. She has a proven track record of developing complex ML/DL solutions, including building multilingual chatbots using LLMs (AWS Bedrock, Langchain) and developing predictive models like Customer Churn Prediction using PySpark and XGBoost. Her technical expertise spans Python, AWS services (S3, Sagemaker), PySpark, and deep learning frameworks like TensorFlow and Keras.

Experience

Lead Developer

Kasmo Technologies Pvt. Ltd.

Project•Apr 2023 - Present•Hyderabad, India

Developed a document-based Multilingual conversational chatbot using Large Language Model (LLM). Worked as lead developer, contributing to AWS AL/ML services integration, building the chain, implementing Zero-Shot prompt, and Streamlit UI.

PhD Scholar / Assistant Professor

Maulana Abul Kalam Azad University of Technology

Academic•Jan 2019 - Present

Taught subjects including Object Oriented Programming with python, Automata Theory, Basic Big Data Analytics, Deep Learning, Artificial Neural Network, C++, and Image Processing. Also involved in continuous monitoring of students’ projects.

Data Scientist

CloudSEK

Project•Jul 2023 - Dec 2023

Developed PySpark pipelines to extract data from Salesforce CRM to Databricks using AWS S3. Performed Feature Engineering, sampling, and built ML models (Logistic Regression, XGBoost) to predict customer churn. Used MLflow for end-to-end model tracking.

Data Engineer

PRESCINTO

Project•Apr 2023 - Jun 2023

Developed a Data Migration tool using PySpark pipelines to extract data from various database servers (MySQL, Oracle, Greenplum, Postgres) and load it to cloud servers (Snowflake or Redshift). Used S3 as intermediate storage.

ML Developer

Internal

Project•Apr 2023 - Jun 2023

Developed a sophisticated supervised learning model to precisely identify tumor regions using Mask-RCNN augmented with COCO transfer learning. Leveraged VGG Image Annotator for precise bounding box annotation.

Junior Research Fellow

Govt. College of Engineering and Ceramic Technology

Research•Apr 2019 - Jul 2019

Worked on microstructural evaluation of sintered ceramic materials using image processing.

Education

Maulana Abul Kalam Azad University of Technology

Master of Technology (Post Graduate)

Information Technology

Jan 2016 - Jan 2018

Maulana Abul Kalam Azad University of Technology

Bachelor of Technology (Graduate)

Computer Science & Engineering

Jan 2010 - Jan 2014•Grade: CGPA 7.86

Licenses & Certifications

Databricks Data Engineer Associate

Databricks

• No expiration

Snowpro Core

Snowflake

• No expiration

Snowpro Advanced - Data Scientist

Snowflake

• No expiration

Skills

Python
Java
C
Shell Script
SQL
Oracle
MongoDB
Streamlit
HTML
CSS
JavaScript
AWS S3
Lambda
Sagemaker
Bedrock
Azure Blob Storage
Azure SQL
PySpark
Snowflake
Databricks
Deep Learning
Machine Learning
NumPy
pandas
Matplotlib
Seaborn
Keras
Tensorflow
SciKit-learn
scipy
opencv
Regression
Dimensionality Reduction
SVM
Classification
Random Forest
K-Means Clustering
KNN
CNN
RNN
DNN
Decision Trees
Autoencoders
NLP
Generative Model
Transformers
GAN
VAE
Generative AI
LLMs
GPT
Llama
Falcon
Git
Github
FastAPI
Flask-RESTful
Docker
Databricks Lakehouse Fundamentals