Default profile banner
SD

Sohini Das

@Sohini

Data Scientist (Remote) at SELF EMPLOYED

India

https://www.linkedin.com/in/sohini-das-1092

SELF EMPLOYEDIGNOU

Sohini Das is a Data Scientist with expertise in predictive modeling and data engineering. She has a proven track record of optimizing business operations, including reducing infrastructure costs and fresh produce wastage using machine learning techniques like Random Forest and XGBoost. Sohini holds an M.A. in Economics and is proficient in Azure, Databricks, and SQL. Her experience spans roles at Fractal Analytics and Iksula Services, where she focused on data-driven decision-making and automation.

Experience

Data Scientist (Remote)

SELF EMPLOYED

Dec 2022 - PresentRemote

Optimize fresh produce wastage focusing on fruits and vegetables. Achieved a predictive accuracy rate of 86.59% using Random Forest and Decision Trees. Implemented multi-layered classification approaches using SMOTE, social network analysis, and XGBoost. Integrated insights into decision-making processes to support strategic planning.

Associate Engineer - Role: Data Scientist

FRACTAL ANALYTICS

Nov 2021 - Dec 2022Bengaluru, India

Collected and preprocessed 5 years of purchase history data from Azure Data Lake and SQL server. Developed and fine-tuned machine learning models using statistical tests and K-Means clustering. Transformed data into PowerBI reports using Databricks and Pyspark. Maintained Data Factory pipelines, leading to a 25% reduction in infrastructural cost.

Digital Marketing Analyst

IKSULA SERVICES

Dec 2020 - Oct 2021Mumbai, India

Executed web scraping on e-commerce platforms for competitive benchmarking. Cleaned and pre-processed data for a Predictive Regression Model. Automated MySQL-Python pipeline, reducing infrastructure costs by 65%.

Data Science Content Writer

SELF EMPLOYED

Feb 2020 - Nov 2020Kolkata, India

Authored Data Science articles covering foundational Hypothesis Testing to advanced Convolutional Neural Networks (CNN). Initiated a peer review culture and co-authored ventures across blog platforms.

Virtual Intern

KPMG

May 2020 - Jun 2020Mumbai, India

Assisted in predictive sales reporting and Data Quality Assessment. Analyzed US vehicle purchase data, achieving 93% accurate purchase tendency forecast.

Junior Data Analyst

NEO PARISRUTAN PVT. LTD.

Feb 2018 - Feb 2020Kolkata, India

Led a team of 3 in the development of a bid forecasting and sales prediction system. Designed a Python-Sklearn-MySql-Tableau integration framework for visualization and prediction.

Education

IGNOU

M.A.

Economics

Jun 2021

University of Calcutta

B.Sc.

Economics

Aug 2015

Licenses & Certifications

Fractal Certified Data Engineer

Fractal Analytics

• No expiration

Microsoft Certified: Data Engineer Associate

Microsoft

• No expiration

Virtual Experience Program Participant - KPMG

KPMG

• No expiration

Data Analysis with Python: Zero to Pandas - Jovian

Jovian

• No expiration

Complete Data Science Bootcamp

• No expiration

Skills

Tableau
Power BI
Advance Excel
Microsoft Azure
Databricks
Data Factory
ML Studio
ML Ops
Azure Data Lake
Snowflake
Jupyter Notebook
VS Code
Spark
Hadoop
Python
SQL
Apache Hive – HQL
pandas
numpy
sklearn
matplotlib
seaborn
pyspark
MySQL
MS SQL Server
Mongo
Postgres