Default profile banner
SC

Sayan Chakraborty

@sayan97

Data Engineer at Fractal Analytics

Kolkata, West Bengal, India

Fractal AnalyticsUniversity of Essex

Sayan Chakraborty is a Data Engineer at Fractal Analytics with expertise in architecting real-time data platforms using Azure Databricks and PySpark. He holds an MSc in Data Science from the University of Essex and a BTech in Computer Science. His professional experience includes roles at Tata Consultancy Services and Wipro Technologies, where he focused on big data workflows, cloud migrations, and automated data pipelines.

Experience

Data Engineer

Fractal Analytics

•Mar 2024 - Present•Bengaluru, KA

Architected an end-to-end real-time data platform on Azure Databricks using Medallion Architecture (Bronze/Silver/Gold) to support scalable analytics and AI/ML workloads. Built scalable batch and streaming pipelines using PySpark Structured Streaming and Delta Lake, enabling real-time visibility of logistics operations. Optimized truck load planning using data-driven models, improving vehicle fill rates by 18% and delivering over $1M in annual cost savings. Developed a material recommendation engine using PySpark, improving operational efficiency and inventory planning accuracy. Migrated legacy ingestion pipelines to Delta Lake, enabling ACID transactions, schema evolution, and significantly improving data reliability. Implemented event-driven ingestion pipelines for shipment data from TMS and FourKites, powering near real-time logistics analytics. Established CI/CD pipelines for data workflows using Git and automated deployments, improving release reliability and reducing manual intervention. Built a GenAI-powered NL-to-SQL platform using LangChain, enabling business users to query operational data using natural language.

Data Science Intern

Blackcoffer

•Nov 2023 - Feb 2024•Delhi, IN

Built Python-based data extraction pipelines and dashboards for business insights.

Assistant System Engineer

Tata Consultancy Services

•Apr 2021 - Sep 2022•Kolkata, WB

Automated batch workflows and improved job scheduling reliability for enterprise systems. Supported cloud migration initiatives improving system performance and stability. Collaborated with global teams to troubleshoot production issues and ensure SLA adherence.

Project Engineer

Wipro Technologies

•Sep 2020 - Mar 2021•Bengaluru, KA

Developed big data workflows using Hadoop, Spark, and Hive for large-scale analytics. Improved data quality by 30% through validation, cleansing, and deduplication processes.

Education

University of Essex

Master of Science

Data Science

Oct 2022 - Oct 2023

Asansol Engineering College, MAKAUT

BTech

Computer Science

Aug 2016 - Aug 2020

Licenses & Certifications

Databricks Certified Data Engineer Associate

Databricks

• No expiration

Skills

Python
Java
SQL (MSSQL)
MongoDB
HTML/CSS
Swift
PySpark
Spark SQL
Databricks
Azure Data Factory
FastAPI
ETL/ELT Pipelines
Microsoft Azure
AWS
GCP
NumPy
Pandas
Matplotlib
Seaborn
Scikit-learn
TensorFlow
Keras
LangChain
LangGraph
Git
Docker
CI/CD
Monitoring & Logging