Sayan Chakraborty
@sayan97
Data Engineer at Fractal Analytics
Kolkata, West Bengal, India
Sayan Chakraborty is a Data Engineer at Fractal Analytics with expertise in architecting real-time data platforms using Azure Databricks and PySpark. He holds an MSc in Data Science from the University of Essex and a BTech in Computer Science. His professional experience includes roles at Tata Consultancy Services and Wipro Technologies, where he focused on big data workflows, cloud migrations, and automated data pipelines.
Experience
Data Engineer
Fractal Analytics
Architected an end-to-end real-time data platform on Azure Databricks using Medallion Architecture (Bronze/Silver/Gold) to support scalable analytics and AI/ML workloads. Built scalable batch and streaming pipelines using PySpark Structured Streaming and Delta Lake, enabling real-time visibility of logistics operations. Optimized truck load planning using data-driven models, improving vehicle fill rates by 18% and delivering over $1M in annual cost savings. Developed a material recommendation engine using PySpark, improving operational efficiency and inventory planning accuracy. Migrated legacy ingestion pipelines to Delta Lake, enabling ACID transactions, schema evolution, and significantly improving data reliability. Implemented event-driven ingestion pipelines for shipment data from TMS and FourKites, powering near real-time logistics analytics. Established CI/CD pipelines for data workflows using Git and automated deployments, improving release reliability and reducing manual intervention. Built a GenAI-powered NL-to-SQL platform using LangChain, enabling business users to query operational data using natural language.
Data Science Intern
Blackcoffer
Built Python-based data extraction pipelines and dashboards for business insights.
Assistant System Engineer
Tata Consultancy Services
Automated batch workflows and improved job scheduling reliability for enterprise systems. Supported cloud migration initiatives improving system performance and stability. Collaborated with global teams to troubleshoot production issues and ensure SLA adherence.
Project Engineer
Wipro Technologies
Developed big data workflows using Hadoop, Spark, and Hive for large-scale analytics. Improved data quality by 30% through validation, cleansing, and deduplication processes.
Education
University of Essex
Master of Science
Data Science
Asansol Engineering College, MAKAUT
BTech
Computer Science
Licenses & Certifications
Databricks Certified Data Engineer Associate
Databricks