Default profile banner
Snigdha DasSD

Snigdha Das

@user.2533296

Big Data Engineer | ETL Pipelines | Spark | Kafka | BigQuery | GCP | Infosys

Kolkata, India

InfosysMaulana Abul Kalam Azad University of Technology

Snigdha Das is a Big Data Engineer with over three years of experience at Infosys, specializing in optimizing data architecture and enhancing data quality through automated data pipelines and robust ingestion processes. She has implemented cloud-based analytics solutions that reduced data processing costs by 30%, designed real-time data processing systems using Kafka, and managed legacy data system migrations to Google Cloud with 30% reductions in operational costs. She holds a B.Tech in Computer Science and Engineering from Maulana Abul Kalam Azad University of Technology with a DGPA of 8.83.

Experience

Digital Specialist Engineer (Big Data Engineer)

Infosys

Full-time•Jun 2022 - Present•India

Implemented cloud-based analytics solution reducing data processing costs by 30%. Developed and maintained ETL pipelines for data ingestion, processing, and distribution. Designed and implemented real-time data processing system using Kafka. Managed migration of legacy data systems to Google Cloud platforms, reducing operational costs by 30%. Developed high-performance, scalable data transformation workflows handling trillions of monthly data points. Enhanced SonarQube integration to achieve 100% code test coverage. Built automated data cleansing and validation processes improving data accuracy by 45%.

Digital Software Engineer

IDC Technologies (worked for Infosys)

Full-time•Nov 2021 - Jun 2022•India

Developed scalable, efficient, and permanent ETL data pipelines for business requirements. Applied Agile methodology in team collaboration, ensuring seamless coordination with business priorities.

Education

Maulana Abul Kalam Azad University of Technology

B.Tech

Computer Science and Engineering

Jul 2017 - Jul 2021•Grade: DGPA 8.83

Skills

Apache Spark
Apache Hive
Apache Kafka
Apache Airflow
SQL
BigQuery
Cassandra
Cosmos
Java
Scala
Python
GitHub
JIRA
Google Cloud Platform
Hadoop
HDFS
MapReduce
Incident Management
ETL Pipelines
CI/CD
SonarQube
Data Architecture
Data Quality
Agile