Default profile banner
SP

Sidhant Patra

@seadhant

Data Engineer at Accenture

Bangalore, Karnataka, India

AccentureVeer Surendra Sai University

Data Engineer with 4 years of experience building end-to-end ETL/ELT data pipelines and data warehouse solutions on AWS and GCP. Experienced in Python, SQL, Apache Spark, PySpark, Airflow, BigQuery, and Dataflow to process multi-terabyte datasets for telecommunications, retail, and financial services. Proven track record in optimising large-scale batch workflows, orchestrating complex DAGs, and delivering reliable analytics platforms for business-critical reporting.

Experience

Data Engineer

Accenture

Jun 2025 - PresentBangalore, India

Architected and maintained complex Apache Airflow DAGs to orchestrate large-scale ELT workflows, implementing task parallelization and dynamic configuration to improve data throughput by 25% for critical pipelines. Led migration of legacy Talend ETL pipelines to GCP, redesigning ingestion and transformation logic using Cloud Composer, BigQuery, and Dataflow to build a scalable cloud-native data platform. Identified reusable patterns across pipelines and built Python automation scripts to programmatically generate and migrate 300+ DAGs, significantly reducing manual engineering effort and accelerating migration timelines. Developed and optimized Google Cloud Functions for event-driven data processing and API integrations, tuning performance to achieve sub-500ms latency for high-priority data feeds. Served as technical point of contact for the department, gathering requirements from product stakeholders and translating them into robust, production-grade data solutions with clear SLAs. Modified existing Apache Beam code in JAR files to incorporate new features, enhancing pipeline flexibility and enabling seamless integration of real-time analytics for 5+ production workflows.

Data Engineer

Cognizant

Jan 2022 - Jun 2025Bangalore, India

Engineered scalable data pipelines using Python and Apache Spark on AWS S3 to process large-scale retail transaction data, designing dimensional models (fact and dimension tables) to power high-performance point-of-sale analytics. Designed and implemented a sales performance incentive engine processing 100GB of daily data, ensuring 100% accuracy of commission calculations through robust validation and reconciliation logic, replacing manual spreadsheet workflows. Optimized Spark jobs, partitioning strategies, and file layouts on S3 to reduce end-to-end processing time and compute costs, improving report availability and reducing delays for stakeholders.

Data Analyst

Cognizant

Jan 2022 - Jun 2025Bangalore, India

Validated and analysed financial datasets in BigQuery and HiveSQL to meet business and regulatory requirements, implementing data quality checks that increased trust in executive-level reporting. Built interactive Power BI dashboards to visualise payment metrics, customer behaviour trends, and portfolio performance, enabling leadership to move from weekly to near-daily data-driven decisions. Collaborated in Agile sprints with data engineers, analysts, and product owners to refine requirements, prioritise backlog items, and deliver a scalable analytics environment ahead of schedule.

Education

Veer Surendra Sai University

B.Tech

Jan 2018 - Jan 2022

Licenses & Certifications

Microsoft Azure DP-900

Microsoft

• No expiration

Microsoft Azure AZ-900

Microsoft

• No expiration

GCP Professional

Google

• No expiration

Skills

AWS
Google Cloud Platform (GCP)
Cloud Composer
Dataflow
BigQuery
Apache Spark
Spark SQL
Apache Airflow
PySpark
Apache Beam
Python
SQL
Java
Node.js
MySQL
Data Warehousing