Default profile banner
SK

Sathwika Kaparthi

@Sathwika

Data Engineer at National Payments Corporation of India

Hyderabad, Telangana, India

National Payments Corporation of IndiaKITSW, Nizamabad

Data Engineer with 3.1 years of experience in designing and optimizing scalable data pipelines, ETL processes, and automation. Skilled in working with big data technologies and distributed systems to ensure high-performance data processing. Experienced in data quality, governance, compliance, and report automation, aligning data strategies with business goals to drive operational efficiency and deliver timely insights.

Experience

Data Engineer

National Payments Corporation of India

•Aug 2022 - Present

Designed and implemented an end-to-end data pipeline using Azure Data Factory for orchestration, Databricks (PySpark) for processing and aggregation of NACH and UPI transaction-level data, and MinIO (S3-compatible) for data lake storage to perform real-time insights. Designed and implemented real-time pipelines for NACH and its sub-products using Kafka, integrated with Apache Flink and Spark Streaming. Built and maintained scalable batch data pipelines using DBT/Python Client, Dagster and SQL, processing large-scale financial transactions over 200M records. Automated data workflows and validations with Python and Pyspark. Implemented Dagster alerting policies and webserver API for automated email reports. Implemented reverse geocode mapping using Python. Optimized SQL queries for performance across distributed systems. Migrated data from Hadoop (HDFS) to Minio (S3-compatible) storage. Containerized data applications using Docker and deployed them on Kubernetes cluster.

Data Analyst

National Payments Corporation of India

•Aug 2022 - Present

Built interactive dashboards using Tableau and Superset to support business intelligence and regulatory reporting (RBI, Ministry of Finance). Led claim and billing data analysis for schemes like NSAP, NREGA, AMC using SQL-driven insights to identify trends and improve cost tracking and automated reconciliation process. Collaborated with business and product teams to implement Python-based data solutions. Used Git for version control and followed CI/CD practices in a Linux-based development environment.

Education

KITSW, Nizamabad

BTech

Electronics & Communication Engineering

•Grade: 8.57/10

Licenses & Certifications

Advanced SQL and Python Certification

Hacker rank

• No expiration

PG Certification in AI/ML

IIIT, Hyderabad

• No expiration

Skills

DBT
Dagster
Airflow
Hadoop
Hive
Trino
Spark
Azure Databricks
Delta Lake
Azure Data Factory
SQL
Python
Pyspark
Shell Scripting
Linux
Apache Superset
Tableau
Data Analysis
Git
CI/CD
Validation
compliance
automation
Distributed Systems
Regulatory reporting