Default profile banner
SP

Srikant Padhy

@srikantpadhy

Senior Data Engineer at Jio Platforms

Bengaluru

https://linkedin.com/in/srikantpadhy252

Jio PlatformsVeer Surendra Sai University of Technology

Senior Big Data Engineer with 5+ years of experience designing and optimizing petabyte-scale data platforms. Expertise in real-time data streaming, batch processing, and distributed computing leveraging Apache Spark, Hadoop ecosystem, and NoSQL technologies, with a strong track record of performance tuning and deep technical knowledge with strategic thinking to transform complex data challenges into scalable solutions.

Experience

Senior Data Engineer

Jio Platforms

Invalid Date - PresentBengaluru

Designed and implemented real-time Spark Streaming pipelines processing 1 TB/day, leveraging structured streaming, dynamic resource allocation, and executor tuning, improving throughput by 30% and reducing batch processing time by 25%. Optimized Cassandra cluster performance by redesigning partitioning strategies, implementing replication tuning and indexing, reducing write latency by 40%, and improving real-time query performance by 35%. Collaborated with business teams to build a re-targeting campaign platform, driving a 45% increase in user engagement and a 20% rise in conversion rates. Pioneered a schema standardization framework to flatten nested JSON data using Spark SQL and custom parsers. Spearheaded a Delta Lake migration strategy, replacing legacy batch pipelines, leveraging Z-Order indexing and data compaction, improving data freshness by 90%, and reducing storage costs by 25%. Led a team of 4 engineers, conducting hands-on mentorship on Spark performance tuning and query optimization.

Software Engineer

Accenture

Invalid Date - Invalid DateBengaluru

Designed and optimized distributed data pipelines handling 6 TB/day, reducing end-to-end latency by 40% using memory tuning, efficient shuffle operations, and partition pruning. Developed production-grade ETL workflows for a price prediction ML model, using Apache Airflow, PySpark, and feature engineering techniques, achieving 99.9% SLA compliance and improving forecast accuracy by 15%. Engineered high-throughput Spark ingestion jobs processing 14 K records/minute with exactly-once semantics. Collaborated on ML-driven customer retention models (Python, Scikit-Learn) that reduced churn by 20%. Automated 15+ manual workflows using Bash/Python, reducing operational overhead by 10 hours/week.

Intern

CDAC, Pune

InternshipInvalid Date - Invalid DatePune

Built 15+ predictive models (XGBoost, Random Forest, L1-regularized Logistic Regression) in Python to solve resource allocation challenges in distributed systems, optimizing hyperparameters via 5-fold cross-validation. Accelerated model training by 35% using parallel programming techniques (MPI, OpenMP) to distribute workloads across multi-core architectures.

Education

Veer Surendra Sai University of Technology

Bachelor of Technology

Computer Science

Invalid Date - Invalid DateGrade: 8.24/10.0

Skills

Apache Spark
Pyspark
Apache Hadoop
Hive
Apache Kafka
Google cloud platform (GCP)
Azure
Cloudera
Cloud Security
SQL
NoSql
Cassandra
Elastic Search
Redis Cache
AWS S3
Delta Lake
Apache Airflow
Crontab
Data Warehousing
Data Pipelines
Python
Java
Scala
C/C++
Query Tuning
Indexing
Partitioning Strategies