Srikant Padhy
@srikantpadhy
Senior Data Engineer at Jio Platforms
Bengaluru
Senior Big Data Engineer with 5+ years of experience designing and optimizing petabyte-scale data platforms. Expertise in real-time data streaming, batch processing, and distributed computing leveraging Apache Spark, Hadoop ecosystem, and NoSQL technologies, with a strong track record of performance tuning and deep technical knowledge with strategic thinking to transform complex data challenges into scalable solutions.
Experience
Senior Data Engineer
Jio Platforms
Designed and implemented real-time Spark Streaming pipelines processing 1 TB/day, leveraging structured streaming, dynamic resource allocation, and executor tuning, improving throughput by 30% and reducing batch processing time by 25%. Optimized Cassandra cluster performance by redesigning partitioning strategies, implementing replication tuning and indexing, reducing write latency by 40%, and improving real-time query performance by 35%. Collaborated with business teams to build a re-targeting campaign platform, driving a 45% increase in user engagement and a 20% rise in conversion rates. Pioneered a schema standardization framework to flatten nested JSON data using Spark SQL and custom parsers. Spearheaded a Delta Lake migration strategy, replacing legacy batch pipelines, leveraging Z-Order indexing and data compaction, improving data freshness by 90%, and reducing storage costs by 25%. Led a team of 4 engineers, conducting hands-on mentorship on Spark performance tuning and query optimization.
Software Engineer
Accenture
Designed and optimized distributed data pipelines handling 6 TB/day, reducing end-to-end latency by 40% using memory tuning, efficient shuffle operations, and partition pruning. Developed production-grade ETL workflows for a price prediction ML model, using Apache Airflow, PySpark, and feature engineering techniques, achieving 99.9% SLA compliance and improving forecast accuracy by 15%. Engineered high-throughput Spark ingestion jobs processing 14 K records/minute with exactly-once semantics. Collaborated on ML-driven customer retention models (Python, Scikit-Learn) that reduced churn by 20%. Automated 15+ manual workflows using Bash/Python, reducing operational overhead by 10 hours/week.
Intern
CDAC, Pune
Built 15+ predictive models (XGBoost, Random Forest, L1-regularized Logistic Regression) in Python to solve resource allocation challenges in distributed systems, optimizing hyperparameters via 5-fold cross-validation. Accelerated model training by 35% using parallel programming techniques (MPI, OpenMP) to distribute workloads across multi-core architectures.
Education
Veer Surendra Sai University of Technology
Bachelor of Technology
Computer Science