Prathamesh Patange

@prathmeshpatange01

Data Engineer | 3 Yrs | ADF · Databricks · PySpark · Delta Lake · SQL · BFSI | Available Immediately

Pune, Maharashtra, India

Softenger India Pvt LtdWalchand Institute of Technology

Data Engineer with nearly 3 years of experience building and maintaining data pipelines and Lakehouse platforms for BFSI clients on Azure. Strong hands-on background in PySpark, Azure Databricks, ADF, Delta Lake, and SQL, with a focus on pipeline reliability, incremental ingestion, and query performance. Have taken ownership across the full data lifecycle from source extraction to BI-ready Gold tables, collaborating directly with analysts and stakeholders. Available to join immediately.

Experience

Data Engineer

Softenger India Pvt Ltd

Full-time•Sep 2023 - Present•Pune, Maharashtra, India

• Designed and developed scalable data pipelines using Azure Data Factory (ADF) and Azure Databricks, ingesting and processing 500GB+ of banking transaction and operational data daily from multiple upstream systems into Azure Data Lake Storage Gen2. • Built scalable ETL/ELT pipelines using PySpark and Spark SQL, processing 50M+ records per batch to support large-scale banking data transformations via batch and incremental loads. • Designed and implemented Medallion Architecture on Delta Lake to organize Bronze, Silver, and Gold data layers for scalable analytics and reporting. • Developed incremental data ingestion pipelines using Change Data Capture (CDC) and watermarking techniques, reducing full data loads and improving pipeline performance. • Developed a metadata-driven ingestion framework in PySpark to automate and standardize ingestion of source datasets into the data lake. • Optimized Spark-based data processing and analytical queries through partitioning strategies, caching, and Delta file compaction to improve performance of large-scale analytical workloads. • Improved analytical query performance by 20% through Databricks query profiling, ZORDER indexing, and Delta Lake optimizations on large Delta tables. • Reduced Delta Lake storage costs using OPTIMIZE and VACUUM scheduling, partition tuning, and removal of redundant intermediate tables. • Delivered curated datasets for Power BI dashboards and reporting, enabling business stakeholders across operations and analytics teams to access reliable data insights. • Implemented data governance and access control using Unity Catalog and Azure Key Vault, with service principal authentication across ADF linked services and Databricks notebooks, supporting compliance requirements. • Collaborated with data analysts, product owners, and business stakeholders to deliver scalable data solutions within an Agile/Scrum development environment.

Education

Walchand Institute of Technology

Bachelor Of Technology

Electronics and Telecommunication

Jan 2019 - Jul 2023•Grade: 8.85 GPA

Licenses & Certifications

Azure Databricks & Spark for Data Engineers

• No expiration

Oracle Cloud Infrastructure Foundations Associate

Oracle

• No expiration

Oracle Fusion AI Agent Studio Foundations Associate

Oracle

• No expiration

Skills

PySpark

SQL

Azure DataBricks

Azure Data Factory

Azure Data Lake Storage Gen2

Azure Synapse Analytics

Azure key valut

Delta Lake Architecture

Data Governance

Unity Catalog

Git / GitHub

Jira

Blob Storage

Python

Data Warehousing

PowerBi

Azure DevOps

Structured Streaming

Auto Loader