Default profile banner
MN

Marivini Naveen

@Marivini_Naveen

Senior Software Engineer – Data Engineering at Yash Technologies

Bangalore

Yash TechnologiesAditya College of Engineering

Data Engineer with 4 years of experience designing scalable ETL pipelines, data warehousing solutions, and cloud-native data platforms using AWS and Snowflake. Strong expertise in PySpark, advanced SQL, Apache Airflow, and Spark performance optimization. Proven experience in EMR migrations, incremental data processing, and implementing production-grade data quality frameworks within banking domain environments.

Experience

Senior Software Engineer – Data Engineering

Yash Technologies

Present

Migrated 100+ production PySpark ETL pipelines from EMR 6.8 to 7.2 in a live production environment. Redesigned Spark write logic to eliminate duplicate data generation post-migration. Optimized Spark configurations improving execution time by ~30%. Developed SQL-based reconciliation framework for multi-million record validation. Stabilized Airflow DAGs (daily/weekly/monthly) during production validation cycle. Reduced pipeline failures by 25% through upstream data validation checks. Supported CI/CD deployments and production monitoring in Agile delivery model. Client: NatWest Group

Senior Systems Engineer – Data Engineering

Infosys Ltd

Designed and developed end-to-end Snowflake ETL workflows for remediation processes. Implemented multi-layer data warehouse architecture (RAW, CLEANSED, STAGING, HISTORIC). Built complex SQL transformations using CTEs, window functions, and incremental delta logic. Developed Python UDFs for transformation and validation. Integrated AWS S3 staging with Snowflake ingestion pipelines. Automated Airflow DAGs reducing manual effort by 40%. Implemented data quality controls ensuring >99% dataset accuracy. Ensured production stability and collaborated with stakeholders in Agile environment. Client: Royal Bank of Scotland

Education

Aditya College of Engineering

B.Tech

Electronics & Communication Engineering

Licenses & Certifications

Infosys Certified Python Associate

Infosys

• No expiration

Infosys Certified MySQL Associate

Infosys

• No expiration

Infosys Certified Machine Learning Professional

Infosys

• No expiration

Skills

Python (PySpark)
Advanced SQL
Shell
AWS (EMR, S3, Athena, IAM)
Apache Spark
EMR Performance Optimization
Snowflake
Apache Airflow
ETL Architecture
CDC Pipelines
Batch & Incremental Processing
Data Modeling
Data Quality Frameworks
StreamSets
Git
GitLab
CI/CD Exposure
JIRA
Power BI Dataset Preparation & Reporting Enablement