Default profile banner
KI

Kshitija Ingle

@kshitijaingle

Senior System Engineer (Data Engineer) at Infosys

Pune

InfosysCollege of Engineering Pune (COEP)

Data Engineer with 3+ years of experience in designing and implementing scalable data pipelines and workflows. Skilled in Python, SQL, Databricks, PySpark, and AWS (S3, Lambda, Step Functions, EC2) to build efficient data solutions. Passionate about optimizing workflows and delivering insights for data-driven decision-making.

Experience

Senior System Engineer (Data Engineer)

Infosys

Dec 2021 - PresentPune

Designed and implemented a large-scale data pipeline using AWS Databricks and PySpark to efficiently process and analyze massive datasets. Streamlined data ingestion from multiple sources into Amazon S3, enhancing retrieval speeds by 30% through partitioning and compression. Built distributed ETL workflows in PySpark, reducing processing time by 40% and improving data quality by 25% with Python-based validations. Automated data transitions with AWS Step Functions, minimizing downtime by 15% and ensuring seamless workflows. Monitored and optimized system performance using CloudWatch, proactively resolving bottlenecks. Utilized EC2 and Databricks clusters efficiently, achieving significant cost reductions. Stored processed data in MySQL, enabling effective reporting and data visualization with optimized SQL queries.

Education

College of Engineering Pune (COEP)

Masters of Technology (M.Tech.)

Jan 2019 - Jan 2021

Sant Gadge baba Amravati University

Bachelor of Engineering

Jan 2014 - Jan 2018

Licenses & Certifications

Python for Data Science

IBM

• No expiration

The Complete SQL Bootcamp

Udemy

• No expiration

Skills

Python
SQL
Pandas
BeautifulSoup
PySpark
boto3
Apache Spark
Databricks
AWS
S3
Glue
EC2
Lambda
Kinesis
CloudWatch
SNS
Step Functions
RDS
ETL Pipelines
Real-time Streaming
Data Quality Monitoring
MySQL
DynamoDB
Git