Skilled Data Engineer with 3+ years of experience in designing and optimizing ETL pipelines using tools like SnapLogic and Databricks. Developed a robust ingestion framework leveraging Python, SQL, and Apache Spark, significantly reducing manual intervention by streamlining data processes. Collaborated with cross-functional teams to deliver scalable, data-driven solutions that enhanced operational efficiency. Seeking an opportunity to leverage my skills in a growth-driven environment while contributing to organizational success.
Experience
Data Engineer
Modak Analytics
Expansion and Maintenance of GAIA Platform (Jan 2024-Present): Mentored the DataOps team through knowledge transfer sessions post-prototype design. Refactored CI/CD processes with the Jira-feature-dev-main strategy, reducing deployment time by 30%. Enhanced the ingestion framework by simplifying SnapLogic blueprints and implementing a single-bundle approach in Databricks. Supported the development of GAIA usage metrics and established data contracts for the gold layer.
Data Engineer
Modak Analytics
Developed and maintained data pipelines for ingestion, profiling, and indexing using Nabu. Implemented custom Scala code to ingest data from REST APIs and processed bulk semi-structured files from S3. Debugged and resolved issues on EC2 servers, addressing limitations for incremental loads to ensure data consistency.
SDE Intern
Modak Analytics
Completed sprints on StreamSets & Kafka, Java, Linux, PostgreSQL, Spark & Scala, and Bots & Crawlers. Prepared solutions for user stories and presented demos at the end of each sprint. Managed Azure Boards for tracking and documentation.
Education
SRKR Engineering college
Bachelor of Technology
Electronics and Communication Engineering
Executive Body Member of IETE (Organised events in the college).
Licenses & Certifications
Databricks Certified Data Engineer Associate
Databricks
SnapLogic Integrator Training
SnapLogic
ELITE Certification on Programming, Data Structures and Algorithms using Python
NPTEL