Default profile banner
MP

Manthan Parab

@manthanparab

GCP Data Engineer at DATAMETICA

Pune, India

https://github.com/manthanparab07

DATAMETICAPimpri Chinchwad College of Engineering and Research

Manthan Parab is a Data Engineer with experience in GCP and ETL processes. He possesses expertise in Python, SQL, and various data tools including Informatica and Apache Airflow. He has successfully worked on data migration projects, handling data extraction and loading into BigQuery for large companies. He is skilled in developing robust data pipelines and mentoring junior engineers.

Experience

GCP Data Engineer

DATAMETICA

•Feb 2021 - Present•Pune, India

Worked on Data Migration project for US based large storage rent company. Created python scripts for extract data from SQL server and load it into GCP-bigQuery through composer-Apache Airflow for different file formats i.e. CSV,PARQUET. Solved different data issues and proposed appropriate solution for maximum efficiency and got implemented. Currently working on data migration project for large toy company. Created python scripts for extract historical data from sql developer (Oracle) to BQ using parquet format for 1500 tables. Converted Informatica mappings into BQ compatible format for data transformation. Worked on CA scheduler and developed shell scripts for data transformation. Created 70 % DAGS for ETL process. Mentored and guided junior engineers.

Education

Pimpri Chinchwad College of Engineering and Research

B. Tech.

Computer Science and Engineering

Jul 2017 - Jun 2021•Grade: CGPA 8.15/10 (Graduated with distinction)

Rani Parvati Devi High School and Jr. College

Jul 2015 - Jun 2017•Grade: Percent 72.6

Rani Parvati Devi High School and Jr. College

Jul 2012 - Jun 2015•Grade: Percent 96

Licenses & Certifications

Machine Learning

Stanford University (Coursera)

• No expiration

AWS : Fundamentals

Amazon (Coursera)

• No expiration

Cloud Technology Workshop

XenStack

• No expiration

Skills

Python
iPython Notebook
Matlab
bash (shell scripting)
C++
JAVA
MYSQL
Oracle
BigQuery
MongoDB
Django
JavaScript
HTML
CSS
Informatica
DataFlow
Google Cloud Platform
Apache airflow
CA scheduler
Linux
Git
Github