Default profile banner
PP

Pritam Prasad

@pritamprasad

Data Engineer at ZS Associates

Gurgaon

linkedin.com/in/pritam003/

ZS AssociatesGovernment College of Engineering & Textile Technology

Pritam Prasad is an aspiring Data Engineer with experience building enterprise-level solutions. His technical expertise spans AWS, Big Data frameworks like Spark and Hadoop, and databases such as Snowflake and Redshift. He has a proven track record of developing robust data pipelines, optimizing complex SQL queries, and automating processes using Python and various ETL tools.

Experience

Data Engineer

ZS Associates

Full-time•Jun 2021 - Present•Gurgaon

Created data pipeline with the help of ETL tool dataiku to process large volume of data from Snowflake. Implemented notification framework using Python to drop Email notification to the operation team in case a job has failed, reducing manual monitoring. Created final output dataset using complex SQL queries and Pandas Api. The whole process was orchestrated and automated using triggers such as change data capture, time driven mode. Also worked in Query optimizing that has reduced the runtime of Query from 3hrs to few minutes.

AWS Cloud Engineer

TCS

Full-time•Nov 2019 - Jun 2021•Kolkata

Worked with processing of both delta and milestone data processing. Crafted pipelines to poll the data from on-premises data base, process the data with help AWS Glue and crawler, and dump the data in Redshift. Created Archival framework that dumps the 1-day old data which was then used for SCD type 2 implementation. Heavily worked with Shell scripting and used that as backend to trigger ETL processes and give information about the health of server upon which the ETL tool was installed.

Education

Government College of Engineering & Textile Technology

B. Tech in EE

EE

Jan 2019•Grade: YGPA: 7.70

Licenses & Certifications

AWS certified Cloud Practitioner

AWS

• No expiration

AWS Solution Architect

UDEMY

• No expiration

Skills

AWS
S3
EC2
SNS
GLUE
LAMBDA
SDK
SPARK
HIVE
HADOOP
SQL
ORACLE
REDSHIFT
SNOWFLAKE
Python
Shell Scripting
Dataiku
Streamsets
Control-M
JIRA
CONFLUENCE