Default profile banner
AA

AARZOO AGRAWAL

@aarzooagrawal

Data Engineer at Nagarro

Noida, Uttar Pradesh, India

https://www.linkedin.com/in/aarzoo-agrawal-06041816a

NagarroGalgotias College of Engineering and Technology

Data Engineer with experience in building and optimizing complex data pipelines across AWS and Azure environments. Proficient in utilizing services like Redshift, S3, Glue, and Athena for ETL processes, including BI migration and real-time data ingestion. Skilled in Python and SQL, with a proven ability to automate workflows and reduce manual workload significantly.

Experience

Data Engineer

Nagarro

•Sep 2021 - Present

Project 2 - BI Migration: Migration from Birst BI system to AWS Quicksight using AWS S3, Redshift, Step function and Quicksight to build reports. Created Snaplogic pipelines to ingest data from multiple data sources like Redshift, MySQL, SFTP server, Email, SQLServer, Rest API in S3. Built a script using python which uses copy command to load data from S3 to redshift tables. Configured glue jobs to use fractional DPU resource using pythonshell instead of spark shell which greatly reduced the cost. Implemented RLS on Redshift tables which contains PII data to control access to specific users and roles. Created data views by joining tables in presentation layer which is used in Quicksight for dashboarding purpose. Project 1 - Payroll Integration: Designed and Constructed real-time data pipeline (writing glue jobs) to process semi-structured (json) data from workday, build a generic utility to export data through api for business use reducing cost by 60%. Ingesting data from sources like sharepoint, oracle, aws redshift into athena after performing schema validation and further writing scripts using Spark in Python to perform transformations in consumption layer. Writing SQL queries to fetch required data from Athena tables and build generic utility to calculate revenue, sum from data based on logics involving working days, absences which reduced entire manual workload saving 95% of time. Automated ETL processes across billions of rows of data, which reduced manual workload.

Azure Cloud Engineer Intern

Him Technologies

•May 2021 - Aug 2021

Learned about Azure platform, its services, types of storage accounts, load balancer, functions, data recovery, Azure AD and Basic Linux commands with hands on exercises. Created Azure Virtual Machines, applications, database, and functions on azure portal. Designed virtual networks to support workloads with the highest security and performance.

Education

Galgotias College of Engineering and Technology

B-Tech

Information Technology

Aug 2017 - Aug 2021•Grade: 73%

Licenses & Certifications

Data Engineering on Microsoft Azure (DP-203)

Microsoft

• No expiration

Microsoft Data Fundamentals (DP-900)

Microsoft

• No expiration

Microsoft Azure Fundamentals (AZ-900)

Microsoft

• No expiration

Applied Data Science with Python

Coursera

• No expiration

Skills

Python
SQL
Core Java
Flask
Pyspark
Hadoop
Snaplogic
Basic ML
AWS
Azure
Github
Gitlab
Agile Scrum
Docker
OpenCV