Default profile banner
NB

Nishith Barodia

@nishithbarodia

Data Engineer at Accenture

Delhi, IN

https://www.linkedin.com/in/nishith-b-bba569170/

AccentureGGSIPU University

Nishith Barodia is an experienced Data Engineer with a proven track record in designing, building, and maintaining scalable data pipelines and infrastructure. He specializes in data processing and analysis using technologies like Hadoop, Spark, and GCP. His expertise includes optimizing complex data workflows, ensuring GDPR compliance, and migrating data delivery infrastructure from on-premise to cloud environments.

Experience

Data Engineer

Accenture

Feb 2021 - PresentDelhi, IN

• Experience in building, designing, monitoring, fixing data pipelines using Big data components like Hadoop, Hive, Spark, Pyspark, Sqoop, SQL, Unix, NIFI. • Collaborated with cross-functional teams to identify business requirements and deliver data-driven solutions using Spark. • Optimized Spark jobs for performance by tuning various parameters such as memory usage and parallelism. • Developed a Python-based auditing system for user actions in Cloudera Data Platform's command-line interface (CLI). • Designed Apache Spark workflows for GDPR-compliant data processing, including data masking and hiding. • Created an efficient data flow pipeline in Apache Nifi, using HDFS and Hive for inbound and outbound data deliveries. • Worked on migrating data delivery pipeline infrastructure from On-Premise to Cloud. • Developed a statistical Reconciliation Framework to monitor data deliveries to all the downstream systems. • Owner of 4 Agile releases in development, supported in Test and deployed to production. • Built a MySQL data pipeline for replicating production environment to DR environment in a data warehousing project. • Cross team collaboration with business teams assisting in requirement gathering for downstream.

Education

GGSIPU University

Bachelor of Engineering

Computer Science

Jan 2016 - Jan 2020

Licenses & Certifications

Google Cloud Certified Associate Cloud Engineer

Google Cloud

• No expiration

Skills

Python
Scala
Shell Scripting
SQL
Hadoop
Hive
Spark
Pyspark
Apache NIFI
Kafka
GCP
Cloudera Data Platform (CDP)
Databricks
Windows
Linux
Unix