Default profile banner
HN

Hruthwik Nulu

@hruthwiknulu

Big Data Developer at CAE Inc.

Bengaluru

CAE Inc.Amrita School of Engineering

Hruthwik has 2 years of experience designing and developing Big Data applications using the Hadoop Ecosystem, including HDFS, Hive, and Apache Spark. He is proficient in processing large sets of structured and semi-structured data, utilizing formats like JSON and Parquet. His expertise includes optimizing Spark SQL queries, managing data pipelines via Sqoop, and deploying jobs on EMR clusters.

Experience

Big Data Developer

CAE Inc.

Project•Apr 2021 - Present

Loaded data into Hive from Spark for further processing. Performed Import and Export of data into HDFS and Hive using Sqoop and managed data within the environment. Loaded and transformed large sets of semi structured data like XML, JSON, Avro, Parquet. Queried data using Spark SQL on top of Spark engine for faster datasets processing. Was responsible for Optimizing Spark SQL queries that helped in saving Cost to the project. Created multiple Hive tables, running hive queries in those data, implemented Partitioning, Dynamic Partitioning and Bucketing in Hive for efficient data access. Generated and processed complex JSON data after all the transformations for easy storage and access as per client requirements. Performed Import and Export of data into HDFS and Hive using Sqoop and managed data within the environment. Involved in creating Hive tables, data loading and writing hive queries. Managed Hive Tables based on partitions. Involved in working on the Data Quality and data sourcing for handling the business that helped the Business team. Loaded and transformed large sets of semi structured data. Development of Code & peer review of assigned tasks and Bug fixing.

Education

Amrita School of Engineering

Jan 2017 - Jan 2021•Grade: 6.8 CGPA

Ascent Junior college

Jan 2015 - Jan 2017•Grade: 94.7%

Dr. KKR Gowtham Concept School

Jan 2014 - Jan 2015•Grade: 9.3 CGPA

Licenses & Certifications

AWS Cloud Practitioner

AWS

• No expiration

Skills

Hadoop
Sqoop
Hive
Apache Spark
PySpark
Scala
SQL
Linux
AWS
Cloudera
RDBMS (My SQL)
JSON
PARQUET
AVRO
ORC
AWS Cloud Practitioner
Data Engineering
Data Development