Priyanka P

@priyanka_p

Hadoop Developer

Bangalore

Blue DartKle Technological University

Priyanka has 2 years of experience in designing, developing, and maintaining large business applications, specializing in data engineering and big data solutions. She possesses deep expertise in the Hadoop ecosystem, utilizing technologies like Spark, Hive, and Sqoop, alongside AWS components such as S3, EMR, and EC2. She is proficient in processing structured and semi-structured data formats and has strong skills in Scala, Python, and SQL.

Experience

Hadoop Developer

Blue Dart

Project•Jan 2022 - Present

Loaded data onto Hive from Spark RDDs and data frames for further processing. Performed Import and Export of data into HDFS and Hive using Sqoop and managed data within the environment. Optimized Spark sql queries and Hive queries. Loaded and transformed large sets of semi structured data (XML, JSON, AVRO, Parquet). Created multiple Hive tables, implementing Partitioning, Dynamic Partitioning, and Bucketing. Processed web URL data using Scala and converted it to data frames. Created EC2 instances and EMR clusters for development and testing.

Prosaic technologies

Work Experience•Jan 2021 - Present

Hadoop Developer

CAR24

Project•Jan 2021 - Dec 2021

Validated data by running queries in the database and verifying results. Responsible for ensuring quality of work for the entire development team. Reviewed Data Models, Data Mappings, Architectural Documentation to create/execute effective SIT Test Plans. Transformed complex Business logic into SQL or PL/SQL queries. Followed Agile Testing practices and guidelines.

Education

Kle Technological University

Masters

Electronics and Communication (Digital Electronics)

Oct 2021•Grade: 8.9

Ballari Institute of Technology & Management

Bachelor of Technology

Electronics and Communication

Jun 2015•Grade: 60 %

TMAEs SMV Polytechnic

Diploma

Electronics and Communication

May 2012•Grade: 70.21 %

Skills

Hadoop

Sqoop

Hive

Apache Spark

AWS

Cloudera 5

MySQL

Scala

Python

IntelliJ

Eclipse

Windows

Linux

SQL

PL/SQL

JSON

XML

AVRO

Parquet

Dimensional Modeling

Data Warehousing