Default profile banner
HS

Hemsagar Sharma

@hemsagarsharma

Big Data Developer

Pune, MH

National Informatics CentreSamrat Ashok Technological Institute

Hemsagar Sharma is an experienced Big Data Developer specializing in ETL operations within the Judicial Services industry. He has expertise in data cleansing, profiling, wrangling, ingestion, and loading. His technical skills include working with Spark, Hadoop, and Hive, and he has experience across the full Software Development Life Cycle.

Experience

BIG DATA DEVELOPERS

National Informatics Centre

Project•Nov 2020 - Present•PUNE, Maharashtra

Importing data from multiple sources into Spark RDD, Data Frame. Using Avro to store data in HDFS. Worked on code optimizations to increase job performance and reduce load on production server. Prepared queries for report development and database systems. Integration of E-Sign with e-Signing authority for sign uploaded documents. Involved in documentation of all important and relevant information needed for deployment in PROD environment. Participates in root cause analysis for defects and provides process improvement suggestions to eliminate future occurrences of similar defects. Helped in Production fixes. Followed End to End Devops Model for Deployment Tracking.

Software Developer

National Informatics Centre

Project•Jul 2019 - Oct 2020•PUNE, Maharashtra

Importing data from multiple sources into Spark RDD, Data Frame. Converting SQL queries, MapReduce programs into Spark transformations. Worked on code optimizations to increase job performance and reduce load on production server. Using Avro and different type of format to store data in HDFS. Involved in documentation of all important and relevant information needed for deployment in PROD environment. Involved in daily team meeting with offshore and onsite team.

Education

Samrat Ashok Technological Institute

Bachelor of Engineering

Computer Science And Engineering

Aug 2010 - Jun 2014

Skills

Spark
Scala
Hadoop
Hive
Python
Machine Learning
ETL
Data Cleansing
Data Profiling
Data Wrangling
Data Ingestion
Data Loading
SQL
Linux
JIRA
HDFS
MapReduce