SAGAR KARASKAR

@sagarkaraskar

Bigdata Developer

Bangalore, India

DIGIMETRIX TECHNOLOGIES PRIVATE LIMITEDHislop College

Sagar Karaskar is a results-oriented Data Engineer with 3 years of extensive expertise in the Big Data/Hadoop technology stack. He specializes in building scalable data pipelines using Apache Spark, Scala, and Hive. His experience includes optimizing performance on AWS services like EMR, Athena, and S3, and following Agile methodologies to deliver robust data solutions.

Experience

DATA ENGINEER

DIGIMETRIX TECHNOLOGIES PRIVATE LIMITED

•Invalid Date - Present

Responsible for building scalable data pipelines using Bigdata/Hadoop technology stack. Used Scala and Spark data frames for data processing and transformation to load data to warehouse tables. Developed Spark Jobs using various transformation and action API to meet the business need. Worked on various Spark optimization techniques such as code level and resource level. Worked on AWS, S3, EMR, Athena, Redshift and developed the data pipelines on cloud. Followed agile-sprints and code versioning through GIT.

HADOOP ADMINISTRATOR

DIGIMETRIX TECHNOLOGIES PRIVATE LIMITED

•Invalid Date - Invalid Date

Worked on Sqoop to load logs and other live inputs to HDFS. Worked on Sqoop import/export with incremental load to extract the data into HDFS. Worked on Hive optimization performance using Partitioning, Bucketing and Map on both Managed and External tables.

Education

Hislop College

Bachelors In Computer (BCA)

Jan 2015 - Jan 2018•Grade: 59.60%

Skills

Bigdata

Hadoop

Apache Spark

Scala

Hive

AWS

EMR

Athena

Glue

Redshift

Sqoop

SQL

MySql

HDFS

YARN