SAGAR KARASKAR
@sagarkaraskar
Bigdata Developer
Bangalore, India
Sagar Karaskar is a results-oriented Data Engineer with 3 years of extensive expertise in the Big Data/Hadoop technology stack. He specializes in building scalable data pipelines using Apache Spark, Scala, and Hive. His experience includes optimizing performance on AWS services like EMR, Athena, and S3, and following Agile methodologies to deliver robust data solutions.
Experience
DATA ENGINEER
DIGIMETRIX TECHNOLOGIES PRIVATE LIMITED
Responsible for building scalable data pipelines using Bigdata/Hadoop technology stack. Used Scala and Spark data frames for data processing and transformation to load data to warehouse tables. Developed Spark Jobs using various transformation and action API to meet the business need. Worked on various Spark optimization techniques such as code level and resource level. Worked on AWS, S3, EMR, Athena, Redshift and developed the data pipelines on cloud. Followed agile-sprints and code versioning through GIT.
HADOOP ADMINISTRATOR
DIGIMETRIX TECHNOLOGIES PRIVATE LIMITED
Worked on Sqoop to load logs and other live inputs to HDFS. Worked on Sqoop import/export with incremental load to extract the data into HDFS. Worked on Hive optimization performance using Partitioning, Bucketing and Map on both Managed and External tables.
Education
Hislop College
Bachelors In Computer (BCA)