Default profile banner
PP

Purushottam Pawar

@purushottampawar

Data Engineer at Kingston info Solution Service Pvt Ltd.

Aurangabad

Kingston info Solution Service Pvt Ltd.Maharashtra Institute of Technology (MIT) Aurangabad

Purushottam Pawar is a Big Data Engineer with 1.9 years of experience specializing in Hadoop Ecosystem Development and Spark-scalar applications. He has proven expertise in building optimized big data pipelines using technologies like HDFS, Hive, Sqoop, and Spark on cloud platforms. His skills include data extraction logic in Scala, working with AWS services (EC2, S3), and handling structured and unstructured data from multiple sources.

Experience

Data Engineer

Kingston info Solution Service Pvt Ltd.

Employment•Sep 2021 - Present•Bangalore

Responsible for building optimized big data pipelines and performing data analysis. Key tasks include importing data into Hive from various RDBMS (Oracle, MySQL) using Sqoop, writing Hive DDL for query optimization, and ingesting flat files. Experience includes handling incremental loads, defining managed and external Hive tables with static/dynamic partitions, and optimizing Sqoop jobs. Worked with various file formats (ORC, Avro, Text File) and compression formats (Snappy, bzip2).

Education

Maharashtra Institute of Technology (MIT) Aurangabad

Bachelor of Technology

N/A

Jan 2021 - Jan 2022•Grade: N/A

N/A

Skills

Hadoop Ecosystem Development
Spark
Scala
Python
SQL
HDFS
Hive
Sqoop
Kafka
Cloudera
AWS
EC2
S3
Apache Ozzie
Data Pipeline
Structured Data Processing
Unstructured Data Processing
MySQL
Linux
Windows
Shell Scripting
Zookeeper