Komal Deshmukh
@komaldeshmukh
Industry Consulting Consultant at NTT Data
Pune, MH
Komal Deshmukh is a Big Data professional with 3+ years of experience in designing, implementing, and supporting big data applications using Apache Spark, Hadoop, and AWS. She possesses strong expertise in various Hadoop tools, including MapReduce, HiveQL, and Sqoop, and is proficient in data ingestion, processing, and analytics across structured and unstructured data. She is skilled in translating complex business requirements into functional technical solutions.
Experience
Industry Consulting Consultant
NTT Data
Developed Spark jobs for importing PostgreSQL data into HBase and performing subsequent processing using the Phoenix service. Responsibilities included designing Hive staging and external tables, implementing data validation and deduplication, generating reports to S3 buckets, and monitoring job performance. Utilized Hbase, phoenix, HDFS, RDS-postgre, and Hive.
Data Engineer
NexGen Cloud Services
Managed data migration projects, including RDBMS to Snowflake data migration using Apache Spark. Developed PySpark programs to extract, transform, and load data into Snowflake. Responsibilities included analyzing and cleaning raw data, writing DDL for Snowflake, optimizing PySpark code for performance, and building a comprehensive data ingestion framework supporting database ingestion, file-based ingestion, and incremental data load. Proficiently used tools like Oracle, PySpark, AWS S3, AWS EMR, and Hadoop components.
Education
ICEEM
BE COMPUTER SCIENCE ENGINEERING
Computer Science Engineering
IETK
DIPLOMA IN COMPUTER ENGINEERING
Computer Engineering