Default profile banner
SN

Sayali Narsale

@sayalinarsale

Senior Software Engineer at Capgemini

Pune

CapgeminiDr.Vikhe Patil College of Engineering

Highly motivated and results-oriented Big Data Engineer with 3 years of experience in designing, developing, and maintaining scalable data solutions using Hadoop, Hive-SQL, Data Warehouse, PySpark, Teradata, Snowlake, Python, and Kafka. Possesses a strong understanding of data structures, SQL concepts, and complex SQL scripting with expertise in building and optimizing data pipelines for high-volume data processing. Proven ability to collaborate effectively with cross-functional teams to deliver innovative solutions and ensure high customer satisfaction.

Experience

Senior Software Engineer

Capgemini

•Jan 2022 - Present

Designed and implemented scalable data storage solutions using Hadoop and Hive-SQL databases. Developed and maintained big data processing pipelines using Hadoop, hive, and Apache PySpark. Distributed data downstream using platforms such as Hadoop, Snowflake, and Teradata. Wrote and tested data processing scripts using Python. Utilized automation tools like Autosys, TWS (job scheduler), TCM, and batch monitoring tools for more than 50 jobs. Used UNIX shell scripts for validations and generic file-watcher scripts. Collaborated with business stakeholders to understand data requirements and identify opportunities for data-driven decision-making. Migrated tables from the hive to Snowflake at table level, and base objects containing data in TBs of data from Hadoop Data Lake to on-premise Teradata. Monitored performance and optimized big data systems. Documented and communicated technical designs, solutions, and best practices and mentored junior team members.

Education

Dr.Vikhe Patil College of Engineering

Bachelor Of Engineering

Electrical

Aug 2017 - Jun 2021•Grade: 7.22 CGPA

Licenses & Certifications

Coursera Certified Data Analysis Using PySpark

Coursera

Coursera Certified Big Data with Spark and Hadoop

Coursera

Hackerrank Certified in SQL & Python

Hackerrank

Skills

Hadoop
Hive-SQL
PySpark
Python
Kafka
Teradata
Snowflake
SQL
PostgreSQL
MS SQL Server
MyQSL
NoSQL
Unix/Shell scripting
Data Warehousing
ETL tools
Git
Impala
HDFS
Agile methodology
Data lake concepts