Suraj Sah
@surajsah
Data Engineer at TATA Consultancy Services
Kolkata, West Bengal
Suraj Prasad Sah is a Data Engineer with 4.75 years of industry experience. He possesses a strong understanding of the Hadoop ecosystem, including Spark, Hive, YARN, and JIRA. He is skilled in analyzing requirements and has good communication abilities, enabling effective client interaction.
Experience
Data Engineer
TATA Consultancy Services
Currently developing spark job to generate the XML file as per the business logic. Worked on creating the hive tables on top of HDFS and loaded it with the sample data. Worked on Hive optimization performance using Partitioning on External tables. Worked on developing the spark job using various transformation and action to meet the business need. Worked on various Spark optimization techniques such as code level and resource level.
Education
Heritage Institute of Technology
B.Tech(ECE)
Hazarimal High School
Higher Secondary
St. Marys Boarding High School
Matriculation