Default profile banner
RP

Rakshit Pandit

@rakshitpandit

Data Engineer at TATA CONSULTANCY SERVICES

Mumbai, India

TATA CONSULTANCY SERVICESKENDRIYA VIDYALAYA NO.1

Rakshit Pandit is a Data Engineer with experience at Tata Consultancy Services, specializing in building real-time data pipelines and graph database architectures using AWS Neptune and Gremlin. He has expertise in Python, PySpark, and various DevOps tools, including Docker and Jenkins. He holds a BTech in Information Technology and has strong foundational knowledge in Machine Learning and Cloud Computing.

Experience

Data Engineer

TATA CONSULTANCY SERVICES

•Nov 2020 - Present•Mumbai, India

Led and delivered Customer360 - The idea was to keep the customers’ data organized and readily available along with business insights. Designed the architecture as a Graph database using Gremlin on AWS Neptune with real-time pipelines. Developed APIs with AWS API Gateway for business and stakeholders to get information at their fingertips. Used PySpark to produce CSVs for historical data to perform bulk data load to Neptune Cluster from Parquet files on Docker container and EC2 Bastion host. Developed real-time ingestion for various reports with Lambda, SNS and EventBridge. For publication, used AWS Glue to fetch data from S3 and DynamoDB and a writer to dump data into PostgreSQL and generate publication CSVs. This helped the organization to swiftly migrate from Batch processing system to real-time. Created re-usable pipelines to orchestrate data movement between various layers like raw, staging and publication. Conducted knowledge transfer sessions with interns and new recruits to help them understand the operations from technical and business point of view.

Education

MEDICAPS UNIVERSITY

BTech

Information Technology

Jan 2016 - Jan 2020•Grade: 7.77/10

KENDRIYA VIDYALAYA NO.1

CBSE XII

Jan 2016•Grade: 82.00%

KENDRIYA VIDYALAYA NO.1

CBSE X

Jan 2014•Grade: 9.0/10

Licenses & Certifications

Machine Learning

Stanford University/Coursera

• No expiration

Neural Networks and Deep Learning

DeepLearning.AI

• No expiration

ML Bootcamp

TCS x TalentSprint

• No expiration

Design Thinking for Innovation

University of Virginia/Coursera

• No expiration

AWS SysOps

Cloud Wizard

• No expiration

Skills

Python
C
C++
MySQL
Gremlin
Spark
AWS
ETL
Docker
AI
Machine Learning
Git
Jenkins
DevOps
Agile
Data Structures and Algorithms
Cloud Computing
Natural Language Processing
Blockchain Architecture
Data Analytics