Rakshit Pandit
@rakshitpandit
Data Engineer at TATA CONSULTANCY SERVICES
Mumbai, India
Rakshit Pandit is a Data Engineer with experience at Tata Consultancy Services, specializing in building real-time data pipelines and graph database architectures using AWS Neptune and Gremlin. He has expertise in Python, PySpark, and various DevOps tools, including Docker and Jenkins. He holds a BTech in Information Technology and has strong foundational knowledge in Machine Learning and Cloud Computing.
Experience
Data Engineer
TATA CONSULTANCY SERVICES
Led and delivered Customer360 - The idea was to keep the customers’ data organized and readily available along with business insights. Designed the architecture as a Graph database using Gremlin on AWS Neptune with real-time pipelines. Developed APIs with AWS API Gateway for business and stakeholders to get information at their fingertips. Used PySpark to produce CSVs for historical data to perform bulk data load to Neptune Cluster from Parquet files on Docker container and EC2 Bastion host. Developed real-time ingestion for various reports with Lambda, SNS and EventBridge. For publication, used AWS Glue to fetch data from S3 and DynamoDB and a writer to dump data into PostgreSQL and generate publication CSVs. This helped the organization to swiftly migrate from Batch processing system to real-time. Created re-usable pipelines to orchestrate data movement between various layers like raw, staging and publication. Conducted knowledge transfer sessions with interns and new recruits to help them understand the operations from technical and business point of view.
Education
MEDICAPS UNIVERSITY
BTech
Information Technology
KENDRIYA VIDYALAYA NO.1
CBSE XII
KENDRIYA VIDYALAYA NO.1
CBSE X
Licenses & Certifications
Machine Learning
Stanford University/Coursera
Neural Networks and Deep Learning
DeepLearning.AI
ML Bootcamp
TCS x TalentSprint
Design Thinking for Innovation
University of Virginia/Coursera
AWS SysOps
Cloud Wizard