Default profile banner
SS

SHUBHAM SINGH

@shubhamsingh8122

Data Engineer

Pune, MH

https://www.linkedin.com/in/shubham0329/

Blazeclan TechnologiesCenter for Development of Advance Computing

Shubham Singh is an experienced Data Engineer skilled in building robust data pipelines and performing complex data transformations. He possesses strong expertise in AWS services, including Lambda, Glue, and DMS, and is proficient in ETL tools like Ab-Initio. Furthermore, he has sound knowledge of SQL, Python, PySpark, and Unix/Shell Scripting for data manipulation and system automation.

Experience

Data/Cloud Engineer

Blazeclan Technologies

•Apr 2021 - Present•Pune, MH

AWS Lambdas (using python) for code exectuion and scheduling within Step functions using AWS cloudwatch. S3 and RDS for Storage. AWS Glue for discover, prepare, and combine data for analytics, machine learning, and application development using python pandas and numpy. AWS DMS for migration of data. Hands on Data-Lake on AWS services with like S3, Data Pipeline, EC2, Cloud9, Lambda, Redshift, Glue, Athena. Sound scripting knowledge like UNIX/LINUX Shell Script and writing SQL queries. Strong background in Data Engineering, Database Migration, Data Modeling.

ETL Developer

Atos-Syntel

•Mar 2018 - Apr 2021•Pune, MH

Mainly involved in building Ab Initio graphs as per the client requirement, shell scripting and data analysis. Pyspark SQL Module (Loading data from oracle in form of parquet using Sqoop and loading into Hive Database. Sound scripting knowledge like UNIX/LINUX Shell Script and writing SQL queries. Having strong analytical skills and quick in providing the permanent solutions files, Multi files,database tables, etc.

Education

Center for Development of Advance Computing

Post Graduate Diploma in Advance Computing

Aug 2017 - Feb 2018•Grade: 6.0/10

Technocrats Institute of Technology

Bachelor of Engineering

Mar 2013 - May 2017•Grade: 7.51/10

Skills

AWS Lambda
AWS Glue
AWS DMS
AWS Step Functions
AWS Sagemaker
AWS ECS/ECR/Fargate
SQL
Heterogeneous Databases
Hive
MongoDB
NoSQL
Hadoop
PySpark
HDFS
Map Reduce
Streaming
RDD
YARN
Python
ETL (Ab-Initio)
Unix/Shell Scripting