Default profile banner
SS

Samriddhi Srivastava

@samriddhisrivastava

Senior Data Engineer

Greater Noida, Uttar Pradesh

https://www.linkedin.com/in/sambigdatadev/

Larsen and Toubro InfotechIndira Gandhi National Open University

Samriddhi is an experienced Data Engineer with over 5 years of exposure in the Big Data domain. They are highly motivated and passionate about solving real-world problems. Proficient in various technologies, including Spark, Python, and AWS services, with strong skills in ETL processes, data pipeline development, and performance tuning.

Experience

Senior Data Engineer

Larsen and Toubro Infotech

•Aug 2020 - Present•Bengaluru, Karnataka

Built a big data solution for Citibank to capture various data sources on EAP platform for self-service reports and analytics. Responsibilities included requirement analysis, designing the overall framework, implementing Python modules for data loading, writing PySpark scripts for transformations, designing incremental data capture approaches, performance tuning Spark jobs, and scheduling ETL jobs using Airflow DAGs.

Associate Consultant | Big Data Engineer

Capgemini

•Nov 2018 - Aug 2020•Hyderabad, Telangana

Worked on migrating a traditional warehouse system to a new data mart built on a modern data platform stack for DBS Bank. Responsibilities included analyzing and re-modeling Teradata views, constructing SparkSQL pipelines, handling SCD Type-2 data requirements using Alluxio and AWS S3, converting Teradata scripts to Spark SQL, and developing Airflow jobs.

Software Engineer | Developer

Prolifics Corporation Limited

•Jul 2017 - Oct 2018•Hyderabad, Telangana

Education

Indira Gandhi National Open University

Master of Business Administration

MBA

Jan 2020 - Jun 2022

Pursuing MBA for better understanding of business knowledge.

G. L. Bajaj Institute Of Technology And Management

Bachelor of Technology

Information Technology

Jul 2012 - Jun 2016

Laxman Public School

High School And Intermediate

Apr 2010 - Mar 2012

Skills

Spark
Python
SparkSQL
Hive
Teradata
CI/CD
Git
BitBucket
JIRA
AWS S3
EMR
EC2
Visual Studio Code
Airflow
Collibra
Scala