Abhishek Anand

@aanand

Data Engineer II at EXL

Siwan, Bihar, India

EXLNational Institute of Technology, Raipur

Abhishek is a highly motivated Data Engineer with 3.5 years of experience in designing and developing end-to-end data pipelines. He specializes in utilizing AWS services, Kafka, Airflow, PySpark, and PowerBI. He possesses a strong work ethic, consistently delivering high-quality solutions while maintaining attention to detail and mastering the latest technologies.

Experience

Data Engineer II

EXL

•Invalid Date - Present

Translated SAS code into Python using Athena SQL and scheduled Python code to optimize processes. Streamlined data ingestion into the Data Lake and leveraged S3, Glue, Athena for csv data. Created and optimized data models for PowerBI to improve data analysis and visualization.

Data Engineer

Genpact pvt ltd

•Invalid Date - Invalid Date

Successfully designed and implemented end-to-end data pipelines for multiple projects, utilizing technologies such as Apache Kafka, Debezium, MySQL, Sql-Server, Postgre, DMS, Redshift, S3, Hudi, PySpark, Hive, and Presto. Created pipelines using mysql, postgres, aws dms, redshift, s3, hudi, spark, hive, and presto. Deployed strimzi kafka cluster on Kubernetes. Streamlined data ingestion into the Data Lake and leveraged S3, Glue, Redshift for csv data. Implemented helm-chart for cp-kafka, cp-kafka-rest-api and cp-kafka-schema-registry using Kubernetes. Utilized Promrtheus and Grafana for effective monitoring of kafka, kafka-connect. Contributed to the end-to-end design of the data pipeline, ensuring smooth and efficient data flow from source to destination.

Education

National Institute of Technology, Raipur

B.Tech

Invalid Date - Invalid Date•Grade: 7.87

Licenses & Certifications

Neo4j certified professional

Neo4j GraphAcademy

Issued: Invalid Date• No expiration

Skills

Python

AWS Services

Kafka

PySpark

Airflow

Redshift

AWS Glue

Athena

MySQL

PostgreSQL

Kafka-connect

Hive

Presto

Kubernetes

Data Structures and Algorithms

DBMS

HBase

PowerBI

Debezium