Abhishek Anand
@aanand
Data Engineer II at EXL
Siwan, Bihar, India
Abhishek is a highly motivated Data Engineer with 3.5 years of experience in designing and developing end-to-end data pipelines. He specializes in utilizing AWS services, Kafka, Airflow, PySpark, and PowerBI. He possesses a strong work ethic, consistently delivering high-quality solutions while maintaining attention to detail and mastering the latest technologies.
Experience
Data Engineer II
EXL
Translated SAS code into Python using Athena SQL and scheduled Python code to optimize processes. Streamlined data ingestion into the Data Lake and leveraged S3, Glue, Athena for csv data. Created and optimized data models for PowerBI to improve data analysis and visualization.
Data Engineer
Genpact pvt ltd
Successfully designed and implemented end-to-end data pipelines for multiple projects, utilizing technologies such as Apache Kafka, Debezium, MySQL, Sql-Server, Postgre, DMS, Redshift, S3, Hudi, PySpark, Hive, and Presto. Created pipelines using mysql, postgres, aws dms, redshift, s3, hudi, spark, hive, and presto. Deployed strimzi kafka cluster on Kubernetes. Streamlined data ingestion into the Data Lake and leveraged S3, Glue, Redshift for csv data. Implemented helm-chart for cp-kafka, cp-kafka-rest-api and cp-kafka-schema-registry using Kubernetes. Utilized Promrtheus and Grafana for effective monitoring of kafka, kafka-connect. Contributed to the end-to-end design of the data pipeline, ensuring smooth and efficient data flow from source to destination.
Education
National Institute of Technology, Raipur
B.Tech
Licenses & Certifications
Neo4j certified professional
Neo4j GraphAcademy