Default profile banner
Jaffar Ahamed Syed Abdul KaleemJA

Jaffar Ahamed Syed Abdul Kaleem

@jaffarahamedsyedabdulkalee

Data Engineer

Chennai

https://linkedin.com/in/jaffar-ahamed/

Cognizant Technology SolutionsDhanalakshmi College of Engineering

Jaffar is a Data Engineer with over 5 years of experience specializing in Data Engineering and Cloud Computing. He possesses expertise in building applications using BigData, AWS Cloud, and data warehousing technologies. His skills include hands-on experience with AWS services (S3, Athena, EMR), Spark Streaming, and cloud orchestration using Kubernetes and Docker. He is proficient in Python and SQL, and skilled in developing data ingestion and transformation pipelines.

Experience

Developer - CDS

Cognizant Technology Solutions

•Aug 2021 - Present

Migrating on-premise data to cloud data lake. Building data ingestion pipeline in delta lake. Developing and enhancing python utilities. Building real time data analysis pipeline using Spark Streaming. Data Harmonization, Curation, Optimization and Warehouse synchronization based on requirement. Monitoring daily and weekly scheduling jobs.

Data Aggregation Associate

Checktronix India Private Limited

•Oct 2018 - Aug 2021

Profile source data and define data structure. Designing and construction of data ingestion pipeline. Designing and construction of data curation pipeline based on requirement. Visualizing curated data using Zeppelin tool.

Data Aggregation Associate

Checktronix India Private Limited

•Dec 2016 - Oct 2018

LENS suite provides the most accurate parsing, searching, and matching technology anywhere for HR staff and recruiters. Our distinctive skills-based approach uses Big Data analysis to sort applicants and find the best match.

Education

Dhanalakshmi College of Engineering

B.Tech

Information Technology

Jan 2012 - Jan 2016

Licenses & Certifications

AWS Certified Cloud Practitioner

Udemy

• No expiration

AWS Solution Architect - Associate

Udemy

• No expiration

Spark and Python for Big Data with PySpark

Udemy

• No expiration

Data Analysis with Pandas and Python

Udemy

• No expiration

Docker, Kubernetes and Python Bootcamp

Udemy

• No expiration

Databricks and Spark Core

Udemy

• No expiration

Git Complete Guide

Udemy

• No expiration

Skills

AWS
S3
Athena
EC2
EMR
PySpark
Spark-SQL
Spark Streaming
Kafka
Pandas
Hadoop
Hive
HDFS
YARN
Map Reduce
Databricks
Jupyter
Zeppelin
Kubernetes
Docker
Rancher
Python
SQL
Boto3
Scrapy
Agile Scrum
JIRA
Confluence
Oracle
Teradata
DB2
MySQL