Default profile banner
AB

Aritra Banik

@aritrabanik

Associate at JP Morgan Chase

Bangalore, India

linkedin.com/in/aritra-banik-86169121

JP Morgan ChaseHaldia Institute Of Technology

Aritra Banik is an experienced Data Engineer with expertise in the Hadoop ecosystem, AWS, and cloud technologies. He has a strong background in building data pipelines, data warehousing, and implementing machine learning applications using tools like Spark, Python, and Scala. His professional experience includes roles at JP Morgan Chase, Accenture, and TCS, focusing on complex data migration and analytics solutions.

Experience

Associate

JP Morgan Chase

•Mar 2018 - Present

Working on AWS migration using Glue, EMR and lake formation for Data lake. Build SQL based abstraction layer for spark batch jobs. Built data warehouse on hadoop with data imported from Oracle HCM could and existing ETL application. Built data anonymization framework for prod to dev data migration. Integrated resume machine learning application with hadoop. Built application framework based on spark with full code coverage support and run the jobs with spark-submit sequentially as workflow with rerun capability. Built logging framework based on Spring AOP with data debugging and job stats capture support. Built Hbase based batch metadata framework.

Senior Software Engineer

Accenture Solutions Pvt. Ltd

•Aug 2016 - Feb 2018

Developing a real time Cisco software and hardware quality testing data insight platform on Hadoop. Developed core Spark program used for queries execution. Hive queries migration to Spark SQL. Complex hive queries performance improvements activity.

Technology Analyst

Infosys Limited

•Mar 2016 - Aug 2016

IT Analyst

Tata Consultancy Services

•Mar 2011 - Feb 2016

Processing data from different system for Revenue Assurance and creating report with that data. Developed Revenue assurance reconciliation on different Key risk areas. Performance tuning on complex hive queries. Generating daily and monthly feeds to reporting system with National and local TV, radio data. Developed the POC on migrating the Monitor plus Mainframe system to Hadoop platform. Generating daily, weekly, bi-weekly and monthly reports for Field Management Business. DB2 version up gradation, Data gap analysis between Monthly reports for Asset reporting, Generation of special yearly reports and data files for 2012 Year End.

Education

Haldia Institute Of Technology

B.Tech

Electronics & Communication

Jan 2010

Anandamath Vidyapith

Higher Secondary

Jan 2006

Ramakrishna Vivekananda Mission Vidyabhavan

Secondary

Jan 2004

Licenses & Certifications

AWS certified Solutions Architect - Associate

AWS

• No expiration

MapR Certified Hadoop Developer

MapR

• No expiration

IBM Certified Application developer on Cloud platform

IBM

• No expiration

Skills

Data Engineering
Core Java
Scala
SQL
Python
Unix/Linux/Windows
Mapreduce
Spark
Hive
Sqoop
Impala
Hue
HBase
Oozie
JUnit
Mockito
Kerberos
CDH 5.16
HDP 2.2.6
MapR 4.0
Oracle
AWS
EC2
EMR
S3
EKS
Glue
Lake formation
Kubernetes
Terraform
Linear Regression
Logistic Regression
Random forest
k-Means Clustering
Word2Vec
tf-idf vectorizer
latent dirichlet allocation