Default profile banner
NM

NITISH MALIK

@nitishmalik

Data Engineer at TelePerformance

New Delhi

https://www.kaggle.com/code/nitishmalik/movies

TelePerformanceKD College (CCS University)

Nitish Malik is a Data Engineer with experience in developing and managing complex data pipelines and cloud migration projects. He possesses strong expertise in Azure Synapse, PySpark, and SQL, having successfully shifted data from on-premises systems to Azure Cloud. His skills include performing comprehensive ETL processes, ensuring data integrity, and optimizing data loads using delta tables and row-level security.

Experience

DATA ENGINEER

TelePerformance

INTERNSHIPInvalid Date - Invalid Date

Worked on live projects related to TP’s internal services and products which includes call centre data with surveys and sentiments data and making tables out of it over cloud. Gained knowledge in the global team to control 3 servers at a time while ensuring the data integrity and data privacy. Worked on shifting the data from on premises to Azure Cloud using Synapse and writing code in synapse notebooks using combination of Pyspark and spark SQL and acted as intermediate to make sure the process is running on daily basis. Testing and checking data quality before final data load is also ensured to create a smoother work flow. Customizing the pipeline load according to the need to prevent data duplicate in final dumping of data in delta tables. Shifted the whole Mytp to cloud while creating its pipeline, creating delta tables out of raw data and its stored procedure and delivered it to Power BI team for reporting. Automated the shifting process using triggers and monitor data flow using Azure Synapse Pipelines. Used certain date columns as filters and parquet file type to tune the daily data load and increase productivity. Query and delta table optimization for faster results is also implemented for cost management. Created the Delta tables based on all 3 servers using Pyspark and spark SQL code on Azure cloud and maintain the CCMS DB with checking the data count and random value checks. ETL and modifications over TpSurvey for Power BI team based on feedbacks and comments in TP calling processes to ensure the desired goal of reporting is matched. Worked over HCRM people’s dashboard related to Brazil and created its delta tables for matrix calculations. Took the leadership and control access over views based on geographical regions and employee’s access tier. Created tables, views and stored Procedures using SQL in ICIMS to fulfil the requestor’s demand. Data validation across all the servers to meet global data team criteria to make sure regional team is not lagging i

Education

KD College (CCS University)

B.Sc

Biology

Jan 2021

Licenses & Certifications

SQL SERVER FUNDAMENTALS

DataCamp

Issued: Jan 2022Expires: Feb 2022

SQL SERVER FOR DB ADMINISTRATOR

DataCamp

Issued: Jan 2022Expires: Feb 2022

SQL DATA ANALYSIS

DataCamp

Issued: Oct 2021Expires: Jan 2022

PYTHON DATA ANALYSIS

DataCamp

Issued: Jan 2022Expires: Apr 2022

TABLEAU FUNDAMENTALS

DataCamp

Issued: May 2022Expires: Jun 2022

POWER BI FUNDAMENTALS

DataCamp

Issued: Jun 2022Expires: Jul 2022

SPREADSHEET

DataCamp

Issued: Aug 2022Expires: Aug 2022

PANDAS

KAGGLE

Issued: Apr 2023Expires: May 2023

Skills

Azure Synapse
MySQL
PySpark
Azure Pipeline
Delta table
Views
Stored Procedures
SQL
Spark SQL
ETL
Data Modeling
Power BI
Tableau
Pandas
Excel