Vishal Thakur
@Vishal_1009
data engineer at BX Data techsolution pvt ltd
Noida, Uttar Pradesh, India
Vishal Thakur is a Data Engineer with experience in PySpark, Azure Databricks, Ab Initio, and SQL. He specializes in building and supporting scalable ETL pipelines across banking, telecom, and EdTech domains. His expertise includes ETL development, performance tuning, data validation, and production support. He is currently a Data Engineer at Bx Data Techsolution Pvt. Ltd. and holds a Bachelor of Technology in Computer Science Engineering.
Experience
data engineer
BX Data techsolution pvt ltd
Project: DARTS under Bharti Airtel Pvt. Ltd – Monitored, managed, and scheduled Ab Initio graphs using Control Center, ensuring smooth execu- tion of daily workflows. – Created and maintained a structured daily tracking Excel dashboard (morning evening shifts) to ensure no critical tasks were missed, enabling real-time monitoring of key KPIs and improving team communication and accountability. – Utilized Arcos for backend data operations and executed SQL queries for data validation, extraction, and hierarchy-based reporting. – Used Spark to remove corrupted records from the reportind file to make sure reporting was delivered timely under SLA. – Worked extensively on Hadoop and Hive, managing HDFS data, partitions, and executing HiveQL queries including MSCK REPAIR TABLE and SHOW PARTITIONS. – whitelisting of the websites which was blocked, and making latency report. – Troubleshot ETL failures across Ab Initio, Hive, and Hadoop systems by analyzing error logs and performing timely corrective actions. – Performed root cause analysis (RCA) using SQL and Hive to identify data issues, performance bot- tlenecks, and upstream/downstream failures. Key Skills: Ab Initio Control Center, PySpark, SQL, Excel, Arcos, Hadoop, Hive, HiveQL, Airflow, Oozie, Data Reporting, ETL Monitoring
Education
University of Lucknow
B.Tech
Computer science
Licenses & Certifications
Databricks Certified Data Engineer Associate
Databricks