Vivek Brahmbhatt

@vivekbrahmbhatt

Data Engineer

Ahmedabad, India

https://www.linkedin.com/in/vivek-brahmbhatt/

Farm JournalNortheastern University

Vivek Brahmbhatt is an experienced Data Engineer skilled in building robust data pipelines and managing ETL components for big data environments. He possesses hands-on expertise with relational databases including SQL Server, MySQL, and PostgreSQL. His technical proficiencies include Apache Spark, Spark SQL, and advanced data warehousing concepts. He is also skilled in generating comprehensive business reports using tools like Tableau and Power BI.

Experience

Data Engineer

Farm Journal

Full-time•Sep 2022 - Present•Ahmedabad, India

Designed and developed complex Tableau dashboards to track subscriber KPIs resulting in a 100% of increase of data utilization for BI purposes. Created and maintained data pipelines bringing in data from AWS, GA4, and SQL Server saving 15 hours of manual work per week. Developed Python and SQL scripts using Pyspark and SparkSQL to de-duplicate and master millions of subscriber data resulting in shrinking the overall size of the demographic database by 25%. Engineered re-usable, modular, and user-friendly applications for the sales team using Retool resulting in an overall sales boost worth $240,000.

Associate Data Engineer

Credit Suisse

•Jun 2021 - Aug 2022

Deployed Databricks pipeline to automate ETL processes across billions of rows of data, which saved 45 person hours per month. Created a cloud-first data ingestion job improving batch ingestion processing speed by 63%. Developed Python scripts to streamline real-time data using Databricks API and facilitating data views for BI tools like Tableau and Databricks dashboards. Engineered data quality controls and measures for precision and efficacy of the ingested data resulting in 90% clean data.

Junior Data Engineer

Colsh Consultants LLC

•Jul 2020 - May 2021

Worked with clients to understand the business needs and translate the requirements into actionable reports using Tableau, saving hours of manual work each week. Improved data ingestion speed of distributed data processing job involving large scale streaming data by 67% using PySpark. Attained 99.8% up-time of data pipelines responsible for ingesting, streaming and transactional data across 8 primary data sources using Spark and Python. Created data extraction, cleansing, and loading scripts to move data from source systems to 3 data warehouses.

Data Analyst

Indiabulls Asset Management

•Jul 2017 - Jun 2018

Developed new integrated data sets based on business requirements by correlating, mapping and acquiring data from different data sources. Identified performance indicators to locate code problems by filtering and cleaning data, reviewing digital and physical reports. Enhanced and improved query performance for business-critical dataset using profiling tools and SQL. Analyzed and provided the strategic insights to deliver the business value to the client using BI tools such as Power BI and SQL Server Reporting Services.

Education

Northeastern University

Master of Science in Engineering Management

Engineering Management

Sep 2018 - Apr 2020

Relevant Coursework: Data Warehousing and Business Intelligence, Data Mining, Database Management and Design, Computation and Visualization, Operations Research, Probability and Statistics, Economic Decision Making

Gujarat Technological University

Bachelor of Engineering

Mechanical Engineering

Jul 2013 - Jun 2017

Skills

SQL

Python

Tableau

PowerBI

Microsoft SQL Server

MySQL

PostgreSQL

Data Warehousing

ETL

Spark

PySpark

AWS S3

GA4

Retool

Alteryx

Talend

Pandas

NumPy

SciPy