Default profile banner
MP

Maneesh Peddy

@maneeshpeddy

Sr Data Engineer / Technical Program Manager at Hitachi Vantara

Pune

http://www.linkedin.com/in/maneeshpeddy

Hitachi VantaraDakota State University

Maneesh is a result-driven Data Engineer and Technical Program Manager with 10 years of experience building optimized data marts and maintaining analytics pipelines. He is skilled in translating complex data into meaningful business intelligence solutions across various industries. He possesses an excellent understanding of business operations and big data analytics tools for building scalable pipelines.

Experience

Sr Data Engineer / Technical Program Manager

Hitachi Vantara

Full-time•May 2024 - Present•Pune

Leading the development of an advanced data engineering platform designed to collect, ingest, and analyze data from various Hitachi Storage Systems and Services. Responsibilities include spearheading platform development on Azure Kubernetes Service (AKS), managing a team of 10 data engineers and data scientists, and designing robust ETL pipelines leveraging PySpark, SQL, and NoSQL databases. Built interactive dashboards using Incorta and Highcharts.

Sr Data Engineer

Gravie Insurance

Full-time•Oct 2022 - May 2024•Minneapolis, USA

Key role in data warehousing activities, including schema design and SQL data modeling, resulting in materialized models in Snowflake and Redshift. Optimized PySpark queries to process large-scale claims data. Built custom data transformation workflows using Python and utilized Tableau to develop advanced claims detail dashboards.

Sr Data Engineer

2U

Full-time•Apr 2022 - Oct 2022•Boston

Designed and developed a comprehensive SQL data model to track user activity and course block changes. Leveraged PySpark to process large-scale user activity data. Built Tableau dashboards to analyze course content effectiveness and identify potential leads for subscription plans.

Data Engineer

Kroger Foods

Full-time•Nov 2021 - Apr 2022•Cincinnati, Ohio

Led the migration of business intelligence solutions from SAP Business Objects to the Azure environment using Azure Data Factory. Employed PySpark to process and transform large volumes of asset protection data. Built Power BI reports and conducted rigorous validations against existing SAP BODS reports.

Data Engineer

USAA

Full-time•Feb 2021 - Nov 2021•SanAntonio, Texas

Designed and implemented a comprehensive BI solution from scratch to track application status and feature development. Leveraged PySpark to process large-scale compliance data. Automated compliance processes using Python scripts, resulting in a 60% time saving in data preparation.

Data Analyst

State Farm Insurance

Full-time•Jul 2018 - Aug 2021•Bloomington, Illinois

Pioneered analytics projects to build an ECRM for the AMCC business unit using Salesforce. Created visually impactful dashboards in Salesforce Einstein Analytics and Tableau, resulting in a 30% decrease in case duration and a 12% increase in closed Won percentage.

Data Scientist

Census Bureau of United States

Full-time•Sep 2019 - Jan 2021•Fulton, Maryland

Developed proof-of-concept image classifier models using CNNs to track construction phases. Conducted data quality assessments and leveraged AWS Textract and Alteryx to extract construction activity data from PDF files. Developed and automated building permit analysis pipelines.

Program Analyst

Qualcomm

Full-time•Jun 2014 - Dec 2016•Hyderabad, India

Supported program managers by developing strategic models and scripts integrating R, SQL, and QlikView. Forecasted savings by simulating supply chain metrics and developed an SQL-Query based approach to help technical teams with hardware needs.

Education

Dakota State University

Master's

Data Science & Business Analytics

Jan 2016 - Jan 2018•Grade: GPA: 3.7/4.0

Completed coursework in Predictive Analytics, Advanced Database Management Systems, Hadoop, and Spark.

Skills

Python
SQL
R
Java
JavaScript
PySpark
Pandas
NumPy
Scikit-Learn
Tableau
Power BI
QlikView
Qlik Sense
Alteryx
Snowflake
AWS
Azure Data Factory
GCP BigQuery
Airflow
DBT
Redshift
Hadoop
Spark
Kafka
Data Modeling
ETL
Data Warehousing
Machine Learning
Statistical Analysis
Cloud Computing
Azure Kubernetes Service
Grafana
Datadog