ACAnkur Chavda is a Data Engineer with over 4 years of experience specializing in data problems. He is proficient in technologies such as Spark, Python, SQL, and ETL processes. His expertise covers cloud environments and various data engineering parallels, including building robust data pipelines.
Experience
Engineer I (DE III)
American Express
Developed data pipelines that fed data into machine learning models for outlier detection using PySpark, SQL, Python, Shell Scripting. Pipelines to clean and supply news data, Reddit posts, comments and activities in other such forums to the machine learning models using Python, Couchbase, NoSQL, Flask, Docker. REST APIs for providing sentiment trend data of various companies on a dashboard used by senior leadership. Reputational Risk dashboard that monitors a surge in negative sentiment on Twitter for American Express to address issues proactively. Led the team and set coding standards by developing common packages, git workflow and review processes.
Business Technology Ananlyst
ZS Associates
Transformed and processed huge sets of Real-World data to OMOP format for pharmaceutical giants in the US using Spark, Python, SQL, Databricks, AWS. Built pipelines using Airflow to generate cohorts for analysis. Automated and optimized transformation runs on monthly and quarterly data feeds using Databricks, Spark, SQL, Python. Led the migration of a Teradata ecosystem to Spark and AWS starting from design, scripts, automation, conversion of data to parquet format and query compatibility.
MEAN Stack Developer
PoshaQ
Developed an email scheduler for a Danish clothing & accessories giant for targeted email campaigns using Node.js, Express.js and MongoDB.
Junior Associate - Intern
Publicis Sapient
Enhanced a hedge fund & portfolio management software for a US based financial services company in C#, Angular JS.
Education
Dhirubhai Ambani Institute Of Information & Communication Technology
Bachelor of Technology
ICT
Licenses & Certifications
Data Engineer
DataTalks
Apache Spark Programming with Databricks
Taming Big Data with Apache Spark and Python