Default profile banner
AS

Archit Sharma

@archit_sharma

Subject Matter Expert in Physics at Chegg India Pvt. Ltd.

Greater Noida, UP

linkedin.com/archit-sharma

IIIT Bangalore & upGrad

Archit is a data engineer with a strong background in mathematics and physics. He possesses experience in working with large-scale datasets, designing, and implementing robust data pipelines. His technical skills encompass data analysis, visualization, and utilizing various tools and frameworks for complex problem-solving.

Experience

Data Engineer/Analyst

•Jan 2021 - Present•Bengaluru, IN

Read the questions carefully and identify the main concepts and formulas involved. Use clear and concise language to explain the reasoning and steps involved in solving the problem. Provide relevant examples or diagrams to illustrate the concepts or principles if needed. Check the answer for accuracy and consistency with the given information and units. Cite the sources of information or references used if applicable.

Subject Matter Expert in Physics

Chegg India Pvt. Ltd.

•Jan 2023•Greater Noida, UP

Data Analysis

•Jan 2018 - Jan 2021•Lucknow, IN

Finding the company's lead conversion rate is the goal. Determining whether a lead can be converted is a problem that can be solved by a well-developed logistic classification. The test data frame was used for the prediction, and the best cutoff value was 0.35 with 80% accuracy, sensitivity, and specificity. For Redshift Data Mart (Schema), create a batch ETL pipeline to read transactional data from RDS, transform it, and load it into the target dimensions and facts, utilizing Amazon S3, Apache PySpark, Amazon RedShift, and Apache Sqoop. Build the end-to-end pipeline for the e-commerce company. Read the data from the Kafka server and prepossess the data to appropriate the data. Calculate the time-based key performance indicators (KPIs) in real-time.

Education

IIIT Bangalore & upGrad

Executive Post Graduate Programme in Data Science

Data Science

Dr. A.P.J. Abdul Kalam Technical University

Bachelor of Technology

Electronics and Communication Engineering

Skills

Python
Pandas
NumPy
Scikit-learn
MySQL
PostgreSQL
IBM DB2
MongoDB
Cassandra
Apache Hive
Apache Sqoop
Flume
Kafka
Hadoop
MapReduce
AWS S3
Amazon Elastic MapReduce (EMR)
AWS EC2
Linux Scripting
IBM Cognos
Tableau
Amazon Redshift
Apache Airflow
PySpark
Data Analysis
Data Modelling
ELT/ETL
Data Streaming
Data Manipulation
Data Visualization