Data Engineer with almost 2 years of experience working across different stages of data pipelines and eager to build robust databases. The candidate has implemented natural language processing tools to create machine-readable databases.
Experience
Data Engineer
Coditas Pvt Ltd.
Developed an ETL process to clean and transform data from multiple sources and load it into a data warehouse. Developed a data quality framework to standardize and validate data. Developed an ETL process that improved data extraction speed. Worked on Pyspark and Pyspark AWS services. Worked on transforming speech audio data to text data including masking process using services like AWS Transcribe, AWS Comprehend, Google Speech to Text and Assembly AI based on NLP. Extracted and fetched data from Unstructured Data form and processed it in Structured Data. Performed data cleaning and applied machine learning algorithms to develop clusters and target specific groups to fetch insights from it. Analyzed and visualized large datasets to uncover key insights using different pre defined liabraires like MatPlotlib,Seaborn,etc. Conducted number of tests to compare the performance of different algorithms.
Software Trainee
Zensar Technologies Pvt Ltd.
Worked on core fundamental skills on JAVA language. Worked on databases like SQL and PgSql and gained insights on queries and sub queries. Learned life skills and focused on improving personality development. Made a JAVA project based on online voting panel application with Authentication and Security.
Education
Walchand Institute Of Technology
Bachelors of Engineering
Computer Science and Engineering