Yashwanth D

@ydoniken

Data Scientist at AMD

Hyderabad, Telangana, India

AMDIndian Institute of Information Technology and Management Gwalior

Yashwanth Donikena is a Data Scientist at AMD with extensive experience in Generative AI, NLP, and Machine Learning. He specializes in building RAG pipelines, LLM-driven agents, and automated CI/CD workflows. Previously, he worked at EXL Services, where he developed AI-driven insurance processing systems. Yashwanth holds an Integrated Post Graduation in Information Technology from IIITM Gwalior and is skilled in Python, SQL, and AWS cloud technologies.

Experience

Data Scientist

AMD

•Jun 2024 - Present•Hyderabad

Built a role-based AI document assistant that dynamically routes user queries to a DuckDB-backed SQL engine or a Chroma-based RAG pipeline, enforcing RBAC policies and reducing unauthorized data exposure risk by 100%. Designed and implemented an LLM-driven NL→SQL agent with a sandboxed DuckDB execution layer and intelligent fallback to RAG, reducing structured-query failures and improving answer coverage. Implemented role-aware retrieval and integrated a Cohere reranker to optimize document ranking, improving top-3 retrieval precision by approximately 30%. Built an automated RAG evaluation pipeline that generates synthetic QA pairs and computes faithfulness, relevance, and conciseness using LLM-based scoring for continuous model validation. Delivered an end-to-end system with a FastAPI backend, Chroma embeddings, DuckDB analytics layer with CI-driven automated API endpoint testing reducing regression issues by approximately 40%. Designed and implemented Dockerized CI/CD pipelines using Jenkins and GitHub Actions to automate ML model validation and test suite execution on NPU platforms, enabling dynamic scheduling across environments and reducing manual testing effort by 60%.

Data Scientist

EXL Services

•Jun 2023 - Apr 2024•Noida

Developed an automated insurance processing system leveraging advanced NLP techniques and OpenAI’s GPT-4 to optimize policy validation and accurately evaluate claim coverage. Processed large volumes of policy and claim data through robust preprocessing pipelines, utilizing spaCy for Named Entity Recognition (NER) and regex-based rules for data validation. Fine-tuned GPT-3.5 Turbo to enhance structured claim classification and decision-making efficiency, reducing inference costs while reserving GPT-4 for complex policy reasoning and edge-case evaluation. Successfully completed onboarding training at upGrad, excelling in SQL, Python, Machine Learning, Deep Learning, NLP, AWS achieving a score of 92%.

Education

Indian Institute of Information Technology and Management Gwalior

Integrated Post Graduation in Information Technology (B.Tech + M.Tech.)

Information Technology

Jul 2018 - May 2023

Skills

Python

SQL

C++

NLP

Machine Learning

Generative AI

Deep Learning

Transformers

Pandas

Scikit-learn

SpaCy

PyTorch

XGBoost

ONNX

FastAPI

LangChain

RAG

RAGAS

Prompt Engineering

OpenAI API

Open-Source LLMs

AWS Lambda

SageMaker

Bedrock

RDS

DynamoDB

Step Functions

Cursor

Jupyter Notebook

VS Code

MLflow

GitHub Actions

Docker

CI/CD