Default profile banner
AS

Antara Raman Sahay

@antararaman

Gen AI Engineer

New Delhi, Delhi, India

Test ZeusInvertis University

Experience

Gen AI Engineer

Test Zeus

Full-timeAug 2025 - Dec 2025Bangalore, Karnataka, India

Multi-Agent Optimization (Reduced execution time by 40%) • Optimized multi-agent system that executes software test cases by interacting with user interfaces in virtual environments. • Identified primary latency bottlenecks through code-level analysis, isolating excessive planner-driven LLM calls as a major overhead. • Deployed a caching-based replay mechanism using the OpenAI SDK and PocketBase, enabling deterministic re-runs by directly invoking sub-agents and tools without planner involvement. • Configured a mode-switching execution strategy (Planner Mode vs RL Mode) with automatic fallback to planner reasoning on failure. • Reduced overall execution time by 40% by eliminating redundant LLM calls and enabling sequential sub-agents and tool-driven execution for repeated execution paths. Customer-Facing Multi-Agent QA Chatbot Suite • Architected a customer-facing multi-agent QA chatbot (AG2) that guided users across the full QA lifecycle and produced production-ready test scenarios, cases, data, and environments. • Created adaptive memory and personalization using Redis, persisted finalized artifacts in pocketbase with hash-based deduplication, and orchestrated multi-modal ingestion (26+ MIME types) via Hatchet workflows and RAG tools. • Productionized the system using ARQ-based guardrails, caching, and retries, enabling reliable, end-to-end test suite generation for beginner to senior QA users.

Software Engineer Trainee

Helmerich and Payne

Full-timeMar 2024 - Aug 2025Remote, OR, USA

Automated Code Reviewer • Independently led the end-to-end development of an AI-powered code review system that analyzes pull requests and mimics senior SWE feedback, cutting manual review and release delays by 10+ hours per cycle. • Conducted in-depth research on academic papers, open-source tools, and commercial solutions; identified key gaps and selected optimal modeling strategy. • Fine-tuned the base model and improved performance through quantization and parallel processing, reducing latency by 30%. • Designed and released a CI-ready Python package, a FastAPI model server, and integrated deployment via Bitbucket Pipelines and Jenkins. • Seamlessly operationalized within organizational CI/CD workflows. Smart Drilling Dashboard • Built an NLP-powered dashboard that translates natural language queries into SQL, fetches real-time drilling data from databricks, and computes domain-specific metrics. • Constructed a dynamic graph module using sliding window analytics to detect drilling dysfunctions with embedded domain knowledge. • Generated AI-powered summaries for both data and visual insights; incorporated a memory-based multimodal RAG pipeline (ColPaLi). MDS Data Extraction using Azure AI • Structured and optimized an OCR pipeline to extract structured data from complex PDF documents containing tables and embedded images, using a hybrid approach of traditional OCR (PyTesseract, PDF readers, base64 encoding, etc) and layout-aware models (e.g., CoLPaLi). Evaluated and benchmarked various LLMs and VLMs (GPT-4o, Qwen 2/2.5/2VL-7B, CoLPaLi) for accuracy and reliability using a Human-in-the-Loop (HITL) evaluation framework, structuring outputs to conform to the organization’s Master Data Service (MDS) schema for seamless integration into internal systems. Research & Development • Architected a custom Graph RAG pipeline using domain-specific ontologies for enterprise data retrieval in the oil and gas domain. • Fine-tuned a Small Language Model (SLM) using Supervised Fine-Tuning (SFT) on internal domain data to enhance relevance and performance for enterprise-specific tasks. • Instituted a domain-specific knowledge base using ontology-guided NLP and entity linking to structure insights from drilling reports, completion summaries, and equipment manuals for semantic search and contextual retrieval. LLMOps, MLOps, and DevOps Initiatives • Engineered a Jenkins Shared Library PoC tailored for ML and LLM pipelines, accelerating CI/CD workflows and standardizing deployment across multiple AI services.

Data Science Mentor

Skillians

FreelanceNov 2024 - Jul 2025Remote, OR, USA

Deliver live interactive sessions on core and advanced Data Science topics including Python, SQL, Machine Learning, Deep Learning (NLP, Computer Vision, Time Series), and MLOps.

AI Research Intern

MTrench

Full-timeAug 2023 - Nov 2023Remote, OR, USA

Researched, conceptualized, and deployed cutting-edge language model architectures (LLMs) to address business challenges using advanced NLP techniques. • Processed and visualized large unstructured datasets, utilizing ML algorithms to extract actionable insights and enhance decision-making efficiency by 25%. • Partnered with product, engineering, and data science teams to deliver scalable AI/ML solutions, reducing processing time by 20% and enhancing model accuracy.

Education

Invertis University

Bachelors of Technology

Computer Science and Engineering

Sep 2019 - Jul 2023Grade: First Class

1. Secured scholarship as top 0.1% student of the university for the Student Exchange Program at Livingstone College, USA. 2. President at I-Tech (The Technical Club) 3. served as the Joint - Secretary of the same club 4. Elected thrice consecutively as the CR for my class

Licenses & Certifications

Google Cloud Certified: Google Cloud Digital Leader

Google Cloud

Skills

Python (OOPs, DSA)
SQL
Bash
Fine-tuning & deployment of LLMs
SLMs
VLLMs
RAG
Agents
MCP
CUA
TensorFlow
PyTorch
Scikit-learn
Hugging Face
Transformers
vLLM
UnsLoTH
LangChain
Accelerate
OpenCV
AG2
CrewAI
Parlant
FastAPI
TensorFlow Serving
Docker
CI/CD (Jenkins, Bitbucket Pipelines)
Git
MLflow
Google Cloud Platform (GCP)
Azure AI Foundry
Flask
Streamlit
Hatchet Workflows
Pocketbase DB