Default profile banner
GK

Ganji Kumar

@ganjikumar

Machine Learning Engineer II at Comcast

Chennai, TN

https://ganjiakhil.github.io/portfolio/

ComcastIndian Statistical Institute

Highly skilled Machine Learning Engineer with expertise in Generative AI, NLP, and Computer Vision. Proven experience developing RAG applications and fine-tuning LLMs (Llama2) in production environments. Possesses a strong academic background, holding a Master's in Computer Science (Data Science) and practical skills in AWS, Docker, and Kubernetes.

Experience

Machine Learning Engineer II

Comcast

Full-timeJul 2022 - PresentChennai, TN

Generative AI (Nov’23 - Present): Developed a RAG application using GPT3.5, for technicians to query technical manuals. Used Opensearch to store chunk embeddings and deployed the application using Gradio. Extended it with a reranker for retriver and implemented a semantic cache to reduce API cost. Finetuned Llama2 7B Instruct Model for Logs summarization. Performed PEFT using Lora and updated weights of 0.3% parameters for better response. Implemented Slack Agent to receive summary. AI For Connected Living (Oct’22 - June’23): Developed audio signature classifier using YAMNet to classify audio from camera and notify Xfinity customers. Built IR images classifier usign XGboost model with image features (blob size, skewness, entropy, intensity statistics, SSIM). Implemented a object state detector using MobileNetV2 and CLIP embedding for various doorways. (POC). Primary for Object Detection Model in production. Captured 500 errors using shadow deployment and resolved them. Media Analytics (Jul’22 - Oct’22): In Celebrity Recognition model, Finetuned facial parameters and Improved clustering accuracy by adding functionality to upload new faces to elasticsearch. AIOps Initiatives (June’23 - Nov’23): Implemented kubeflow pipelines for automated training and deployment for deepant and prophet models for anomaly detection of metrics. Performed scaling and performance testing on the endpoint. Implemented multiprocessing for scheduling parallel training of 100’s of models in kubeflow.

Business Analytics Intern

Kofluence

InternshipJul 2020 - Sep 2020Bangalore

Implemented influencer segmentation using k-means clustering model on youtube data to identify potential influencers with better channel engagement. Implemented youtube data scraping tool.

Engineering Management Trainee

ITC Limited (Hotels Division)

TraineeOct 2018 - Aug 2019New Delhi

Summer Trainee

Bharat Heavy Electrial Limited

TraineeJun 2016 - Jul 2016Hyderabad

Education

Indian Statistical Institute

Masters in Computer Science (Data Science)

Data Science

Oct 2020 - Jul 2022

Electives - Machine Learning, Neural networks, CV, NLP, Information retrieval, Discrete Math., Linear Algebra, Statistics, Computational Finance, etc.

National Institute of Technology

B.Tech

Electrical Engineering

Aug 2014 - May 2018

Skills

Statistics
Machine Learning
Deep Learning
NLP
Computer Vision
Generative AI
LLM
Python
MySQL
Git
AWS
Docker
Kubernetes
Kubeflow
MLflow
Flask
Numpy
Pandas
Matplotlib
Scikit-learn
Seaborn
Tensorflow
Langchain