Default profile banner
RUHAN SIDDIQUIRS

RUHAN SIDDIQUI

@ruhansiddiqui

AI/ML || DATA SCIENTIST || GEN-AI ENGINEER

Kanpur

Think Future TechnologiesKanpur institute of technology

Enthusiastic and results-driven professional with 2.6 years of expertise in machine learning, generative AI, and data sciences. Proven track record of leveraging advanced technologies to drive impactful solutions. Excels in translating complex data into actionable insights, driving innovation, and contributing to successful project outcomes.

Experience

Machine Learning || Gen-AI engineer

Think Future Technologies

•Oct 2022 - Present•Gurgaon, India

Developed LLM's Langchain and OpenAI App. Created a Knowledge base chatbot using Langchain and various open-source LLMs (e.g., LLAMA 7B, Mistral AI). Constructed embedding models for precise responses. Implemented storage of embeddings in Chroma and FAISS vector DB. Fine-tuned OpenAI models (text-davinci-003, gpt-3.5-turbo) for specific use cases. Applied LEFT LORA techniques to fine-tune LLM models using provided data. Built a language translation model. Developed OCR models for real-time image scraping. Implemented Text-to-Speech (TTS) using style TTS and Bark AI for multiple languages. Created a Speech-to-Text (STT) model using Whisper for Hindi, English, and Punjabi. Applied prompt engineering techniques for improved output. Developed a Resume parser using Mistral AI LLM models. Constructed a Machine Learning classification model for layer identification. Established a Retrieval Augmentation Generation (RAG) pipeline for fetching accurate data from source documents to enhance response generation.

Data Ops Developer

De Soto Technologies

•May 2022 - Oct 2022•India

Worked on Python and Advance Pandas. Worked on Selenium with Python. Worked on writing SQL queries and create Reports.

Data Analyst

BDB.ai

•Sep 2021 - Apr 2022•Bangalore, India

Extracted and transformed unstructured live data from Telematics devices using Python in the Data Pipeline. Developed structured format, created a pipeline to store data in MySQL DB. Utilized MySQL for data analysis and query execution based on client specifications. Collected real-time data from Caruso website via Postman API, ingested into MongoDB using Python. Performed aggregation operations to transition nested data to a non-nested format in MongoDB. Established a pipeline environment for processing and storing live data in MongoDB Database. Applied MongoDB queries for data analysis in line with client requirements. Selected impactful KPIs in Dashboard designer, reported daily work in Agile Environment using Jira for informed business decision-making.

Education

Kanpur institute of technology

Bachelor of Technology

Computer Science

Jan 2017 - Jan 2021

Skills

Python
SQL
Pandas
Numpy
Matplotlib
Seaborn
Data Cleaning
EDA
Data Analysis and Visualization
Tableau
Machine Learning
NLP
Deep learning
OpenAI
Langchain
Vector DB
LLM
git
Gen-AI
Conversational-AI
Postman
RestAPI
Prompt Engineering
FastAPI
Swagger
PEFT LORA
TGI
OCR
STT
TTS