Projects

Research papers, production pipelines, and side experiments — things I've built that I'm proud of.

Featured Projects

TinyYoloV2 Face Detection Visualizer
Computer Vision / Research

TinyYoloV2 Face Detection Visualizer

Built a live, interactive browser-based tool that visualizes how a face detection AI "sees" the world — layer by layer, neuron by neuron. Most AI models are black boxes; you put in an image and get an answer but have no idea what happened in between. This project peels back those layers, letting users literally see what patterns each part of the neural network is detecting. Built during a research collaboration with Carnegie Mellon University (CMU), this tool was built using TensorFlow.js, enabling it to run entirely in a web browser with no installation required.

TypeScriptTensorFlow.jsface-api.jsSSD MobileNet V1TinyYoloV2
NLP / Research

Aspect-Sentiment Opinion Summarizer (Published Research)

Built an AI system that reads e-commerce product reviews, Q&As, and descriptions from multiple sources and produces a structured, aspect-based summary — telling you exactly what customers love or hate about specific features like battery life, design, or price. The system scraped real product data from Flipkart and used generative AI models (T5, mT5, GPT-3) to generate human-readable summaries. This research was published at ACM CODS-COMAD 2023, one of India's premier data science conferences.

PythonmT5T5GPT-3Selenium
AI / Full Stack

Docketry — Intelligent Document Processing Platform

Designed and built the AI core of Docketry, a SaaS platform that automates document-heavy business workflows. The platform can read any document — invoices, contracts, ID cards — understand its layout and content, and automatically sort it into the right category. The AI model I fine-tuned, LayoutLMV2, reads documents the way humans do: it looks at both the text AND the visual structure of the page simultaneously. I also built the entire server backend, designed the system architecture, and published a Python package so businesses could plug Docketry into their existing systems with just a few lines of code.

PythonLayoutLMV2Hugging FaceDjangogRPC
NLP / Research

Indic NER — Multilingual AI for Indian Languages

Built an AI system that can read mixed-language text — like when someone writes a sentence using both Hindi and English words — and automatically identify important entities such as movie names, actor names, and locations. This is called Named Entity Recognition (NER), and doing it for Indian languages is uniquely hard because people switch languages mid-sentence all the time. I built this for the top 5 Indian languages, covering 17 different entity types, trained on over 100,000 annotated entries. I also created a data augmentation technique to generate more training data for languages with limited resources, achieving a 20% improvement in accuracy.

PythonHugging Face TransformersBERT (multilingual)PyTorch
AI / Data Engineering

RAG Pipeline for Healthcare Intelligence

Built an AI-powered system that makes sense of massive healthcare databases in real time. Imagine asking a question in plain English and getting back a concise, accurate summary from thousands of patient records — that is what this system does. It uses a technique called Retrieval-Augmented Generation (RAG), where an AI first searches for the most relevant records, ranks them by how closely they match the question, and then generates a clean, readable summary. The system was deployed in production with parallel processing, handling multiple queries simultaneously without slowdowns.

PythonLangChainOpenAI / LLM APIsFastAPIasyncio

All Projects

Scroll horizontally to explore

Security / Algorithms

Image Steganography — Hiding Secrets in Plain Sight

Built a Python tool that hides secret messages inside ordinary images — a technique called steganography. The image looks completely normal to the human eye, but it secretly contains up to 1.7 MB of hidden text. The technique uses the Least Significant Bit (LSB) algorithm, which modifies the tiniest component of each pixel's color value to encode data without causing any visible change to the image. This has real-world applications in secure communication and digital watermarking.

NLP / Research

Neural Machine Translation

Built an AI system capable of translating English sentences into Hindi. Translation is one of the hardest tasks in AI because language is nuanced — the same word can mean different things in different contexts. I trained two different neural architectures on 1 million sentence pairs from the Samanantar dataset (one of the largest multilingual corpora for Indian languages) and evaluated them using BLEU score, the industry-standard metric for translation quality.