Projects
A selection of work spanning ML inference platforms, agent AI systems, creative tools, and open-source projects.
Amazon
(6)ML Inference Platform
Production ML Serving at Scale
Founding member of an ML inference platform team at Alexa AI. Built a production Tier-1 service hosting 30+ deep learning models serving all Alexa voice traffic, with zero production defects during a major infrastructure migration.
NLU Disambiguation System
Ambiguity Detection for Voice AI
Designed and launched an ambiguity detection and resolution system for Alexa, enabling millions of customers monthly to resolve ambiguous voice requests with significantly reduced error rates.
US Patent 12,494,194 B1 ↗
Incremental Asynchronous ML Inference
Granted patent for a novel architecture using neural network subgraphs for improved responsiveness in speech and NLU systems. 20 claims, active until 2044.
LLM Service Architecture
Microservices Decomposition
Authored a technical proposal to decompose a large monolithic LLM service into modular microservices. Reviewed by senior leadership and drove alignment across multiple organizations.
Agent AI Latency Optimization
LLM Chat Performance Engineering
Achieved significant latency reduction in an LLM-based chat product through preprocessing parallelization, speculative retrieval, and prompt caching techniques.
Alexa+ Launch Support
Technical Lead for Public Launch
Led a technical support team for the public Alexa+ announcement event, debugging blocking issues in real-time to ensure a successful launch.
Personal
(4)vaani
Hindi Programming Language
A Hindi-based programming language equal to Python. Original language design bridging linguistic barriers in computing, with full parser and interpreter.
aishell
LLM-Augmented Shell in Rust
An AI-powered shell that uses LLMs to augment command-line workflows. Systems-level AI integration built in Rust.
ProofOfImpact
Blockchain + AI Social Impact
Decentralized platform using blockchain and AI to power trust, verification, and incentives in philanthropy, education, and digital media.
SecureConnect
Encrypted Communication Bridge
Modular, open-source system creating a secure local-first communication bridge between iPhone and Mac with encrypted, authenticated, policy-controlled communication.
DGX Research Lab
(5)DGX Research Lab
10+ AI Services on NVIDIA DGX Spark
Production-grade multi-blueprint AI platform with 15+ ML models, ~200GB aggregate GPU VRAM, full observability. Spans LLM orchestration, multi-modal generation, voice synthesis, video analytics, and autonomous agents.
AutonomousMe
Digital Twin Agent System
Fully autonomous personal AI agent with 3-layer perception-cognition-action architecture, memory systems, and graduated trust autonomy (levels 0–4).
PDF-to-Podcast Pipeline
Transform Documents into Audio
Converts PDFs into engaging podcast audio via LLM-powered script generation. Supports dialogue and monologue modes with multiple TTS backends.
Voice Cloner
GPT-SoVITS Multi-Lingual TTS
Zero-shot and few-shot voice cloning with 5-second voice capture, supporting Chinese, English, Japanese, Korean, and Cantonese.
SongAgent
AI Music Generation
Conversational agent creating personalized songs with lyrics, beats (MusicGen), and vocals (Suno Bark). Fully autonomous from description to final mix.