Sahil Malik
Senior Software Engineer | AI Systems Architect
Summary
Senior software engineer and AI systems architect with nearly a decade at Amazon, building large-scale machine learning infrastructure and intelligent AI systems used by millions. Works at the intersection of distributed systems, applied machine learning, and production AI platforms — translating research-grade ideas into reliable, scalable services.
Led the design and delivery of Tier-1 ML inference services powering 30+ deep learning models at scale, executing zero-defect migrations across critical infrastructure. Co-inventor of a granted US patent introducing incremental asynchronous neural inference architectures using model subgraphs. Currently building agent AI systems and LLM-based products, contributing to next-generation AI initiatives including Alexa+ and enterprise AI agent platforms.
Beyond production systems, builds applied AI platforms that solve real-world problems — including an AI-driven personal finance system using hybrid rule-based and LLM approaches, and vaani, a Hindi-keyword programming language expanding linguistic accessibility in computing. Combines rigorous engineering discipline with systems thinking, from chip-level compute economics to end-user AI products.
Technical Skills
ML & AI
Languages
Infrastructure
System Design
Patent
“Machine learning model architecture for incremental asynchronous inference” — Amazon Technologies Inc, granted Dec 2025. 20 claims, active until 2044.
Experience
Sr. Software Development Engineer
2025–Present · Seattle, WA
Agent AI systems, LLM latency optimization, and developer tooling.
- •Significant latency reduction in LLM-based chat through parallelization and caching
- •Driving migration to modern agent frameworks
- •Built developer tooling for AI-assisted debugging with MCP integration
Sr. Software Development Engineer — Alexa+
2024–2025 · Seattle, WA
LLM service architecture, agent orchestration, and cross-org technical leadership.
- •Demonstrated sub-2-second Alexa response latency in cross-team demos
- •Led technical support team for Alexa+ public launch event
- •Authored monolith-to-microservices architecture proposal reviewed by senior leadership
- •Drove cross-organization alignment across 6+ teams
Sr. Software Development Engineer
2023 · Seattle, WA
Self-learning arbitration systems and containerized microservices.
- •Built self-learning arbitration systems
- •Subject matter expert for containerized microservices and cloud infrastructure
Software Development Engineer II — Alexa AI
2019–2023 · Seattle, WA
Founding member of an ML inference platform team. Grew the platform from serving a small fraction to 100% of Alexa voice traffic.
- •Delivered a production Tier-1 ML inference service hosting 30+ models
- •Built an NLU disambiguation feature serving millions of customers monthly
- •Expanded ML-based routing to 11 locales across 8 languages
- •Co-invented US Patent 12,494,194 B1 for ML inference architecture
Software Development Engineer II — Payments
2018–2019 · Hyderabad, India
Financial products platform development.
- •End-to-end owner of a customer differentiation module
- •Prototyped NoSQL migration for millions of customer records
Software Development Engineer I — Gift Cards
2016–2018 · Hyderabad, India
Gift card services, security features, and API development.
- •Built refund processing system impacting tens of thousands of customers
- •Designed and implemented security features for the gift card claim flow
- •Developed 5 APIs with >95% test coverage
Software Development Engineer Intern
2015 · Hyderabad, India
Summer internship, converted to full-time offer.
Education
B.Tech Computer Engineering
NIT Kurukshetra · 2013–2017 · CGPA: 8.99/10.0
College Projects
Secret Text (Major Project)
Android · Java · Encryption
Android app combining voice recognition with text encryption for secure message sharing
Bird Call Recognition (Minor Project)
Python · ML · Audio Processing
ML-based system to identify bird species from audio recordings of their calls