bluenotebook.io by Nikhil

Before this, I led an AI team at Eka.care, building products serving 1M+ daily active users - from deploying Speech LLMs via vLLM on Kubernetes to designing retrieval systems that doctors relied on mid-consultation. I've scaled LLM products from 0 -> 1 serving millions of users, from prototype to production across healthcare, search, and e-commerce.

I write here about what I learn along the way.

Which H100 instance to train Nanochat

Benchmarking H100 PCIe vs SXM vs NVL on training cost, step times, and NCCL profiling to find the cheapest GPU configuration for Nanochat

Mar 4, 2026 Nikhil Kasukurthi 12 min read

Why Model Context Protocol (MCP)?

Enabling endless capabilities for LLMs

Apr 2, 2025 Nikhil Kasukurthi

Making LLM workflows human friendly

Validate the data as it streams, don't make your users wait.

Jan 28, 2025 Nikhil Kasukurthi