Sign in Subscribe

Noah Hertlein

Founder @NohaTek

a white board with writing written on it

The Inference Scheduler: Architecting High-Throughput LLM Serving with Continuous Batching and vLLM on Kubernetes

a bunch of inflatable objects floating in the air

The Embedded Retriever: Architecting Ultra-Low Latency RAG Pipelines with Zvec and Python

A group of blue and green balls on a black background

The Knowledge Anchor: Architecting Hallucination-Resistant RAG Pipelines with Knowledge Graphs and Python

A pair of headphones sitting on top of a table

The Escalation Engine: Architecting Seamless AI-to-Human Handoffs with LangGraph and WebSockets