The Sovereign Enclave: Architecting Confidential Computing Nodes on Kubernetes with Intel SGX and Gramine
The Context Economist: Architecting Cost-Aware Memory Systems for LLM Agents with Semantic Caching and Python
The Inference Scheduler: Architecting High-Throughput LLM Serving with Continuous Batching and vLLM on Kubernetes