Architecting Python Microservices for 1M-Token Context Windows: Preventing Memory Bloat and Timeout Cascades