agentgateway - Cloud Native Deep Dive

NemoClaw + Agentgateway: Inference Routing For LLMs

10 May 2026 3 min read NemoClaw

With any agent sandbox or client that you use for interacting with LLMs, the same question will always arise: how can I securely, and in an observable fashion, connect to endpoints (LLMs, MCP

Multi-Model Failover In Your AI Gateway

09 May 2026 4 min read agentgateway

Think about two scenarios that are pretty common. 1) You hit a rate limit or run out of tokens, so you have to "downgrade" to a small/less powerful Model. 2)

Managing an Agents Uptime (Reliability Engineering for Agents)

02 May 2026 6 min read Agentic

"treat 'em like cattle, not pets". This was, and continues to be, how many look at Kubernetes Pods and microservice-based architecture. It makes a lot of sense for objects like

Configuring Tool Traces In Your MCP Gateway

26 Apr 2026 7 min read agentgateway

An Agent makes a call to an LLM. The LLM decides which MCP server tool should be used for a task. The Agent then makes a call to said tool. This can happen

Making Your Agent Model-Aware With Inference Extension vLLM, & Routing

11 Apr 2026 6 min read Inference Extension

Your Agent has a "mind of its own" (well, it was programmed to act a particular way). For example, Claude Code is known to downgrade your Model for particular tasks to