Skip to content

Instantly share code, notes, and snippets.

@rvennam
Last active February 25, 2026 02:06
Show Gist options
  • Select an option

  • Save rvennam/46ef0443bed53b9946c0cbd3cc1f99f2 to your computer and use it in GitHub Desktop.

Select an option

Save rvennam/46ef0443bed53b9946c0cbd3cc1f99f2 to your computer and use it in GitHub Desktop.

Solo Enterprise for Agentgateway - Feature List

A comprehensive list of functional features available in Solo Enterprise for Agentgateway, categorized by LLM and MCP capabilities.


LLM Features

Multi-Provider Routing

  • Route to OpenAI
  • Route to Anthropic Claude
  • Route to AWS Bedrock via IAM or API key auth
  • Route to Azure OpenAI
  • Route to Google Vertex AI via OAuth or service account
  • Route to Google Gemini
  • Route to any OpenAI-compatible provider
  • Path-based routing to different providers/models
  • Header-based routing to different providers/models
  • Query parameter-based routing to different providers/models
  • Model override per backend (force a specific model regardless of client request)
  • Unified /v1/chat/completions interface across all providers

Streaming

  • Streaming responses supported across all providers
  • Time to First Token (TTFT) metrics
  • Tokens Per Output Token (TPOT) metrics

Security & Access Control

  • API key masking
  • JWT token validation with JWKS
  • Claims-based RBAC using CEL expressions
  • TLS termination
  • Frontend mTLS
  • SNI matching

Prompt Engineering & Content Safety

  • Prompt enrichment
  • Built-in guardrails: string matching patterns
  • Built-in guardrails: regex patterns
  • Built-in guardrails: built-in detectors
  • Guardrail actions: reject requests
  • Guardrail actions: mask sensitive content
  • External moderation API integration
  • Custom guardrails via webhook API

Rate Limiting

  • Request-based rate limiting
  • Per-user / per-key rate limiting via headers
  • Local token-based rate limiting
  • Global token-based rate limiting

Resilience & Traffic Management

  • Configurable request timeouts
  • Retry policies with error code matching
  • Retry backoff configuration
  • LLM failover with priority groups
  • Health-based routing
  • Intra-pool failover
  • Failover from rate-limited backends to healthy backends
  • Direct response

Transformations

  • CEL expression-based request transformations
  • CEL expression-based response transformations
  • Header enrichment

Advanced LLM Features

  • Batches API support
  • Embeddings API support
  • Models API passthrough
  • Function calling / tool use support
  • Claude Code CLI integration

Observability (LLM-specific)

  • Token usage metrics
  • Cost tracking
  • LLM request count and error rate metrics
  • Request duration metrics
  • Per-model and per-route metrics
  • Access logs for LLM requests
  • Pre-built Grafana dashboards for LLM monitoring

MCP Features

Connectivity

  • Static MCP backend routing
  • Dynamic MCP backend routing
  • Multiplex MCP
  • In-cluster MCP server routing
  • Remote MCP server routing
  • HTTPS connectivity to upstream MCP servers
  • SSE protocol support
  • Streamable HTTP protocol support
  • Centralized MCP tool server registry

Security & Access Control

  • MCP OAuth 2.0 authentication
  • Dynamic MCP client registration with IdP
  • JWT authentication for MCP servers
  • Claims-based RBAC for MCP access using CEL expressions
  • Tool-level access control
  • Principle of least privilege
  • On-behalf-of (OBO) token exchange
  • Elicitations

Observability (MCP-specific)

  • Full visibility into tool interactions
  • MCP request metrics in Grafana dashboards
  • Distributed tracing for MCP tool calls

Cross-Cutting / Platform Features

Observability Stack

  • OpenTelemetry (OTel) integration
  • Centralized log collection with Grafana Loki
  • Distributed tracing with Grafana Tempo
  • Metrics collection with Prometheus
  • Unified telemetry collection with OTel Collector
  • Grafana dashboards with pre-built panels
  • Control plane metrics

Enterprise UI

  • Dashboard
  • Playground
  • Tracing viewer
  • Gateway management view
  • Route management view
  • Destination management view
  • Policy management view
  • Elicitations management view
  • RBAC for UI access
  • OIDC-secured UI access

Kubernetes-Native Architecture

  • Built on Kubernetes Gateway API
  • Gateway and GatewayClass resources
  • HTTPRoute for traffic routing
  • Custom CRDs for backends, policies, rate limiting, and auth
  • Helm-based installation
  • Air-gapped installation support
  • OpenShift support
  • Configurable proxy replicas and resources
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment