Standalone Auth Server Design RFC #28

tgrunnagle · 2026-01-25T17:39:50Z

Overview

This PR adds design documentation for deploying ToolHive's existing pkg/authserver as a standalone Kubernetes service with mTLS authentication.

What's Included

THV-0028-standalone-auth-server-overview.md - RFC overview with problem statement, goals, high-level architecture, security considerations, and implementation phases
THV-0028-standalone-auth-server-design.md - Detailed technical design with CRD specifications, code snippets, and implementation guidance

Problem Being Solved

The authserver (pkg/authserver/) is a complete OAuth2/OIDC implementation but exists as an unintegrated library with no deployment model, no proxyrunner integration, and no operator support.

Proposed Solution

New MCPAuthServer CRD - Deploy authserver as a standalone Kubernetes service
mTLS Authentication - Secure communication between proxyrunners and authserver using cert-manager
Token Exchange Endpoint - Allow proxyrunners to retrieve upstream IDP tokens via /internal/token-exchange
MCPServer CRD Updates - Add authServerClientConfig for mTLS client certificate configuration

Key Design Decisions

Per-MCPServer client certificates for identity and audit
cert-manager integration for automated certificate lifecycle
RFC 9728 OAuth Protected Resource Metadata for discovery
Token caching in proxyrunner to reduce authserver calls

OK, I think this is good in the sense that we control who can access, but don't see how we gate which tokens the server can retrieve?

And a problem we need to figure out is - when the client talks to the authserver, the only identity-like information authserver is in the /authorize request. The MCP spec mandates that the client sends the resource and I suggest we take advantage of it.

Then the new authorization server controller could maintain a mapping (based on MCPGroup or label matching or something) between the resource and the MCPServer (name,namespace) pairs which we'll get from the certificate identity:

{ "https://github.example.com/": { "namespace": "engineering", "name": "github-mcp" }, "https://slack.example.com": { "namespace": "everyone", "name": "slack-mcp" } }

The authserver storage already has a session ID. This binds the JWT to the upstream token (alice's github token vs bob's github token). We'll then extend the storage to include the owner as well (name,namespace) so when authserver stores the token we'll have something like:

sessions[tsid] = { owner = name/namespace tokens = { access, refresh, ..} }

then we issue a JWT with the tsid claim.

On token retrieval, we extract the owner from the client cert and verify:

the owner

the tsid

for vMCP, this means that the vMCP instance is the owner of all the upstream tokens for all its back ends, but I don't know how to separate those..maybe based on claims? Maybe we could have a backend prefix in the claim? OTOH I would like the backends to be quite opaque from the client perspective.

this way we authenticate the clients and authorize access to the tokens on the level of the mcpserver and the user.

jhrozek · 2026-01-26T14:53:28Z