Back to Article List

Hermes Agent credential pools for multi-key setup

Hermes Agent credential pools for multi-key setup

If your Hermes Agent regularly hits 429 rate limits during busy hours, credential pools are the feature you didn't know you needed. They let you associate multiple API keys with the same provider config, and Hermes rotates between them per request. Same provider, same model, just twice (or three times) the per-account rate budget.

Pools are an official Hermes feature documented at the credential pools docs. Most tutorials skip them because they sound advanced. They aren't.

The problem credential pools solve

Two related problems.

Rate limits on a single account. Anthropic gives you N requests per minute on a tier. Same for OpenAI, OpenRouter, every hosted provider. With one key, you hit that ceiling and Hermes either queues or 429s. With three keys in a pool, you've got 3N RPM until you upgrade the tier.

Per-account daily quotas. Some providers cap daily token spend per account regardless of plan (especially the free tiers on OpenRouter, Gemini, Groq). One account exhausts, the agent stops working. Two accounts pooled: you get double the budget.

The catch: pooling doesn't help if you've already maxed out a paid tier (you'd just hit the higher limit twice as fast). Pools shine when the per-account limit is the bottleneck, not your total spend.

Setting up a pool

The CLI exposes pool management through hermes auth. Exact syntax has shifted between versions; check hermes auth --help on your install to confirm. As of recent versions:

Add multiple keys to a provider

hermes auth add anthropic --pool primary
hermes auth add anthropic --pool secondary
hermes auth list anthropic

The CLI walks you through pasting each key. The pool name is arbitrary (I use "primary" and "secondary" but "key1" / "key2" works too).

Tell the provider config to use the pool

hermes provider set anthropic --credential-pool primary,secondary
hermes provider show anthropic

Now Hermes will rotate between the two keys on each request. Round-robin by default.

Rotation strategies

Hermes supports a few rotation modes. Pick based on what you're optimising.

Round-robin (default)

Each request goes to the next key in the list. Predictable, easy to reason about. Fine for most use cases.

Random

Each request picks a key at random. Slightly better at spreading load if your traffic is bursty.

hermes provider set anthropic --pool-strategy random

Failover (least common)

Always use the first key. Only switch to the next one if the first 429s or 5xxs. This is closer to fallback-provider behaviour applied within a single provider.

hermes provider set anthropic --pool-strategy failover

I use round-robin in production. Random has slightly less predictable cost attribution (you can't trace which key paid for which request from external billing alone). Failover is fine if you actively want one account to be the "primary" and the other a safety net.

Credential pools vs fallback providers

These two features sound similar. They aren't. Quick decision guide.

ConcernUse this
Same provider hits rate limitCredential pool
Same provider hits daily quotaCredential pool
Whole provider goes downFallback provider
Want to compare provider qualityFallback provider
Need same model across two billing relationshipsCredential pool (one provider) or fallback (Anthropic + OpenRouter routing to Sonnet)

You can use both together. I do. Pool of two Anthropic keys as primary, single OpenRouter key as fallback. Rate limits get absorbed by the pool, full provider outages get caught by the fallback. Setup pattern is in our Hermes 402 quota fallback piece.

Where you can't pool

A few edge cases worth knowing.

Some providers tie keys to specific projects or organisations. Pooling keys from two different projects on the same provider is fine. Pooling keys belonging to the same user in the same org sometimes triggers anti-abuse detection ("looks like one user spinning up multiple keys to dodge limits"). Provider TOS varies. Read the fine print.

Local providers (Ollama, LM Studio, vLLM) don't have rate limits in the same way and pooling doesn't apply. Just point at one endpoint.

Streaming responses from some providers are pinned to a specific key for the duration of the stream. Pooling between requests works fine but a single long streaming response stays on one key. So pools don't help you mid-response if that key 429s on a follow-up token.

Billing visibility with pools

Your provider bill now has charges across two (or more) accounts. If you need clean per-team or per-channel cost attribution, pooling makes it harder. You can mitigate this by:

  • Naming the keys descriptively when you create them on the provider dashboard ("hermes-bot-primary", "hermes-bot-secondary")
  • Setting separate budgets on each account at the provider level
  • Tracking spend on the Hermes side instead, covered in our Hermes cost tracking and budgets tutorial

For small teams this isn't worth worrying about. For org-level deployments where finance needs to allocate per-team, decide upfront how you want to track and stick to it.

Real-world setup: My production pool

I got two Anthropic keys (primary and secondary, both on the same workspace), one OpenRouter key as fallback. Pool strategy: round-robin. Anthropic monthly budgets set on the dashboard so an unusual day doesn't blow my month. OpenRouter has a hard cap because it's only fallback.

This setup has survived two Anthropic rate-limit incidents and one OpenRouter outage in the past four months. Users noticed nothing. The only operational task was checking the gateway logs the next morning to confirm what happened.

What happens when both keys in the pool 429

Hermes retries with exponential backoff a few times, then falls through to whatever fallback provider you've configured. If you have no fallback, the request errors. Setup detail in the fallback providers tutorial.

How pools interact with the 401 troubleshooting flow

If only one key in the pool is broken (you regenerated one and forgot to update it in Hermes), you get intermittent 401s that look random. Roughly half your requests succeed, half fail. The fix is to verify each key in the pool independently. See our Hermes 401 auth errors tutorial for the curl-based verification flow. Run it once per key in the pool.

What I'd skip

Don't bother with pools if you're running a personal Hermes that gets a few dozen messages a day. The complexity isn't worth the wins. Pools start mattering at the point where you're routinely seeing 429s during peak hours or your daily budget exhausts before the day is over.

Pre-configured on LumaDock

The Hermes Agent template on LumaDock supports pools out of the box once you add a second key through the auth wizard. No special setup beyond what's covered above. Unmetered bandwidth and no setup fees on every plan. Full setup walkthrough in our Hermes Agent complete guide.

Your idea deserves better hosting

24/7 support 30-day money-back guarantee Cancel anytime
Ciclo de Pagamento

1 GB RAM VPS

$3.99 Save  25 %
$2.99 Mensalmente
  • 1 vCPU AMD EPYC
  • 30 GB NVMe disco
  • Ilimitada largura de banda
  • IPv4 e IPv6 incluídos O suporte IPv6 não está disponível em França, Finlândia ou Países Baixos.
  • 1 Gbps rede
  • Gestão de firewall
  • Monitor grátis

2 GB RAM VPS

$5.99 Save  17 %
$4.99 Mensalmente
  • 2 vCPU AMD EPYC
  • 30 GB NVMe disco
  • Ilimitada largura de banda
  • IPv4 e IPv6 incluídos O suporte IPv6 não está disponível em França, Finlândia ou Países Baixos.
  • 1 Gbps rede
  • Gestão de firewall
  • Monitor grátis

6 GB RAM VPS

$14.99 Save  33 %
$9.99 Mensalmente
  • 6 vCPU AMD EPYC
  • 70 GB NVMe disco
  • Ilimitada largura de banda
  • IPv4 e IPv6 incluídos O suporte IPv6 não está disponível em França, Finlândia ou Países Baixos.
  • 1 Gbps rede
  • Gestão de firewall
  • Monitor grátis

AMD EPYC VPS.P1

$7.99 Save  25 %
$5.99 Mensalmente
  • 2 vCPU AMD EPYC
  • 4 GB memória RAM
  • 40 GB NVMe disco
  • Ilimitada largura de banda
  • IPv4 e IPv6 incluídos O suporte a IPv6 não está disponível em França, Finlândia ou Países Baixos.
  • 1 Gbps rede
  • Cópia automática incluída
  • Gestão de firewall
  • Monitor grátis

AMD EPYC VPS.P2

$14.99 Save  27 %
$10.99 Mensalmente
  • 2 vCPU AMD EPYC
  • 8 GB memória RAM
  • 80 GB NVMe disco
  • Ilimitada largura de banda
  • IPv4 e IPv6 incluídos O suporte a IPv6 não está disponível em França, Finlândia ou Países Baixos.
  • 1 Gbps rede
  • Cópia automática incluída
  • Gestão de firewall
  • Monitor grátis

AMD EPYC VPS.P4

$29.99 Save  20 %
$23.99 Mensalmente
  • 4 vCPU AMD EPYC
  • 16 GB memória RAM
  • 160 GB NVMe disco
  • Ilimitada largura de banda
  • IPv4 e IPv6 incluídos O suporte a IPv6 não está disponível em França, Finlândia ou Países Baixos.
  • 1 Gbps rede
  • Cópia automática incluída
  • Gestão de firewall
  • Monitor grátis

AMD EPYC VPS.P5

$36.49 Save  21 %
$28.99 Mensalmente
  • 8 vCPU AMD EPYC
  • 16 GB memória RAM
  • 180 GB NVMe disco
  • Ilimitada largura de banda
  • IPv4 e IPv6 incluídos O suporte a IPv6 não está disponível em França, Finlândia ou Países Baixos.
  • 1 Gbps rede
  • Cópia automática incluída
  • Gestão de firewall
  • Monitor grátis

AMD EPYC VPS.P6

$56.99 Save  21 %
$44.99 Mensalmente
  • 8 vCPU AMD EPYC
  • 32 GB memória RAM
  • 200 GB NVMe disco
  • Ilimitada largura de banda
  • IPv4 e IPv6 incluídos O suporte a IPv6 não está disponível em França, Finlândia ou Países Baixos.
  • 1 Gbps rede
  • Cópia automática incluída
  • Gestão de firewall
  • Monitor grátis

AMD EPYC VPS.P7

$69.99 Save  20 %
$55.99 Mensalmente
  • 16 vCPU AMD EPYC
  • 32 GB memória RAM
  • 240 GB NVMe disco
  • Ilimitada largura de banda
  • IPv4 e IPv6 incluídos O suporte a IPv6 não está disponível em França, Finlândia ou Países Baixos.
  • 1 Gbps rede
  • Cópia automática incluída
  • Gestão de firewall
  • Monitor grátis

EPYC Genoa VPS.G1

$4.99 Save  20 %
$3.99 Mensalmente
  • 1 vCPU AMD EPYC Gen4 AMD EPYC Genoa 4.ª geração 9xx4 com 3.25 GHz ou equivalente, baseada na arquitetura Zen 4.
  • 1 GB DDR5 memória RAM
  • 25 GB NVMe disco
  • Ilimitada largura de banda
  • IPv4 e IPv6 incluídos O suporte a IPv6 não está disponível em França, Finlândia ou Países Baixos.
  • 1 Gbps rede
  • Cópia automática incluída
  • Gestão de firewall
  • Monitor grátis

EPYC Genoa VPS.G2

$12.99 Save  23 %
$9.99 Mensalmente
  • 2 vCPU AMD EPYC Gen4 AMD EPYC Genoa 4.ª geração 9xx4 com 3.25 GHz ou equivalente, baseada na arquitetura Zen 4.
  • 4 GB DDR5 memória RAM
  • 50 GB NVMe disco
  • Ilimitada largura de banda
  • IPv4 e IPv6 incluídos O suporte a IPv6 não está disponível em França, Finlândia ou Países Baixos.
  • 1 Gbps rede
  • Cópia automática incluída
  • Gestão de firewall
  • Monitor grátis

EPYC Genoa VPS.G4

$25.99 Save  27 %
$18.99 Mensalmente
  • 4 vCPU AMD EPYC Gen4 AMD EPYC Genoa 4.ª geração 9xx4 com 3.25 GHz ou equivalente, baseada na arquitetura Zen 4.
  • 8 GB DDR5 memória RAM
  • 100 GB NVMe disco
  • Ilimitada largura de banda
  • IPv4 e IPv6 incluídos O suporte a IPv6 não está disponível em França, Finlândia ou Países Baixos.
  • 1 Gbps rede
  • Cópia automática incluída
  • Gestão de firewall
  • Monitor grátis

EPYC Genoa VPS.G6

$48.99 Save  31 %
$33.99 Mensalmente
  • 8 vCPU AMD EPYC Gen4 AMD EPYC Genoa 4.ª geração 9xx4 com 3.25 GHz ou equivalente, baseada na arquitetura Zen 4.
  • 16 GB DDR5 memória RAM
  • 200 GB NVMe disco
  • Ilimitada largura de banda
  • IPv4 e IPv6 incluídos O suporte a IPv6 não está disponível em França, Finlândia ou Países Baixos.
  • 1 Gbps rede
  • Cópia automática incluída
  • Gestão de firewall
  • Monitor grátis

EPYC Genoa VPS.G7

$74.99 Save  27 %
$54.99 Mensalmente
  • 8 vCPU AMD EPYC Gen4 AMD EPYC Genoa 4.ª geração 9xx4 com 3.25 GHz ou equivalente, baseada na arquitetura Zen 4.
  • 32 GB DDR5 memória RAM
  • 250 GB NVMe disco
  • Ilimitada largura de banda
  • IPv4 e IPv6 incluídos O suporte a IPv6 não está disponível em França, Finlândia ou Países Baixos.
  • 1 Gbps rede
  • Cópia automática incluída
  • Gestão de firewall
  • Monitor grátis

AMD Ryzen VPS.R1

$15.99 Save  31 %
$10.99 Mensalmente
  • 1 CPU dedicado AMD Ryzen 9 7950X com 4,5 GHz ou similar, na arquitetura Zen 4. vCPU
  • 4 GB DDR5MEMÓRIA
  • 50 GB NVMeDISCO
  • Largura de banda ilimitada
  • IPv4 & IPv6 incluídos O suporte a IPv6 está indisponível de momento em França, Finlândia ou nos Países Baixos.
  • Backup automático incluído

AMD Ryzen VPS.R2

$27.99 Save  21 %
$21.99 Mensalmente
  • 2 CPU dedicados AMD Ryzen 9 7950X com 4,5 GHz ou similar, na arquitetura Zen 4. vCPU
  • 8 GB DDR5MEMÓRIA
  • 100 GB NVMeDISCO
  • Largura de banda ilimitada
  • IPv4 & IPv6 incluídos O suporte a IPv6 está indisponível de momento em França, Finlândia ou nos Países Baixos.
  • Backup automático incluído

AMD Ryzen VPS.R4

$99.99 Save  20 %
$79.99 Mensalmente
  • 8 CPU dedicados AMD Ryzen 9 7950X com 4,5 GHz ou similar, na arquitetura Zen 4. vCPU
  • 32 GB DDR5MEMÓRIA
  • 400 GB NVMeDISCO
  • Largura de banda ilimitada
  • IPv4 & IPv6 incluídos O suporte a IPv6 está indisponível de momento em França, Finlândia ou nos Países Baixos.
  • Backup automático incluído

Questions?

What is a credential pool in Hermes Agent?

A way to associate multiple API keys with the same provider config. Hermes rotates between the keys on each request, so per-account rate limits don't block you as quickly.

Your agent runs wild. Your bill doesn't.

Easily deploy Hermes in one click on Ubuntu 24.04 with AMD EPYC, NVMe storage and unmetered bandwidth. The price stays the same whatever the agent does, no setup fees, no overage charges and no tier traps.

GPU products are in high demand at the moment. Fill the form to get notified as soon as your preferred GPU server is back in stock.