Skip to content

OpenClaw's Model Requirements and Provider Ecosystem

Mar 28, 2026 1 min
TL;DR OpenClaw supports 35+ model providers. The minimum requirement is that the model supports tool use + streaming. It has built-in auth rotation and model failover mechanisms.

🌏 中文版

OpenClaw is a model-agnostic AI gateway that supports 35+ providers. But you can’t just plug in any model — it has explicit functional requirements. This post covers OpenClaw’s model requirements, how to configure the major providers, and the failover mechanisms when things go wrong.

Functional Requirements for Models

OpenClaw’s agent operation relies on these capabilities:

Tool Use (Required) — The model must support function calling. This is the foundation of OpenClaw agent operation; without tool use, no tools can be executed.

Streaming (Required) — Supports streaming output, used for real-time responses and chunked delivery to chat platforms.

Extended Thinking (Optional) — Deep reasoning capability. Claude 4.6 supports this natively; it automatically degrades to normal mode on failure.

Schema Compatibility — Different providers use different tool schema formats. OpenClaw has a built-in normalizer that handles schema differences between Gemini and OpenAI, so no manual processing is needed.

The documentation recommends: “use the strongest latest-generation model available”. If using local models (Ollama, vLLM, SGLang), make sure the model supports tool calling.

Configuration

All models use a unified provider/model format, configured in ~/.openclaw/openclaw.json:

{
  agents: {
    defaults: {
      model: { primary: "anthropic/claude-opus-4-6" }
    }
  }
}

Supported Providers

CategoryProviders
Top-tier CommercialAnthropic (Claude), OpenAI (GPT), Google (Gemini)
Chinese VendorsDeepSeek, Qwen/Alibaba Cloud, GLM (Zhipu), MiniMax, Moonshot (Kimi), Qianfan (Baidu), Volcengine (Doubao), Xiaomi
Inference AccelerationGroq (LPU), Together AI
Local DeploymentOllama, vLLM, SGLang
Proxy GatewaysOpenRouter, LiteLLM, Vercel AI Gateway, Cloudflare AI Gateway
OthersxAI, Mistral, NVIDIA, Hugging Face, Venice, Amazon Bedrock, GitHub Copilot
Speech TranscriptionDeepgram

Configuration Details for the Three Major Providers

Anthropic (Claude)

Three authentication methods (choose one): API Key, Claude CLI, or Setup-Token.

openclaw onboard --anthropic-api-key "$ANTHROPIC_API_KEY"
ItemDetails
Recommended Modelanthropic/claude-opus-4-6
Thinking ModeClaude 4.6 defaults to adaptive
Fast ModeAPI Key only, maps to service_tier: "auto"
Prompt Cachingnone / short (5 min) / long (1 hr), API Key defaults to short
1M ContextBeta, requires setting params.context1m: true

Limitations: CLI mode does not support tool use or streaming. Setup-Token does not support fast mode.

OpenAI (GPT)

Two authentication methods: API Key or Codex subscription (OAuth).

{
  agents: { defaults: { model: { primary: "openai/gpt-5.4" } } }
}
ItemDetails
Recommended Modelopenai/gpt-5.4, openai/gpt-5.4-pro
Transport ProtocolAutomatic (WebSocket preferred, SSE fallback)
Fast Modereasoning.effort = "low" + text.verbosity = "low"
Auto CompactionTriggers server-side compaction at 70% context window

Codex subscription users can also use openai-codex/gpt-5.3-codex-spark.

Google (Gemini)

Primarily uses API Key authentication.

openclaw onboard --auth-choice google-api-key
ItemDetails
Recommended Modelgoogle/gemini-3.1-pro-preview
Special CapabilitiesImage generation, image/audio/video understanding, Web search (Grounding)
ThinkingGemini 3.1+ supports reasoning mode

Model Failover Mechanism

OpenClaw has a built-in two-stage failover:

Stage 1: Round-robin auth profile rotation within the same provider
         ↓ All exhausted
Stage 2: Switch to fallback model

Configuring fallbacks:

{
  agents: {
    defaults: {
      model: {
        primary: "anthropic/claude-opus-4-6",
        fallbacks: ["openai/gpt-5.4", "google/gemini-3.1-pro-preview"]
      }
    }
  }
}

Cooldown Mechanism

Error TypeCooldown Escalation
General Failure1 min → 5 min → 25 min → 1 hr (cap)
Billing/Quota Failure5 hr → 10 hr → 20 hr → 24 hr (cap)

Multi-Account Rotation

You can configure multiple sets of API Keys; on failure, it automatically switches to the next one. Auth profiles are stored in ~/.openclaw/agents/<agentId>/agent/auth-profiles.json. The default ordering prioritizes OAuth over API Key, and within the same type, uses “oldest first.”

Session Stickiness

Once an auth profile is selected, it remains unchanged for the entire session duration, maintaining provider cache efficiency. Switching only occurs on session reset, compaction, or cooldown trigger.

Thinking Degradation

If an extended thinking call fails, it automatically degrades to normal mode without interrupting the conversation.

Overall

OpenClaw’s baseline requirement for models is tool use + streaming. On top of that, it uses three layers of mechanisms — auth rotation, model fallback, and thinking degradation — to ensure uninterrupted service. If you have API Keys from multiple providers, configuring fallbacks gives you very high availability. Local models can also be connected, but you need to verify tool calling support.

References

This post is compiled from the following OpenClaw source documents: