OpenClaw Additional Providers: DeepSeek, Groq, Ollama, OpenRouter, Bedrock...

TL;DR Beyond the big three (Anthropic/OpenAI/Google), OpenClaw supports 30+ providers — from DeepSeek to local Ollama and everything in between.

#openclaw #deepseek #groq #ollama #openrouter #vllm #bedrock #sglang #mistral

Table of Contents

Unified Configuration Format
Cost-Effective: DeepSeek
Ultra-Fast Inference: Groq
Local Models: Ollama
Local Models: vLLM
Local Models: SGLang
Unified Gateway: OpenRouter
Enterprise Cloud: Amazon Bedrock
Other Providers at a Glance
Using Proxy Gateways
Speech Transcription: Deepgram
Community Tool: Claude Max API Proxy
The Big Picture
References

🌏 中文版

The previous model article covered Anthropic, OpenAI, and Google — the big three providers. But OpenClaw supports 35+ providers. This article covers the rest — from cost-effective DeepSeek, to ultra-fast Groq inference, to locally deployed Ollama and vLLM.

Unified Configuration Format

All providers follow the same pattern:

{
  env: { PROVIDER_API_KEY: "your-key" },
  agents: {
    defaults: {
      model: { primary: "provider/model-name" }
    }
  }
}

Most providers can be configured interactively with openclaw onboard.

Cost-Effective: DeepSeek

A Chinese AI company with an OpenAI-compatible API at extremely low prices.

openclaw onboard --auth-choice deepseek-api-key

Model	Use Case	Context
`deepseek-chat` (V3.2)	General conversation	128K
`deepseek-reasoner` (V3.2)	Reasoning / chain-of-thought	128K

Set the environment variable DEEPSEEK_API_KEY. If Gateway runs as a daemon, make sure the key is in ~/.openclaw/.env.

Ultra-Fast Inference: Groq

Groq uses its proprietary LPU hardware to run open-source models with extremely fast inference speeds.

{
  env: { GROQ_API_KEY: "gsk_..." },
  agents: {
    defaults: {
      model: { primary: "groq/llama-3.3-70b-versatile" }
    }
  }
}

Commonly used models:

Model	Highlights
Llama 3.3 70B Versatile	Broad capabilities, long context
Llama 3.1 8B Instant	Speed-oriented
Gemma 2 9B	Lightweight
Mixtral 8x7B	MoE architecture, complex reasoning

Bonus feature: Groq’s Whisper can do fast speech transcription — just configure it as the media-understanding provider.

Local Models: Ollama

Run open-source models on your own machine at zero cost.

# Install Ollama
# Pull a model
ollama pull glm-4.7-flash

# OpenClaw configuration
openclaw onboard  # Select Ollama

OpenClaw automatically discovers local Ollama models. It also supports cloud models (kimi-k2.5:cloud, minimax-m2.5:cloud, etc.).

Important warning: Do not use the /v1 OpenAI-compatible URL. This breaks tool calling — the model will output tool JSON as plain text. Use the native Ollama API URL: http://host:11434 (without /v1).

Minimal setup: Set OLLAMA_API_KEY=ollama-local, and OpenClaw handles auto-discovery.

Local Models: vLLM

Serves open-source and custom models via an OpenAI-compatible HTTP API.

# Start vLLM server
# Configure
export VLLM_API_KEY="vllm-local"

{
  agents: {
    defaults: {
      model: { primary: "vllm/your-model-id" }
    }
  }
}

Like Ollama, vLLM supports auto-discovery — if VLLM_API_KEY is set but no provider config is specified, OpenClaw will query GET http://127.0.0.1:8000/v1/models.

Manual configuration allows specifying parameters like contextWindow, maxTokens, etc.

Local Models: SGLang

Another local model runtime. Documentation is in docs/providers/sglang.md, and the configuration approach is similar to vLLM.

Unified Gateway: OpenRouter

One API key to access models from multiple providers. Model format: openrouter/<provider>/<model>.

openclaw onboard --auth-choice apiKey --token-provider openrouter --token "$OPENROUTER_API_KEY"

{
  agents: {
    defaults: {
      model: { primary: "openrouter/anthropic/claude-sonnet-4-6" }
    }
  }
}

Ideal for those who want a single bill for accessing models from multiple providers. OpenRouter automatically injects cache control for Anthropic models.

Enterprise Cloud: Amazon Bedrock

No API key needed — uses the AWS SDK default credential chain.

{
  models: {
    providers: {
      "amazon-bedrock": {
        api: "bedrock-converse-stream"
        // No apiKey needed, uses AWS credentials
      }
    }
  }
}

Authentication priority: AWS_BEARER_TOKEN_BEDROCK → standard AWS credentials → profiles → SDK chain.

Supports auto-discovery — if AWS credentials are available, it automatically lists available Bedrock models (via bedrock:ListFoundationModels, cached for 1 hour).

When using instance roles on EC2, set AWS_PROFILE=default to tell OpenClaw that credentials are available.

Required IAM permissions: bedrock:InvokeModel, bedrock:InvokeModelWithResponseStream, bedrock:ListFoundationModels.

Other Providers at a Glance

Each provider has its own dedicated documentation. Here are the highlights:

Provider	Provider ID	Auth Method	Highlights
Mistral	`mistral`	API Key	European company, strong multilingual support
xAI	`xai`	API Key	Grok models
NVIDIA	`nvidia`	API Key	NIM inference service
Hugging Face	`huggingface`	API Key	Inference API
Together AI	`together`	API Key	Multi-model inference platform
Qwen / Alibaba Cloud	`qwen_modelstudio`	API Key	Qwen series
GLM (Zhipu AI)	`glm`	API Key	GLM series
MiniMax	`minimax`	API Key	Chinese AI
Moonshot (Kimi)	`moonshot`	API Key	Kimi series, includes Kimi Coding
Qianfan (Baidu)	`qianfan`	API Key	ERNIE Bot
Volcengine (Doubao)	`volcengine`	API Key	ByteDance
Xiaomi	`xiaomi`	API Key	Xiaomi AI
Venice	`venice`	API Key	Privacy-focused
GitHub Copilot	`github-copilot`	OAuth	Uses Copilot subscription
LiteLLM	`litellm`	Custom	Unified proxy gateway
Vercel AI Gateway	—	Custom	Vercel proxy
Cloudflare AI Gateway	—	Custom	Cloudflare proxy

Using Proxy Gateways

LiteLLM, Vercel AI Gateway, and Cloudflare AI Gateway are not model providers — they are proxy layers. You place them between OpenClaw and the actual provider for unified billing, rate limiting, or routing.

Speech Transcription: Deepgram

Not a language model, but a speech transcription service. Once configured as the transcription provider, voice messages are automatically converted to text.

Community Tool: Claude Max API Proxy

A community-maintained proxy that uses Claude subscription credentials to access the API. Make sure to verify compliance with Anthropic’s Terms of Service before using it.

The Big Picture

OpenClaw’s provider ecosystem breaks down into four tiers:

Top-tier commercial (Anthropic / OpenAI / Google) — Strongest capabilities, most expensive
Cost-effective (DeepSeek / Groq / Mistral) — Cheap or with free tiers, solid capabilities
Local deployment (Ollama / vLLM / SGLang) — Zero cost, but requires your own hardware
Proxy gateways (OpenRouter / LiteLLM) — One key to access multiple providers

You can mix and match across tiers in model.fallbacks — for example, use Claude as the primary, fall back to DeepSeek, then fall back to local Ollama.

References

This article is compiled from the following OpenClaw source documents:

docs/providers/index.md — Provider directory
docs/providers/deepseek.md — DeepSeek
docs/providers/groq.md — Groq
docs/providers/ollama.md — Ollama
docs/providers/vllm.md — vLLM
docs/providers/sglang.md — SGLang
docs/providers/openrouter.md — OpenRouter
docs/providers/bedrock.md — Amazon Bedrock
docs/providers/mistral.md — Mistral
docs/providers/xai.md — xAI
docs/providers/nvidia.md — NVIDIA
docs/providers/huggingface.md — Hugging Face
docs/providers/together.md — Together AI
docs/providers/qwen_modelstudio.md — Qwen
docs/providers/glm.md — GLM
docs/providers/minimax.md — MiniMax
docs/providers/moonshot.md — Moonshot
docs/providers/qianfan.md — Qianfan
docs/providers/volcengine.md — Volcengine
docs/providers/xiaomi.md — Xiaomi
docs/providers/venice.md — Venice
docs/providers/github-copilot.md — GitHub Copilot
docs/providers/litellm.md — LiteLLM
docs/providers/vercel-ai-gateway.md — Vercel AI Gateway
docs/providers/cloudflare-ai-gateway.md — Cloudflare AI Gateway
docs/providers/deepgram.md — Deepgram
docs/providers/claude-max-api-proxy.md — Claude Max API Proxy