Configure Models
This page explains how to configure a large language model for Hermes, including the differences between the three configuration methods, how to obtain an API Key, console configuration steps, supported models, and model selection guidance.
> 💡 Configuration is optional > > The Launch plan includes 100 trial credits. After creating Hermes and connecting a channel, you can start chatting directly. Configure a model after the credits run out.
---
This page covers
- Differences between the three configuration methods (Coding Plan / API / Custom Model)
- How to obtain an API Key
- Console configuration steps
- Model switching
- Supported models
- Model selection guidance
---
Three configuration methods
The first-level dropdown in the console "Model" card provides three options:
| Method | Billing mode | API Key source | Use case |
|---|---|---|---|
| Model Coding Plan | Monthly / yearly | Provider | High-frequency usage |
| Model API | Pay-as-you-go | Provider | Low-frequency use or trial |
| Custom model | Depends on endpoint | User-provided | Connect overseas models or OpenAI-compatible endpoints |
> ⚠️ Prepare your own API Key > > All three methods require your own API Key. Please register with the corresponding provider and create a Key first.
---
General process for obtaining an API Key
Provider interfaces differ, but the process is generally the same:
- Visit the provider website: After selecting a model in the console model card, the "Get API KEY" link appears below. Click it to open the provider website
- Register or sign in: Most Chinese platforms support WeChat or phone-number login
- Complete real-name verification: Chinese model providers generally require real-name verification before API calls are allowed
- Open the API Key management page: Usually under "Console → API Management / Key Management / Credentials"
- Create a new Key: Name the Key (for example,
LightVela) and copy it immediately after creation. Most platforms show the complete Key only once
- Return to LightVela and paste the Key
> ⚠️ API Keys are sensitive credentials > > Do not send Keys to chat groups, publish them on web pages, or commit them to Git repositories. If a Key is leaked, revoke or reset it immediately in the provider console.
---
Configure in the console
Enter the console and open the "Model" button.
Configuration steps
- Choose source (first-level dropdown): Model API / Model Coding Plan / Custom Model
- Choose specific model (second-level dropdown): Available models are shown based on the source selected above
- Enter API Key: Paste the Key you obtained. The eye icon on the right toggles visibility
- Click "Add as default"
- The corresponding record appears in the "Current model" list, indicating successful configuration

Additional fields for custom models
When selecting "Custom Model", you usually need to fill in the following fields in addition to the API Key:
- Base URL: The model service API address, for example
https://api.openai.com/v1
- Model Name: The model identifier, for example
gpt-4oorclaude-3-5-sonnet-20241022
The actual fields are subject to the console interface.
> 💡 How to configure custom models (such as OpenAI GPT, Google Gemini, etc.) > > In addition to common models listed in the console, such as MiniMax, Kimi, and Zhipu GLM, users can also configure OpenAI GPT, Google Gemini, Claude, or other models not shown in the list through "Custom Model".
---
Switch models
How to switch
Run the configuration steps again. The new "Add as default" action overwrites the previous configuration.
> ⚠️ Only one model is active > > Currently, each Hermes can have only one active model. To compare different model performance, configure and try them separately.
Impact on memory
Switching models does not affect conversation memory. Memory belongs to Hermes itself; the model is only responsible for generating responses.
---
Supported models
Model Coding Plan (monthly)
| Name | Provider |
|---|---|
| Zhipu AI (GLM International - Coding Plan) | Zhipu AI |
| Kimi Coding Plan | Moonshot AI |
| Xiaomi Token Plan | Xiaomi |
| Bailian Coding Plan | Alibaba Cloud Bailian |
| Volcano Engine Ark Coding Plan | Volcano Engine (ByteDance) |
Model API (pay-as-you-go)
| Name | Provider |
|---|---|
| OpenAI | OpenAI |
| Anthropic | Anthropic |
| Google Gemini | |
| OpenRouter | OpenRouter |
| DeepSeek | DeepSeek |
| Bailian (Qwen) | Alibaba |
| MiniMax (International) | MiniMax |
| Moonshot AI (Kimi International) | Moonshot |
| Zhipu AI (GLM International) | Zhipu |
| Volcano Engine (Doubao) | ByteDance |
| Xiaomi MiMo | Xiaomi |
Custom models
Any endpoint compatible with the OpenAI Chat Completions protocol can be connected, including:
- Self-hosted models: vLLM, Ollama, LM Studio, and other services exposing OpenAI-compatible endpoints
- Aggregation platforms: OpenRouter, etc.
---
Selection guidance
Selection directions for different priorities:
| Priority | Suggested direction |
|---|---|
| Free quota | Many providers offer free quota for new accounts; start with a trial |
| Long-context processing | Choose models with larger context windows (such as Moonshot AI) |
| Response speed | Providers vary significantly; rely on actual experience |
| Chinese understanding | Chinese models generally support Chinese better |
| High-frequency usage | Choose the corresponding provider's monthly Coding Plan |
Recommendation: use the Launch plan trial credits for several conversations first. After confirming that Hermes works normally, decide which model to use.
---
FAQ
| Issue | Possible cause | Solution |
|---|---|---|
| "Current model" is still empty after entering a Key | Key format is wrong or contains spaces | Copy and paste again; confirm there are no leading or trailing spaces |
| No response for a long time after sending a message | Provider balance is insufficient or the Key is disabled | Check balance and Key status in the provider console |
| Response content is garbled or meaningless | Custom model endpoint is not compatible with the OpenAI protocol | Verify endpoint availability with another OpenAI-compatible client first |
| "Model response timeout" | Provider rate limit or network instability | Try again later or switch to another provider |
| Not sure about the Base URL of a custom model | — | Check the corresponding service API documentation; it is usually https://xxx.com/v1 |
Conversation error "HTTP 401:InvalidAuthentication" or similar 401 errors
If HTTP 401:InvalidAuthentication or another 401 xxx error appears during a conversation with OpenClaw, it is usually caused by one of the following:
- The model API Key is configured incorrectly;
- The Coding Plan API Key is confused with a regular model API Key;
- The domestic or international version of a model provider is selected incorrectly.
When this happens, check the model configuration again.
---