Configure Models

This page explains how to configure a large language model for Hermes, including the differences between the three configuration methods, how to obtain an API Key, console configuration steps, supported models, and model selection guidance.

> 💡 Configuration is optional > > The Launch plan includes 100 trial credits. After creating Hermes and connecting a channel, you can start chatting directly. Configure a model after the credits run out.

---

This page covers

  • Differences between the three configuration methods (Coding Plan / API / Custom Model)
  • How to obtain an API Key
  • Console configuration steps
  • Model switching
  • Supported models
  • Model selection guidance

---

Three configuration methods

The first-level dropdown in the console "Model" card provides three options:

MethodBilling modeAPI Key sourceUse case
Model Coding PlanMonthly / yearlyProviderHigh-frequency usage
Model APIPay-as-you-goProviderLow-frequency use or trial
Custom modelDepends on endpointUser-providedConnect overseas models or OpenAI-compatible endpoints

> ⚠️ Prepare your own API Key > > All three methods require your own API Key. Please register with the corresponding provider and create a Key first.

---

General process for obtaining an API Key

Provider interfaces differ, but the process is generally the same:

  • Visit the provider website: After selecting a model in the console model card, the "Get API KEY" link appears below. Click it to open the provider website
  • Register or sign in: Most Chinese platforms support WeChat or phone-number login
  • Complete real-name verification: Chinese model providers generally require real-name verification before API calls are allowed
  • Open the API Key management page: Usually under "Console → API Management / Key Management / Credentials"
  • Create a new Key: Name the Key (for example, LightVela) and copy it immediately after creation. Most platforms show the complete Key only once
  • Return to LightVela and paste the Key

> ⚠️ API Keys are sensitive credentials > > Do not send Keys to chat groups, publish them on web pages, or commit them to Git repositories. If a Key is leaked, revoke or reset it immediately in the provider console.

---

Configure in the console

Enter the console and open the "Model" button.

Configuration steps

  • Choose source (first-level dropdown): Model API / Model Coding Plan / Custom Model
  • Choose specific model (second-level dropdown): Available models are shown based on the source selected above
  • Enter API Key: Paste the Key you obtained. The eye icon on the right toggles visibility
  • Click "Add as default"
  • The corresponding record appears in the "Current model" list, indicating successful configuration

Additional fields for custom models

When selecting "Custom Model", you usually need to fill in the following fields in addition to the API Key:

  • Base URL: The model service API address, for example https://api.openai.com/v1
  • Model Name: The model identifier, for example gpt-4o or claude-3-5-sonnet-20241022

The actual fields are subject to the console interface.

> 💡 How to configure custom models (such as OpenAI GPT, Google Gemini, etc.) > > In addition to common models listed in the console, such as MiniMax, Kimi, and Zhipu GLM, users can also configure OpenAI GPT, Google Gemini, Claude, or other models not shown in the list through "Custom Model".

---

Switch models

How to switch

Run the configuration steps again. The new "Add as default" action overwrites the previous configuration.

> ⚠️ Only one model is active > > Currently, each Hermes can have only one active model. To compare different model performance, configure and try them separately.

Impact on memory

Switching models does not affect conversation memory. Memory belongs to Hermes itself; the model is only responsible for generating responses.

---

Supported models

Model Coding Plan (monthly)

NameProvider
Zhipu AI (GLM International - Coding Plan)Zhipu AI
Kimi Coding PlanMoonshot AI
Xiaomi Token PlanXiaomi
Bailian Coding PlanAlibaba Cloud Bailian
Volcano Engine Ark Coding PlanVolcano Engine (ByteDance)

Model API (pay-as-you-go)

NameProvider
OpenAIOpenAI
AnthropicAnthropic
Google GeminiGoogle
OpenRouterOpenRouter
DeepSeekDeepSeek
Bailian (Qwen)Alibaba
MiniMax (International)MiniMax
Moonshot AI (Kimi International)Moonshot
Zhipu AI (GLM International)Zhipu
Volcano Engine (Doubao)ByteDance
Xiaomi MiMoXiaomi

Custom models

Any endpoint compatible with the OpenAI Chat Completions protocol can be connected, including:

  • Self-hosted models: vLLM, Ollama, LM Studio, and other services exposing OpenAI-compatible endpoints
  • Aggregation platforms: OpenRouter, etc.

---

Selection guidance

Selection directions for different priorities:

PrioritySuggested direction
Free quotaMany providers offer free quota for new accounts; start with a trial
Long-context processingChoose models with larger context windows (such as Moonshot AI)
Response speedProviders vary significantly; rely on actual experience
Chinese understandingChinese models generally support Chinese better
High-frequency usageChoose the corresponding provider's monthly Coding Plan

Recommendation: use the Launch plan trial credits for several conversations first. After confirming that Hermes works normally, decide which model to use.

---

FAQ

IssuePossible causeSolution
"Current model" is still empty after entering a KeyKey format is wrong or contains spacesCopy and paste again; confirm there are no leading or trailing spaces
No response for a long time after sending a messageProvider balance is insufficient or the Key is disabledCheck balance and Key status in the provider console
Response content is garbled or meaninglessCustom model endpoint is not compatible with the OpenAI protocolVerify endpoint availability with another OpenAI-compatible client first
"Model response timeout"Provider rate limit or network instabilityTry again later or switch to another provider
Not sure about the Base URL of a custom modelCheck the corresponding service API documentation; it is usually https://xxx.com/v1

Conversation error "HTTP 401:InvalidAuthentication" or similar 401 errors

If HTTP 401:InvalidAuthentication or another 401 xxx error appears during a conversation with OpenClaw, it is usually caused by one of the following:

  • The model API Key is configured incorrectly;
  • The Coding Plan API Key is confused with a regular model API Key;
  • The domestic or international version of a model provider is selected incorrectly.

When this happens, check the model configuration again.

---

Next steps