docs: update recommended model links (#7345)

bdougie · web-flow · commit 35434542aba7 · 2025-08-22T13:23:10.000-07:00
diff --git a/docs/customization/models.mdx b/docs/customization/models.mdx
@@ -18,11 +18,11 @@ description: "These blocks form the foundation of the entire assistant experienc
 
 | Model role | Best open models | Best closed models | Notes |
 |:------------|:------------------|:-------------------|:--------|
-| Agent / Plan | Qwen 3 Coder (480B), Qwen 3 Coder (30B), Devstral (24B), GLM 4.5 (355B), GLM 4.5 Air (106B), Kimi K2 (1T), gpt-oss (120B), gpt-oss (20B) | [Claude Opus 4.1](https://hub.continue.dev/anthropic/claude-4-1-opus), [Claude Sonnet 4](https://hub.continue.dev/anthropic/claude-4-sonnet), [GPT-5](https://hub.continue.dev/openai/gpt-5), [Gemini 2.5 Pro](https://hub.continue.dev/google/gemini-2.5-pro) | Closed models are slightly better than open models |
-| Chat / Edit | Qwen 3 Coder (480B), Qwen 3 Coder (30B), gpt-oss (120B), gpt-oss (20B) | [Claude Opus 4.1](https://hub.continue.dev/anthropic/claude-4-1-opus), [Claude Sonnet 4](https://hub.continue.dev/anthropic/claude-4-sonnet), [GPT-5](https://hub.continue.dev/openai/gpt-5), [Gemini 2.5 Pro](https://hub.continue.dev/google/gemini-2.5-pro) | Closed and open models have pretty similar performance |
-| Autocomplete | [QwenCoder2.5 (1.5B)](https://hub.continue.dev/ollama/qwen2.5-coder-1.5b), QwenCoder2.5 (7B) | [Codestral](https://hub.continue.dev/mistral/codestral), Mercury Coder | Closed models are slightly better than open models |
+| Agent / Plan | Qwen 3 Coder (480B), Qwen 3 Coder (30B), Qwen2.5-Coder (32B), Devstral (27B), Devstral (24B), GLM 4.5 (355B), GLM 4.5 Air (106B), Kimi K2 (1T), gpt-oss (120B), gpt-oss (20B) | [Claude Opus 4.1](https://hub.continue.dev/anthropic/claude-4-1-opus), [Claude Sonnet 4](https://hub.continue.dev/anthropic/claude-4-sonnet), GPT-4, [GPT-5](https://hub.continue.dev/openai/gpt-5), [Gemini 2.5 Pro](https://hub.continue.dev/google/gemini-2.5-pro), DeepSeek models | Closed models are slightly better than open models |
+| Chat / Edit | Qwen 3 Coder (480B), Qwen 3 Coder (30B), gpt-oss (120B), gpt-oss (20B), DeepSeek Chat | [Claude Opus 4.1](https://hub.continue.dev/anthropic/claude-4-1-opus), [Claude Sonnet 4](https://hub.continue.dev/anthropic/claude-4-sonnet), [GPT-5](https://hub.continue.dev/openai/gpt-5), [Gemini 2.5 Pro](https://hub.continue.dev/google/gemini-2.5-pro) | Closed and open models have pretty similar performance |
+| Autocomplete | [QwenCoder2.5 (1.5B)](https://hub.continue.dev/ollama/qwen2.5-coder-1.5b), QwenCoder2.5 (7B) | [Codestral](https://hub.continue.dev/mistral/codestral), Mercury Coder, Mercury Coder Small, DeepSeek Coder | Closed models are slightly better than open models |
 | Apply | N/A | [Relace Instant Apply](https://hub.continue.dev/relace/instant-apply), [Morph Fast Apply](https://hub.continue.dev/morphllm/morph-v2) | Open models are basically non-existent / not good enough for this model role |
-| Embed | N/A | [Voyage Code 3](https://hub.continue.dev/voyageai/voyage-code-3), [Morph Embeddings](https://hub.continue.dev/morphllm/morph-embedding-v2), Codestral Embed | Open models are basically non-existent / not good enough for this model role |
+| Embed | Nomic Embed Text | [Voyage Code 3](https://hub.continue.dev/voyageai/voyage-code-3), [Morph Embeddings](https://hub.continue.dev/morphllm/morph-embedding-v2), Codestral Embed, text-embedding-3-large, text-embedding-004 | Open embeddings models are emerging but closed models still perform better |
 | Rerank | zerank-1, zerank-1-small | rerank-2.5, Relace Code Rerank, [Morph Rerank](https://hub.continue.dev/morphllm/morph-rerank-v2) | Open models are beginning to emerge for this model role |
 | Next Edit | Zeta | Mercury Coder | Closed models are significantly better than open models |
 
diff --git a/docs/customize/model-providers/top-level/azure.mdx b/docs/customize/model-providers/top-level/azure.mdx
@@ -92,7 +92,7 @@ If you use Azure Machine Learning Studio to deploy Codestral:
 
 ## How to Configure Azure AI Foundry Embeddings Models
 
-We recommend configuring **text-embedding-3-large** as your embeddings model.
+For recommended embeddings models, please refer to our [Model Recommendations page](/customization/models).
 
 <Tabs>
     <Tab title="YAML">
diff --git a/docs/customize/model-providers/top-level/deepseek.mdx b/docs/customize/model-providers/top-level/deepseek.mdx
@@ -41,7 +41,7 @@ We recommend configuring **DeepSeek Chat** as your chat model.
 
 ## How to Set Up DeepSeek Autocomplete Models
 
-We recommend configuring **DeepSeek Coder** as your autocomplete model.
+For recommended autocomplete models, please refer to our [Model Recommendations page](/customization/models).
 
 <Tabs>
    <Tab title="YAML">
diff --git a/docs/customize/model-providers/top-level/gemini.mdx b/docs/customize/model-providers/top-level/gemini.mdx
@@ -48,7 +48,7 @@ Gemini currently does not offer any autocomplete models.
 
 ## How to Configure Gemini Embeddings Models
 
-We recommend configuring **text-embedding-004** as your embeddings model.
+For recommended embeddings models, please refer to our [Model Recommendations page](/customization/models).
 
 <Tabs>
   <Tab title="YAML">
diff --git a/docs/customize/model-providers/top-level/ollama.mdx b/docs/customize/model-providers/top-level/ollama.mdx
@@ -65,7 +65,7 @@ We recommend configuring **Qwen2.5-Coder 1.5B** as your autocomplete model.
 
 ## How to Set Up Ollama Embeddings Models
 
-We recommend configuring **Nomic Embed Text** as your embeddings model.
+For recommended embeddings models, please refer to our [Model Recommendations page](/customization/models).
 
 <Tabs>
     <Tab title="YAML">
diff --git a/docs/features/agent/model-setup.mdx b/docs/features/agent/model-setup.mdx
@@ -27,22 +27,7 @@ Instead of relying solely on native tool calling APIs (which vary between provid
 - **Better reliability** - Models that struggle with native tools often perform better with system message tools
 - **Seamless switching** - Change between providers without modifying your workflow
 
-### What Models Are Recommended for Agent Mode
-
-For the best Agent mode experience, we recommend models with strong reasoning and instruction-following capabilities:
-
-**Premium Models:**
-
-- **Claude Sonnet 4** (Anthropic) - Our top recommendation for its exceptional tool use and reasoning
-- **GPT-4** models (OpenAI) - Excellent native tool support
-- **DeepSeek** models - Strong performance with competitive pricing
-
-**Local Models:**
-While more limited in capabilities, these models can work with system message tools:
-
-- Qwen2.5-Coder 32B - Best local option for Agent mode
-- Devstral 27B - Good for code-specific tasks
-- Smaller models (7B-13B) - May struggle with complex tool interactions
+For recommended Agent models, please refer to our [Model Recommendations page](/customization/models).
 
 ### How to Configure Agent Mode
 
diff --git a/docs/features/autocomplete/model-setup.mdx b/docs/features/autocomplete/model-setup.mdx
@@ -10,45 +10,7 @@ Setting up the right model for autocomplete is crucial for a smooth coding exper
   For a complete comparison of all autocomplete models, see our [comprehensive model recommendations](/customization/models#recommended-models).
 </Info>
 
-## Recommended Models for Autocomplete in Continue
-
-### Hosted (Best Performance)
-
-For the highest quality autocomplete suggestions, we recommend **[Codestral](https://hub.continue.dev/mistral/codestral)** from Mistral.
-
-This model is specifically designed for code completion and offers excellent performance across multiple programming languages.
-
-**Codestral Quick Setup:**
-
-1. Get your API key from [Mistral AI](https://console.mistral.ai)
-2. Add [Codestral](https://hub.continue.dev/mistral/codestral) to your assistant on Continue Hub
-3. Add `MISTRAL_API_KEY` as a [User Secret](https://docs.continue.dev/hub/secrets/secret-types#user-secrets) on Continue Hub [here](https://hub.continue.dev/settings/secrets)
-4. Click `Reload config` in the assistant selector in the Continue IDE extension
-
-### Hosted (Best Speed/Quality Tradeoff)
-
-For fast, quality autocomplete suggestions, we recommend **[Mercury Coder Small](https://hub.continue.dev/inceptionlabs/mercury-coder-small)** from Inception.
-
-This model is specifically designed for code completion and is particularly fast because it is a diffusion model.
-
-**Mercury Coder Small Quick Setup:**
-
-1. Get your API key from [Inception](https://platform.inceptionlabs.ai/)
-2. Add [Mercury Coder Small](https://hub.continue.dev/inceptionlabs/mercury-coder-small) to your assistant on Continue Hub
-3. Add `INCEPTION_API_KEY` as a [User Secret](https://docs.continue.dev/hub/secrets/secret-types#user-secrets) on Continue Hub [here](https://hub.continue.dev/settings/secrets)
-4. Click `Reload config` in the assistant selector in the Continue IDE extension
-
-### Local (Offline / Privacy First)
-
-For a fully local autocomplete experience, we recommend **[Qwen 2.5 Coder 1.5B](https://hub.continue.dev/ollama/qwen2.5-coder-1.5b)**.
-
-This model provides good suggestions while keeping your code completely private.
-
-**Quick Setup:**
-
-1. Install [Ollama](https://ollama.ai/)
-2. Add [Qwen 2.5 Coder 1.5B](https://hub.continue.dev/ollama/qwen2.5-coder-1.5b) to your assistant on Continue Hub
-3. Click `Reload config` in the assistant selector in the Continue IDE extension
+For model recommendations, please refer to our [Model Recommendations page](/customization/models).
 
 ## Next Edit Model
 
diff --git a/docs/features/chat/model-setup.mdx b/docs/features/chat/model-setup.mdx
@@ -7,16 +7,7 @@ The model you use for for Chat mode will be
   For a comprehensive comparison of all available models by role, see our [model recommendations table](/customization/models#recommended-models).
 </Info>
 
-## What Models Are Recommended for Chat?
-
-Our strong recommendation is to use [Claude Sonnet 4](https://hub.continue.dev/anthropic/claude-4-sonnet) from Anthropic.
-
-Its strong tool calling and reasoning capabilities make it the best model for Agent mode.
-
-1. Get your API key from [Anthropic](https://console.anthropic.com/)
-2. Add [Claude Sonnet 4](https://hub.continue.dev/anthropic/claude-4-sonnet) to your assistant on Continue Hub
-3. Add `ANTHROPIC_API_KEY` as a [User Secret](https://docs.continue.dev/hub/secrets/secret-types#user-secrets) on Continue Hub [here](https://hub.continue.dev/settings/secrets)
-4. Click `Reload config` in the assistant selector in the Continue IDE extension
+For model recommendations, please refer to our [Model Recommendations page](/customization/models).
 
 ### What Other Hosted Models Are Available?
 
diff --git a/docs/features/edit/model-setup.mdx b/docs/features/edit/model-setup.mdx
@@ -16,10 +16,7 @@ The recommended models and how to set them up can be found [here](/features/chat
 
 We also recommend setting up an Apply model for the best Edit experience.
 
-**Recommended Apply models:**
-
-- [Morph v3](https://hub.continue.dev/morphllm/morph-v2)
-- [Relace Instant Apply](https://hub.continue.dev/relace/instant-apply)
+For recommended Apply models, please refer to our [Model Recommendations page](/customization/models).
 
 ## How to Determine Model Compatibility