Models

Clew supports 100+ models across 27 providers. Pick the right model for each task — from frontier reasoning to fast local inference.

Model Selection Guide

WorkloadRecommended ModelsContextNotes
Complex architecture & large editsclaude-opus-4-7, claude-sonnet-4-6, gpt-5.5, deepseek-v4-pro1M tokensBest tool calling, strong reasoning, largest context
Code review & debuggingclaude-sonnet-4-6, gpt-5.5, gemini-3.1-pro1M-2M tokensGood instruction following, reliable output
Quick search & summarizationclaude-haiku-4-5, deepseek-v4-flash, mistral-small-4, groq/llama-3.3-70b128K-200KFast, cost-effective, lower latency
Offline / air-gappedollama/llama4:70b, ollama/llama3.3131K tokensFully local, no network needed
Cost-sensitive bulkkilocode/kilo-auto/free, opencode-go/minimax-m2.7, groq/llama-3.1-8bVariesFree or minimal cost tiers available
Research & reasoningclaude-opus-4-7, deepseek-v4-pro, gemini-3.1-pro1M-2M tokensDeep reasoning, chain-of-thought
Vision / multimodalclaude-sonnet-4-6, gpt-5.5, gemini-3.1-pro, deepseek-v4-flash1M tokensImage understanding, document analysis

Notable Models by Provider

ProviderModels Available
Anthropicclaude-opus-4-7 (1M ctx), claude-sonnet-4-6 (1M ctx, recommended), claude-haiku-4-5 (200K ctx, fast)
OpenAIgpt-5.5 (1M ctx), gpt-5.4 (1M ctx), gpt-5.4-mini (fast)
Googlegemini-3.1-pro (2M ctx), gemini-3.1-flash-lite (1M ctx, fast)
DeepSeekdeepseek-v4-pro (1M ctx, MoE), deepseek-v4-flash (1M ctx, fast MoE)
xAIgrok-4.3 (128K ctx)
Mistralmistral-medium-3.5, mistral-small-4, mistral-large-3
Groqllama-3.3-70b, llama-3.1-8b-instant (fast)
Ollamallama4:70b (local), llama3.3 (local), qwen3.6-plus (local)
GitHub Copilotclaude-opus-4.7, claude-sonnet-4.6, claude-haiku-4.5, gpt-5.5
OpenCodeclaude-opus-4-7, claude-sonnet-4-6, gpt-5.5, gemini-3.1-pro, kimi-k2.6
KiloCodekilo-auto/free, kilo-auto/balanced, kilo-auto/frontier
Coherecommand-a-plus (128K ctx)
Perplexitysonar-pro, sonar, sonar-reasoning-pro
Together AIdeepseek-v4, Qwen3.6, Llama-4
NVIDIA NIMdeepseek-v4-pro, glm-5.1, nemotron-3
Cerebrasqwen-3-235b (extremely fast)
Moonshot (Kimi)kimi-k2.6 (128K ctx)
Zhipu (GLM)glm-4.7 (128K ctx)
Hugging Facellama-3.3-70b-instruct
Poeclaude-3-7-sonnet

Model Switching

/model                          # Interactive picker (fetches live models from provider API when supported)
/model list                     # List all available models from the active provider API
/model claude-sonnet-4-7        # Switch by full ID
/model sonnet                   # Alias-based switching
--model opus                    # CLI flag at startup

For providers with a /v1/models endpoint (OpenAI, DeepSeek, NVIDIA, etc.), the picker fetches live models from the API instead of the static providers.json list. The picker shows all models from every provider in named sections — not just the active provider's models. Recent models appear at top with the active provider's default. Select any model from any provider directly.

Model switching is instant — the next conversation turn uses the new model. Previous context is preserved.

Model Aliases

Short aliases are resolved to full model IDs: sonnet, opus, haiku, gpt5, flash, etc.

CLI Flags for Models

FlagDescription
--model <name>Set model at startup (e.g., --model sonnet or --model claude-opus-4-7)
--effort <level>Set reasoning effort: low, medium, high, max
--max-turns <N>Limit agentic turns in non-interactive mode
--thinking <mode>Thinking mode: enabled, adaptive, disabled
--fallback-model <model>Fallback when primary model is overloaded (print mode only)
--task-budget <tokens>API-side task budget in tokens
--max-budget-usd <amount>Maximum spend on API calls (print mode only)

Model Capabilities

Each model declaration in providers.json includes:

  • Model ID and display label
  • Context window (maxContext) and max output tokens (maxOutput)
  • Tool calling type (native, none)
  • Vision support
  • Streaming mode (full, partial)
  • Reasoning support
  • System prompt support
  • Tags (for filtering): fast, verified, recommended, latest, free, vision, tools, native, moe, local