Models

Claude Code supports 100+ models across 27 providers. Pick the right model for each task — from frontier reasoning to fast local inference.

Model Selection Guide

WorkloadRecommended ModelsContextNotes
Complex architecture & large editsclaude-sonnet-4-7, claude-opus-4-7, gpt-5.5-pro, deepseek-v4-pro1M tokensBest tool calling, strong reasoning, largest context
Code review & debuggingclaude-sonnet-4-7, gpt-5.5, gemini-3.1-pro1M-2M tokensGood instruction following, reliable output
Quick search & summarizationclaude-haiku-4-5, deepseek-v4-flash, mistral-small-latest, groq/llama-4-70b128K-200KFast, cost-effective, lower latency
Offline / air-gappedollama/llama4:70b, ollama/llama3.3131K tokensFully local, no network needed
Cost-sensitive bulkkilocode/kilo-auto/free, opencode-go/deepseek-v4-flash-free, groq/llama-4-8bVariesFree or minimal cost tiers available
Research & reasoningclaude-opus-4-7, gpt-5.5-thinking, gemini-3.1-pro, deepseek-v4-pro1M-2M tokensDeep reasoning, chain-of-thought
Vision / multimodalclaude-sonnet-4-7, gpt-5.5-pro, gemini-3.1-flash, deepseek-v4-flash1M tokensImage understanding, document analysis

Notable Models by Provider

ProviderModels Available
Anthropicclaude-opus-4-7 (1M ctx), claude-sonnet-4-7 (1M ctx, recommended), claude-haiku-4-5 (200K ctx, fast), claude-sonnet-4-5 (1M ctx)
OpenAIgpt-5.5-pro (1M ctx, reasoning), gpt-5.5-thinking (1M ctx), gpt-5.5 (1M ctx), gpt-5.4-mini (fast)
Googlegemini-3.1-pro (2M ctx), gemini-3.1-flash (1M ctx, fast)
DeepSeekdeepseek-v4-pro (1M ctx, MoE), deepseek-v4-flash (1M ctx, fast MoE)
xAIgrok-4-20 (128K ctx), grok-4 (128K ctx, recommended)
Mistralmistral-large-4 (128K ctx), mistral-small-latest (recommended), ministral-3-14b (fast)
Groqllama-4-70b (fast), llama-4-8b (instant)
Ollamallama4:70b (local), deepseek-v4 (local), llama3.3 (local, verified)
GitHub Copilotgpt-5.5, gpt-4o, claude-opus-4-7, claude-sonnet-4.5, claude-haiku-4.5
OpenCodeclaude-opus-4-7, claude-sonnet-4-6, gpt-5.5, gemini-3.1-pro, kimi-k2.6, glm-5.1, minimax-m2.7
KiloCodekilo-auto/free (free tier), gpt-5.5, gpt-5.5-pro, claude-opus-4.7, kimi-k2.6, deepseek-v4-pro, mimo-v2.5-pro
NVIDIA NIMdeepseek-v4-pro, llama-3.1-405b, llama-3.1-70b, nemotron-4-340b

Model Switching

/model                          # Interactive picker (recent models at top)
/model list                     # List all available models
/model claude-sonnet-4-7        # Switch by full ID
/model sonnet                   # Alias-based switching
--model opus                    # CLI flag at startup

Model switching is instant — the next conversation turn uses the new model. Previous context is preserved.

Model Aliases

Short aliases are resolved to full model IDs: sonnet, opus, haiku, gpt5, flash, etc.

CLI Flags for Models

FlagDescription
--model <name>Set model at startup (e.g., --model sonnet or --model claude-opus-4-7)
--effort <level>Set reasoning effort: low, medium, high, max
--max-turns <N>Limit agentic turns in non-interactive mode
--thinking <mode>Thinking mode: enabled, adaptive, disabled
--fallback-model <model>Fallback when primary model is overloaded (print mode only)
--task-budget <tokens>API-side task budget in tokens
--max-budget-usd <amount>Maximum spend on API calls (print mode only)

Model Capabilities

Each model declaration in providers.json includes: