Skip to content

LLM providers

Cascade abstracts the LLM behind a single interface, so swapping providers is one config command. Each provider has its own setup story.

The default. Uses Anthropic’s tool-use feature for reliable structured output.

Terminal window
cascade configure llm anthropic --key sk-ant-xxx --set-default
SettingValue
Default modelclaude-opus-4-7
AuthAPI key
Env varANTHROPIC_API_KEY
Best forMaximum reliability of structured output

Uses OpenAI’s Structured Outputs (response_format with json_schema). Works with any OpenAI-compatible endpoint via --base-url.

Terminal window
cascade configure llm openai --key sk-xxx --set-default
# Azure OpenAI:
cascade configure llm openai --key sk-xxx --base-url https://your.openai.azure.com/v1
SettingValue
Default modelgpt-5
AuthAPI key
Env varOPENAI_API_KEY
Best forTeams already on OpenAI; Azure OpenAI deployments

Uses response_schema with Pydantic models for structured output.

Terminal window
cascade configure llm google --key gemini-xxx --set-default
SettingValue
Default modelgemini-2.0-flash
AuthAPI key
Env varsGOOGLE_API_KEY or GEMINI_API_KEY
Best forTeams on Google Cloud; cost-sensitive use cases

Uses your local Claude Code subscription. No separate API key.

Terminal window
pip install "cascade-agent[claude-code]"
cascade configure llm claude_code --set-default
SettingValue
Default modelclaude-opus-4-7
AuthYour Claude Code installation
Env varNone
Best forZero-setup adoption; teams with Claude subscriptions

Full guide: run without an API key →

Any OpenAI-compatible local LLM server.

Terminal window
cascade configure llm ollama --model llama3.1 --set-default
# Custom endpoint:
cascade configure llm ollama --base-url http://gpu-host:8000/v1 --set-default
SettingValue
Default modelllama3.1
Default base URLhttp://localhost:11434/v1
AuthNone
Env varNone
Best forAir-gapped environments; data residency requirements

Per command:

Terminal window
cascade prompt "Add health endpoint" --model claude-opus-4-7

Per project (via cascade.yaml):

agent:
provider: openai
model: gpt-5

Globally (default):

Terminal window
cascade configure llm anthropic --set-default
  • GitHub Copilot CLI (best-effort; Copilot’s API isn’t structured-output-friendly)
  • OpenRouter as a one-config gateway to all providers
  • Per-stage routing: planner on Gemini, coder on Claude, etc.