1---
2title: AI Models and Pricing - Zed
3description: AI models available via Zed Pro including Claude, GPT-5.4, Gemini 3.1 Pro, and Grok. Pricing, context windows, and tool call support.
4---
5
6# Models
7
8Zed's plans offer hosted versions of major LLMs with higher rate limits than direct API access. Model availability is updated regularly. To use your own API keys instead, see [LLM Providers](./llm-providers.md). For general setup, see [Configuration](./configuration.md).
9
10> **Note:** Claude Opus models and GPT-5.4 pro are not available on the [Student plan](./plans-and-usage.md#student).
11
12| Model | Provider | Token Type | Provider Price per 1M tokens | Zed Price per 1M tokens |
13| ---------------------- | --------- | ------------------- | ---------------------------- | ----------------------- |
14| Claude Opus 4.5 | Anthropic | Input | $5.00 | $5.50 |
15| | Anthropic | Output | $25.00 | $27.50 |
16| | Anthropic | Input - Cache Write | $6.25 | $6.875 |
17| | Anthropic | Input - Cache Read | $0.50 | $0.55 |
18| Claude Opus 4.6 | Anthropic | Input | $5.00 | $5.50 |
19| | Anthropic | Output | $25.00 | $27.50 |
20| | Anthropic | Input - Cache Write | $6.25 | $6.875 |
21| | Anthropic | Input - Cache Read | $0.50 | $0.55 |
22| Claude Opus 4.7 | Anthropic | Input | $5.00 | $5.50 |
23| | Anthropic | Output | $25.00 | $27.50 |
24| | Anthropic | Input - Cache Write | $6.25 | $6.875 |
25| | Anthropic | Input - Cache Read | $0.50 | $0.55 |
26| Claude Sonnet 4.5 | Anthropic | Input | $3.00 | $3.30 |
27| | Anthropic | Output | $15.00 | $16.50 |
28| | Anthropic | Input - Cache Write | $3.75 | $4.125 |
29| | Anthropic | Input - Cache Read | $0.30 | $0.33 |
30| Claude Sonnet 4.6 | Anthropic | Input | $3.00 | $3.30 |
31| | Anthropic | Output | $15.00 | $16.50 |
32| | Anthropic | Input - Cache Write | $3.75 | $4.125 |
33| | Anthropic | Input - Cache Read | $0.30 | $0.33 |
34| Claude Haiku 4.5 | Anthropic | Input | $1.00 | $1.10 |
35| | Anthropic | Output | $5.00 | $5.50 |
36| | Anthropic | Input - Cache Write | $1.25 | $1.375 |
37| | Anthropic | Input - Cache Read | $0.10 | $0.11 |
38| GPT-5.4 pro | OpenAI | Input | $30.00 | $33.00 |
39| | OpenAI | Output | $180.00 | $198.00 |
40| GPT-5.4 | OpenAI | Input | $2.50 | $2.75 |
41| | OpenAI | Output | $15.00 | $16.50 |
42| | OpenAI | Cached Input | $0.025 | $0.0275 |
43| GPT-5.3-Codex | OpenAI | Input | $1.75 | $1.925 |
44| | OpenAI | Output | $14.00 | $15.40 |
45| | OpenAI | Cached Input | $0.175 | $0.1925 |
46| GPT-5.2 | OpenAI | Input | $1.75 | $1.925 |
47| | OpenAI | Output | $14.00 | $15.40 |
48| | OpenAI | Cached Input | $0.175 | $0.1925 |
49| GPT-5.2-Codex | OpenAI | Input | $1.75 | $1.925 |
50| | OpenAI | Output | $14.00 | $15.40 |
51| | OpenAI | Cached Input | $0.175 | $0.1925 |
52| GPT-5 mini | OpenAI | Input | $0.25 | $0.275 |
53| | OpenAI | Output | $2.00 | $2.20 |
54| | OpenAI | Cached Input | $0.025 | $0.0275 |
55| GPT-5 nano | OpenAI | Input | $0.05 | $0.055 |
56| | OpenAI | Output | $0.40 | $0.44 |
57| | OpenAI | Cached Input | $0.005 | $0.0055 |
58| Gemini 3.1 Pro | Google | Input | $2.00 | $2.20 |
59| | Google | Output | $12.00 | $13.20 |
60| Gemini 3 Flash | Google | Input | $0.50 | $0.55 |
61| | Google | Output | $3.00 | $3.30 |
62| Grok 4 | X.ai | Input | $3.00 | $3.30 |
63| | X.ai | Output | $15.00 | $16.5 |
64| | X.ai | Cached Input | $0.75 | $0.825 |
65| Grok 4 Fast | X.ai | Input | $0.20 | $0.22 |
66| | X.ai | Output | $0.50 | $0.55 |
67| | X.ai | Cached Input | $0.05 | $0.055 |
68| Grok 4 (Non-Reasoning) | X.ai | Input | $0.20 | $0.22 |
69| | X.ai | Output | $0.50 | $0.55 |
70| | X.ai | Cached Input | $0.05 | $0.055 |
71| Grok Code Fast 1 | X.ai | Input | $0.20 | $0.22 |
72| | X.ai | Output | $1.50 | $1.65 |
73| | X.ai | Cached Input | $0.02 | $0.022 |
74
75## Recent Model Retirements
76
77As of February 19, 2026, Zed Pro serves newer model versions in place of the retired models below:
78
79- Claude Opus 4.1 → Claude Opus 4.5, Claude Opus 4.6, or Claude Opus 4.7
80- Claude Sonnet 4 → Claude Sonnet 4.5 or Claude Sonnet 4.6
81- Claude Sonnet 3.7 (retired Feb 19) → Claude Sonnet 4.5 or Claude Sonnet 4.6
82- GPT-5.1 and GPT-5 → GPT-5.2 or GPT-5.2-Codex
83- Gemini 2.5 Pro → Gemini 3.1 Pro
84- Gemini 3 Pro → Gemini 3.1 Pro
85- Gemini 2.5 Flash → Gemini 3 Flash
86
87## Usage {#usage}
88
89Any usage of a Zed-hosted model will be billed at the Zed Price (rightmost column above). See [Plans and Usage](./plans-and-usage.md) for details on Zed's plans and limits for use of hosted models.
90
91> LLMs can enter unproductive loops that require user intervention. Monitor longer-running tasks and interrupt if needed.
92
93## Context Windows {#context-windows}
94
95A context window is the maximum span of text and code an LLM can consider at once, including both the input prompt and output generated by the model.
96
97| Model | Provider | Zed-Hosted Context Window |
98| --------------------------- | --------- | ------------------------- |
99| Claude Opus 4.5 | Anthropic | 200k |
100| Claude Opus 4.6 | Anthropic | 1M |
101| Claude Opus 4.7 | Anthropic | 1M |
102| Claude Sonnet 4.5 | Anthropic | 200k |
103| Claude Sonnet 4.6 | Anthropic | 1M |
104| Claude Haiku 4.5 | Anthropic | 200k |
105| GPT-5.4 pro | OpenAI | 400k |
106| GPT-5.4 | OpenAI | 400k |
107| GPT-5.3-Codex | OpenAI | 400k |
108| GPT-5.2 | OpenAI | 400k |
109| GPT-5.2-Codex | OpenAI | 400k |
110| GPT-5 mini | OpenAI | 400k |
111| GPT-5 nano | OpenAI | 400k |
112| Gemini 3.1 Pro | Google | 200k |
113| Gemini 3 Flash | Google | 200k |
114| Grok 4 | X.ai | 128k |
115| Grok 4 Fast | X.ai | 128k |
116| Grok 4 Fast (Non-Reasoning) | X.ai | 128k |
117| Grok Code Fast 1 | X.ai | 256k |
118
119> Context window limits for hosted Gemini 3.1 Pro/3 Flash may increase in future releases.
120
121Each Agent thread in Zed maintains its own context window.
122The more prompts, attached files, and responses included in a session, the larger the context window grows.
123
124Start a new thread for each distinct task to keep context focused.
125
126## Tool Calls {#tool-calls}
127
128Models can use [tools](./tools.md) to interface with your code, search the web, and perform other useful functions.