models.md

  1---
  2title: AI Models and Pricing - Zed
  3description: AI models available via Zed Pro including Claude, GPT-5.4, Gemini 3.1 Pro, and Grok. Pricing, context windows, and tool call support.
  4---
  5
  6# Models
  7
  8Zed's plans offer hosted versions of major LLMs with higher rate limits than direct API access. Model availability is updated regularly. To use your own API keys instead, see [LLM Providers](./llm-providers.md). For general setup, see [Configuration](./configuration.md).
  9
 10> **Note:** Claude Opus models and GPT-5.4 pro are not available on the [Student plan](./plans-and-usage.md#student).
 11
 12| Model                  | Provider  | Token Type          | Provider Price per 1M tokens | Zed Price per 1M tokens |
 13| ---------------------- | --------- | ------------------- | ---------------------------- | ----------------------- |
 14| Claude Opus 4.5        | Anthropic | Input               | $5.00                        | $5.50                   |
 15|                        | Anthropic | Output              | $25.00                       | $27.50                  |
 16|                        | Anthropic | Input - Cache Write | $6.25                        | $6.875                  |
 17|                        | Anthropic | Input - Cache Read  | $0.50                        | $0.55                   |
 18| Claude Opus 4.6        | Anthropic | Input               | $5.00                        | $5.50                   |
 19|                        | Anthropic | Output              | $25.00                       | $27.50                  |
 20|                        | Anthropic | Input - Cache Write | $6.25                        | $6.875                  |
 21|                        | Anthropic | Input - Cache Read  | $0.50                        | $0.55                   |
 22| Claude Opus 4.7        | Anthropic | Input               | $5.00                        | $5.50                   |
 23|                        | Anthropic | Output              | $25.00                       | $27.50                  |
 24|                        | Anthropic | Input - Cache Write | $6.25                        | $6.875                  |
 25|                        | Anthropic | Input - Cache Read  | $0.50                        | $0.55                   |
 26| Claude Sonnet 4.5      | Anthropic | Input               | $3.00                        | $3.30                   |
 27|                        | Anthropic | Output              | $15.00                       | $16.50                  |
 28|                        | Anthropic | Input - Cache Write | $3.75                        | $4.125                  |
 29|                        | Anthropic | Input - Cache Read  | $0.30                        | $0.33                   |
 30| Claude Sonnet 4.6      | Anthropic | Input               | $3.00                        | $3.30                   |
 31|                        | Anthropic | Output              | $15.00                       | $16.50                  |
 32|                        | Anthropic | Input - Cache Write | $3.75                        | $4.125                  |
 33|                        | Anthropic | Input - Cache Read  | $0.30                        | $0.33                   |
 34| Claude Haiku 4.5       | Anthropic | Input               | $1.00                        | $1.10                   |
 35|                        | Anthropic | Output              | $5.00                        | $5.50                   |
 36|                        | Anthropic | Input - Cache Write | $1.25                        | $1.375                  |
 37|                        | Anthropic | Input - Cache Read  | $0.10                        | $0.11                   |
 38| GPT-5.4 pro            | OpenAI    | Input               | $30.00                       | $33.00                  |
 39|                        | OpenAI    | Output              | $180.00                      | $198.00                 |
 40| GPT-5.4                | OpenAI    | Input               | $2.50                        | $2.75                   |
 41|                        | OpenAI    | Output              | $15.00                       | $16.50                  |
 42|                        | OpenAI    | Cached Input        | $0.025                       | $0.0275                 |
 43| GPT-5.3-Codex          | OpenAI    | Input               | $1.75                        | $1.925                  |
 44|                        | OpenAI    | Output              | $14.00                       | $15.40                  |
 45|                        | OpenAI    | Cached Input        | $0.175                       | $0.1925                 |
 46| GPT-5.2                | OpenAI    | Input               | $1.75                        | $1.925                  |
 47|                        | OpenAI    | Output              | $14.00                       | $15.40                  |
 48|                        | OpenAI    | Cached Input        | $0.175                       | $0.1925                 |
 49| GPT-5.2-Codex          | OpenAI    | Input               | $1.75                        | $1.925                  |
 50|                        | OpenAI    | Output              | $14.00                       | $15.40                  |
 51|                        | OpenAI    | Cached Input        | $0.175                       | $0.1925                 |
 52| GPT-5 mini             | OpenAI    | Input               | $0.25                        | $0.275                  |
 53|                        | OpenAI    | Output              | $2.00                        | $2.20                   |
 54|                        | OpenAI    | Cached Input        | $0.025                       | $0.0275                 |
 55| GPT-5 nano             | OpenAI    | Input               | $0.05                        | $0.055                  |
 56|                        | OpenAI    | Output              | $0.40                        | $0.44                   |
 57|                        | OpenAI    | Cached Input        | $0.005                       | $0.0055                 |
 58| Gemini 3.1 Pro         | Google    | Input               | $2.00                        | $2.20                   |
 59|                        | Google    | Output              | $12.00                       | $13.20                  |
 60| Gemini 3 Flash         | Google    | Input               | $0.50                        | $0.55                   |
 61|                        | Google    | Output              | $3.00                        | $3.30                   |
 62| Grok 4                 | X.ai      | Input               | $3.00                        | $3.30                   |
 63|                        | X.ai      | Output              | $15.00                       | $16.5                   |
 64|                        | X.ai      | Cached Input        | $0.75                        | $0.825                  |
 65| Grok 4 Fast            | X.ai      | Input               | $0.20                        | $0.22                   |
 66|                        | X.ai      | Output              | $0.50                        | $0.55                   |
 67|                        | X.ai      | Cached Input        | $0.05                        | $0.055                  |
 68| Grok 4 (Non-Reasoning) | X.ai      | Input               | $0.20                        | $0.22                   |
 69|                        | X.ai      | Output              | $0.50                        | $0.55                   |
 70|                        | X.ai      | Cached Input        | $0.05                        | $0.055                  |
 71| Grok Code Fast 1       | X.ai      | Input               | $0.20                        | $0.22                   |
 72|                        | X.ai      | Output              | $1.50                        | $1.65                   |
 73|                        | X.ai      | Cached Input        | $0.02                        | $0.022                  |
 74
 75## Recent Model Retirements
 76
 77As of February 19, 2026, Zed Pro serves newer model versions in place of the retired models below:
 78
 79- Claude Opus 4.1 → Claude Opus 4.5, Claude Opus 4.6, or Claude Opus 4.7
 80- Claude Sonnet 4 → Claude Sonnet 4.5 or Claude Sonnet 4.6
 81- Claude Sonnet 3.7 (retired Feb 19) → Claude Sonnet 4.5 or Claude Sonnet 4.6
 82- GPT-5.1 and GPT-5 → GPT-5.2 or GPT-5.2-Codex
 83- Gemini 2.5 Pro → Gemini 3.1 Pro
 84- Gemini 3 Pro → Gemini 3.1 Pro
 85- Gemini 2.5 Flash → Gemini 3 Flash
 86
 87## Usage {#usage}
 88
 89Any usage of a Zed-hosted model will be billed at the Zed Price (rightmost column above). See [Plans and Usage](./plans-and-usage.md) for details on Zed's plans and limits for use of hosted models.
 90
 91> LLMs can enter unproductive loops that require user intervention. Monitor longer-running tasks and interrupt if needed.
 92
 93## Context Windows {#context-windows}
 94
 95A context window is the maximum span of text and code an LLM can consider at once, including both the input prompt and output generated by the model.
 96
 97| Model                       | Provider  | Zed-Hosted Context Window |
 98| --------------------------- | --------- | ------------------------- |
 99| Claude Opus 4.5             | Anthropic | 200k                      |
100| Claude Opus 4.6             | Anthropic | 1M                        |
101| Claude Opus 4.7             | Anthropic | 1M                        |
102| Claude Sonnet 4.5           | Anthropic | 200k                      |
103| Claude Sonnet 4.6           | Anthropic | 1M                        |
104| Claude Haiku 4.5            | Anthropic | 200k                      |
105| GPT-5.4 pro                 | OpenAI    | 400k                      |
106| GPT-5.4                     | OpenAI    | 400k                      |
107| GPT-5.3-Codex               | OpenAI    | 400k                      |
108| GPT-5.2                     | OpenAI    | 400k                      |
109| GPT-5.2-Codex               | OpenAI    | 400k                      |
110| GPT-5 mini                  | OpenAI    | 400k                      |
111| GPT-5 nano                  | OpenAI    | 400k                      |
112| Gemini 3.1 Pro              | Google    | 200k                      |
113| Gemini 3 Flash              | Google    | 200k                      |
114| Grok 4                      | X.ai      | 128k                      |
115| Grok 4 Fast                 | X.ai      | 128k                      |
116| Grok 4 Fast (Non-Reasoning) | X.ai      | 128k                      |
117| Grok Code Fast 1            | X.ai      | 256k                      |
118
119> Context window limits for hosted Gemini 3.1 Pro/3 Flash may increase in future releases.
120
121Each Agent thread in Zed maintains its own context window.
122The more prompts, attached files, and responses included in a session, the larger the context window grows.
123
124Start a new thread for each distinct task to keep context focused.
125
126## Tool Calls {#tool-calls}
127
128Models can use [tools](./tools.md) to interface with your code, search the web, and perform other useful functions.