models.md

 1# Models
 2
 3Zed’s plans offer hosted versions of major LLM’s, generally with higher rate limits than individual API keys.
 4We’re working hard to expand the models supported by Zed’s subscription offerings, so please check back often.
 5
 6| Model             | Provider  | Burn Mode | Context Window | Price per Prompt | Price per Request |
 7| ----------------- | --------- | --------- | -------------- | ---------------- | ----------------- |
 8| Claude 3.5 Sonnet | Anthropic | ❌        | 60k            | $0.04            | N/A               |
 9| Claude 3.7 Sonnet | Anthropic | ❌        | 120k           | $0.04            | N/A               |
10| Claude 3.7 Sonnet | Anthropic | ✅        | 200k           | N/A              | $0.05             |
11| Claude Sonnet 4   | Anthropic | ❌        | 120k           | $0.04            | N/A               |
12| Claude Sonnet 4   | Anthropic | ✅        | 200k           | N/A              | $0.05             |
13
14## Usage {#usage}
15
16The models above can be used with the prompts included in your plan. For models not marked with [“Burn Mode”](#burn-mode), each prompt is counted against the monthly limit of your plan.
17
18If you’ve exceeded your limit for the month, and are on a paid plan, you can enable usage-based pricing to continue using models for the rest of the month. See [Plans and Usage](./plans-and-usage.md) for more information.
19
20Non-Burn Mode usage will use up to 25 tool calls per one prompt. If your prompt extends beyond 25 tool calls, Zed will ask if you’d like to continue, which will consume a second prompt.
21
22## Burn Mode {#burn-mode}
23
24> Note: "Burn Mode" is the new name for what was previously called "Max Mode".
25> Currently, the new terminology is only available in Preview and will follow to Stable in the next version.
26
27In Burn Mode, we enable models to use [large context windows](#context-windows), unlimited tool calls, and other capabilities for expanded reasoning, to allow an unfettered agentic experience.
28
29Because of the increased cost to Zed, each subsequent request beyond the initial user prompt in Burn Mode models is counted as a prompt for metering.
30
31In addition, usage-based pricing per request is slightly more expensive for Burn Mode models than usage-based pricing per prompt for regular models.
32
33> Note that the Agent Panel using a Burn Mode model may consume a good bit of your monthly prompt capacity, if many tool calls are used.
34> We encourage you to think through what model is best for your needs before leaving the Agent Panel to work.
35
36By default, all threads and [text threads](./text-threads.md) start in normal mode.
37However, you can use the `agent.preferred_completion_mode` setting to have Burn Mode activated by default.
38
39## Context Windows {#context-windows}
40
41A context window is the maximum span of text and code an LLM can consider at once, including both the input prompt and output generated by the model.
42
43In [Burn Mode](#burn-mode), we increase context window size to allow models to have enhanced reasoning capabilities.
44
45Each Agent thread and text thread in Zed maintains its own context window.
46The more prompts, attached files, and responses included in a session, the larger the context window grows.
47
48For best results, it’s recommended you take a purpose-based approach to Agent thread management, starting a new thread for each unique task.
49
50## Tool Calls {#tool-calls}
51
52Models can use [tools](./tools.md) to interface with your code, search the web, and perform other useful functions.
53
54In [Burn Mode](#burn-mode), models can use an unlimited number of tools per prompt, with each tool call counting as a prompt for metering purposes.
55
56For non-Burn Mode models, you'll need to interact with the model every 25 tool calls to continue, at which point a new prompt will be counted against your plan limit.