AI Models
TeXRA supports models from multiple providers. Select models from the dropdown in the TeXRA UI. Hover over an option to see its context window and cost estimate.
The model picker: monospace model ids with a T badge on thinking variants, and a hover popover showing context window and per-1M token pricing.
Model ID suffixes:
T= Thinking/reasoning mode enabled (shows chain-of-thought)-= Lighter/faster variant- Numbers indicate version (e.g.,
45= 4.5,25= 2.5)
Anthropic Models
| Model ID | Use Case | Cost | Speed |
|---|---|---|---|
fable5 | Most capable, always-on adaptive thinking | $$$$ | Slow |
opus48T | Top-tier reasoning, long-horizon work | $$$$ | Slow |
opus48 | Most capable for agentic coding | $$$$ | Slow |
sonnet46T | All-rounder with reasoning | $$$ | Medium |
sonnet46 | Strong all-rounder | $$$ | Medium |
haiku45T | Fast with reasoning | $$ | Fast |
haiku45 | Fast responses | $$ | Fast |
Fable 5, Opus 4.8, and Sonnet 4.6 include the full 1M context window at standard pricing — no opt-in or beta header required. Other Claude models use a 200K context window.
Claude Fable 5 (fable5) is Anthropic's most capable model. Thinking is always on — adaptive, with summarized reasoning — so there is no separate T variant. It supports the full reasoning-effort range up to Extra High and the top Max tier, and is eligible for context compaction in tool-use mode.
Claude Opus 4.8 uses adaptive thinking only (extended thinking with a manual budget_tokens is no longer accepted). TeXRA's reasoning-effort selector maps to Anthropic's effort levels automatically — pick opus48T with Extra High (or the top Max tier) effort for the strongest agentic coding and long-horizon tasks. Opus 4.8 also supports high-resolution images (up to 2576px / 3.75MP) for better figure, chart, and screenshot understanding; note that TeXRA downscales images above texra.maxImageDimension (default 2000px) before sending, so raise that setting if you want to take full advantage of Opus 4.8's higher limit.
OpenAI Models
| Model ID | Use Case | Cost | Speed |
|---|---|---|---|
gpt55pro | Top-tier reasoning, 1M context | $$$$$ | Slow |
gpt55 | Flagship reasoning | $$$$ | Medium |
gpt54 | Mid-tier reasoning | $$$ | Medium |
gpt54- | Lower-cost reasoning | $$ | Fast |
gpt54-- | Budget reasoning | $ | Fast |
GPT-5.5 is OpenAI's latest flagship and the model TeXRA pins the Codex integration to. GPT-5.5 Pro (gpt55pro) extends GPT-5.5 with a 1.05M context window and xhigh default reasoning for the hardest planning and long-horizon tasks, at premium pricing — it is hidden by default; enable it from Settings → Models when you need it. The GPT Pro variants (gpt5pro, gpt52pro, gpt55pro) charge $15-$30 per 1M input and $120-$180 per 1M output, so for one-off hard questions consider enabling the inquiry tool and pasting the answer from your own ChatGPT subscription instead of running a full agent turn against the API. gpt54 and its mini/nano variants remain the lower-cost option for most workloads. See the API reference for full capabilities.
GPT-5 reasoning summaries require account verification. Enable with texra.model.gpt5ReasoningSummary.
Google Models
| Model ID | Use Case | Cost | Speed |
|---|---|---|---|
gemini31p | Pro with reasoning, 1M context | $$$ | Medium |
gemini35f | Flash model with 1M context | $$ | Fast |
DeepSeek Models
| Model ID | Use Case | Cost | Speed |
|---|---|---|---|
deepseek | V4 Flash chat mode | $ | Fast |
deepseekT | V4 Flash with reasoning | $ | Medium |
deepseekpro | V4 Pro chat mode | $ | Medium |
deepseekproT | V4 Pro with reasoning | $ | Medium |
dsr1 | Advanced reasoning | $$ | Medium |
Moonshot Kimi Models
| Model ID | Use Case | Cost | Speed |
|---|---|---|---|
kimi26T | K2.6 with thinking mode | $$$ | Medium |
kimi26 | K2.6, agent tasks | $$$ | Medium |
DashScope Qwen Models
| Model ID | Use Case | Cost | Speed |
|---|---|---|---|
qwenplus | Hybrid thinking, 1M context | $$ | Medium |
qwenturbo | Fast with optional thinking | $ | Fast |
MiniMax Models
| Model ID | Use Case | Cost | Speed |
|---|---|---|---|
minimaxM27 | Flagship with interleaved thinking | $$ | Medium |
minimaxM25 | Strong all-rounder | $$ | Medium |
MiniMax uses interleaved thinking (chain-of-thought woven into responses). API keys are region-specific — international keys (api.minimax.io) and China keys (api.minimaxi.com) are not interchangeable. Toggle the region in the Models tab.
- International: Get your API key at platform.minimax.io
- China: Get your API key at platform.minimaxi.com
- Coding Plan: MiniMax offers monthly subscription plans ($10/$20/$50/mo) as an alternative to pay-as-you-go. Coding Plan keys are not interchangeable with standard API keys — enter your Coding Plan key via "Set API Key" as usual. Subscribe here.
GLM (Zhipu AI / Z.AI) Models
| Model ID | Use Case | Cost | Speed |
|---|---|---|---|
glm5 | Flagship open-source model | $$$ | Medium |
glm5turbo | Fast inference, agent-optimized | $$$ | Medium |
glm47 | Programming and multi-step reasoning | $$ | Medium |
glm46v | Multimodal vision model | $$ | Medium |
glm45 | Hybrid reasoning (MoE) | $$ | Medium |
GLM models support thinking mode (reasoning is shown inline). The API uses a non-standard base path (/api/paas/v4), which TeXRA handles automatically.
- International (Z.AI): Get your API key at z.ai — endpoint: api.z.ai
- China (BigModel): Get your API key at open.bigmodel.cn — endpoint: open.bigmodel.cn (default)
- Coding Plan: GLM offers monthly subscription plans as an alternative to pay-as-you-go, with access to all GLM models. Coding Plan uses a separate endpoint (
/api/coding/paas/v4). Enable the "Coding Plan" toggle in the Models tab. Subscribe here.
Grok / xAI Models
| Model ID | Use Case | Cost | Speed |
|---|---|---|---|
grok4 | Large context (256k), reasoning | $$$ | Medium |
grok3 | Large context (131k) | $$$ | Medium |
grok2v | Vision-enabled | $$ | Medium |
Choosing a Model
Pick a model by intent: each use case maps to a short list of recommended model ids.
Customizing the Model List
Choose which models appear in the extension picker from the Dashboard → Models tab — toggle them on or off per provider, no JSON required (the choice is saved in the extension).
In the CLI TUI, use /model after a chat starts to switch among models that are runnable in the active API mode. Startup also asks for a model when the launcher flow needs one after the agent or team choice.
For headless CLI runs, list what's available with texra models list (or texra models show <id> for details), then pick a default for your project by setting the model key in .texra/config.json, or override per run with --model <id>.
Using OpenRouter
To access additional models or alternative pricing:
- Get an OpenRouter API key
- Add via
TeXRA: Set API Keycommand - In the Dashboard → Models tab → API Configuration, expand the OpenRouter row and enable "Use OpenRouter for All Models"
Expanding any provider's row in API Configuration reveals its key field plus the per-provider toggles described here and under Streaming:
Expand a provider's API Configuration row to reveal its masked key field, the per-provider Enable streaming toggle, and Use OpenRouter for All Models.
Streaming
Streaming is configured per provider rather than through a single global setting. Open the Dashboard → Models tab, expand a provider's API Configuration, and toggle streaming there (see the expanded row above). Enabling it makes long responses arrive incrementally instead of in one large reply.
Next Steps
- Built-in Agents: See which agents work with different models
- Configuration: Model-related settings