Grok-code-fast-1 is now out and available for everyone to use ๐๐๏ธ๐จ
When I joined the coding team, the team was just 3 people and we very quickly built a model which was SOTA on SWEBench. But as things go, in the real world benchmarks matter less. Over the last few months we approached the modelling + data + infra perspective from a different lens, putting developers and users first over everything else.
This required us to tune the data recipe, get the infra in place to do a lot of rollouts and create a set of grounded evals which was powered by both human judgement and an in-house auto-evaluation framework that captured real world usability.
This is the first model of many in the grok coding family, we are going to make quick iterations and improve the model performance over time.
We thrive on your feedback, please share your unfiltered and honest thoughts so we can keep pushing new boundaries for agentic coding
Introducing Grok Code Fast 1, a speedy and economical reasoning model that excels at agentic coding.
Now available for free on GitHub Copilot, Cursor, Cline, Kilo Code, Roo Code, opencode, and Windsurf.
https://t.co/3tMbmLbxOP
Bug fixes shipping to Grok Build 0.2.16 (release notes will be available in the TUI and on change-log website)
โข fix stuck "Starting session..." / "Connecting MCPs (0/0)..."
โข show where external skills/mcps/hooks/plugins originate in grok inspect
โข fix Terminal app unsafe shortcuts
โข render streaming bash output
โข use grok to derive /loop interval from the request instead of hardcoded parsing
โข fix parent session state (background tasks, monitor) leaking into subagent conversations
โข make Claude/Cursor skills, AGENTS, mcps, and plugins configurable
โข wire --permission-mode acceptEdits to auto-approve edits
โข share FS watchers by working directory
โข inject UTF-8 env defaults for Windows child processes
โข fix blank completed bash / code-execution cards
โข fix scroll wrap on scrolling vs. shortcuts on all menus
Bug fixes shipping to Grok Build 0.2.13 (release notes will be available in the TUI and on change-log website)
We are leveraging the alt-screen to better handle your background tasks, subagents, monitors with smart grouping allowing you to navigate between them quickly
โข Group Subagents โ Tasks โ Watchers, within subagents, order by agent type (Explore, General, Plan)
โข Show timestamps on pinned user messages at top of scrollback
โข Fix command highlighting when prompt has paste chips
โข Update the context-usage indicator to show tokens by default
โข ANSI16 fallback for themes
โข Better context usage breakdown/rendering
โข Update highlighting for /loop, monitor, tag colors
โข Tab now cycles Prompt โ Scrollback โ Tasks โ Prompt
โข Order gateway turn-completion after streamed content
โข Formatting: fix language-tagged fenced code blocks and fix code under a list item
We are excited for all of you to try out Composer 2.5 in Grok Build starting today!
To use composer-2-5 do `/model` in Grok Build and type in Composer to switch
Composer 2.5 comes with 200k context window and supports: subagents, MCPs, skills and additionally also works with your .cursor settings
Grok Build: X search and backend web search are rolling out again, we will complete it over the next few hours; currently at 25%
cache hit rates and ttft are healthy again
Bug fixes shipping to Grok Build 0.2.11 (release notes will be available in the TUI and on change-log website)
- Make tab bar arrow-focusable in picker modals (MCPs, plugins, hooks)
- Show "Switched to mode" banner above prompt for `Shift+Tab` cycles
- Instant loading indicator on model switch
- Fix broken branch glyph on Windows
- Fix WSL `Ctrl+V` image paste
- Fix UX bug where `/context` always showed "Auto-compact at 85%"
- Fix slash command autocomplete and rendering
- Boost terminal video playback to 30fps
- `Left`/`Right` arrows switch tabs in extensions modal
- `Up`/`Down` arrows at list edges focus search input
- Fix terminal resize on multiplexer (zellij/tmux)
- Open buttons for imagine media output
- Render streaming bash tool output
- Increase default retry budget to ~5 min and simplify retry UI
@abdurrahmanregi@Jaaneek this will work if the model is trained and can handle OOD traces, without doing evals supporting a mode like this can lead to not so optimal traces