- Speed is the core feature: Built on NVIDIA GB200 NVL72 (Blackwell) clusters, GPT-5.3 Codex generates a complete React component in 4.2 seconds and scores 77.3% on Terminal Bench 2.0 — 13.3 points ahead of its predecessor.
- Lives in the terminal, not the chat window: The model is trained to execute CLI commands, run tests, fix lint errors, and manipulate files directly on the local codebase — not just generate passive code snippets.
- The –steerable flag is a game changer: Real-time intervention lets developers pause the output stream mid-generation, inject corrections, and redirect the agent without waiting for a completed (potentially wrong) result.
- Know its limits: GPT-5.3 Codex suffers from context drift on long-horizon tasks with unstructured documents, skips clarifying questions (risking fast hallucinations), and over-refuses legitimate security and refactoring tasks due to conservative filters.
📖 This article is part of our comprehensive ChatGPT guide. Read the full guide →
OpenAI releases GPT-5.3 Codex and makes a radical pivot from pure reasoning depth to extreme inference speed and direct terminal integration. The model dominates with 77.3 percent accuracy in CLI tasks and positions itself as an “interactive teammate” that deliberately prioritizes latency and control over the absolute autonomy of its competitors. We classify the specs and the decisive comparison with Claude Opus 4.6. Read our in-depth review of Claude Opus 4.6 — the depth-focused rival.