GPT-5.3 Codex: The autonomous coding agent is here

GPT-5.3 Codex: The autonomous coding agent is here
TL;DR — GPT-5.3 Codex at a Glance

  • Speed is the core feature: Built on NVIDIA GB200 NVL72 (Blackwell) clusters, GPT-5.3 Codex generates a complete React component in 4.2 seconds and scores 77.3% on Terminal Bench 2.0 — 13.3 points ahead of its predecessor.
  • Lives in the terminal, not the chat window: The model is trained to execute CLI commands, run tests, fix lint errors, and manipulate files directly on the local codebase — not just generate passive code snippets.
  • The –steerable flag is a game changer: Real-time intervention lets developers pause the output stream mid-generation, inject corrections, and redirect the agent without waiting for a completed (potentially wrong) result.
  • Know its limits: GPT-5.3 Codex suffers from context drift on long-horizon tasks with unstructured documents, skips clarifying questions (risking fast hallucinations), and over-refuses legitimate security and refactoring tasks due to conservative filters.

📖 This article is part of our comprehensive ChatGPT guide. Read the full guide →

OpenAI releases GPT-5.3 Codex and makes a radical pivot from pure reasoning depth to extreme inference speed and direct terminal integration. The model dominates with 77.3 percent accuracy in CLI tasks and positions itself as an “interactive teammate” that deliberately prioritizes latency and control over the absolute autonomy of its competitors. We classify the specs and the decisive comparison with Claude Opus 4.6. Read our in-depth review of Claude Opus 4.6 — the depth-focused rival.

Read more

Crawlee: Powerful web scraping in 2 minutes

Crawlee: Powerful web scraping in 2 minutes

Crawlee combines high-speed HTTP scraping and modern browser automation in a single, scalable Node.js library to extract complex web data more efficiently. Intelligent fallback mechanisms that only activate browsers when necessary can save up to 90 percent of system resources compared to pure Playwright setups. We take a look at the architecture, performance benchmarks, and real-world challenges in continuous operation.

Read more

Xcode 26.3: Agentic Coding with Claude & Codex

Xcode 26.3: Agentic Coding with Claude & Codex

With the release candidate of Xcode 26.3,Apple is opening up the IDE architecture for autonomous AI agents via Model Context Protocol (MCP) for the first time. With direct access to build servers and error consoles, models can not only suggest code, but also independently fix compilation errors in a “closed loop” and visually validate them. We analyze the technical specs surrounding macOS Tahoe and why developers are warning of potential security risks.

Read more

OpenAI releases native Codex app for macOS

OpenAI releases native Codex app for macOS

OpenAI has released a standalone Codex app for macOS that deeply integrates coding agents based on GPT-5.2 into the operating system. The tool relies on isolated Git work trees to solve complex tasks in parallel in the background without blocking the developer’s active workflow in the main editor. We analyze how this asynchronous “manager” approach compares directly to Anthropic’s CLI competition.

Read more

Cowork Plugins: Build your own Claude

Cowork Plugins - Build your own Claude

Anthropic is rolling out a new plugin infrastructure for Claude Cowork that integrates AI agents deeply into local file systems and workflows for the first time. Unlike OpenAI’s web-based approach, the system is based on local “config-as-code” via JSON and Markdown, enabling complex automations in isolated sandboxes. We analyze the technical specifications of the Model Context Protocol (MCP) and the critical security debate surrounding potential “prompt injections” on your own computer.

Read more

OpenAI unveils GPT-5.2 codex: New security standards for coding agents

OpenAI unveils GPT-5.2 codex: New security standards for coding agents

📖 This article is part of our comprehensive ChatGPT guide. Read the full guide →

With an addendum to the System Card, OpenAI radically shifts the security focus of GPT-5.2 codex from content moderation to functional capabilities safety. The updated model now blocks malware, obfuscation and prompt injections directly during token generation instead of relying on external guardrails.

Read more