Flash News

OpenAI Launches Chrome Extension for Codex, Enabling Task Execution Directly in Browser

OpenAI announced that Codex now officially supports a Chrome extension, which Mac and Windows users can install in the Codex app starting today.

The extension allows Codex to run in the background across multiple tabs in the browser without taking over the user's current interface, overcoming previous limitations that were primarily confined to terminals, IDEs, and cloud environments.

Users can handle scenarios requiring login, such as querying internal backends, updating CRM records, and completing complex multi-page form processes. Codex writes and executes code in real-time to operate the page and intelligently selects between plugin or Chrome methods.

Source: Public Information

ABAB AI Insight

OpenAI previously launched the Codex app, IDE extensions, and cloud proxies. This Chrome extension further extends the agent's capabilities from isolated environments to real user browser sessions, continuing its product path of "enabling AI to operate within users' existing toolchains."

From a capital perspective, OpenAI is expanding the use cases of Codex from developer code editing to enterprise business process automation by offering the extension for free to paid users. This shifts resources from model inference optimization to deep integration with browser protocols, aiming to enhance enterprise subscription stickiness and capture the AI agent entry point in browsers, differentiating itself from competitors like Microsoft Copilot.

Similar to Anthropic Claude's deployment across Microsoft 365 and the rise of browser-based agent tools like Browserbase, AI programming agents are currently transitioning from code generation to full browser operation control.

This represents a fundamental restructuring of the industry chain: browsers are evolving from "information consumption interfaces" to "programmable execution environments." The mechanism lies in Codex driving mixed operations of DOM and APIs with code, eliminating the script maintenance costs of traditional RPA tools, allowing agent capabilities to seamlessly integrate into already logged-in enterprise systems, thus shifting pricing power from single development tools to AI agent platforms covering entire workflows.

ABAB News · Cognitive Laws

The browser is not just a window; it is the next operating system.
When agents become truly powerful, users won't need to switch interfaces; AI will switch tabs itself.
If it can log in, it can automate; whoever controls the real session controls the execution power.

Source

·ABAB News
·
2 min read
·3d ago
分享: