GPT-5 Rumors
Does OpenAI have much more powerful models than they admit?
OpenAI already is heavily restricting access to their new o3 model.
Was OpenAIโs multi-year total pause in SOTA model training, after the completion of GPT-4, just to help kill off potentially-competing startups?
Does OpenAI have much more powerful models than they admit?
OpenAI already is heavily restricting access to their new o3 model.
Was OpenAIโs multi-year total pause in SOTA model training, after the completion of GPT-4, just to help kill off potentially-competing startups?
๐ฏ20๐คฃ9๐8๐7๐คฏ4โ2๐ฟ2๐จ1๐1
OpenAI Introduces โOperatorโ
Today weโre releasing Operatorโ (opens in a new window), an agent that can go to the web to perform tasks for you. Using its own browser, it can look at a webpage and interact with it by typing, clicking, and scrolling.
To ensure a safe and iterative rollout, we are starting small. Starting today, Operator is available to Pro users in the U.S. at operator.chatgpt.comโ
Operator is powered by a new model called Computer-Using Agent (CUA). Combining GPT-4o's vision capabilities with advanced reasoning through reinforcement learning, CUA is trained to interact with graphical user interfaces (GUIs)โthe buttons, menus, and text fields people see on a screen.
Operator can โseeโ (through screenshots) and โinteractโ (using all the actions a mouse and keyboard allow) with a browser, enabling it to take action on the web without requiring custom API integrations.
To get started, simply describe the task youโd like done and Operator can handle the rest. Users can choose to take over control of the remote browser at any point, and Operator is trained to proactively ask the user to take over for tasks that require login, payment details, or when solving CAPTCHAs.
OpenAI Announcement
Today weโre releasing Operatorโ (opens in a new window), an agent that can go to the web to perform tasks for you. Using its own browser, it can look at a webpage and interact with it by typing, clicking, and scrolling.
To ensure a safe and iterative rollout, we are starting small. Starting today, Operator is available to Pro users in the U.S. at operator.chatgpt.comโ
Operator is powered by a new model called Computer-Using Agent (CUA). Combining GPT-4o's vision capabilities with advanced reasoning through reinforcement learning, CUA is trained to interact with graphical user interfaces (GUIs)โthe buttons, menus, and text fields people see on a screen.
Operator can โseeโ (through screenshots) and โinteractโ (using all the actions a mouse and keyboard allow) with a browser, enabling it to take action on the web without requiring custom API integrations.
To get started, simply describe the task youโd like done and Operator can handle the rest. Users can choose to take over control of the remote browser at any point, and Operator is trained to proactively ask the user to take over for tasks that require login, payment details, or when solving CAPTCHAs.
OpenAI Announcement
๐ฑ21๐15๐4๐4๐ฅ1๐1๐ฏ1๐1๐1
This media is not supported in your browser
VIEW IN TELEGRAM
OpenAIโs Operator reading through hotel reviews on Tripadvisor to find the best hotel sauna in Stockholm
๐คฏ30๐11๐7๐5๐ฟ2
This media is not supported in your browser
VIEW IN TELEGRAM
A more realistic Hal 9000
๐คฃ82๐8๐ฅ2
Forwarded from DoomPosting
OpenAI releases o3 and o4-mini
Highlights,
(1) Reinforcement learning training โ a DECADE after DeepMind thoroughly showed that the RL approach is the way to go, and AFTER OpenAI getting thoroughly wrecked by DeepSeek who heavily used the RL approach โ OpenAI FINALLY gets off of their retarded wordcel asses, and starts to shift in the rotator direction, finally making much heavier use of RL training. OpenAI also pretends to be surprised in further confirming the ~unbounded RL training scaling laws that DeepMind thoroughly confirmed a decade ago.
(2) Real thinking with images โ OpenAI FINALLY making images more of a first-class medium, โFor the first time, these models can integrate images directly into their chain of thought. They donโt just see an imageโthey think with it.โ
(3) Codex CLI โ apparently a new AI agent, through which their latest models were trained to be good at using the terminal - huge for giving the AI real agency, instead of constantly needing help from humans for mundane things
(4) Agentic tool use โ OpenAI FINALLY gives their top models full access to all their internal โtoolsโ, and heavily trains the models with RL to best use the tools. Big question hear is WHY THE F&$& did they wait years before finally giving their top models full tool access. OpenAI had been weirdly restricting their top models to different arbitrary subsets of the tools up to now.
^^ OpenAI has finally done the few things critical needed to enable powerful agentic AIs
New huge AI wave finally inbound?
OpenAI Announcement
๐ณ๐พ๐พ๐ผ๐ฟ๐พ๐ ๐ ๐ธ๐ฝ๐ถ
Highlights,
(1) Reinforcement learning training โ a DECADE after DeepMind thoroughly showed that the RL approach is the way to go, and AFTER OpenAI getting thoroughly wrecked by DeepSeek who heavily used the RL approach โ OpenAI FINALLY gets off of their retarded wordcel asses, and starts to shift in the rotator direction, finally making much heavier use of RL training. OpenAI also pretends to be surprised in further confirming the ~unbounded RL training scaling laws that DeepMind thoroughly confirmed a decade ago.
(2) Real thinking with images โ OpenAI FINALLY making images more of a first-class medium, โFor the first time, these models can integrate images directly into their chain of thought. They donโt just see an imageโthey think with it.โ
(3) Codex CLI โ apparently a new AI agent, through which their latest models were trained to be good at using the terminal - huge for giving the AI real agency, instead of constantly needing help from humans for mundane things
(4) Agentic tool use โ OpenAI FINALLY gives their top models full access to all their internal โtoolsโ, and heavily trains the models with RL to best use the tools. Big question hear is WHY THE F&$& did they wait years before finally giving their top models full tool access. OpenAI had been weirdly restricting their top models to different arbitrary subsets of the tools up to now.
^^ OpenAI has finally done the few things critical needed to enable powerful agentic AIs
New huge AI wave finally inbound?
OpenAI Announcement
๐ณ๐พ๐พ๐ผ๐ฟ๐พ๐ ๐ ๐ธ๐ฝ๐ถ
๐5๐4๐3๐ฏ1
Forwarded from DoomPosting
OpenAI Releases Codex
Codex CLI is built for developers who already live in the terminal and want ChatGPTโlevel reasoning plus the power to actually run code, manipulate files, and iterate โ all under version control. In short, itโs chatโdriven development that understands and executes your repo.
โข Zero setup โ bring your OpenAI API key and it just works!
โข Full auto-approval, while safe + secure by running network-disabled and directory-sandboxed
โข Multimodal โ pass in screenshots or diagrams to implement features
โข Fully open-source
Usage examples:
Interactive REPL:
Initial prompt for interactive REPL:
Nonโinteractive "quiet modeโ:
OpenAI Codex
๐ณ๐พ๐พ๐ผ๐ฟ๐พ๐ ๐ ๐ธ๐ฝ๐ถ
Codex CLI is built for developers who already live in the terminal and want ChatGPTโlevel reasoning plus the power to actually run code, manipulate files, and iterate โ all under version control. In short, itโs chatโdriven development that understands and executes your repo.
โข Zero setup โ bring your OpenAI API key and it just works!
โข Full auto-approval, while safe + secure by running network-disabled and directory-sandboxed
โข Multimodal โ pass in screenshots or diagrams to implement features
โข Fully open-source
Usage examples:
Interactive REPL:
codex
Initial prompt for interactive REPL:
codex "fix lint errorsโ
Nonโinteractive "quiet modeโ:
codex -q --json "explain utils.tsโ
OpenAI Codex
๐ณ๐พ๐พ๐ผ๐ฟ๐พ๐ ๐ ๐ธ๐ฝ๐ถ
๐26๐ฅ6โคโ๐ฅ5๐5๐พ2๐ฅฐ1๐จ1