TestingCatalog AI News πŸ—ž
4.74K subscribers
2.92K photos
378 videos
40 files
3.86K links
Reporting AI nonsense. A future news media, driven by virtual assistants πŸ€–
Download Telegram
BREAKING 🚨: A new Gemini checkpoint has been spotted in A/B testing.

Will we see this live? πŸ‘€

h/t x@marmaduke091
πŸ”₯14πŸ‘33
43
BREAKING 🚨: CLAUDE OPUS 4.6 HAS BEEN SPOTTED IN PERPLEXITY APIs!

* keep in mind that this doesn’t imply an imminent release.

h/t x@synthwavedd
πŸ‘85
BREAKING 🚨: OPENAI ANNOUNCED OPENAI FRONTIER, A NEW ENTERPRISE PLATFORM TO CREATE AND MANAGE AI COWORKERS.

"Frontier gives agents the same skills people need to succeed at work: Understand how work gets done, Use a computer and tools, Improve quality over time, Stay governed & observable"

The biggest part πŸ‘€

"Built-in ways to evaluate and optimise performance make it clear to human managers and AI coworkers what’s working and what isn’t, so good behaviours improve over time. Over time, AI coworkers learn what good looks like and get better at the work that matters most."
πŸ”₯5πŸ‘41
BREAKING 🚨: A BIG DROP IS EXPECTED FOR CODEX TODAY! CODEX GITHUB ALSO DOESN’T STATE β€œLATEST” NEXT TO GPT-5.2 ANYMORE.
❀74πŸ‘€3
BREAKING 🚨: PERPLEXITY IS PREPARING CLAUDE OPUS 4.6 FOR RELEASE ON THE WEB. A STRONG SIGNAL THAT IT WILL ARRIVE TODAY.

We are super close πŸ‘€
❀6πŸ‘1
BREAKING 🚨: CLAUDE OPUS 4.6 IS ROLLING OUT ON THE WEB, APPS AND DESKTOP!

TESTING TIME πŸ”₯
❀10😭5πŸ‘1
TestingCatalog AI News πŸ—ž
BREAKING 🚨: CLAUDE OPUS 4.6 IS ROLLING OUT ON THE WEB, APPS AND DESKTOP! TESTING TIME πŸ”₯
This media is not supported in your browser
VIEW IN TELEGRAM
BREAKING 🚨: Claude Opus 4.6 has been officially announced. Opus 4.6 comes with an improved performance across various agentic, reasearch and coding tasks.

What would you test first? πŸ‘€
❀3πŸ‘1
TestingCatalog AI News πŸ—ž
BREAKING 🚨: Claude Opus 4.6 has been officially announced. Opus 4.6 comes with an improved performance across various agentic, reasearch and coding tasks. What would you test first? πŸ‘€
Opus 4.6 comes with a big improvement at Agentic Search, Agentic financial analysis and Office tasks.

"Financial professionals use AI to research across multiple data sources, support financial analyses, and create deliverables that their teams and customers can act on."
❀2πŸ‘1
BREAKING 🚨: GPT-5.3-CODEX IS ROLLING OUT ON CODEX CLI AND DESKTOP APP!

COMPETITION AT SCALE πŸ”₯
❀11πŸ‘1
TestingCatalog AI News πŸ—ž
BREAKING 🚨: GPT-5.3-CODEX IS ROLLING OUT ON CODEX CLI AND DESKTOP APP! COMPETITION AT SCALE πŸ”₯
BREAKING 🚨: GPT‑5.3‑CODEX WAS USED TO SUPPORT CREATING ITSELF, ACCORDING TO OPENAI'S BLOG!

It achieves SOTA score of 57% at SWE Bench Pro and 76% on TerminalBench.

"With GPT‑5.3-Codex, Codex goes from an agent that can write and review code to an agent that can do nearly anything developers and professionals can do on a computer."
❀4πŸ‘1πŸ”₯1
TestingCatalog AI News πŸ—ž
BREAKING 🚨: GPT‑5.3‑CODEX WAS USED TO SUPPORT CREATING ITSELF, ACCORDING TO OPENAI'S BLOG! It achieves SOTA score of 57% at SWE Bench Pro and 76% on TerminalBench. "With GPT‑5.3-Codex, Codex goes from an agent that can write and review code to an agent…
OpenAI opens up Trusted Access framework to accelerate cyber defence.

GPT-5.3-Codex was the first model to hit a "High" on OpenAI's preparedness framework.

Shit is about to get real πŸ‘€
❀6😍6πŸ€”3
TestingCatalog AI News πŸ—ž
BREAKING 🚨: CLAUDE OPUS 4.6 IS ROLLING OUT ON THE WEB, APPS AND DESKTOP! TESTING TIME πŸ”₯
Claude subscribers can claim $50 worth of credits for TESTING Claude Opus 4.6!

Claim it πŸ‘€
❀8πŸ”₯5πŸ™3
Anthropic readies upgraded Claude voice mode for desktop

Anthropic is testing upgraded voice functionality for Claude across web and mobile, alongside a new knowledge base feature to organize and retain conversation context. A near-term launch is possible, potentially aligning with upcoming marketing efforts.

πŸ—ž #claude
❀2πŸ‘2
TestingCatalog AI News πŸ—ž
BREAKING 🚨: Anthropic prepares an upgraded voice mode for Claude desktop and mobile! Here is an early look at how it works πŸ‘€
Anthropic keeps working on Knowledge Bases, as a new "Save to knowledge base" button has been spotted in testing. Isn't this a continuous learning solution?

Save button triggers this prompt πŸ‘€
πŸ‘6❀2πŸ”₯2
It turns out that Telegram boosts vaporise quite quickly. 5 are missing until level 3 to enable auto translations and 5 more will be lost later in February.

Do you have these sparks? ⚑
https://t.iss.one/boost/testingcatalog
Please open Telegram to view this post
VIEW IN TELEGRAM
11πŸ‘54
OpenAI debuts Frontier to deploy AI agents for enterprise users

OpenAI launched Frontier, an enterprise platform to build and manage AI agents as coworkers across business systems. It integrates existing tools and data sources and supports feedback-driven learning.

πŸ—ž #chatgpt
2πŸ‘1