TestingCatalog AI News 🗞 – Telegram

TestingCatalog AI News 🗞

@testingcatalog

4.74K subscribers

2.92K photos

378 videos

40 files

3.86K links

Reporting AI nonsense. A future news media, driven by virtual assistants 🤖

Download Telegram

About

Blog

Apps

Platform

TestingCatalog AI News 🗞

4.74K subscribers

TestingCatalog AI News 🗞

Perplexity working on Model Concil, combining 3 AI models

Perplexity is developing Model Council, a Max-tier feature enabling users to compare outputs from top AI models like GPT-5.2, Gemini 3 Pro, and Claude Opus 4.5. A separate mode, Gamma, hints at experimental high-tier capabilities.

🗞 #perplexity

Perplexity working on Model Council, combining 3 AI models

What do we know so far? Perplexity may soon introduce Model Council for Max users, allowing multi-model system advancements and hinting at a new ASI mode.

👍331

662 viewstc_zapier_bot, edited 22:31

TestingCatalog AI News 🗞

TestingCatalog AI News 🗞

Perplexity working on Model Concil, combining 3 AI models Perplexity is developing Model Council, a Max-tier feature enabling users to compare outputs from top AI models like GPT-5.2, Gemini 3 Pro, and Claude Opus 4.5. A separate mode, Gamma, hints at experimental…

BREAKING 🚨: Perplexity is working on a new Model Council multi-model system, combining outputs from GPT-5.2, Opus 4.5 and Gemini 3 Pro into one response.

In addition, a new mode named Gemma is in the works, labelled as "ASI"

❤12👍3👎1

975 viewsAlexey, edited 22:38

TestingCatalog AI News 🗞

Perplexity launches Advanced Deep Research for Max users

Perplexity launched the DRACO Benchmark to publicly assess AI research tools on real-world tasks across ten domains. It measures accuracy, depth, presentation, and sourcing, with initial results showing Perplexity leads in both precision and speed.

🗞 #perplexity

Perplexity launches Advanced Deep Research for Max users

What's new? Perplexity launches DRACO benchmark for AI research in law, medicine, finance and academia; it uses LLM as judge and is public;

👍3

695 viewstc_zapier_bot, 22:52

TestingCatalog AI News 🗞

This media is not supported in your browser

VIEW IN TELEGRAM

GitHub Copilot Pro+ and Copilot Enterprise subscribers can now use Codex and Claude agents on GitHub.

Github Codex Pilot or Github Claude Pilot?

❤5👍2

746 viewsAlexey, edited 23:26

TestingCatalog AI News 🗞

According to The Information, upcoming Avocado model from Meta is referenced as the “most capable “ to date internally.

Soon? 👀

👍11😁2🤮1

776 viewsAlexey, 23:32

TestingCatalog AI News 🗞

BREAKING 🚨: A new Gemini checkpoint has been spotted in A/B testing.

Will we see this live? 👀

h/t x@marmaduke091

🔥14👍33

774 viewsAlexey, 08:01

TestingCatalog AI News 🗞

Who will crash benchmarks this week?

Anonymous Poll

43

269 voters714 viewsAlexey, 10:40

TestingCatalog AI News 🗞

BREAKING 🚨: CLAUDE OPUS 4.6 HAS BEEN SPOTTED IN PERPLEXITY APIs!

* keep in mind that this doesn’t imply an imminent release.

h/t x@synthwavedd

👍85

701 viewsAlexey, 13:25

TestingCatalog AI News 🗞

BREAKING 🚨: OPENAI ANNOUNCED OPENAI FRONTIER, A NEW ENTERPRISE PLATFORM TO CREATE AND MANAGE AI COWORKERS.

"Frontier gives agents the same skills people need to succeed at work: Understand how work gets done, Use a computer and tools, Improve quality over time, Stay governed & observable"

The biggest part 👀

"Built-in ways to evaluate and optimise performance make it clear to human managers and AI coworkers what’s working and what isn’t, so good behaviours improve over time. Over time, AI coworkers learn what good looks like and get better at the work that matters most."

🔥5👏41

684 viewsAlexey, edited 15:16

TestingCatalog AI News 🗞

BREAKING 🚨: A BIG DROP IS EXPECTED FOR CODEX TODAY! CODEX GITHUB ALSO DOESN’T STATE “LATEST” NEXT TO GPT-5.2 ANYMORE.

❤74👀3

628 viewsAlexey, 16:09

TestingCatalog AI News 🗞

BREAKING 🚨: PERPLEXITY IS PREPARING CLAUDE OPUS 4.6 FOR RELEASE ON THE WEB. A STRONG SIGNAL THAT IT WILL ARRIVE TODAY.

We are super close 👀

❤6👍1

668 viewsAlexey, 16:18

TestingCatalog AI News 🗞

TestingCatalog AI News 🗞

Perplexity launches Advanced Deep Research for Max users Perplexity launched the DRACO Benchmark to publicly assess AI research tools on real-world tasks across ten domains. It measures accuracy, depth, presentation, and sourcing, with initial results showing…

This media is not supported in your browser

VIEW IN TELEGRAM

BREAKING 🚨: PERPLEXITY LAUNCHES MODEL COUNCIL, A NEW MODE WHERE GEMINI 3 PRO, OPUS 4.5 AND GPT 5.2 WILL WORK AS A SWARM OF ASYNC AGENTS ON A GIVEN TASK.

Perplexity MAX 👀

❤5👍1

876 viewsAlexey, edited 16:24

TestingCatalog AI News 🗞

BREAKING 🚨: CLAUDE OPUS 4.6 IS ROLLING OUT ON THE WEB, APPS AND DESKTOP!

TESTING TIME 🔥

❤10😭5👍1

893 viewsAlexey, 17:33

TestingCatalog AI News 🗞

TestingCatalog AI News 🗞

BREAKING 🚨: CLAUDE OPUS 4.6 IS ROLLING OUT ON THE WEB, APPS AND DESKTOP! TESTING TIME 🔥

This media is not supported in your browser

VIEW IN TELEGRAM

BREAKING 🚨: Claude Opus 4.6 has been officially announced. Opus 4.6 comes with an improved performance across various agentic, reasearch and coding tasks.

What would you test first? 👀

❤3👍1

578 viewsAlexey, 17:53

TestingCatalog AI News 🗞

TestingCatalog AI News 🗞

BREAKING 🚨: Claude Opus 4.6 has been officially announced. Opus 4.6 comes with an improved performance across various agentic, reasearch and coding tasks. What would you test first? 👀

Opus 4.6 comes with a big improvement at Agentic Search, Agentic financial analysis and Office tasks.

"Financial professionals use AI to research across multiple data sources, support financial analyses, and create deliverables that their teams and customers can act on."

❤2👍1

562 viewsAlexey, 18:00

TestingCatalog AI News 🗞

BREAKING 🚨: GPT-5.3-CODEX IS ROLLING OUT ON CODEX CLI AND DESKTOP APP!

COMPETITION AT SCALE 🔥

❤11👍1

607 viewsAlexey, 18:10

TestingCatalog AI News 🗞

TestingCatalog AI News 🗞

BREAKING 🚨: GPT-5.3-CODEX IS ROLLING OUT ON CODEX CLI AND DESKTOP APP! COMPETITION AT SCALE 🔥

BREAKING 🚨: GPT‑5.3‑CODEX WAS USED TO SUPPORT CREATING ITSELF, ACCORDING TO OPENAI'S BLOG!

It achieves SOTA score of 57% at SWE Bench Pro and 76% on TerminalBench.

"With GPT‑5.3-Codex, Codex goes from an agent that can write and review code to an agent that can do nearly anything developers and professionals can do on a computer."

❤4👍1🔥1

583 viewsAlexey, 18:24

TestingCatalog AI News 🗞

TestingCatalog AI News 🗞

BREAKING 🚨: GPT‑5.3‑CODEX WAS USED TO SUPPORT CREATING ITSELF, ACCORDING TO OPENAI'S BLOG! It achieves SOTA score of 57% at SWE Bench Pro and 76% on TerminalBench. "With GPT‑5.3-Codex, Codex goes from an agent that can write and review code to an agent…

OpenAI opens up Trusted Access framework to accelerate cyber defence.

GPT-5.3-Codex was the first model to hit a "High" on OpenAI's preparedness framework.

Shit is about to get real 👀

❤6😍6🤔3

604 viewsAlexey, 18:30