TestingCatalog AI News ๐Ÿ—ž
4.74K subscribers
2.92K photos
378 videos
40 files
3.86K links
Reporting AI nonsense. A future news media, driven by virtual assistants ๐Ÿค–
Download Telegram
TestingCatalog AI News ๐Ÿ—ž
BREAKING๐Ÿšจ: Claude Sonnet 5 has been spotted in the configs! UPD: And removed ๐Ÿ˜‹
UPD: No Sonnet 5 today ๐Ÿค–

1. v0 is not reliable
2. Config changes do not gaurantee same day release
๐Ÿ˜ข17๐Ÿ˜8โค1
Kimi AI adds website generation from images, new annotation tools

Kimi AI expands its toolset with a feature that converts images or videos into websites using its K2.5 model, which leads open source benchmarks. A new edit mode also allows visual annotations and review within the chat interface.

๐Ÿ—ž #kimi
๐Ÿ‘6
Anthropic published several 30s Super Bowl ads pointing at OpenAI for introducing ads in ChatGPT.

According to WSJ, Anthropic will also reveal another 60s ad, that targets consumer audience.

Stars keep aligning ๐Ÿ‘€
๐Ÿ˜6๐Ÿค32
BREAKING ๐Ÿšจ: ElevenLabs raised $500M in a Series D round at an $11B valuation.

ElevenLabs market valuation now matches its company name.

Perfect match ๐Ÿ‘€
๐Ÿ‘6
Mistral AI released Voxtral Transcribe 2 speech-to-text models and a new Audio Playground on La Platforme.

Testing time! ๐Ÿ‘€
๐Ÿ‘8
Claude Cowork now supports G Suite connectors - Google Drive, Google Calendar and Gmail.
๐Ÿ”ฅ6๐Ÿ‘3๐Ÿ˜ด1
Apple adds Claude Agent SDK and Codex into Xcode 26.3

Appleโ€™s Xcode 26.3 introduces built-in Claude Agent SDK support, enabling subagents, background tasks, plugins, and full project reasoning within the IDE. This update targets productivity for solo developers and small teams.

๐Ÿ—ž #ai
โค5๐Ÿ‘2๐Ÿ”ฅ2
v0 live announcement ๐Ÿ‘€

"Important announcement from the v0 team", the next evolution of the v0 platform for vibe coding.

* v0 has been teasing Sonnet 5 earlier today.
โค8๐Ÿ‘31
Perplexity released a new Advanced Deep Research upgrade with SOTA performance.

It scores 79% at Google DeepMind Deep Research QA and tops the newly introduced DRACO Benchmark. Currently available to Max users and expected to arrive for Pro users soon.
โค11๐Ÿ”ฅ3๐Ÿ‘1
Perplexity working on Model Concil, combining 3 AI models

Perplexity is developing Model Council, a Max-tier feature enabling users to compare outputs from top AI models like GPT-5.2, Gemini 3 Pro, and Claude Opus 4.5. A separate mode, Gamma, hints at experimental high-tier capabilities.

๐Ÿ—ž #perplexity
๐Ÿ‘331
TestingCatalog AI News ๐Ÿ—ž
Perplexity working on Model Concil, combining 3 AI models Perplexity is developing Model Council, a Max-tier feature enabling users to compare outputs from top AI models like GPT-5.2, Gemini 3 Pro, and Claude Opus 4.5. A separate mode, Gamma, hints at experimentalโ€ฆ
BREAKING ๐Ÿšจ: Perplexity is working on a new Model Council multi-model system, combining outputs from GPT-5.2, Opus 4.5 and Gemini 3 Pro into one response.

In addition, a new mode named Gemma is in the works, labelled as "ASI"
โค12๐Ÿ‘3๐Ÿ‘Ž1
Perplexity launches Advanced Deep Research for Max users

Perplexity launched the DRACO Benchmark to publicly assess AI research tools on real-world tasks across ten domains. It measures accuracy, depth, presentation, and sourcing, with initial results showing Perplexity leads in both precision and speed.

๐Ÿ—ž #perplexity
๐Ÿ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
GitHub Copilot Pro+ and Copilot Enterprise subscribers can now use Codex and Claude agents on GitHub.

Github Codex Pilot or Github Claude Pilot?
โค5๐Ÿ‘2
According to The Information, upcoming Avocado model from Meta is referenced as the โ€œmost capable โ€œ to date internally.

Soon? ๐Ÿ‘€
๐Ÿ‘11๐Ÿ˜2๐Ÿคฎ1
BREAKING ๐Ÿšจ: A new Gemini checkpoint has been spotted in A/B testing.

Will we see this live? ๐Ÿ‘€

h/t x@marmaduke091
๐Ÿ”ฅ14๐Ÿ‘33