TestingCatalog AI News πŸ—ž
4.63K subscribers
2.87K photos
370 videos
40 files
3.85K links
Reporting AI nonsense. A future news media, driven by virtual assistants πŸ€–
Download Telegram
Dario Amodei explicitly argues there is β€œa strong chance” that β€œpowerful AI”, smarter than Nobel laureates across domains and able to operate as a β€œcountry of geniuses in a datacenter” with millions of instances, arrives within the next few years, possibly as soon as 2027.

He notes that Claude Sonnet 4.5 could recognise when it was being evaluated and adjust behaviour, and that when researchers altered a model’s β€œbeliefs” to make it think it was not being evaluated.
πŸ”₯5πŸ‘2🀑1
This media is not supported in your browser
VIEW IN TELEGRAM
Anthropic released interactive apps for Claude. Interactive apps can respond with interactive UI Widgets to enable additional usecases.

"MCP Apps is a new extension to MCP that lets any MCP server deliver an interactive interface within any supporting AI product"
πŸ”₯7❀6πŸ‘4
Qwen3-Max-Thinking debuts with focus on hard math, code

Qwen3-Max-Thinking is Alibaba Cloud’s advanced reasoning model for math, coding, and agent workflows, now accessible via Qwen Chat and Model Studio. It supports long-context tasks, tool use, and selective compute for accuracy-critical prompts.

πŸ—ž #ai
πŸ‘3❀1
Anthropic integrates interactive MCP apps into Claude

Anthropic updates Claude to support direct use of tools like Asana, Slack, Figma, and Box within its interface. Users on paid plans can now collaborate on projects without switching apps, using real-time UIs powered by the open-source MCP protocol.

πŸ—ž #claude
πŸ‘3
An early Grok 4.20 checkpoint has been spotted on Prediction Arena, achieving +10% gain after a 2 weeks long round.

Soon? πŸ‘€
πŸ‘7
Anthropic is working on a new inline voice mode UI for its mobile apps. Users will be able to seamlessly switch between text and voice conversations.
πŸ‘5
TestingCatalog AI News πŸ—ž
Anthropic is working on a new inline voice mode UI for its mobile apps. Users will be able to seamlessly switch between text and voice conversations.
Besides this πŸ‘€

- Claude Code is about to get prompt suggestions.
- A new Thinking effort selector will be added to the model selector.
πŸ‘5
OpenAI Town Hall starts soon πŸ‘€

"Sam Altman sits down with builders from across the AI ecosystem to answer questions and talk about the future of building with AI."
πŸ‘4
This media is not supported in your browser
VIEW IN TELEGRAM
BREAKING 🚨: Kimi K2.5 open-source model is now live on Kimi Chat and APIs with a leading 50% score on HLE benchmark!

It comes along with an Agentic Swarm feature, where up to 100 sub-agents would be working on a problem in parallel (Available in beta for some customers)
πŸ”₯5πŸ‘2
TestingCatalog AI News πŸ—ž
It turned out, in fact, that Clawdbot is all you need. This is the best thing you can test at this moment. Have you tried it yet? πŸ¦€
ICYMI: A popular GitHub project Clawdbot is now a Moltbot as Anthropic pushed the project to get a new name.

Molty 🦞
😭6πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
Mistral AI released Mistral Vibe 2.0, an upgraded SWE agent CLI with subagents, skills support and new unified agent modes. Now available on Team and Pro plans.

Terminal Testing Time πŸ‘€
πŸ”₯4πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
Manus AI now supports Skills, a common standard introduced by Anthropic, so now you can reuse them in Manus as well.

Skill is not an issue anymore πŸ‘€
πŸ”₯3πŸ‘2
TestingCatalog AI News πŸ—ž
Anthropic works on customizable Commands for Claude Code Anthropic is developing a Customize section for Claude, consolidating tools like Skills, Connectors, and a new Commands feature to support tailored workflows and modular use, aiming to serve professional…
Seems like Anthropic is preparing to release a dedicated Plugins section for Connectors and Skills, as it now appears on Claude Desktop. However, it is not clickable yet.

This feature was named "Customize" earlier, during the development phase.

The customizable commands option was removed after this publication, and yet unclear if those will be discontinued (very likely).
πŸ”₯5πŸ‘1
Moonshot AI launches Kimi K2.5 with swarm of 100 parallel agents

Moonshot AI's Kimi K2.5 introduces a multimodal open-source model with parallel-agent capabilities, achieving faster task execution, advanced code generation, and visual debugging for developers, enterprises, and researchers via API and apps.

πŸ—ž #kimi
πŸ”₯3πŸ‘1
Manus AI launches Agent Skills open standard for AI workflows

Manus AI launches full support for Agent Skills, an open standard enabling modular workflows and reusable expertise across platforms. The update supports Python and Bash, reduces context cost, and promotes AI system interoperability.

πŸ—ž #manus
πŸ‘2