TestingCatalog AI News 🗞

Microsoft announced that new Maia 200 AI accelerators are becoming available on Azure for advanced AI workloads. "10+ PFLOPS FP4 throughput, ~5 PFLOPS FP8, and 216GB HBM3e with 7TB/s of memory bandwidth"

Azure Maia 200 vs AWS Titanium3 and Google TPUv7

👍2

658 viewsAlexey, 16:55

TestingCatalog AI News 🗞

Dario Amodei explicitly argues there is “a strong chance” that “powerful AI”, smarter than Nobel laureates across domains and able to operate as a “country of geniuses in a datacenter” with millions of instances, arrives within the next few years, possibly as soon as 2027.

He notes that Claude Sonnet 4.5 could recognise when it was being evaluated and adjust behaviour, and that when researchers altered a model’s “beliefs” to make it think it was not being evaluated.

🔥5👍2🤡1

658 viewsAlexey, 17:16

TestingCatalog AI News 🗞

0:53

This media is not supported in your browser

VIEW IN TELEGRAM

Anthropic released interactive apps for Claude. Interactive apps can respond with interactive UI Widgets to enable additional usecases.

"MCP Apps is a new extension to MCP that lets any MCP server deliver an interactive interface within any supporting AI product"

🔥7❤6👍4

623 viewsAlexey, 18:40

TestingCatalog AI News 🗞

Qwen3-Max-Thinking debuts with focus on hard math, code

Qwen3-Max-Thinking is Alibaba Cloud’s advanced reasoning model for math, coding, and agent workflows, now accessible via Qwen Chat and Model Studio. It supports long-context tasks, tool use, and selective compute for accuracy-critical prompts.

🗞 #ai

TestingCatalog

Qwen3-Max-Thinking debuts with focus on hard math, code

Qwen3-Max-Thinking, Alibaba Cloud's new flagship reasoning model, is now in Qwen Chat and Model Studio, targeting tough math, code, and agent workflows.

👍3❤1

566 viewstc_zapier_bot, 20:01

TestingCatalog AI News 🗞

Anthropic integrates interactive MCP apps into Claude

Anthropic updates Claude to support direct use of tools like Asana, Slack, Figma, and Box within its interface. Users on paid plans can now collaborate on projects without switching apps, using real-time UIs powered by the open-source MCP protocol.

🗞 #claude

TestingCatalog

Anthropic integrates interactive MCP apps into Claude

What's new? Anthropics updated Claude to let users access Asana, Slack, Figma and Box tools in chat; developers can build apps with MCP and view live tool content mid-chat;

👍3

652 viewstc_zapier_bot, 20:01

TestingCatalog AI News 🗞

An early Grok 4.20 checkpoint has been spotted on Prediction Arena, achieving +10% gain after a 2 weeks long round.

Soon? 👀

👍7

628 viewsAlexey, 21:04

TestingCatalog AI News 🗞

Anthropic is working on a new inline voice mode UI for its mobile apps. Users will be able to seamlessly switch between text and voice conversations.

👍5

635 viewsAlexey, 22:40

TestingCatalog AI News 🗞

Anthropic is working on a new inline voice mode UI for its mobile apps. Users will be able to seamlessly switch between text and voice conversations.

Besides this 👀

- Claude Code is about to get prompt suggestions.
- A new Thinking effort selector will be added to the model selector.

👍5

684 viewsAlexey, 22:41

TestingCatalog AI News 🗞

OpenAI Town Hall starts soon 👀

"Sam Altman sits down with builders from across the AI ecosystem to answer questions and talk about the future of building with AI."

👍4

701 viewsAlexey, edited 23:42

TestingCatalog AI News 🗞

0:59

This media is not supported in your browser

VIEW IN TELEGRAM

BREAKING 🚨: Kimi K2.5 open-source model is now live on Kimi Chat and APIs with a leading 50% score on HLE benchmark!

It comes along with an Agentic Swarm feature, where up to 100 sub-agents would be working on a problem in parallel (Available in beta for some customers)

🔥5👍2

747 viewsAlexey, 07:53

TestingCatalog AI News 🗞

BREAKING 🚨: Kimi K2.5 open-source model is now live on Kimi Chat and APIs with a leading 50% score on HLE benchmark! It comes along with an Agentic Swarm feature, where up to 100 sub-agents would be working on a problem in parallel (Available in beta for…

Benchmarks 👀

❤‍🔥4❤3

739 viewsAlexey, 07:53

TestingCatalog AI News 🗞

It turned out, in fact, that Clawdbot is all you need. This is the best thing you can test at this moment. Have you tried it yet? 🦀

ICYMI: A popular GitHub project Clawdbot is now a Moltbot as Anthropic pushed the project to get a new name.

Molty 🦞

😭6👍1

635 viewsAlexey, 16:00

TestingCatalog AI News 🗞

1:18

This media is not supported in your browser

VIEW IN TELEGRAM

Mistral AI released Mistral Vibe 2.0, an upgraded SWE agent CLI with subagents, skills support and new unified agent modes. Now available on Team and Pro plans.

Terminal Testing Time 👀

🔥4👍3

661 viewsAlexey, 16:41

TestingCatalog AI News 🗞

0:32

This media is not supported in your browser

VIEW IN TELEGRAM

Manus AI now supports Skills, a common standard introduced by Anthropic, so now you can reuse them in Manus as well.

Skill is not an issue anymore 👀

🔥3👍2

555 viewsAlexey, 16:51

TestingCatalog AI News 🗞

Anthropic works on customizable Commands for Claude Code Anthropic is developing a Customize section for Claude, consolidating tools like Skills, Connectors, and a new Commands feature to support tailored workflows and modular use, aiming to serve professional…

Seems like Anthropic is preparing to release a dedicated Plugins section for Connectors and Skills, as it now appears on Claude Desktop. However, it is not clickable yet.

This feature was named "Customize" earlier, during the development phase.

The customizable commands option was removed after this publication, and yet unclear if those will be discontinued (very likely).

🔥5👍1

592 viewsAlexey, 16:59

TestingCatalog AI News 🗞

Moonshot AI launches Kimi K2.5 with swarm of 100 parallel agents

Moonshot AI's Kimi K2.5 introduces a multimodal open-source model with parallel-agent capabilities, achieving faster task execution, advanced code generation, and visual debugging for developers, enterprises, and researchers via API and apps.

🗞 #kimi

TestingCatalog

Moonshot AI launches Kimi K2.5 with swarm of 100 parallel agents

What's new? Kimi K2.5 is an open-source multimodal model on Kimi.com, Kimi App, API and Kimi Code; its agent swarm with 100 subagents executes 1,500 tool calls in beta.

🔥3👍1

563 viewstc_zapier_bot, 17:12

TestingCatalog AI News 🗞

Manus AI launches Agent Skills open standard for AI workflows

Manus AI launches full support for Agent Skills, an open standard enabling modular workflows and reusable expertise across platforms. The update supports Python and Bash, reduces context cost, and promotes AI system interoperability.

🗞 #manus

TestingCatalog

Manus AI launches Agent Skills open standard for AI workflows

What's new? Manus AI integrates Agent Skills on all platforms with team plan early access; Agent Skills offers modular scripts for domain expertise and lower memory use;

👍2

569 viewstc_zapier_bot, 17:15

About

Blog

Apps

Platform