TestingCatalog AI News π
Early look: Meta silently launches Vibes AI editor to challenge rivals Metaβs Vibes is evolving from an AI video feed into a standalone creation studio with project workflows, timeline editing, and asset libraries. While output quality still lags, the toolingβ¦
Media is too big
VIEW IN TELEGRAM
BREAKING π¨: Meta silently launched its new standalone Vibes AI video editor!
Image and video generation, ingredients, characters, lip sync, start & end frames, music, and timeline editor are supported.
Some extra features in development include moodboards, shared styles and character libraries, and text overlays for the timeline.
Models are still the same for now π
Image and video generation, ingredients, characters, lip sync, start & end frames, music, and timeline editor are supported.
Some extra features in development include moodboards, shared styles and character libraries, and text overlays for the timeline.
Models are still the same for now π
π₯4 3π1 1
ICYMI: OpenClaw got upgraded to support newly released GPT-5.4 and Gemini 3.1 Flash Lite models.
π₯11π4π΄4β€3
This media is not supported in your browser
VIEW IN TELEGRAM
BREAKING π¨: Microsoft announced Copilot Cowork for M365 users.
"When you hand off a task to Cowork, it turns your request into a plan and executes it across your apps and files, grounded in your work data and operating within M365βs security and governance boundaries."
"When you hand off a task to Cowork, it turns your request into a plan and executes it across your apps and files, grounded in your work data and operating within M365βs security and governance boundaries."
β€4π2π΄2
Microsoft reveals Copilot Cowork for M365 users to rival Anthropic
Microsoft has introduced Copilot Cowork for Microsoft 365, enabling automated task execution across apps like Outlook, Teams, and Excel. Powered by Work IQ and integrated with Claude, it supports secure, compliant workflow delegation, with broader rollout planned for March 2026.
π #microsoftcopilot
Microsoft has introduced Copilot Cowork for Microsoft 365, enabling automated task execution across apps like Outlook, Teams, and Excel. Powered by Work IQ and integrated with Claude, it supports secure, compliant workflow delegation, with broader rollout planned for March 2026.
π #microsoftcopilot
TestingCatalog
Microsoft reveals Copilot Cowork for M365 users to rival Anthropic
What's new? Copilot Cowork executes tasks across Microsoft 365 apps using Work IQ with emails, meetings and docs; it integrates Anthropic's Claude Cowork for meeting management and research;
π3π΄1
This media is not supported in your browser
VIEW IN TELEGRAM
Perplexity Computer can now operate Claude Code as a subagent and GitHub CLI to open PRs for you.
Have you tried it yet? π
Have you tried it yet? π
β€2π2
OpenAI acquired Promptfoo, an AI security platform for enterprises.
βPromptfoo brings deep engineering expertise in evaluating, securing, and testing AI systems at enterprise scaleβ
βPromptfoo brings deep engineering expertise in evaluating, securing, and testing AI systems at enterprise scaleβ
π9
This media is not supported in your browser
VIEW IN TELEGRAM
Figure published a new demonstration of Helix 02, where it can clean up your living room fully autonomously.
In several years, we will be testing completely different things.
In several years, we will be testing completely different things.
πΎ4π2π₯2
This media is not supported in your browser
VIEW IN TELEGRAM
Anthropic released Claude Review for Claude Code, a new code-review solution that uses parallel agents to hunt for bugs and issues.
"Agents search for bugs in parallel, verify each bug to reduce false positives, and rank bugs by severity. You get one high-signal summary comment plus inline flags."
In general, I expect that in 2026 we will see a rise of "parallelization", which will also significantly bump average token consumption. Models will get cheaper, but we will start consuming them more and more.
"Agents search for bugs in parallel, verify each bug to reduce false positives, and rank bugs by severity. You get one high-signal summary comment plus inline flags."
In general, I expect that in 2026 we will see a rise of "parallelization", which will also significantly bump average token consumption. Models will get cheaper, but we will start consuming them more and more.
β€6π€3π1 1
BREAKING π¨: Advanced Machine Intelligence (AMI), founded by Yann LeCun raised $1.03B in a seed round.
βAMI is building a new breed of AI systems that understand the world, have persistent memory, can reason and plan, and are controllable and safe.β
βAMI is building a new breed of AI systems that understand the world, have persistent memory, can reason and plan, and are controllable and safe.β
π€5π2
BREAKING π¨: According to Axios, Meta has aquired @moltbook, a social network for AI agents which became popular along with a rise of OpenClaw agents.
Looks like all AI agents will get their Facebook page at some point.
Looks like all AI agents will get their Facebook page at some point.
π6π4πΏ2
This media is not supported in your browser
VIEW IN TELEGRAM
Google is rolling out a new Gemini experience in Docs, Sheets, and Slides, allowing users to offload more tasks to AI.
Gemini will be able to pull context from relevant sources and generate or modify the document's content.
I have big hopes on this feature π
Gemini will be able to pull context from relevant sources and generate or modify the document's content.
I have big hopes on this feature π
β€7π1
This media is not supported in your browser
VIEW IN TELEGRAM
Google released a new embedding multimodal model, Gemini Embedding 2, with SOTA performance!
Unimodal and multimodal π
Unimodal and multimodal π
π₯6π4π€1π1
Google tests Multi-agent planning mode on Gemini Business
Google is testing a Gemini Enterprise feature in Workspace to identify task specialists and create delegation plans.
π #gemini
Google is testing a Gemini Enterprise feature in Workspace to identify task specialists and create delegation plans.
π #gemini
TestingCatalog
Google tests Multi-agent planning mode on Gemini Business
Google is testing a Gemini Enterprise feature in Workspace to identify task specialists and create delegation plans.
π3
Anthropic launches AI code review tool for Claude Teams & Enterprise
Anthropic has introduced Code Review, an automated PR evaluation tool in research preview for Team and Enterprise users. It deploys multiple AI agents to detect, verify, and prioritize bugs, delivering detailed feedback. Pricing ranges from $15β25 per review.
π #claude
Anthropic has introduced Code Review, an automated PR evaluation tool in research preview for Team and Enterprise users. It deploys multiple AI agents to detect, verify, and prioritize bugs, delivering detailed feedback. Pricing ranges from $15β25 per review.
π #claude
TestingCatalog
Anthropic launches AI code review tool for Claude Teams & Enterprise
What's new? Anthropic launched Code Review PR evaluation tool using AI agents to inspect errors and rank severity for team and enterprise beta;
β€2π1
Google launches new multimodal Gemini Embedding 2 model
Google has launched Gemini Embedding 2 in Public Preview via the Gemini API and Vertex AI, delivering a unified multimodal embedding model for text, images, video, audio, and PDFs. It supports 100+ languages and flexible dimensions for search, RAG, and clustering use cases.
π #gemini
Google has launched Gemini Embedding 2 in Public Preview via the Gemini API and Vertex AI, delivering a unified multimodal embedding model for text, images, video, audio, and PDFs. It supports 100+ languages and flexible dimensions for search, RAG, and clustering use cases.
π #gemini
TestingCatalog
Google launches new multimodal Gemini Embedding 2 model
What's new? Gemini embedding 2 supports text, image, video, audio and document embeddings in a unified space; available via Gemini API and Vertex AI with adjustable output dimensions;
π2
Claude for mobile got updated π
βImprovements to voice mode, transcription, LaTeX rendering, artifact display, large prompts perf, MCP connections, attachment uploads, and more.β
βImprovements to voice mode, transcription, LaTeX rendering, artifact display, large prompts perf, MCP connections, attachment uploads, and more.β
π3π₯3