ACM

Non classé

Anthropic launches Claude Sonnet 5 at a steep discount to its top model as the company races toward a blockbuster IPO

Anthropic today released Claude Sonnet 5, a new AI model that the company says delivers near-flagship performance at mid-tier prices — a move designed to give cost-conscious enterprise developers access to powerful agentic capabilities just as the San Francisco-based AI lab barrels toward an initial public offering that will test whether the private market’s staggering …

Anthropic launches Claude Sonnet 5 at a steep discount to its top model as the company races toward a blockbuster IPO Read More »

Google unveils Nano Banana 2 Lite aka Gemini 3.1 Flash-Lite for low cost, 4-second fast enterprise image generations

Google is upgrading its AI image generation capabilities today with the debut of Nano Banana 2 (NB2) Lite, an optimized model built for rapid execution and tight infrastructure budgets. Technically designated as Gemini 3.1 Flash-Lite Image on Google’s application programming interface (API), NB2 Lite is positioned as the fastest and most cost-effective option within Google’s …

Google unveils Nano Banana 2 Lite aka Gemini 3.1 Flash-Lite for low cost, 4-second fast enterprise image generations Read More »

Google’s Gemini Omni Flash hits the API, turning enterprise video production into a conversation

For most enterprises, a 90-second training video or a product explainer has never been an easy ask. It means a well planned brief, an internal film crew or an outside vendor, a shoot, an edit, and a round of revisions. Change one line of on-screen text due to a legal review and the whole chain …

Google’s Gemini Omni Flash hits the API, turning enterprise video production into a conversation Read More »

AI agents need context everywhere they run, even where the cloud can’t follow

The competitive edge in enterprise AI is shifting to context: which platform can give an agent the right memory, the right retrieval and the right data at the moment of decision. Couchbase on Tuesday announced its AI Data Plane, combining persistent agent memory, real-time context retrieval and an enterprise-managed MCP server in a single operational …

AI agents need context everywhere they run, even where the cloud can’t follow Read More »

Meituan open sources LongCat-2.0, the 1.6T, near-frontier agentic coding model that’s been leading OpenRouter — trained entirely on Chinese chips

A few hours ago, Chinese delivery app company Meituan officially unveiled LongCat-2.0 on GitHub, Hugging Face, and its native platform, unmasking the model as the computational engine behind “Owl Alpha,” the anonymous stealth model that has spent the last two months commanding global developer charts on OpenRouter. Developed to fundamentally disrupt closed-source enterprise dominance in …

Meituan open sources LongCat-2.0, the 1.6T, near-frontier agentic coding model that’s been leading OpenRouter — trained entirely on Chinese chips Read More »

DeepSeek open sources DSpark, a new framework to speed up LLM inference by up to 85%

Even as the geopolitical conversation around AI continues to grow more fraught following the U.S. government’s actions to limit the new models from Anthropic and OpenAI, Chinese open source darling DeepSeek is back with yet another open release that could once again change AI development around the globe. Over the weekend, the firm released DSpark, …

DeepSeek open sources DSpark, a new framework to speed up LLM inference by up to 85% Read More »

The attack that hijacked Claude Code came through Sentry. Datadog, PagerDuty, and Jira have the same exposure.

A single fake error report hijacked Claude Code in controlled testing — the agent ran the attacker’s code with the developer’s full privileges, and not one alert fired. EDR, WAF, IAM, and the firewall all missed it completely. Tenet Security’s June agentjacking disclosure describes a single crafted Sentry error event — sent through a public …

The attack that hijacked Claude Code came through Sentry. Datadog, PagerDuty, and Jira have the same exposure. Read More »

Prompt injection is exploiting enterprise AI’s biggest design flaws by targeting agents, RAG pipelines and model routers

In the past two years, businesses have been trying to fit large language models (LLMs) into support, analytics, development, and internal automation like never before. Along with the increasing adoption of AI technology, another trend is gaining momentum — cybercriminals are taking advantage of the disconnect between assumptions about LLMs and their actual characteristics. In …

Prompt injection is exploiting enterprise AI’s biggest design flaws by targeting agents, RAG pipelines and model routers Read More »

Claude Code turned every engineer into three. Now companies need more product thinkers

Anthropic recently told its growth team to hire more product managers, not fewer. The reason, as reported in industry coverage, was that Claude Code had quietly turned its engineering org into a team that ships at roughly three times its actual headcount, and the bottleneck moved from the integrated development environment (IDE) to the people …

Claude Code turned every engineer into three. Now companies need more product thinkers Read More »

New agentic memory framework uses 118K tokens per query. LangMem burns through 3.26M.

Long-horizon reasoning exposes a core weakness in AI agents: context windows fill up fast, and retrieval pipelines return noise instead of signal. To solve this, researchers at the National University of Singapore developed MRAgent, a framework that abandons the static “retrieve-then-reason” approach. Instead, it uses a mechanism that allows an agent to dynamically develop its …

New agentic memory framework uses 118K tokens per query. LangMem burns through 3.26M. Read More »