ACM

Non classé

Anthropic ships major Claude Design overhaul with design system imports, code round-trips, and a fix for its token-burning problem

When Anthropic quietly released Claude Design in April as a “research preview,” it generated the kind of instant traction most product teams dream about: more than one million users in its first week. It also generated a problem. The tool consumed tokens so voraciously that a PCWorld reviewer burned through 80 percent of his weekly …

Anthropic ships major Claude Design overhaul with design system imports, code round-trips, and a fix for its token-burning problem Read More »

Why Weibo’s tiny VibeThinker-3B has the AI world arguing over benchmarks again

On Sunday, a team of nine researchers at Sina Weibo — the Chinese social media giant better known for its microblogging platform than for cutting-edge artificial intelligence — quietly posted a 14-page technical report to arXiv that sent shockwaves through the AI research community. Their claim: a language model with just 3 billion parameters can …

Why Weibo’s tiny VibeThinker-3B has the AI world arguing over benchmarks again Read More »

Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks for 1/6th the cost

Today, Chinese AI startup Z.ai (formerly Zhipu AI) announced the immediate release of GLM-5.2, a 753-billion parameter open-weights large language model (LLM) engineered specifically to dominate “long-horizon” autonomous coding and engineering tasks. Available immediately on Hugging Face, the Z.ai API, and more than 20 third-party coding environments, the model boasts a highly stable 1-million-token context …

Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks for 1/6th the cost Read More »

Databricks says it solved the decades-old data pipeline problem that’s been slowing AI agents

For decades, data professionals have struggled with the challenge of managing both operational and analytical databases in a unified approach that doesn’t introduce latency and performance degradation. Agents made the problem structural. A system that reasons continuously and acts on live data cannot tolerate a pipeline between itself and the information it needs to act …

Databricks says it solved the decades-old data pipeline problem that’s been slowing AI agents Read More »

Stanford’s DeLM cuts multi-agent task costs 50% — without a central orchestrator

One of the assumptions behind today’s AI frameworks is that agents require a “boss” at the center; this orchestrator runs the show, routes requests, and makes sure the whole system doesn’t descend into chaos. That assumption may be wrong, and the cost of carrying it could be measured in inference dollars and coordination latency. A …

Stanford’s DeLM cuts multi-agent task costs 50% — without a central orchestrator Read More »

When deep research isn’t enough for your business: Sakana AI launches ‘ultra deep research’ agent for 100+ page reports in 8 hours

Tokyo-based AI startup Sakana AI has officially launched its first commercial product, Sakana Marlin. Billed as a “Virtual CSO” (Chief Strategy Officer), Marlin is an autonomous, B2B research agent that deliberately abandons the instantaneous text generation of modern chatbots in favor of deep, long-horizon reasoning. What sets Marlin apart from the current ecosystem of AI …

When deep research isn’t enough for your business: Sakana AI launches ‘ultra deep research’ agent for 100+ page reports in 8 hours Read More »

Satya Nadella warns that AI could hollow out entire industries, echoing the damage done by globalization

Microsoft CEO Satya Nadella published a sweeping essay on Sunday laying out what he describes as the defining economic challenge of the AI era: the risk that a handful of frontier models will absorb the expertise of entire industries and commoditize it, leaving businesses stripped of their competitive moats. “The last thing any of us …

Satya Nadella warns that AI could hollow out entire industries, echoing the damage done by globalization Read More »

85% of IT teams claim every AI agent is under control. Only 42% actually know who owns them.

Organizational leaders are nearly twice as likely to hide their AI use compared to all other employees, at 42% versus 23%, according to new Ivanti research surveying 3,900 employees across six countries. Among leaders who conceal that usage, 52% say they do it for a “secret advantage.” The same research found 85% of IT professionals …

85% of IT teams claim every AI agent is under control. Only 42% actually know who owns them. Read More »

Vibe coding can build your pipeline. It can’t explain it six months later

AI coding agents are rapidly accelerating data engineering by generating transformations, pipelines, orchestration workflows, validation tests, and infrastructure configurations from prompts. However, enterprise data platforms have long operated across fragmented systems owned by different teams and built on different technologies. As these systems evolve independently, organizations increasingly struggle with inconsistent business logic, duplicated implementations, difficult …

Vibe coding can build your pipeline. It can’t explain it six months later Read More »

Attackers scale deception with AI. Defenders need truth at machine speed.

Presented by Splunk AI has changed the economics of cyber deception. An attacker can now generate thousands of convincing phishing lures, fake identities, and tailored pretexts before a defender finishes a single change-control cycle. That is the new security challenge: deception got faster and cheaper, while verification did not. Much of the discussion around AI …

Attackers scale deception with AI. Defenders need truth at machine speed. Read More »