ACM

Non classé

Ontology is the real guardrail: How to stop AI agents from misunderstanding your business

Enterprises are investing billions of dollars in AI agents and infrastructure to transform business processes. However, we are seeing limited success in real-world applications, often due to the inability of agents to truly understand business data, policies and processes. While we manage the integrations well with technologies like API management, model context protocol (MCP) and …

Ontology is the real guardrail: How to stop AI agents from misunderstanding your business Read More »

Why observable AI is the missing SRE layer enterprises need for reliable LLMs

As AI systems enter production, reliability and governance can’t depend on wishful thinking. Here’s how observability turns large language models (LLMs) into auditable, trustworthy enterprise systems. Why observability secures the future of enterprise AI The enterprise race to deploy LLM systems mirrors the early days of cloud adoption. Executives love the promise; compliance demands accountability; …

Why observable AI is the missing SRE layer enterprises need for reliable LLMs Read More »

Anthropic says it solved the long-running AI agent problem with a new multi-session Claude SDK

Agent memory remains a problem that enterprises want to fix, as agents forget some instructions or conversations the longer they run.  Anthropic believes it has solved this issue for its Claude Agent SDK, developing a two-fold solution that allows an agent to work across different context windows. “The core challenge of long-running agents is that …

Anthropic says it solved the long-running AI agent problem with a new multi-session Claude SDK Read More »

Beyond math and coding: New RL framework helps train LLM agents for complex, real-world tasks

Researchers at the University of Science and Technology of China have developed a new reinforcement learning (RL) framework that helps train large language models (LLMs) for complex agentic tasks beyond well-defined problems such as math and coding.  Their framework, Agent-R1, is compatible with popular RL algorithms and shows considerable improvement on reasoning tasks that require …

Beyond math and coding: New RL framework helps train LLM agents for complex, real-world tasks Read More »

Prompt Security’s Itamar Golan on why generative AI security requires building a category, not a feature

VentureBeat recently sat down (virtually) with Itamar Golan, co-founder and CEO of Prompt Security, to chat through the GenAI security challenges organizations of all sizes face. We talked about shadow AI sprawl, the strategic decisions that led Golan to pursue building a market-leading platform versus competing on features, and a real-world incident that crystallized why …

Prompt Security’s Itamar Golan on why generative AI security requires building a category, not a feature Read More »

Alibaba’s AgentEvolver lifts model performance in tool use by ~30% using synthetic, auto-generated tasks

Researchers at Alibaba’s Tongyi Lab have developed a new framework for self-evolving agents that create their own training data by exploring their application environments. The framework, AgentEvolver, uses the knowledge and reasoning capabilities of large language models for autonomous learning, addressing the high costs and manual effort typically required to gather task-specific datasets. Experiments show …

Alibaba’s AgentEvolver lifts model performance in tool use by ~30% using synthetic, auto-generated tasks Read More »

A weekend ‘vibe code’ hack by Andrej Karpathy quietly sketches the missing layer of enterprise AI orchestration

This weekend, Andrej Karpathy, the former director of AI at Tesla and a founding member of OpenAI, decided he wanted to read a book. But he did not want to read it alone. He wanted to read it accompanied by a committee of artificial intelligences, each offering its own perspective, critiquing the others, and eventually …

A weekend ‘vibe code’ hack by Andrej Karpathy quietly sketches the missing layer of enterprise AI orchestration Read More »

Black Forest Labs launches Flux.2 AI image models to challenge Nano Banana Pro and Midjourney

It’s not just Google’s Gemini 3, Nano Banana Pro, and Anthropic’s Claude Opus 4.5 we have to be thankful for this year around the Thanksgiving holiday here in the U.S. No, today the German AI startup Black Forest Labs released FLUX.2, a new image generation and editing system complete with four different models designed to …

Black Forest Labs launches Flux.2 AI image models to challenge Nano Banana Pro and Midjourney Read More »

OpenAI now lets enterprises choose where to host their data

OpenAI expanded its data residency regions for ChatGPT and its API, giving enterprise users the option to store and process their data closest to their business operations and better comply with local regulations. This expansion removes one of the biggest compliance blockers preventing global enterprises from deploying ChatGPT at scale. Data residency, often an overlooked piece …

OpenAI now lets enterprises choose where to host their data Read More »