ACM

Non classé

Researchers introduce Self-Harness, a framework that lets AI agents rewrite their own rules, boosting performance up to 60%

Not every company can or should build their own frontier AI language model. However, the harness controlling the model is something that most enterprises can and should customize for their specific purposes. Of course, this is easier said than done. Agent harnesses are still largely tuned through manual, ad hoc debugging — a process that …

Researchers introduce Self-Harness, a framework that lets AI agents rewrite their own rules, boosting performance up to 60% Read More »

AI hit the memory wall — now it needs a new context tier

Presented by Solidigm As inference workloads evolve from discrete question-and-answer exchanges into persistent, multi-step agentic systems, GPU availability is no longer the most critical AI bottleneck. Instead, the bottleneck has migrated from compute to context, says Jeff Harthorn, AI applied research lead at Solidigm. “Why context management has become a primary bottleneck, more than GPU …

AI hit the memory wall — now it needs a new context tier Read More »

Why agentic enterprises need to become learning systems

Presented by Splunk Every day, organizations learn things their AI systems never get to use. A security analyst corrects an AI-generated investigation. A network engineer identifies the root cause of a recurring outage. An observability team discovers that a pattern of latency, logs and infrastructure changes predicts service degradation. A customer operations team learns which …

Why agentic enterprises need to become learning systems Read More »

7,000 Langflow servers are under attack. LangGraph and LangChain have the same holes

Your AI agent did exactly what it was designed to do. The framework underneath it just handed an attacker a shell on the box that holds your OpenAI key, your database credentials, and your CRM tokens. That is not a hypothetical. In a few months, three of the most widely deployed AI agent frameworks each …

7,000 Langflow servers are under attack. LangGraph and LangChain have the same holes Read More »

Fine-tuning forgets. RAG leaks context. Hypernetworks build the model your agent needs on demand.

Enterprise teams keep watching the same thing happen. An AI agent demos beautifully, goes to production, and stalls: it runs for a short stretch, then needs a human to top up its context and check its output, and the promised efficiency drains into supervision. The agent did the work; you did the watching. It’s one …

Fine-tuning forgets. RAG leaks context. Hypernetworks build the model your agent needs on demand. Read More »

Anthropic’s Claude Code Artifacts update brings live, shared dashboards and interactive workspaces to enterprises

Anthropic announced a potentially game-changing new feature for users of Claude Code on the Claude Team and Enterprise subscription plans: Artifacts. This update turns a Claude Code session’s work into a live, interactive, and shareable, custom HTML webpage, allowing a Claude Code user to plug in live code, multiple data sources, and have it surface …

Anthropic’s Claude Code Artifacts update brings live, shared dashboards and interactive workspaces to enterprises Read More »

Copilot searched your mailbox. LiteLLM handed out admin keys. Run this 5-check audit before your stack is next

Two AI tools broke in the same way in the same two weeks, and four research teams proved it. The pattern underneath every disclosure is one sentence: enterprise AI accepts external input with no trust boundary. On June 15, Varonis disclosed SearchLeak (CVE-2026-42824), a proof-of-concept exfiltration chain in Microsoft 365 Copilot Enterprise Search. A victim …

Copilot searched your mailbox. LiteLLM handed out admin keys. Run this 5-check audit before your stack is next Read More »

New AI optimization framework beats Claude Code and Codex by 2.5x on the same compute budget

Imagine your engineering team just deployed an AI agent to search through internal company documents and answer employee questions. It works perfectly in development, but in production, it consistently hallucinates or misses key constraints. Fixing this is rarely a simple patch. It requires a tedious, trial-and-error process of tweaking chunking strategies, retrieval methods, and system …

New AI optimization framework beats Claude Code and Codex by 2.5x on the same compute budget Read More »

Adobe embeds agentic AI workflows across Creative Cloud, shifting from media generation to production orchestration

Adobe has announced a major expansion of its “creative agent” across its flagship Creative Cloud suite and upgraded Firefly AI studio. Available in public beta starting today across Premiere Pro, Photoshop, Illustrator, InDesign, and Frame.io, the agent is designed to serve everyone from individual creators to enterprise marketing teams. Unlike first-generation generative AI tools that …

Adobe embeds agentic AI workflows across Creative Cloud, shifting from media generation to production orchestration Read More »

AWS enters the context layer race with a graph that learns from agents, not manual curation

Building a context layer between enterprise data stores and AI agents is bespoke work, with no standard service to automate or maintain the graphs over time. Amazon is making a direct play to change that. Amazon on Wednesday entered the space, announcing a series of three products it’s positioning as a context intelligence stack for …

AWS enters the context layer race with a graph that learns from agents, not manual curation Read More »