ACM

Non classé

Autonomous security agents need complete data. Here’s how to check if yours is ready.

An endpoint agent cannot report its own absence. The 2026 Axonius Actionability Report, conducted with the Ponemon Institute and surveying 662 IT and security professionals, put a number on a gap SOC teams have worked around for years. Across the Axonius customer base, 12.7% of devices in a 298,000-device median inventory are missing their expected …

Autonomous security agents need complete data. Here’s how to check if yours is ready. Read More »

OpenAI unveils GPT-5.6 Sol, Terra and Luna models — but only accessible to limited preview partners for now, per US Gov

OpenAI is announcing a limited preview of its next-generation GPT-5.6 model series today, introducing three distinct, capability-tiered models—Sol, Terra, and Luna—designed to re-engineer developer and enterprise workflows. The initial rollout is available through the API and Codex to a narrow set of trusted partners and organizations after OpenAI previewed the models and release plans to …

OpenAI unveils GPT-5.6 Sol, Terra and Luna models — but only accessible to limited preview partners for now, per US Gov Read More »

Most companies think they’re building a software factory. They’re actually just shipping bugs faster.

Industrialized factories changed how the world produced physical goods: more output, lower costs, faster than anything that came before. Now a similar shift is happening with software.  LLMs have lowered the barrier to writing code, increased individual output, and pushed organizations to think about software development as a production system. The standard software development lifecycle …

Most companies think they’re building a software factory. They’re actually just shipping bugs faster. Read More »

Liquid AI’s smallest model yet LFM2.5-230M beats models 4X its size at data extraction, can run ‘anywhere’

Liquid AI, founded by former MIT computer scientists, today released its smallest AI language model yet, LFM2.5-230M, and enterprises would do well to consider it for their uses in data extraction and local deployment on smartphones, laptops and robotics. This is a 230-million-parameter foundation model explicitly designed for on-device agentic workflows, and as Liquid states …

Liquid AI’s smallest model yet LFM2.5-230M beats models 4X its size at data extraction, can run ‘anywhere’ Read More »

OpenAI’s updated GPT-5.5 Instant is better at shopping, complex constraints, and understanding user intent  — and it’s already in the API

OpenAI has made a significant update to its most widely used language model, GPT-5.5 Instant, which is the default in the free version of ChatGPT. The company announced the upgraded version of GPT-5.5 Instant yesterday on X, calling it “much more fun to talk to” and saying it is “better at understanding the intent behind …

OpenAI’s updated GPT-5.5 Instant is better at shopping, complex constraints, and understanding user intent  — and it’s already in the API Read More »

Your enterprise AI agents should automatically remember which model is right for which task. Mindstone built the capability with Rebel

AI agent orchestration platforms are popping up like weeds these days, but London-based AI transformation startup Mindstone’s Rebel might be among the most promising I’ve come across. That’s because the system, which officially launched this week, is a local-first, agentic AI operating system distributed under a “Fair Source” license, allowing teams of under 100 users …

Your enterprise AI agents should automatically remember which model is right for which task. Mindstone built the capability with Rebel Read More »

Mistral launches OCR 4, turning document extraction into a full enterprise AI play

Mistral AI on Tuesday released OCR 4, a document intelligence model that moves beyond raw text extraction to return structured representations of entire documents — complete with bounding boxes, block-type classification, and per-word confidence scores. The release marks Mistral’s fourth generation of optical character recognition technology in roughly 15 months and lands at a moment …

Mistral launches OCR 4, turning document extraction into a full enterprise AI play Read More »

Stanford researchers will discuss their agentic ‘scientists’ that are on course to reshape drug discovery at VB Transform 2026

Drug discovery is notoriously inefficient. Pharmaceutical projects span years, moving from one specialized human team to the next through disconnected workflows that result in knowledge loss during each handoff.  A shocking 90% to 95% of drug discovery projects reportedly fail — one of the highest failure rates of any industry. A single successful drug can …

Stanford researchers will discuss their agentic ‘scientists’ that are on course to reshape drug discovery at VB Transform 2026 Read More »

Alibaba’s model never trained as an agent — and improved agent performance across seven benchmarks

Alibaba’s Qwen team released Qwen-AgentWorld on Tuesday — two models trained not to act inside agent environments, but to predict what those environments return. The release covers seven domains under a single architecture: MCP, Search, Terminal, Software Engineering, Android, Web, and OS. The release extends Alibaba’s recent push into autonomous agents. Qwen3.7-Max, released in May, …

Alibaba’s model never trained as an agent — and improved agent performance across seven benchmarks Read More »

Xiaomi’s HarnessX rewrites its own AI scaffolding mid-task — and smaller models gain the most

As enterprise AI agents take on increasingly complex, long-horizon tasks, their performance is often restricted by their harness, the software scaffolding that connects the backbone LLM to its environment.  Currently, harnesses are largely static and hand-crafted. Improving them is largely manual and they do not automatically improve based on the execution data they collect from …

Xiaomi’s HarnessX rewrites its own AI scaffolding mid-task — and smaller models gain the most Read More »