ACM

Non classé

OpenAI drastically updates Codex desktop app to use all other apps on your computer, generate images, preview webpages

Confirming it has reached 3 million weekly developers, OpenAI is massively updating its Codex developer environment via its Mac and Windows desktop apps today to bring it closer to the “Super App” the company has confirmed it is pursuing. Before today, Codex was primarily an environment for using OpenAI’s underlying language models to write, edit, …

OpenAI drastically updates Codex desktop app to use all other apps on your computer, generate images, preview webpages Read More »

OpenAI debuts GPT-Rosalind, a new limited access model for life sciences, and broader Codex plugin on Github

The journey from a laboratory hypothesis to a pharmacy shelf is one of the most grueling marathons in modern industry, typically spanning 10 to 15 years and billions of dollars in investment. Progress is often stymied not just by the inherent mysteries of biology, but by the “fragmented and difficult to scale” workflows that force …

OpenAI debuts GPT-Rosalind, a new limited access model for life sciences, and broader Codex plugin on Github Read More »

Anthropic releases Claude Opus 4.7, narrowly retaking lead for most powerful generally available LLM

Anthropic is publicly releasing its most powerful large language model yet, Claude Opus 4.7, today — as it continues to keep an even more powerful successor, Mythos, restricted to a small number of external enterprise partners for cybersecurity testing and patching vulnerabilities in the software said enterprises use (which Mythos exposed rapidly). The big headlines …

Anthropic releases Claude Opus 4.7, narrowly retaking lead for most powerful generally available LLM Read More »

AI lowered the cost of building software. Enterprise governance hasn’t caught up

Presented by Retool The logic used to be: buying software is cheaper, faster, and safer for most use cases. Building was reserved for companies with large engineering teams, deep pockets, and problems so specific that no vendor could address them. But now, the cost to code a piece of software has dropped to zero. Anyone …

AI lowered the cost of building software. Enterprise governance hasn’t caught up Read More »

Microsoft patched a Copilot Studio prompt injection. The data exfiltrated anyway.

Microsoft assigned CVE-2026-21520, a CVSS 7.5 indirect prompt injection vulnerability, to Copilot Studio. Capsule Security discovered the flaw, coordinated disclosure with Microsoft, and the patch was deployed on January 15. Public disclosure went live on Wednesday. That CVE matters less for what it fixes and more for what it signals. Capsule’s research calls Microsoft’s decision …

Microsoft patched a Copilot Studio prompt injection. The data exfiltrated anyway. Read More »

Meta researchers introduce ‘hyperagents’ to unlock self-improving AI for non-coding tasks

Creating self-improving AI systems is an important step toward deploying agents in dynamic environments, especially in enterprise production environments, where tasks are not always predictable, nor consistent. Current self-improving AI systems face severe limitations because they rely on fixed, handcrafted improvement mechanisms that only work under strict conditions such as software engineering. To overcome this …

Meta researchers introduce ‘hyperagents’ to unlock self-improving AI for non-coding tasks Read More »

Frontier models are failing one in three production attempts — and getting harder to audit

AI agents are now embedded in real enterprise workflows, and they’re still failing roughly one in three attempts on structured benchmarks. That gap between capability and reliability is the defining operational challenge for IT leaders in 2026, according to Stanford HAI’s ninth annual AI Index report. This uneven, unpredictable performance is what the AI Index …

Frontier models are failing one in three production attempts — and getting harder to audit Read More »

We tested Anthropic’s redesigned Claude Code desktop app and ‘Routines’ — here’s what enterprises should know

The transition from AI as a chatbot to AI as a workforce is no longer a theoretical projection; it has become the primary design philosophy for the modern developer’s toolkit. On April 14, 2026, Anthropic signaled this shift with a dual release: a complete redesign of the Claude Code desktop app (for Mac and Windows) …

We tested Anthropic’s redesigned Claude Code desktop app and ‘Routines’ — here’s what enterprises should know Read More »

AI’s next bottleneck isn’t the models — it’s whether agents can think together

AI agents can connect together, but they cannot think together. That’s a huge difference and a bottleneck for next-gen systems, says Outshift by Cisco’s SVP and GM Vijoy Pandey. As he describes the current state of AI: Agents can be stitched together in a workflow or plug into a supervisor model — but there’s no …

AI’s next bottleneck isn’t the models — it’s whether agents can think together Read More »

Traza raises $2.1 million led by Base10 to automate procurement workflows with AI

For decades, procurement has been the back office that enterprise software forgot. Billions of dollars flow through vendor negotiations, purchase orders, and supplier communications every year at the largest manufacturers and construction companies in the country — and the vast majority of that work still runs on email threads, spreadsheets, and phone calls. Traza, a …

Traza raises $2.1 million led by Base10 to automate procurement workflows with AI Read More »