ACM

Non classé

Nous Research just released Nomos 1, an open-source AI that ranks second on the notoriously brutal Putnam math exam

Nous Research, the San Francisco-based artificial intelligence startup, released on Tuesday an open-source mathematical reasoning system called Nomos 1 that achieved near-elite human performance on this year’s William Lowell Putnam Mathematical Competition, one of the most prestigious and notoriously difficult undergraduate math contests in the world. The Putnam is known for its difficulty: While a …

Nous Research just released Nomos 1, an open-source AI that ranks second on the notoriously brutal Putnam math exam Read More »

Cohere’s Rerank 4 quadruples the context window over 3.5 to cut agent errors and boost enterprise search accuracy

Almost a year after releasing Rerank 3.5, Cohere launched the latest version of its search model, now with a larger context window to help agents find the information they need to complete their tasks.  Cohere said in a blog post that Rerank 4 has a 32K context window, representing a four-fold increase compared to 3.5.  …

Cohere’s Rerank 4 quadruples the context window over 3.5 to cut agent errors and boost enterprise search accuracy Read More »

OpenAI’s GPT-5.2 is here: what enterprises need to know

The rumors were true, and the “Code Red” is over: OpenAI today announced the release of its new frontier large language model (LLM) family: GPT-5.2. It comes at a pivotal moment for the AI pioneer, which has faced intensifying pressure since rival Google’s Gemini 3 LLM seized the top spot on major third-party performance leaderboards …

OpenAI’s GPT-5.2 is here: what enterprises need to know Read More »

Marble enters the race to bring AI to tax work, armed with $9 million and a free research tool

Marble, a startup building artificial intelligence agents for tax professionals, has raised $9 million in seed funding as the accounting industry grapples with a deepening labor shortage and mounting regulatory complexity. The round, led by Susa Ventures with participation from MXV Capital and Konrad Capital, positions Marble to compete in a market where AI adoption …

Marble enters the race to bring AI to tax work, armed with $9 million and a free research tool Read More »

Creating a glass box: How NetSuite is engineering trust into AI

Presented by Oracle NetSuite When any company tells you it is their biggest product release in almost three decades, it’s worth listening. When the person saying it founded the world’s first cloud computing company, it’s time to take note. At SuiteWorld 2025, Evan Goldberg, founder and EVP of Oracle NetSuite, did just that when he …

Creating a glass box: How NetSuite is engineering trust into AI Read More »

The 70% factuality ceiling: why Google’s new ‘FACTS’ benchmark is a wake-up call for enterprise AI

There’s no shortage of generative AI benchmarks designed to measure the performance and accuracy of a given model on completing various helpful enterprise tasks — from coding to instruction following to agentic web browsing and tool use. But many of these benchmarks have one major shortcoming: they measure the AI’s ability to complete specific problems …

The 70% factuality ceiling: why Google’s new ‘FACTS’ benchmark is a wake-up call for enterprise AI Read More »

How Google’s TPUs are reshaping the economics of large-scale AI

For more than a decade, Nvidia’s GPUs have underpinned nearly every major advance in modern AI. That position is now being challenged.  Frontier models such as Google’s Gemini 3 and Anthropic’s Claude 4.5 Opus were trained not on Nvidia hardware, but on Google’s latest Tensor Processing Units, the Ironwood-based TPUv7. This signals that a viable …

How Google’s TPUs are reshaping the economics of large-scale AI Read More »

OpenAI report reveals a 6x productivity gap between AI power users and everyone else

The tools are available to everyone. The subscription is company-wide. The training sessions have been held. And yet, in offices from Wall Street to Silicon Valley, a stark divide is opening between workers who have woven artificial intelligence into the fabric of their daily work and colleagues who have barely touched it. The gap is …

OpenAI report reveals a 6x productivity gap between AI power users and everyone else Read More »

Quilter’s AI just designed an 843‑part Linux computer that booted on the first try. Hardware will never be the same.

A San Francisco-based startup has demonstrated what it calls a breakthrough in hardware development: an artificial intelligence system that designed a fully functional Linux computer in one week — a process that would typically consume nearly three months of skilled engineering labor. Quilter, which has raised more than $40 million from investors including Benchmark, Index …

Quilter’s AI just designed an 843‑part Linux computer that booted on the first try. Hardware will never be the same. Read More »

How Hud’s runtime sensor cut triage time from 3 hours to 10 minutes

Engineering teams are generating more code with AI agents than ever before. But they’re hitting a wall when that code reaches production. The problem isn’t necessarily the AI-generated code itself. It’s that traditional monitoring tools generally struggle to provide the granular, function-level data AI agents need to understand how code actually behaves in complex production …

How Hud’s runtime sensor cut triage time from 3 hours to 10 minutes Read More »