ACM

Non classé

Samsung AI researcher’s new, open reasoning model TRM outperforms models 10,000X larger — on specific problems

The trend of AI researchers developing new, small open source generative models that outperform far larger, proprietary peers continued this week with yet another staggering advancement. Alexia Jolicoeur-Martineau, Senior AI Researcher at Samsung’s Advanced​ Institute of Technology (SAIT) in Montreal, Canada,​ has introduced the Tiny Recursion Model (TRM) — a neural network so small it …

Samsung AI researcher’s new, open reasoning model TRM outperforms models 10,000X larger — on specific problems Read More »

AI21’s Jamba Reasoning 3B Redefines What “Small” Means in LLMs — 250K Context on a Laptop

The latest addition to the small model wave for enterprises comes from AI21 Labs, which is betting that bringing models to devices will free up traffic in data centers.  AI21’s Jamba Reasoning 3B, a “tiny” open-source model that can run extended reasoning, code generation and respond based on ground truth. Jamba Reasoning 3B handles more …

AI21’s Jamba Reasoning 3B Redefines What “Small” Means in LLMs — 250K Context on a Laptop Read More »

OpenAI Dev Day 2025: ChatGPT becomes the new app store — and hardware is coming

In a packed hall at Fort Mason Center in San Francisco, against a backdrop of the Golden Gate Bridge, OpenAI CEO Sam Altman laid out a bold vision to remake the digital world. The company that brought generative AI to the mainstream with a simple chatbot is now building the foundations for its next act: …

OpenAI Dev Day 2025: ChatGPT becomes the new app store — and hardware is coming Read More »

Google’s AI can now surf the web for you, click on buttons, and fill out forms with Gemini 2.5 Computer Use

Some of the largest providers of large language models (LLMs) have sought to move beyond multimodal chatbots — extending their models out into “agents” that can actually take more actions on behalf of the user across websites. Recall OpenAI’s ChatGPT Agent (formerly known as “Operator”) and Anthropic’s Computer Use, both released over the last two …

Google’s AI can now surf the web for you, click on buttons, and fill out forms with Gemini 2.5 Computer Use Read More »

Has this stealth startup finally cracked the code on enterprise AI agent reliability? Meet AUI’s Apollo-1

For more than a decade, conversational AI has promised human-like assistants that can do more than chat. Yet even as large language models (LLMs) like ChatGPT, Gemini, and Claude learn to reason, explain, and code, one critical category of interaction remains largely unsolved — reliably completing tasks for people outside of chat. Even the best …

Has this stealth startup finally cracked the code on enterprise AI agent reliability? Meet AUI’s Apollo-1 Read More »

IBM claims 45% productivity gains with Project Bob, its multi-model IDE that orchestrates LLMs with full repository context

For many enterprises, there continue to be barriers to fully adopting and benefiting from agentic AI. IBM is betting the blocker isn’t building AI agents but governing them in production. At its TechXchange 2025 conference today, IBM unveiled a series of capabilities designed to bridge the gap: Project Bob, an AI-first IDE that orchestrates multiple …

IBM claims 45% productivity gains with Project Bob, its multi-model IDE that orchestrates LLMs with full repository context Read More »

OpenAI unveils AgentKit that lets developers drag and drop to build AI agents

OpenAI launched an agent builder that the company hopes will eliminate fragmented tools and make it easier for enterprises to utilize OpenAI’s system to create agents. AgentKit, announced during OpenAI’s DevDay in San Francisco, enables developers and enterprises to build agents and add chat capabilities in one place, potentially competing with platforms like Zapier. By …

OpenAI unveils AgentKit that lets developers drag and drop to build AI agents Read More »

From Silicon Valley to Nairobi: What the Global South’s AI leapfrogging teaches tech leaders

When I write about the cognitive migration now underway, brought about by the rapid advance of gen AI, I do so from the perspective of someone who has spent four decades in the technology industry. My own journey runs from coding business applications in Fortran and COBOL to systems analysis and design, IT project management, …

From Silicon Valley to Nairobi: What the Global South’s AI leapfrogging teaches tech leaders Read More »

OpenAI announces Apps SDK allowing ChatGPT to launch and run third party apps like Zillow, Canva, Spotify

OpenAI’s annual conference for third-party developers, DevDay, kicked off with a bang today as co-founder and CEO Sam Altman announced a new “Apps SDK” that makes it “possible to build apps inside of ChatGPT,” including paid apps, which companies can charge users for using OpenAI’s recently unveiled Agentic Commerce Protocol (ACP). In other words, instead …

OpenAI announces Apps SDK allowing ChatGPT to launch and run third party apps like Zillow, Canva, Spotify Read More »

Stopping breaches at machine speed demands agents, not alerts

Presented by DXC Technology The sheer volume and sophistication of incoming threats today has dwarfed attacks from just six months ago, let alone two years ago, because adversaries have leveled up with AI. Naturally, security operations and analysts are under pressure, facing mounting alert volumes and false positives, while organizations scramble to support them amidst …

Stopping breaches at machine speed demands agents, not alerts Read More »