ACM

Non classé

OpenAI launches GPT-5.4 with native computer use mode, financial plugins for Microsoft Excel, Google Sheets

The AI updates aren’t slowing down. Literally two days after OpenAI launched a new underlying AI model for ChatGPT called GPT-5.3 Instant, the company has unveiled another, even more massive upgrade: GPT-5.4. Actually, GPT-5.4 comes in two varieties: GPT-5.4 Thinking and GPT-5.4 Pro, the latter designed for the most complex tasks. Both will be available …

OpenAI launches GPT-5.4 with native computer use mode, financial plugins for Microsoft Excel, Google Sheets Read More »

Databricks built a RAG agent it says can handle every kind of enterprise search

Most enterprise RAG pipelines are optimized for one search behavior. They fail silently on the others. A model trained to synthesize cross-document reports handles constraint-driven entity search poorly. A model tuned for simple lookup tasks falls apart on multi-step reasoning over internal notes. Most teams find out when something breaks. Databricks set out to fix …

Databricks built a RAG agent it says can handle every kind of enterprise search Read More »

Microsoft built Phi-4-reasoning-vision-15B to know when to think — and when thinking is a waste of time

Microsoft on Tuesday released Phi-4-reasoning-vision-15B, a compact open-weight multimodal AI model that the company says matches or exceeds the performance of systems many times its size — while consuming a fraction of the compute and training data. The release marks the latest and most technically ambitious chapter in the software giant’s year-long campaign to prove …

Microsoft built Phi-4-reasoning-vision-15B to know when to think — and when thinking is a waste of time Read More »

Black Forest Labs’ new Self-Flow technique makes training multimodal AI models 2.8x more efficient

To create coherent images or videos, generative AI diffusion models like Stable Diffusion or FLUX have typically relied on external “teachers”—frozen encoders like CLIP or DINOv2—to provide the semantic understanding they couldn’t learn on their own. But this reliance has come at a cost: a “bottleneck” where scaling up the model no longer yields better …

Black Forest Labs’ new Self-Flow technique makes training multimodal AI models 2.8x more efficient Read More »

EY hit 4x coding productivity by connecting AI agents to engineering standards

Coding agents can generate thousands of lines of code in minutes. The problem: most of it can’t be deployed. It breaks internal standards, fails compliance checks, or creates more cleanup work than it saves. “You can generate a ton of code, but it doesn’t mean really anything, right? It’s got to be code that is …

EY hit 4x coding productivity by connecting AI agents to engineering standards Read More »

Pentagon vendor cutoff exposes the AI dependency map most enterprises never built

The federal directive ordering all U.S. government agencies to cease using Anthropic technology comes with a six-month phaseout window. That timeline assumes agencies already know where Anthropic’s models sit inside their workflows. Most don’t today. Most enterprises wouldn’t, either. The gap between what enterprises think they’ve approved and what’s actually running in production is wider …

Pentagon vendor cutoff exposes the AI dependency map most enterprises never built Read More »

Did Alibaba just kneecap its powerful Qwen AI team? Key figures depart in wake of latest open source release

Alibaba’s Qwen team of AI researchers have been among the most prolific and well-regarded by international machine learning community — shipping dozens of powerful generalized and specialized generative models starting last summer, most of them entirely open source and free. But now, just 24 hours after shipping the open source Qwen3.5 small model series—a release …

Did Alibaba just kneecap its powerful Qwen AI team? Key figures depart in wake of latest open source release Read More »

GPT-5.3 Instant cuts hallucinations by 26.8% as OpenAI shifts focus from speed to accuracy

OpenAI’s GPT-5.3 Instant — the company’s most widely used model — reduces hallucinations by up to 26.8% compared to its predecessor, prioritizing accuracy and conversational reliability over raw performance gains, OpenAI says. GPT-5.3 Instant, which is essentially the default and is the most used model for ChatGPT users, also improves on tone, relevance and conversation …

GPT-5.3 Instant cuts hallucinations by 26.8% as OpenAI shifts focus from speed to accuracy Read More »

Google releases Gemini 3.1 Flash Lite at 1/8th the cost of Pro

Google’s newest AI model is here: Gemini 3.1 Flash-Lite, and the biggest improvements this time around come in cost and speed, especially for enterprises and developers seeking to leverage powerful reasoning and multimodal capabilities from the U.S. search and cloud giant. Positioning it as the most cost-efficient and responsive model in the Gemini 3 series, …

Google releases Gemini 3.1 Flash Lite at 1/8th the cost of Pro Read More »

Endor Labs launches free tool AURI after study finds only 10% of AI-generated code is secure

Endor Labs, the application security startup backed by more than $208 million in venture funding, today launched AURI, a platform that embeds real-time security intelligence directly into the AI coding tools that are reshaping how software gets built. The product is available free to individual developers and integrates natively with popular AI coding assistants including …

Endor Labs launches free tool AURI after study finds only 10% of AI-generated code is secure Read More »