ACM

Non classé

The AI that scored 95% — until consultants learned it was AI

Presented by SAP When SAP ran a quiet internal experiment to gauge consultant attitudes toward AI, the results were striking. Five teams were asked to validate answers to more than 1,000 business requirements completed by SAP’s AI co-pilot, Joule for Consultants — a workload that would normally take several weeks. Four teams were told the …

The AI that scored 95% — until consultants learned it was AI Read More »

Mistral launches powerful Devstral 2 coding model including open source, laptop-friendly version

French AI startup Mistral has weathered a rocky period of public questioning over the last year to emerge, now here in December 2025, with new, crowd-pleasing models for enterprise and indie developers. Just days after releasing its powerful open source, general purpose Mistral 3 LLM family for edge devices and local hardware, the company returned …

Mistral launches powerful Devstral 2 coding model including open source, laptop-friendly version Read More »

Databricks’ OfficeQA uncovers disconnect: AI agents ace abstract tests but stall at 45% on enterprise docs

There is no shortage of AI benchmarks in the market today, with popular options like Humanity’s Last Exam (HLE), ARC-AGI-2 and GDPval, among numerous others. AI agents excel at solving abstract math problems and passing PhD-level exams that most benchmarks are based on, but Databricks has a question for the enterprise: Can they actually handle …

Databricks’ OfficeQA uncovers disconnect: AI agents ace abstract tests but stall at 45% on enterprise docs Read More »

Brand-context AI: The missing requirement for marketing AI

Presented by BlueOcean AI has become a central part of how marketing teams work, but the results often fall short. Models can generate content at scale and summarize information in seconds, yet the outputs are not always aligned with the brand, the audience, or the company’s strategic goals. The problem is not capability. The problem …

Brand-context AI: The missing requirement for marketing AI Read More »

Tracking every decision, dollar and delay: The new process intelligence engine driving public-sector progress

Presented by Celonis The State of Oklahoma discovered its blind spots the hard way. In April 2023, a legislative report revealed its agencies had spent $3 billion without proper oversight. Janet Morrow, Director of Oklahoma’s Risk, Assessment and Compliance Division, set out to track thousands of monthly transactions across dozens of disconnected systems. The Sooner …

Tracking every decision, dollar and delay: The new process intelligence engine driving public-sector progress Read More »

Z.ai debuts open source GLM-4.6V, a native tool-calling vision model for multimodal reasoning

Chinese AI startup Zhipu AI aka Z.ai has released its GLM-4.6V series, a new generation of open-source vision-language models (VLMs) optimized for multimodal reasoning, frontend automation, and high-efficiency deployment. The release includes two models in “large” and “small” sizes: GLM-4.6V (106B), a larger 106-billion parameter model aimed at cloud-scale inference GLM-4.6V-Flash (9B), a smaller model …

Z.ai debuts open source GLM-4.6V, a native tool-calling vision model for multimodal reasoning Read More »

Anthropic’s Claude Code can now read your Slack messages and write code for you

Anthropic on Monday launched a beta integration that connects its fast-growing Claude Code programming agent directly to Slack, allowing software engineers to delegate coding tasks without leaving the workplace messaging platform where much of their daily communication already happens. The release, which Anthropic describes as a “research preview,” is the AI safety company’s latest move …

Anthropic’s Claude Code can now read your Slack messages and write code for you Read More »

Booking.com’s agent strategy: Disciplined, modular and already delivering 2× accuracy

When many enterprises weren’t even thinking about agentic behaviors or infrastructures, Booking.com had already “stumbled” into them with its homegrown conversational recommendation system. This early experimentation has allowed the company to take a step back and avoid getting swept up in the frantic AI agent hype. Instead, it is taking a disciplined, layered, modular approach …

Booking.com’s agent strategy: Disciplined, modular and already delivering 2× accuracy Read More »

Design in the age of AI: How small businesses are building big brands faster

Presented by Design.com For most of history, design was the last step in starting a business — something entrepreneurs invested in once the idea was proven. Today, it’s one of the first. The rise of generative AI has shifted how small businesses imagine, launch, and grow — turning what used to be a months-long creative …

Design in the age of AI: How small businesses are building big brands faster Read More »

Why AI coding agents aren’t production-ready: Brittle context windows, broken refactors, missing operational awareness

Remember this Quora comment (which also became a meme)? (Source: Quora) In the pre-large language model (LLM) Stack Overflow era, the challenge was discerning which code snippets to adopt and adapt effectively. Now, while generating code has become trivially easy, the more profound challenge lies in reliably identifying and integrating high-quality, enterprise-grade code into production …

Why AI coding agents aren’t production-ready: Brittle context windows, broken refactors, missing operational awareness Read More »