ACM

Non classé

Tracking every decision, dollar and delay: The new process intelligence engine driving public-sector progress

Presented by Celonis The State of Oklahoma discovered its blind spots the hard way. In April 2023, a legislative report revealed its agencies had spent $3 billion without proper oversight. Janet Morrow, Director of Oklahoma’s Risk, Assessment and Compliance Division, set out to track thousands of monthly transactions across dozens of disconnected systems. The Sooner …

Tracking every decision, dollar and delay: The new process intelligence engine driving public-sector progress Read More »

Z.ai debuts open source GLM-4.6V, a native tool-calling vision model for multimodal reasoning

Chinese AI startup Zhipu AI aka Z.ai has released its GLM-4.6V series, a new generation of open-source vision-language models (VLMs) optimized for multimodal reasoning, frontend automation, and high-efficiency deployment. The release includes two models in “large” and “small” sizes: GLM-4.6V (106B), a larger 106-billion parameter model aimed at cloud-scale inference GLM-4.6V-Flash (9B), a smaller model …

Z.ai debuts open source GLM-4.6V, a native tool-calling vision model for multimodal reasoning Read More »

Anthropic’s Claude Code can now read your Slack messages and write code for you

Anthropic on Monday launched a beta integration that connects its fast-growing Claude Code programming agent directly to Slack, allowing software engineers to delegate coding tasks without leaving the workplace messaging platform where much of their daily communication already happens. The release, which Anthropic describes as a “research preview,” is the AI safety company’s latest move …

Anthropic’s Claude Code can now read your Slack messages and write code for you Read More »

Booking.com’s agent strategy: Disciplined, modular and already delivering 2× accuracy

When many enterprises weren’t even thinking about agentic behaviors or infrastructures, Booking.com had already “stumbled” into them with its homegrown conversational recommendation system. This early experimentation has allowed the company to take a step back and avoid getting swept up in the frantic AI agent hype. Instead, it is taking a disciplined, layered, modular approach …

Booking.com’s agent strategy: Disciplined, modular and already delivering 2× accuracy Read More »

Design in the age of AI: How small businesses are building big brands faster

Presented by Design.com For most of history, design was the last step in starting a business — something entrepreneurs invested in once the idea was proven. Today, it’s one of the first. The rise of generative AI has shifted how small businesses imagine, launch, and grow — turning what used to be a months-long creative …

Design in the age of AI: How small businesses are building big brands faster Read More »

Why AI coding agents aren’t production-ready: Brittle context windows, broken refactors, missing operational awareness

Remember this Quora comment (which also became a meme)? (Source: Quora) In the pre-large language model (LLM) Stack Overflow era, the challenge was discerning which code snippets to adopt and adapt effectively. Now, while generating code has become trivially easy, the more profound challenge lies in reliably identifying and integrating high-quality, enterprise-grade code into production …

Why AI coding agents aren’t production-ready: Brittle context windows, broken refactors, missing operational awareness Read More »

AI denial is becoming an enterprise risk: Why dismissing “slop” obscures real capability gains

Three years ago, Chat GPT was born. It amazed the world and ignited unprecedented investment and excitement in AI. Today, ChatGPT is still a toddler, but public sentiment around the AI boom has turned sharply negative. The shift began when OpenAI released GPT-5 this summer to mixed reviews, mostly from casual users who, unsurprisingly, judged …

AI denial is becoming an enterprise risk: Why dismissing “slop” obscures real capability gains Read More »

GAM takes aim at “context rot”: A dual-agent memory architecture that outperforms long-context LLMs

For all their superhuman power, today’s AI models suffer from a surprisingly human flaw: They forget. Give an AI assistant a sprawling conversation, a multi-step reasoning task or a project spanning days, and it will eventually lose the thread. Engineers refer to this phenomenon as “context rot,” and it has quietly become one of the …

GAM takes aim at “context rot”: A dual-agent memory architecture that outperforms long-context LLMs Read More »

The ‘truth serum’ for AI: OpenAI’s new method for training models to confess their mistakes

OpenAI researchers have introduced a novel method that acts as a “truth serum” for large language models (LLMs), compelling them to self-report their own misbehavior, hallucinations and policy violations. This technique, “confessions,” addresses a growing concern in enterprise AI: Models can be dishonest, overstating their confidence or covering up the shortcuts they take to arrive …

The ‘truth serum’ for AI: OpenAI’s new method for training models to confess their mistakes Read More »

Anthropic vs. OpenAI red teaming methods reveal different security priorities for enterprise AI

Model providers want to prove the security and robustness of their models, releasing system cards and conducting red-team exercises with each new release. But it can be difficult for enterprises to parse through the results, which vary widely and can be misleading. Anthropic’s 153-page system card for Claude Opus 4.5 versus OpenAI’s 60-page GPT-5 system …

Anthropic vs. OpenAI red teaming methods reveal different security priorities for enterprise AI Read More »