ACM

Non classé

This new, dead simple prompt technique boosts accuracy on LLMs by up to 76% on non-reasoning tasks

In the chaotic world of Large Language Model (LLM) optimization, engineers have spent the last few years developing increasingly esoteric rituals to get better answers. We’ve seen “Chain of Thought” (asking the model to think step-by-step and often, show those “reasoning traces” to the user), “Emotional Blackmail” (telling the model its career depends on the …

This new, dead simple prompt technique boosts accuracy on LLMs by up to 76% on non-reasoning tasks Read More »

DeepSeek’s conditional memory fixes silent LLM waste: GPU cycles lost to static lookups

When an enterprise LLM retrieves a product name, technical specification, or standard contract clause, it’s using expensive GPU computation designed for complex reasoning — just to access static information. This happens millions of times per day. Each lookup wastes cycles and inflates infrastructure costs.  DeepSeek’s newly released research on “conditional memory” addresses this architectural limitation …

DeepSeek’s conditional memory fixes silent LLM waste: GPU cycles lost to static lookups Read More »

Why Sakana AI’s big win is a big deal for the future of enterprise agents

In an impressive feat, Japanese startup Sakana AI’s coding agent ALE-Agent recently secured first place in the AtCoder Heuristic Contest (AHC058), a complex coding competition that involves complicated optimization problems — and a more difficult and perhaps telling challenge than benchmarks like HumanEval, which mostly test the ability to write isolated functions, and which many …

Why Sakana AI’s big win is a big deal for the future of enterprise agents Read More »

Why Egnyte keeps hiring junior engineers despite the rise of AI coding tools

Egnyte, the $1.5 billion cloud content governance company, has embedded AI coding tools across its global team of more than 350 developers — but not to reduce headcount. Instead, the company continues to hire junior engineers, using AI to accelerate onboarding, deepen codebase understanding, and shorten the path from junior to senior contributor. The approach …

Why Egnyte keeps hiring junior engineers despite the rise of AI coding tools Read More »

Salesforce rolls out new Slackbot AI agent as it battles Microsoft and Google in workplace AI

Salesforce on Tuesday launched an entirely rebuilt version of Slackbot, the company’s workplace assistant, transforming it from a simple notification tool into what executives describe as a fully powered AI agent capable of searching enterprise data, drafting documents, and taking action on behalf of employees. The new Slackbot, now generally available to Business+ and Enterprise+ …

Salesforce rolls out new Slackbot AI agent as it battles Microsoft and Google in workplace AI Read More »

Anthropic launches Cowork, a Claude Desktop agent that works in your files — no coding required

Anthropic released Cowork on Monday, a new AI agent capability that extends the power of its wildly successful Claude Code tool to non-technical users — and according to company insiders, the team built the entire feature in approximately a week and a half, largely using Claude Code itself. The launch marks a major inflection point …

Anthropic launches Cowork, a Claude Desktop agent that works in your files — no coding required Read More »

Nvidia Rubin’s rack-scale encryption signals a turning point for enterprise AI security

Nvidia’s Vera Rubin NVL72, announced at CES 2026, encrypts every bus across 72 GPUs, 36 CPUs, and the entire NVLink fabric. It’s the first rack-scale platform to deliver confidential computing across CPU, GPU, and NVLink domains. For security leaders, this fundamentally shifts the conversation. Rather than attempting to secure complex hybrid cloud configurations through contractual …

Nvidia Rubin’s rack-scale encryption signals a turning point for enterprise AI security Read More »

How DoorDash scaled without a costly ERP overhaul

Presented by NetSuite Most companies racing from startup to an industry leader face a choice: limp along with scrappy early systems or endure a costly platform migration. DoorDash did neither. The local-commerce giant scaled from its 2013 founding through IPO and global expansion — acquiring the Helsiniki-based technology company Wolt in 2022 and UK-based Deliveroo …

How DoorDash scaled without a costly ERP overhaul Read More »

Why your LLM bill is exploding — and how semantic caching can cut it by 73%

Our LLM API bill was growing 30% month-over-month. Traffic was increasing, but not that fast. When I analyzed our query logs, I found the real problem: Users ask the same questions in different ways. “What’s your return policy?,” “How do I return something?”, and “Can I get a refund?” were all hitting our LLM separately, …

Why your LLM bill is exploding — and how semantic caching can cut it by 73% Read More »

Anthropic cracks down on unauthorized Claude usage by third-party harnesses and rivals

Anthropic has confirmed the implementation of strict new technical safeguards preventing third-party applications from spoofing its official coding client, Claude Code, in order to access the underlying Claude AI models for more favorably pricing and limits — a move that has disrupted workflows for users of popular open source coding agent OpenCode. Simultaneously but separately, …

Anthropic cracks down on unauthorized Claude usage by third-party harnesses and rivals Read More »