ACM

Non classé

Developers can now debug and evaluate AI agents locally with Raindrop’s open source tool Workshop

Observability startup Raindrop AI’s new open source, MIT Licensed “Workshop” tool, launched today, gives developers something that they’ve likely wanted, perhaps subconsciously, since the agentic AI era kicked off in earnest last year: a local debugger and evaluation tool specifically designed for AI agents, allowing devs to see all the traces of what their agent …

Developers can now debug and evaluate AI agents locally with Raindrop’s open source tool Workshop Read More »

Cerebras stock nearly doubles on day one as AI chipmaker hits $100 billion — what it means for AI infrastructure

Cerebras Systems, the Silicon Valley chipmaker that built the world’s largest commercial AI processor, erupted onto the Nasdaq on Wednesday, opening at $350 per share — nearly double its $185 IPO price — and rocketing past a $100 billion market capitalization in its first hours of trading. The debut instantly crowned Cerebras as one of …

Cerebras stock nearly doubles on day one as AI chipmaker hits $100 billion — what it means for AI infrastructure Read More »

Enterprises can verify who their agents are. They cannot control what those agents do.

Anthony Grieco, Cisco’s SVP and chief security and trust officer, did not hesitate when VentureBeat asked whether rogue agent incidents are reaching Cisco’s customer base. “A hundred percent. We see them regularly,” Grieco told VentureBeat in an exclusive interview at RSAC 2026. “I’ve heard some that I can’t repeat, but they do get to the …

Enterprises can verify who their agents are. They cannot control what those agents do. Read More »

Claude Code’s ‘/goals’ separates the agent that works from the one that decides it’s done

A code migration agent finishes its run, and the pipeline looks green. But several pieces were never compiled — and it took days to catch. That’s not a model failure; that’s an agent deciding it was done before it actually was. Many enterprises are now seeing that production AI agent pipelines fail not because of …

Claude Code’s ‘/goals’ separates the agent that works from the one that decides it’s done Read More »

Enterprises can now train custom AI models from production workflows — no ML team required

Every query an enterprise AI application processes, every correction a subject matter expert makes to its output — that interaction is training data. Most organizations are not capturing it. The production workflows companies have already built are generating a continuous signal that improves AI models, and it is disappearing. San Francisco-based Empromptu AI on Thursday …

Enterprises can now train custom AI models from production workflows — no ML team required Read More »

AI IQ is here: a new site scores frontier AI models on the human IQ scale. The results are already dividing tech.

For decades, the IQ test has been one of the most familiar — and most contested — yardsticks for human intelligence. Now, a startup project called AI IQ is applying the same metaphor to artificial intelligence, assigning estimated intelligence quotients to more than 50 of the world’s most powerful language models and plotting them on …

AI IQ is here: a new site scores frontier AI models on the human IQ scale. The results are already dividing tech. Read More »

Anthropic reinstates OpenClaw and third-party agent usage on Claude subscriptions — with a catch

Good news, OpenClaw fans — you can once again use your Claude AI subscription to power the hit, open source, autonomous AI agentic harness! But, there’s a big catch with how it’s being enacted. A few hours ago, Anthropic announced via its official developer communications account on X, @ClaudeDevs, that it is changing its Claude …

Anthropic reinstates OpenClaw and third-party agent usage on Claude subscriptions — with a catch Read More »

Anthropic finally beat OpenAI in business AI adoption — but 3 big threats could erase its lead

For the first time since the AI race began, more American businesses are paying for Anthropic’s Claude than for OpenAI’s ChatGPT. Adoption of Anthropic rose 3.8% in April to 34.4% of businesses, according to the May 2026 release of the Ramp AI Index. OpenAI’s adoption fell 2.9% to 32.3%. Overall AI adoption among businesses rose …

Anthropic finally beat OpenAI in business AI adoption — but 3 big threats could erase its lead Read More »

Frontier AI models don’t just delete document content — they rewrite it, and the errors are nearly impossible to catch

As large language models become more capable, users are tempted to delegate knowledge tasks where models process documents on their behalf and provide the finished results. But how far can you trust the model to stay faithful to the content of your documents when it has to iterate over them across multiple rounds? A new …

Frontier AI models don’t just delete document content — they rewrite it, and the errors are nearly impossible to catch Read More »

Perceptron Mk1 shocks with highly performant video analysis AI model 80-90% cheaper than Anthropic, OpenAI & Google

AI that can see and understand what’s happening in a video — especially a live feed — is understandably an attractive product to lots of enterprises and organizations. Beyond acting as a security “watchdog” over sites and facilities, such an AI model could also be used to clip out the most exciting parts of marketing …

Perceptron Mk1 shocks with highly performant video analysis AI model 80-90% cheaper than Anthropic, OpenAI & Google Read More »