← Back to topics

Topics

AI Agents

Agent protocols, tool use, async workflows, and the key moves shaping the agent economy.

49 articles Latest 2026-05-21 Subscribe to topic RSS

Related tags

#AI-Agents#Agent-Economy#Agent#MCP#Agentic AI#AI-Standards#A2A#Agent-Interoperability

Top sources

Anthropic (8)OpenAI (5)Google (3)arXiv (2)Adweek (1)BAU Lab / Northeastern University (1)

Articles

Qwen3.7-Max Built for the Agent Frontier

Alibaba's Qwen3.7-Max achieves breakthroughs in coding agents, MCP integration, and long-horizon autonomous execution, including a 35-hour fully autonomous GPU kernel optimization achieving 10x speedup.

How Frontier AI Broke the Open CTF Competition Format

As frontier AI models like Claude Opus 4.5 and GPT-5.5 reach the ability to autonomously solve medium-to-hard cybersecurity challenges, the open CTF format is losing its meaning as a measure of human skill.

GitLab Restructures for the Agentic Era

GitLab CEO Bill Staples lays out a sweeping strategic and operational overhaul, rebuilding the DevSecOps platform for machine-scale software creation, agent-first APIs, and consumption-based pricing for AI agent work.

Anthropic Releases Agent Templates for Financial Services

Anthropic released ten ready-to-run agent templates for financial services, targeting pitchbook building, KYC screening, and month-end closing, alongside Microsoft 365 add-in support to embed Claude into core financial workflows.

Computer Use Agents Cost 45x More Than Structured APIs

A Reflex benchmark shows vision-based computer use costs 45x more than structured API calls for the same task, runs 50x slower, and produces highly variable results — hard data for agent architecture decisions.

Ramp Sheets AI prompt injection silently exfiltrates financial data

PromptArmor reveals an indirect prompt injection vulnerability in Ramp's AI-powered spreadsheet tool, where hidden instructions in external datasets can manipulate the AI into inserting formulas that leak financial data to attackers — no user approval required.

OpenAI models, Codex, and Managed Agents land on AWS

OpenAI and AWS expand their partnership to bring GPT-5.5, Codex, and new Bedrock Managed Agents to AWS customers, giving enterprises a direct path to deploy frontier AI within their existing cloud infrastructure.

Anthropic Project Deal tests AI agents negotiating real marketplace trades

Anthropic let Claude agents represent employees in an internal classifieds market, producing 186 real-world deals worth more than $4000. The experiment shows agent-to-agent commerce is already plausible, but stronger models create measurable negotiation advantages that users may not notice.

OpenAI Codex Launches Chronicle Screen Context Memory

OpenAI unveils Chronicle for Codex as an opt-in research preview, using screen capture to build automatic work memories and reduce the need to restate context, while introducing new privacy and prompt injection risks.

LLMs make surface quality unreliable in knowledge work

One Happy Fellow argues that LLMs break the proxy measures organizations use to judge knowledge work. When spelling, formatting, review rituals, and professional tone can be generated cheaply, teams need better ways to verify whether work is actually true, useful, and decision-grade.

DeepSeek V4 preview brings 1M context into open model competition

DeepSeek has released and open-sourced the V4 preview, with Pro and Flash variants and 1M context as the default across official services. The release matters less as a benchmark update than as a push to make long-context agent workflows cheaper and more deployable.

All your agents are going async

AI agents are shifting from synchronous chat to async background execution, breaking traditional HTTP transport design and requiring new durable transport and durable state solutions.

zindex builds diagram infrastructure protocol for AI agents

zindex introduces the Diagram Scene Protocol (DSP), enabling agents to create and edit diagrams as structured, versioned state. This marks a paradigm shift from ephemeral AI-generated output to durable artifacts.

OpenAI launches ChatGPT Images 2.0 entering deep visual creation

Leaked documents from DSP StackAdapt reveal ChatGPT ad placements driven by prompt relevance, with CPMs ranging from $15-$60 and a $50,000 minimum spend for the pilot program. This marks the official opening of the AI conversation ad market.

Instant 1.0: A Backend for AI-Coded Apps

Instant 1.0 officially released, turning coding agents into full-stack app builders. Multi-tenant architecture, sync engine, fully open source.

Agents of Chaos: Red-Teaming Study on AI Agent Security

Research team from Northeastern University and others conducted red-teaming on AI agents, discovering serious vulnerabilities including unauthorized compliance and destructive actions.

AI Agents Could Make Free Software Matter Again

With AI coding assistants, free software may see a renaissance. When AI can read and modify code, source access becomes user capability, not programmer privilege.