<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>Agent Economy - AI Infra</title><description>Compute, chips, data centers, and developer infrastructure powering the agent era.</description><link>https://agenteconomy.cn/</link><language>en-us</language><lastBuildDate>Tue, 19 May 2026 00:02:49 GMT</lastBuildDate><item><title>Modal cuts inference cold start times by 40x, pushing serverless GPU limits</title><link>https://agenteconomy.cn/en/blog/modal-cuts-inference-cold-starts/</link><guid isPermaLink="true">https://agenteconomy.cn/en/blog/modal-cuts-inference-cold-starts/</guid><description>Modal details its engineering approach combining cloud buffers, custom filesystems, process checkpointing, and CUDA checkpointing to slash inference cold starts from minutes to tens of seconds.</description><pubDate>Tue, 19 May 2026 00:02:49 GMT</pubDate></item><item><title>AI Is Infrastructure, Not a Product</title><link>https://agenteconomy.cn/en/blog/ai-is-technology-not-product/</link><guid isPermaLink="true">https://agenteconomy.cn/en/blog/ai-is-technology-not-product/</guid><description>John Gruber pushes back against the notion that Apple needs a &apos;killer AI product,&apos; arguing that AI is more like wireless networking — pervasive infrastructure, not a standalone product category.</description><pubDate>Mon, 18 May 2026 00:02:48 GMT</pubDate></item><item><title>Apple Silicon Local LLM Inference Costs 3x More Than Cloud APIs</title><link>https://agenteconomy.cn/en/blog/apple-silicon-costs-more-than-openrouter/</link><guid isPermaLink="true">https://agenteconomy.cn/en/blog/apple-silicon-costs-more-than-openrouter/</guid><description>A data-driven analysis shows running local LLM inference on an M5 Max MacBook Pro costs ~3x more per million tokens than cloud inference via OpenRouter, while being 3-7x slower.</description><pubDate>Mon, 18 May 2026 00:02:48 GMT</pubDate></item><item><title>The US Is Winning the AI Commercialization Race — Infrastructure and Platform Ecosystems Are the Decisive Factors</title><link>https://agenteconomy.cn/en/blog/us-winning-ai-commercialization-race/</link><guid isPermaLink="true">https://agenteconomy.cn/en/blog/us-winning-ai-commercialization-race/</guid><description>A widely discussed analysis argues that US AI leadership comes not from paper counts or engineers, but from full-stack integration spanning chips, data centers, cloud platforms, and developer ecosystems.</description><pubDate>Fri, 15 May 2026 00:02:50 GMT</pubDate></item><item><title>Google Launches Googlebook AI-Native Laptop Line</title><link>https://agenteconomy.cn/en/blog/google-googlebook-ai-laptop/</link><guid isPermaLink="true">https://agenteconomy.cn/en/blog/google-googlebook-ai-laptop/</guid><description>Google unveils Googlebook, a laptop series designed for Gemini Intelligence with Magic Pointer AI cursor, AI widget generation, and deep Android phone integration, shipping Fall 2026.</description><pubDate>Fri, 15 May 2026 00:02:50 GMT</pubDate></item><item><title>Local AI Needs to Be the Norm</title><link>https://agenteconomy.cn/en/blog/local-ai-needs-to-be-norm/</link><guid isPermaLink="true">https://agenteconomy.cn/en/blog/local-ai-needs-to-be-norm/</guid><description>Over-reliance on cloud AI APIs is creating fragile, privacy-invasive, and costly applications. On-device AI is not just feasible — it&apos;s a better path to trustworthy software.</description><pubDate>Fri, 15 May 2026 00:02:50 GMT</pubDate></item><item><title>Anthropic Partners With SpaceX for 220,000+ NVIDIA GPU Compute Capacity</title><link>https://agenteconomy.cn/en/blog/anthropic-spacex-compute-deal/</link><guid isPermaLink="true">https://agenteconomy.cn/en/blog/anthropic-spacex-compute-deal/</guid><description>Anthropic signs a deal with SpaceX to use all compute capacity at the Colossus 1 data center — over 300 megawatts and 220,000+ NVIDIA GPUs — while doubling Claude Code rate limits and raising Opus API caps.</description><pubDate>Fri, 15 May 2026 00:02:50 GMT</pubDate></item><item><title>Computer Use Agents Cost 45x More Than Structured APIs</title><link>https://agenteconomy.cn/en/blog/computer-use-45x-cost-comparison/</link><guid isPermaLink="true">https://agenteconomy.cn/en/blog/computer-use-45x-cost-comparison/</guid><description>A Reflex benchmark shows vision-based computer use costs 45x more than structured API calls for the same task, runs 50x slower, and produces highly variable results — hard data for agent architecture decisions.</description><pubDate>Fri, 15 May 2026 00:02:50 GMT</pubDate></item><item><title>OpenAI Details Low Latency Voice AI Architecture at Scale</title><link>https://agenteconomy.cn/en/blog/openai-low-latency-voice-ai/</link><guid isPermaLink="true">https://agenteconomy.cn/en/blog/openai-low-latency-voice-ai/</guid><description>OpenAI&apos;s engineering team published a deep technical deep-dive on rearchitecting their WebRTC stack with a Relay + Transceiver split architecture to serve real-time voice AI to over 900 million weekly active users.</description><pubDate>Fri, 15 May 2026 00:02:50 GMT</pubDate></item><item><title>Google deepens its Anthropic bet to own both model access and compute demand</title><link>https://agenteconomy.cn/en/blog/google-anthropic-40-billion-bet/</link><guid isPermaLink="true">https://agenteconomy.cn/en/blog/google-anthropic-40-billion-bet/</guid><description>Google plans to invest up to $40 billion in Anthropic, with $10 billion up front and the rest tied to performance milestones. The bigger story is how the deal binds equity, cloud distribution, and TPU demand into a single infrastructure value chain.</description><pubDate>Fri, 15 May 2026 00:02:50 GMT</pubDate></item><item><title>Google launches TorchTPU to make PyTorch migration smoother</title><link>https://agenteconomy.cn/en/blog/google-torchtpu-pytorch-native-tpu/</link><guid isPermaLink="true">https://agenteconomy.cn/en/blog/google-torchtpu-pytorch-native-tpu/</guid><description>Google introduces TorchTPU to tie PyTorch ergonomics, XLA compilation, and TPU hardware more tightly together, with the explicit goal of reducing migration friction for developers.</description><pubDate>Fri, 15 May 2026 00:02:50 GMT</pubDate></item><item><title>Deep learning may finally be approaching a real scientific theory</title><link>https://agenteconomy.cn/en/blog/scientific-theory-of-deep-learning/</link><guid isPermaLink="true">https://agenteconomy.cn/en/blog/scientific-theory-of-deep-learning/</guid><description>A new arXiv review argues that deep learning is converging toward a falsifiable, quantitative theory centered on training dynamics, which the authors call learning mechanics. For the AI industry, that could shift model development from empiricism toward more predictable engineering.</description><pubDate>Fri, 15 May 2026 00:02:50 GMT</pubDate></item><item><title>Google unveils eighth-generation TPUs with a dual-chip bet on the agent era</title><link>https://agenteconomy.cn/en/blog/google-eighth-generation-tpu-agentic-era/</link><guid isPermaLink="true">https://agenteconomy.cn/en/blog/google-eighth-generation-tpu-agentic-era/</guid><description>Google’s TPU 8t and TPU 8i split training and inference into clearer product paths, reflecting how agent-era infrastructure now demands deeper specialization and system-level optimization.</description><pubDate>Fri, 15 May 2026 00:02:50 GMT</pubDate></item><item><title>AI demand drives RAM shortage that could last for years</title><link>https://agenteconomy.cn/en/blog/the-ram-shortage-could-last-years-the-verge/</link><guid isPermaLink="true">https://agenteconomy.cn/en/blog/the-ram-shortage-could-last-years-the-verge/</guid><description>According to Nikkei Asia, even as suppliers ramp up DRAM production, manufacturers are only expected to meet 60 percent of demand by the end of 2027.</description><pubDate>Fri, 15 May 2026 00:02:50 GMT</pubDate></item><item><title>Mintlify ChromaFs: Virtual Filesystem for AI Assistants</title><link>https://agenteconomy.cn/en/blog/mintlify-chromafs-virtual-filesystem/</link><guid isPermaLink="true">https://agenteconomy.cn/en/blog/mintlify-chromafs-virtual-filesystem/</guid><description>Reduced doc assistant boot time from 46s to 100ms, marginal cost from $0.0137 to $0. Virtual filesystem built on just-bash and Chroma DB.</description><pubDate>Fri, 15 May 2026 00:02:50 GMT</pubDate></item><item><title>Project NOMAD: Free Open-Source Offline AI Server</title><link>https://agenteconomy.cn/en/blog/project-nomad-offline-ai-server/</link><guid isPermaLink="true">https://agenteconomy.cn/en/blog/project-nomad-offline-ai-server/</guid><description>Free open-source offline server to run AI on your own computer. Perfect for emergency prep, off-grid living, or self-hosting.</description><pubDate>Fri, 15 May 2026 00:02:50 GMT</pubDate></item><item><title>TinyBox: Deep Learning Supercomputer Now Shipping</title><link>https://agenteconomy.cn/en/blog/tinygrad-tinybox/</link><guid isPermaLink="true">https://agenteconomy.cn/en/blog/tinygrad-tinybox/</guid><description>Tiny Corp launches TinyBox deep learning supercomputer with 4x 9070 XT for $12,000, now shipping.</description><pubDate>Fri, 15 May 2026 00:02:50 GMT</pubDate></item></channel></rss>