NVIDIA Vera CPU Launched for Agentic AI Workloads

NVIDIA has introduced the Vera CPU, a processor built specifically for agentic AI and reinforcement learning workloads. According to NVIDIA, it delivers results with 50% faster performance and twice the efficiency compared to traditional rack-scale CPUs.
Technical Specifications
The Vera CPU features 88 custom NVIDIA-designed Olympus cores, each capable of running two tasks using NVIDIA Spatial Multithreading. It includes a high-bandwidth memory subsystem built on LPDDR5X memory and uses the second-generation NVIDIA Scalable Coherency Fabric for faster agentic responses under high utilization conditions.
System Configurations
- New Vera CPU rack integrates 256 liquid-cooled Vera CPUs
- Sustains more than 22,500 concurrent CPU environments running independently at full performance
- Built using NVIDIA MGX modular reference architecture
- Part of NVIDIA Vera Rubin NVL72 platform with NVIDIA GPUs connected via NVIDIA NVLink-C2C interconnect
- Provides 1.8 TB/s of coherent bandwidth (7x PCIe Gen 6 bandwidth)
- Also serves as host CPU for NVIDIA HGX Rubin NVL8 systems
- Systems integrate NVIDIA ConnectX SuperNIC cards and NVIDIA BlueField-4 DPUs
Adoption and Partners
Customers collaborating with NVIDIA to deploy Vera CPU include Alibaba, ByteDance, Meta, Oracle Cloud Infrastructure, CoreWeave, Lambda, Nebius, and Nscale. Manufacturing partners include Dell Technologies, HPE, Lenovo, Supermicro, ASUS, Compal, Foxconn, GIGABYTE, Pegatron, Quanta Cloud Technology (QCT), Wistron, and Wiwynn.
Target Workloads
Vera systems are designed for reinforcement learning, agentic inference, data processing, orchestration, storage management, cloud applications, and high-performance computing. Systems partners provide both dual and single-socket CPU server configurations.
According to Jensen Huang, NVIDIA's CEO, "The CPU is no longer simply supporting the model; it's driving it. With breakthrough performance and energy efficiency, Vera unlocks AI systems that think faster and scale further."
📖 Read the full source: HN AI Agents
👀 See Also

Pentagon to adopt Palantir AI as core US military system
The Pentagon plans to adopt Palantir's AI technology as a core system for the US military, according to a memo. The Reuters article generated 47 points and 2 comments on Hacker News.

Meta to capture employee mouse movements and keystrokes for AI training
Meta plans to start capturing employee mouse movements and keystrokes for AI training data, according to a Reuters report. The article has generated discussion on Hacker News with 33 points and 7 comments.

1-Bit Bonsai Image 4B: On-Device Image Generation via Binary/Ternary FLUX.2
PrismML releases Bonsai Image 4B, a binary (1.125-bit) and ternary (1.71-bit) FLUX.2 Klein 4B variant that shrinks the diffusion transformer to 0.93 GB / 1.21 GB, enabling 512x512 image generation on iPhone 17 Pro Max in 9.4 seconds.

Wikipedia's AI Policy: LLMs Banned for Article Creation, Exceptions for Copyediting and Translation
Wikipedia prohibits using LLMs to generate or rewrite articles, with narrow exceptions for basic copyediting and translation. Violations can lead to speedy deletion (G15) and removal of AI-generated comments from talk pages.