Hardware

56 Posts

Aerial view of expansive data centers with orange roofs, representing massive AI infrastructure projects.

OpenAI On the Road to Trillion-Dollar Spending: OpenAI partners with Oracle, Nvidia, Softbank, and more to build out 20 gigawatts of data center capacity

A flurry of announcements brought into sharper focus OpenAI’s plans to build what may amount to trillions of dollars of global computing capacity.

Swarms of military drones flying in formation over a field, showing autonomous AI-driven coordination in warfare.

Hardware

Drone Swarms Go to War: Ukraine experiments with small groups of low-contact, high-autonomy drones that strike on initiative

Swarms of drones that coordinate with one another autonomously have become a battlefield staple in Ukraine.

Diagram of Qwen3-Next architecture with Mixture of Experts, Gated Attention, and Gated DeltaNet layers.

Hardware

Qwen3-Next Accelerates: Alibaba’s new model uses hybrid attention layers and a sparse MoE architecture for speed and performance

Alibaba updated its popular Qwen3 open-weights models with a number of fresh, speed-boosting tweaks.

Diagram comparing sliding window attention and ATLAS memory, showing wider context tracking in ATLAS.

Hardware

10 Million Tokens of Input Context: ATLAS, a transformer-like architecture, can process a context window as large as ten million tokens

An alternative to attention enables large language models to track relationships among words across extraordinarily wide spans of text.

Hand holding a Pixel 10 smartphone using Google’s Magic Cue AI assistant, which suggests a location reply in a text message conversation.

Hardware

Proactive AI Assistance for Phones: Inside Magic Cue, Google’s new AI assistant for Pixel 10

Google’s latest smartphone sports an AI assistant that anticipates the user’s needs and presents helpful information without prompting.

Aerial view of a large, partially constructed data center surrounded by parked vehicles and red soil in Abilene, Texas

Hardware

OpenAI Turns to Oracle for Compute: A new $30 billion, 4.5 gigawatt data center offshoot of the Stargate Project

OpenAI is working with Oracle to build its next chunk of processing power, a $30 billion outgrowth of the partners’ $500 billion Stargate project and a sign of OpenAI’s ongoing thirst for computation.

Illustration of Nvidia H20 and Huawei Ascend 910 AI chips on wood with a Chinese flag, symbolizing China’s shift from U.S. AI processors to domestic development amid geopolitical tensions.

Hardware

China Reconsiders U.S. AI Processors: Nvidia and AMD must reassure China their high-end GPUs don't pose security risk

Nvidia and AMD, having obtained the U.S. government’s permission to resume selling AI processors in China, received a cool welcome there.

Meta Aria Gen 2 smart glasses for AI research, equipped with cameras, microphones, and other sensors for real-time data capture.

Hardware

Meta’s Smart Glasses Come Into Focus: Meta reveals further details of Aria Gen 2 smart glasses for multisensory AI research

Meta revealed new details about its latest Aria eyeglasses, which aim to give AI models a streaming, multisensory, human perspective.

Hardware

Amazon’s Constellation of Compute: Amazon plans to spend tens of billions on AI infrastructure with Project Rainier

Amazon revealed new details of its plan to build a constellation of massive data centers and connect them into an “ultracluster.” Customer Number One: Anthropic.

BitNet b1.58 matrix multiplication shows ternary weights enabling faster neural network computation.

Hardware

Low Precision, High Performance: Researchers at Microsoft and Tsinghua researchers propose 1.58-bit AI model that rivals full-precision competitors

Reducing the number of bits used to represent each parameter in a neural network from, say, 16 bits to 8 bits shrinks the network’s size and boosts its speed. Researchers took this approach to an extreme: They built a competitive large language model whose weights are limited to three values.

Bar chart comparing electricity use by various text-generation models: very small, small, medium-sized, large MoE, and large reasoning.

Hardware

AI Uses Energy, AI Saves Energy: The International Energy Agency examines the energy costs and potential savings of the AI boom

AI’s thirst for energy is growing, but the technology also could help produce huge energy savings over the next five to 10 years, according to a recent report.

DeepSeek computation diagram showing transformer blocks, multi-head attention, and routing, using FP8 and BF16 precision.

Hardware

How DeepSeek Did It: Researchers describe training methods and hardware choices for DeepSeek’s V3 and R1 models

DeepSeek made headlines late last year, when it built a state-of-the-art, open-weights large language model at a cost far lower than usual. The upstart developer shared new details about its method.

Diagram of FP4 training scheme showing BF16 tensor quantization and FP4 tensor core processing for efficient computation.

Hardware

4-Bit Efficiency, 16-Bit Accuracy: Microsoft researchers show that heavily quantized versions of Llama can perform as well as near-full-precision

Using an 8-bit number format like FP8 during training saves computation compared to 16- or 32-bit formats, but it can yield less-accurate results. Researchers trained models using 4-bit numbers without sacrificing accuracy.

U.S. and Saudi flags waving against a microchip background

Hardware

U.S. to Supply Middle Eastern AI Hubs: Nvidia, AMD, Amazon, and others strike deals with Saudi Arabia’s Humain and G42 in the UAE

The United States government announced sweeping agreements to sell tens of billions of dollars worth of AI technology and services to Saudi Arabia and the United Arab Emirates.

Person interacting with a humanoid robot using virtual reality headset and controllers.

Hardware

Hugging Face Rolls Out Open Robot: Hugging Face acquires Pollen Robotics, launches Reachy 2 robot for open-source research

Hugging Face has made a name by providing open AI models. Now it’s providing an open robot.

Hardware

OpenAI On the Road to Trillion-Dollar Spending: OpenAI partners with Oracle, Nvidia, Softbank, and more to build out 20 gigawatts of data center capacity

Drone Swarms Go to War: Ukraine experiments with small groups of low-contact, high-autonomy drones that strike on initiative

Qwen3-Next Accelerates: Alibaba’s new model uses hybrid attention layers and a sparse MoE architecture for speed and performance

10 Million Tokens of Input Context: ATLAS, a transformer-like architecture, can process a context window as large as ten million tokens

Proactive AI Assistance for Phones: Inside Magic Cue, Google’s new AI assistant for Pixel 10

OpenAI Turns to Oracle for Compute: A new $30 billion, 4.5 gigawatt data center offshoot of the Stargate Project

China Reconsiders U.S. AI Processors: Nvidia and AMD must reassure China their high-end GPUs don't pose security risk

Meta’s Smart Glasses Come Into Focus: Meta reveals further details of Aria Gen 2 smart glasses for multisensory AI research

Amazon’s Constellation of Compute: Amazon plans to spend tens of billions on AI infrastructure with Project Rainier

Low Precision, High Performance: Researchers at Microsoft and Tsinghua researchers propose 1.58-bit AI model that rivals full-precision competitors

AI Uses Energy, AI Saves Energy: The International Energy Agency examines the energy costs and potential savings of the AI boom

How DeepSeek Did It: Researchers describe training methods and hardware choices for DeepSeek’s V3 and R1 models

4-Bit Efficiency, 16-Bit Accuracy: Microsoft researchers show that heavily quantized versions of Llama can perform as well as near-full-precision

U.S. to Supply Middle Eastern AI Hubs: Nvidia, AMD, Amazon, and others strike deals with Saudi Arabia’s Humain and G42 in the UAE

Hugging Face Rolls Out Open Robot: Hugging Face acquires Pollen Robotics, launches Reachy 2 robot for open-source research

Subscribe to The Batch