Throughput

Criminals flip Hexstrike-AI, shrinking zero-day exploits to minutes

A cybersecurity tool designed to help defenders was flipped by criminals within hours, collapsing zero-day exploit timelines from weeks to minutes. The Hexstrike-AI weaponization signals a fundamental shift in attack capabilities.

Google keeps Chrome as judge bets on AI competition

Judge spares Google from breakup but forces data sharing with rivals. The twist: AI has fundamentally changed search competition since the DOJ sued in 2020, leading to a remedy that bets on technological disruption over structural fixes.

Category: Hardware & Infrastructure

Definition

Throughput measures the number of AI inference requests a system can process per unit time, typically expressed as queries per second or tokens per second.

How It Works

Throughput depends on model size, batch processing efficiency, and hardware capabilities. Systems optimize throughput through batching, parallel processing, and efficient scheduling.

Load balancing across multiple GPUs or instances increases aggregate throughput for production systems.

Why It Matters

High throughput reduces serving costs and enables AI systems to handle millions of users. It's the key metric for production AI deployments.

Improving throughput by 10x can make previously uneconomical AI applications viable at scale.

← Back to Hardware & Infrastructure | All Terms

AI Accelerates Everything (Including the Bad Stuff)