AMD Instinct MI355X Achieves MLPerf Inference v6.0 Gains with Over 1 Million Tokens per Second and Supports Scalable ROCm Stack
2 Articles
2 Articles
[Digital Daily Reporter Kim Moon-ki] AMD announced on the 1st (local time) that it has proven its technological prowess by breaking the barrier of processing 1 million tokens per second in the latest AI inference benchmark, 'MLPerf 6.0,' using its next-generation GPU, the 'Instinct MI355X.' For this benchmark, AMD deployed the 'Instinct MI355X' GPU, based on the 3nm process and CDNA 4 architecture. This product supports 288GB of HBM3E memory an…
AMD Instinct MI355X Achieves MLPerf Inference v6.0 Gains with Over 1 Million Tokens per Second and Supports Scalable ROCm Stack
AMD has released its MLPerf Inference v6.0 results, positioning the Instinct MI355X GPU as a scalable inference platform across single-node, multinode, and heterogeneous deployments. The submission extends beyond incremental gains by adding new workloads, demonstrating cluster-scale throughput exceeding 1 million tokens per second, and validating reproducibility across a growing partner ecosystem. CDNA 4 Architecture Targets High-Capacity Infere…
Coverage Details
Bias Distribution
- There is no tracked Bias information for the sources covering this story.
Factuality
To view factuality data please Upgrade to Premium

