Machine learning is taking giant leaps at an unprecedented pace to advance innovations. The results of MLPerf Training v3.1, the latest round of MLPerf Training and HPC Benchmark, show 49X performance gains in just 5 Years. As a member of MLCommons, QCT also contributed to this progress with two submissions in the closed division. QCT’s submissions included tasks in Image Classification, Object Detection, Natural Language Processing, Speech Recognition, and Recommendation, all of which were successfully achieved by meeting the prescribed quality targets (see below) using its QuantaGrid D54U-3U and QuantaGrid D74H-7U.
Area | Benchmark | Dataset | Quality Target | Reference Implementation Model | Latest Version Available |
Vision | Image classification | ImageNet | 75.90% classification | ResNet-50 v1.5 | v3.1 |
Vision | Image segmentation (medical) | KiTS19 | 0.908 Mean DICE score | 3D U-Net | v3.1 |
Vision | Object detection (light weight) | Open Images | 34.0% mAP | RetinaNet | v3.1 |
Vision | Object detection (heavy weight) | COCO | 0.377 Box min AP and 0.339 Mask min AP | Mask R-CNN | v3.1 |
Language | Speech recognition | LibriSpeech | 0.058 Word Error Rate | RNN-T | v3.1 |
Language | NLP | Wikipedia 2020/01/01 | 0.72 Mask-LM accuracy | BERT-large | v3.1 |
Commerce | Recommendation | Criteo 4TB multi-hot | 0.8032 AUC | DLRM-dcnv2 | v3.1 |
Fig. 1. MLPerf Training v3.1 benchmarks that QCT submitted
The QuantaGrid D74H-7U is an 8-way GPU server equipped with the NVIDIA HGX H100 8-GPU Hopper SXM5 module, making it an ideal choice for compute-intensive AI training. With innovative hardware design and software optimization, the QuantaGrid D74H-7U server consistently delivers cutting-edge performance in training results.
The QuantaGrid D54U-3U, powered by 4th Gen Intel Xeon Scalable processors, is a 3U system featuring the capacity to accommodate up to four dual-width accelerator cards or up to eight single-width accelerator cards, along with 32 DIMM slots. This provides a comprehensive and flexible architecture that can be tailored to optimize various AI/HPC applications. In this round, the QuantaGrid D54U-3U Server, configured with four NVIDIA H100-PCIe-80GB accelerator cards, achieved outstanding performance.
QCT remains committed to delivering comprehensive hardware systems, solutions, and services to academic and industrial users. Moreover, we are dedicated to maintaining transparency by openly sharing our MLPerf results with the public, encompassing both training and inference benchmarks.For more detailed information, visit the official MLPerf Training results.