Skip to content

Specifications

FieldValue
SuperchipNVIDIA GB10 Grace Blackwell
CPU20-core Arm: 10× Cortex-X925 + 10× Cortex-A725 (MediaTek IP)
GPU architectureBlackwell
CUDA cores6,144
Tensor Cores5th generation
RT Coresyes
Compute capabilitysm_121
Peak AI performanceup to 1 petaFLOP FP4 (theoretical, sparsity enabled)
FieldValue
System memory128 GB LPDDR5X
Memory modelunified and coherent across CPU and GPU
CPU↔GPU interconnectNVLink-C2C, ~6× PCIe Gen5 bandwidth
Storage4 TB NVMe
FieldValue
High-speeddual ConnectX-7 200GbE, QSFP56, ethernet-only
Management1GbE
WirelessWi-Fi
Approved CX-7 cablesAmphenol NJAAKK-N911, NJAAKK0006 (0.5 m); Luxshare LMTQF022-SD-R
FieldValue
Operating systemDGX OS (Ubuntu base)
ArchitectureARM64 / aarch64
Container runtimeNVIDIA Container Runtime for Docker
ConfigurationUnified memoryApprox. model size
Single Spark128 GBup to ~200B params
Two Sparks (200GbE)256 GBup to ~405B params
Quad stack512 GBlargest open-weight models; 4,000 TFLOPS FP4
PrecisionBytes/paramExample: 70B weights
FP162~140 GB
FP8 / 8-bit1~70 GB
FP4 / 4-bit0.5~35 GB

KV cache is additional and grows with context length and batch size. Budget for it on top of the weights.