In a data center, what is the purpose and benefit of a DPU?
Correct Answer: D
Explanation:
A Data Processing Unit (DPU) is a programmable processor that offloads, accelerates, and isolates infrastructure workloads like networking, storage, and security from the CPU. This enhances performance, reduces CPU overhead, and improves security by segregating tasks, benefiting AI data centers. It doesn’t handle backups or physical infrastructure directly, focusing instead on compute efficiency.
In training and inference architecture requirements, what is the main difference between training and inference?
Correct Answer: B
Explanation:
The primary distinction between training and inference lies in their operational demands. Training necessitates large amounts of data to iteratively optimize model parameters, often involving extensive datasets processed in batches across multiple GPUs to achieve convergence. Inference, however, is designed for real-time or low-latency processing, where trained models are deployed to make predictions on new inputs with minimal delay, typically requiring less data volume but high responsiveness. This fundamental difference shapes their respective architectural designs and resource allocations.
(Reference: NVIDIA AI Infrastructure and Operations Study Guide, Section on Training vs. Inference Requirements)
Demo Practice Mode
You are viewing only the questions marked as Demo.