Container
vLLM is a fast and easy-to-use library for LLM inference and serving. The NVIDIA vLLM NGC Container is optimized for GPU acceleration, and contains a validated set of libraries that enable and optimize GPU performance.
26.03.post1-py3
Signed
This image has a digital signature verifying that it has not been altered or corrupted since its signing.
ScannedNo malware was found in this artifact.
Copy the image path for this tag below:
View all tagsCopied!