This is a technical quick-start gist for the latest Red Hat AI Inference Server (RHAIIS) preview image, featuring NVIDIA Nemotron v3 Nano 30B-A3B models on vLLM.
Preview image tag (this release):
registry.redhat.io/rhaiis-preview/vllm-cuda-rhel9:nvidia-nemotron-v3
Upstream model family (Hugging Face):
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16