NVIDIA NGC
The NGC catalog hosts containers for the top AI and data science software for HPC applications and data analytics. NGC catalog containers provide powerful and easy-to-deploy software proven to deliver the fastest results, allowing users to build solutions from a tested framework, with complete control.
The goal of this page is to show how to build and run an NVIDIA GPU cloud (NGC) containers on Discovery.
Building NGC
Container
To build NGC container, simply follow these steps:
-
Create an account at NVIDIA NGC - SignUp page.
-
Generate NGC API key at NVIDIA NGC - Setup page.
-
Configure your working environment at
Discovery
so that Apptainer can find your NGC container registry authentication credential (The user name and the API key generated from Step 2). For that, execute the export commands below in your terminal.
export APPTAINER_DOCKER_USERNAME='$oauthtoken'
export APPTAINER_DOCKER_PASSWORD=<API_key>
-
Pull an image container from NVIDIA NGC. For that, go to the section
Container
at NVIDIA NGC - Containers. Then, navigate the list of containers displayed to find your desired image container. Once it’s found, click on it, and at the top right of your screen click the buttonPull tag
to copy the pull command to the clipboard. For this tutorial, NVIDIA CUDA image container has been selected. -
Finally, execute the command below in your terminal so that Apptainer retrieves the docker image from NGC and converts it into SIF image.
For example, to pull and build PyTorch NGC Container, you run:
apptainer build pytorch.sif docker://nvcr.io/nvidia/pytorch:21.09-py3
For more information, see also NGC Private Registry User Guide.
Running NGC
Containers
To integrate NGC
containers into a Slurm job and run them on Discovery
, refer to Using Containers.