You can purchase NVIDIA DGX systems through NVIDIA’s Partner Network. Install the DGX system in a hosting environment with network access to additional host and storage infrastructure required to host Domino.
If this is a new (greenfield) deployment of Domino:
Install and configure a Kubernetes cluster that meets Domino’s cluster requirements, including valid configuration of your Kubernetes' network policies to support secure communication between pods that will host Domino’s platform services and compute grid.
-
Add the DGX to your K8s API server as a worker node, with a node label consistent with your chosen naming conventions. The default node label for GPU-based worker nodes is
default-gpu
. -
You must add proper taints to your DGX node. This facilitates the selection of the DGX for GPU-based workloads running on Domino.
After the DGX is added to your API server and labeled properly, you can configure hardware tiers from within Domino’s Admin application.
Domino provides governance features from within this interface, supporting LDAP/AD federation or SSO-based attributes for managed access control and user execution quotas. Domino has also published a series of best practices to manage hardware tiers in your compute grid.