domino logo
5.1
  • About Domino
  • Architecture
  • Domino infrastructure
  • Kubernetes infrastructure
  • Installation
  • Configuration reference
  • Security and compliance
  • Authentication and authorization
  • Projects
  • Manage data
  • Models
  • Environments
  • Monitoring
  • Audits and logs
  • Control Center
  • Notifications
  • Disaster recovery
  • Backup and restore
  • Troubleshooting
  • Get help
  • Run the admin toolkit
  • MongoDB console
domino logo
About Domino
Domino Data LabKnowledge BaseData Science BlogTraining
>
Admin guide
>
Models
>
Create Model Resource Quotas

Create Model Resource Quotas

You set the resource quotas for the hardware specifications of the pods that host Model APIs. A resource quota determines the CPU and memory resources available to the Model that uses it. Resource quotas are different than the hardware tiers Domino Runs use.

Create and edit resource quotas
  1. From the admin home, go to Advanced > Resource Quotas.

  2. Click New to create a resource quota, or click Edit to modify an existing resource quota or set a default resource quota. Resource quotas cannot be permanently deleted.

  3. When you create or edit a Resource quotas, set the following properties:

    • Visible: Leave this checkbox selected for a resource quota to be listed in the menu for users publishing Models. Clear the checkbox to make a resource quota unavailable for use.

    • Memory (GB) - Request: The amount of RAM reserved for a model with this quota.

    • Memory (GB) - Limit: If the hosting node has RAM available, a model running this quota can make use of additional memory up to this limit.

    • CPU (# of Cores) - Request: The number of cores reserved for a Model with this quota.

    • CPU (# of Cores) - Limit: If the hosting node has idle cores available, a model running this quota can use additional cores up to this limit.

    • Default: The resource quota with this set to true will be used for all newly published Models by default.

    Your users can now set the Resource Quotes for their deployments. To do this, they must go to the Model’s deployment page. Then, go to the Compute resources per instance menu in the Resource Quota section to set resource quotas.

    + deployment configuration

  4. Click Create.

Domino Data Lab
Knowledge Base
Data Science Blog
Training
Copyright © 2023 Domino Data Lab. All rights reserved.