Train a model with AutoML

Learn how to train a low-code model with Domino AutoML. Select your dataset and specify the prediction task, AutoML generates code to prepare the data, test multiple algorithms, and optimize hyperparameters to find the best model for you.

Data scientists can use AutoML to speed up repetitive tasks and generate starting code for more advanced projects. Data analysts can use it to run initial ML experiments before working with data scientists to fine tune experiments.

Domino leverages FLAML (Fast and Lightweight AutoML), an open-source Python library that automates model selection and hyperparameter tuning.

Prerequisites

Transparent scaling with Domino AutoML

Unlike other AutoML tools, Domino provides immediate code access, eliminating the black-box approach. AutoML offers the following benefits:

  1. Time saving - Streamline model development by automating tasks like experiment setup and hyperparameter tuning.

  2. Accessibility - Empower data analysts to perform early-stage machine learning without extensive data science knowledge.

  3. Reproducibility and transparency - Maintain control over the entire process with full code visibility and ownership.

  4. Collaboration - Use code-first collaboration tools like Git, leveraging the code provided by AutoML.

Load sample data

  1. Start a Jupyter workspace to use AutoML.

  2. Hover over the Code Assist icon icon to show a popup menu.

  3. Select Load data > QUICK-START > iris to load the iris dataset.

Code Assist modal popup

Create a training experiment

AutoML currently supports classification, regression, and forecasting machine learning tasks.

  1. In the next cell of the Jupyter notebook, hover over the assistant icon icon and select AutoML > Classifications.

  2. Configure the AutoML experiment:

    1. Data frame - Select the dataframe variable that contains your dataset. In this case, select df.

    2. Predict value - Select the variable that you want to predict. In this case, select species.

    3. Features - Specify the data features that you want to train your model on. Remove species_id to make sure you don’t train on data that already contains the correct value.

      AutoML experiment options
  3. Expand the Training and Advanced settings to customize run parameters, algorithm selection, and hyperparameter tuning.

    AutoML advanced options, showing classification algorithm candidates
  4. Click RUN to add the training code to the notebook and run it.

View the experiment

Code Assist automatically adds MLflow tracking to let you view results and monitor progress. It uses the default experiment to track results. However, you can specify a different experiment by editing the experiment_name variable.

  1. To view the experiment, click open experiment. Note the Run-name indicated.

    You will see the parent iteration together with the child iterations.

    Graph view of the experiment run
  2. Click on the experiment name you noted earlier in the RUN column to view details of the experiment including parameters, metrics, outputs, and logs.

Edit the experiment

To edit the experiment, modify the code in the cell and rerun the code, or choose the Edit item from the Domino Code Assist context menu.

For more advanced edits, see the FLAML docs to learn more.