Column Summary with Code Assist

The Transformations widget generates summary data for all columns in the dataset. These summaries are generated on the fly and then cached.

For this tutorial we’ll use the Palmer Penguins data.

  1. Launch the Transformations widget.

  2. Hover over one of the column names at the top of the data preview. A kebab menu appears. Hover over the kebab menu. A summary is generated and displayed.

  3. If the column contains categorical or string data (like island) then the summary will contain a count of (i) the total number of values, (ii) the number of missing values and (iii) the number of missing values. A small bar chart shows the relative frequency of each of the values.

    Summary of string or categorical columns
  4. For numerical columns (like bill_length_mm) the summary also contains the mean, standard deviation, minimum and maximum values along with a histogram of the distribution.

    Summary of numeric columns
  5. At the bottom of the summary there’s a drop column button.

  6. Pressing the drop column button removes the corresponding column from the data.