For this tutorial we’ll use the Titanic data.
-
The Titanic data have been assigned to a variable named
df
. -
Hover over the
icon. Select the Transformations item from the popup menu. -
The Transformations dialog appears. Select the target variable from the dropdown menu and preview the data.
-
Click on the kebab icon to the right of any of the
nan
values in theCabin
column. Then click on the Filter values like this popup button. -
A dialog appears with fields to choose a column, an operator and a value. The value is set to
nan
by default. -
Since we are wanting to exclude records with missing values, we change the operator from
==
to!=
. Press the button. -
The required code is inserted into the notebook and immediately executed.
![jupyter load data datasets titanic transformations filtered](/images/code-assist/transformations/python/jupyter-load-data-datasets-titanic-transformations-filtered.png)
-
Under the Column selector choose
Embarked
. SetOperator
to==
andValue
toS
. Turn theas string
toggle on. Press the button. -
The preview is updated to show only those records where
Embarked
is"S"
. -
Under the Column selector choose
Pclass
. SetOperator
to<
andValue
to3
. Press the button. -
The preview is updated to show only those records where
Embarked
is"S"
andPclass
is less than 3. Press the button. -
The required code is inserted into the notebook and immediately executed.
![jupyter titanic filter manual insert code](/images/code-assist/transformations/python/jupyter-titanic-filter-manual-insert-code.png)
-
Group and aggregate records to generate summary data.