Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Info

View in Help Center (registration required)

Before building a predictive model it is recommended that recommended that you split the dataset into subsets.

...

The following datasets can be created:

  • the training set, used to identify patterns in the data and build the model,

  • the test set, used to assess the accuracy of the model and

  • the optional validation set, which can be used for tuning the model parameters.


Splitting methods

There are two distinct tasks for splitting datasets in Rulex:

Task name

Icon

Description

Corresponding page

Split Data

Image Removed

Splits the dataset randomly or sequentially.

Splitting Data with the Split Data Task

Data Manager

Image Removed

Splits datasets according to specified criteria.

Splitting Data with the Data Manager