Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Grouping consists in identifying subsets of cases having the same value of a certain variable or set of variables.

Grouping can be coupled with Apply operations (see Applying Operations on Data in the Query Manager) in order to apply operators on each group individually.

...

Group Mode

Description

Icon

Contracted

Each group is shown as a single record, namely the first one (i.e. that having the lowest value of row index)

Image Removed

Expanded

All the rows are shown, divided in groups.

Image Removed


Info

Changes you make can be committed run-time or on request.

...

  1. Drag and drop the attribute you want to group by onto any cell of the Group column: when the attribute is dropped on the Group column the subdivision in groups in automatically computed. More than one attribute at a time can be selected.

    Image Removed


    Image Added

  2. To toggle between the contracted and expanded grouping mode, right-click one of the attributes in the Group column, and select/deselect Expand

  3. Save and compute the task.

...

The following example is based on the Adult dataset.

Info

Sample Datasets

Scenario data can be found in the /wiki/spaces/UM0302/pages/299794555 folder in your Rulex installation.

We have grouped the data according to these variables: 

...

The number of groups is 2219 since each possible combination of age, workclass and native-country represents a group. For example, the first group consists of 39 year-old people working as State-gov and born in the United-States and so on. Adding more attributes will result in a higher number of groups.

Image RemovedImage Added