Access Keys:
Skip to content (Access Key - 0)




An instance filter that discretizes a range of numeric attributes in the dataset into nominal attributes. Discretization is by simple binning. Skips the class attribute if set.


The table below describes the options available for Discretize.

Option Description
attributeIndices Specify range of attributes to act on. This is a comma separated list of attribute indices, with "first" and "last" valid values. Specify an inclusive range with "-". E.g: "first-3,5,6-10,last".
bins Number of bins.
desiredWeightOfInstancesPerInterval Sets the desired weight of instances per interval for equal-frequency binning.
findNumBins Optimize number of equal-width bins using leave-one-out. Doesn't work for equal-frequency binning
ignoreClass The class index will be unset temporarily before the filter is applied.
invertSelection Set attribute selection mode. If false, only selected (numeric) attributes in the range will be discretized; if true, only non-selected attributes will be discretized.
makeBinary Make resulting attributes binary.
useEqualFrequency If set to true, equal-frequency binning will be used instead of equal-width binning.


The table below describes the capabilites of Discretize.

Capability Supported
Class String class, Binary class, Relational class, Missing class values, Date class, Empty nominal class, No class, Numeric class, Nominal class, Unary class
Attributes Nominal attributes, Numeric attributes, Missing values, Relational attributes, Empty nominal attributes, Unary attributes, Date attributes, String attributes, Binary attributes
Min # of instances 0

This documentation is maintained by the Pentaho community, and members are encouraged to create new pages in the appropriate spaces, or edit existing pages that need to be corrected or updated.

Please do not leave comments on Wiki pages asking for help. They will be deleted. Use the forums instead.

Adaptavist Theme Builder (4.2.0) Powered by Atlassian Confluence 3.3.3, the Enterprise Wiki