Hitachi Vantara Pentaho Community Wiki
Child pages
  • Resample (unsupervised)
Skip to end of metadata
Go to start of metadata

Package

weka.filters.unsupervised.instance

Synopsis

Produces a random subsample of a dataset using either sampling with replacement or without replacement. The original dataset must fit entirely in memory. The number of instances in the generated dataset may be specified. When used in batch mode, subsequent batches are NOT resampled.

Options

The table below describes the options available for Resample.

Option

Description

invertSelection

Inverts the selection (only if instances are drawn WITHOUT replacement).

noReplacement

Disables the replacement of instances.

randomSeed

The seed used for random sampling.

sampleSizePercent

Size of the subsample as a percentage of the original dataset.

Capabilities

The table below describes the capabilites of Resample.

Capability

Supported

Class

Date class, Binary class, Missing class values, No class, Nominal class, String class, Empty nominal class, Relational class, Unary class, Numeric class

Attributes

Missing values, String attributes, Numeric attributes, Empty nominal attributes, Binary attributes, Unary attributes, Nominal attributes, Date attributes, Relational attributes

Min # of instances

0

  • No labels