Access Keys:
Skip to content (Access Key - 0)

Package

weka.filters.unsupervised.instance

Synopsis

Produces a random subsample of a dataset using the reservoir sampling Algorithm "R" by Vitter. The original data set does not have to fit into main memory, but the reservoir does.

Options

The table below describes the options available for ReservoirSample.

Option Description
randomSeed The seed used for random sampling.
sampleSize Size of the subsample (reservoir). i.e. the number of instances.

Capabilities

The table below describes the capabilites of ReservoirSample.

Capability Supported
Class Relational class, Binary class, No class, Date class, Numeric class, Nominal class, String class, Empty nominal class, Unary class, Missing class values
Attributes Date attributes, Empty nominal attributes, Missing values, Unary attributes, String attributes, Nominal attributes, Relational attributes, Binary attributes, Numeric attributes
Min # of instances 0

This documentation is maintained by the Pentaho community, and members are encouraged to create new pages in the appropriate spaces, or edit existing pages that need to be corrected or updated.

Please do not leave comments on Wiki pages asking for help. They will be deleted. Use the forums instead.

Adaptavist Theme Builder (4.2.0) Powered by Atlassian Confluence 3.3.3, the Enterprise Wiki