Skip to main content

Reservoir Sampling

Description

Reservoir Sampling is a step in the Statistics Plugin for Process Studio Workflows. The reservoir sampling step samples a specified number of rows from an input stream. The step uses uniform sampling; all incoming rows have an equal chance of being selected

Configurations

No.Field NameDescription
S Tab
1Step nameSpecify the name of the step as it appears in the workflow workspace. This name has to be unique in a single workflow.
2Sample sizeSelect how many rows to sample from an incoming stream.Setting a value of 0 will cause all rows to be sampled; setting a negative value will block all rows.
3Random seedChoose a seed for the random number generator. Repeating a workflow with a different value for the seed will result in a different random sample being chosen.