Reservoir Sampling
Description
Reservoir Sampling is a step in the Statistics Plugin for Process Studio Workflows. The reservoir sampling step samples a specified number of rows from an input stream. The step uses uniform sampling; all incoming rows have an equal chance of being selected
Configurations
No. | Field Name | Description |
---|---|---|
S Tab | ||
1 | Step name | Specify the name of the step as it appears in the workflow workspace. This name has to be unique in a single workflow. |
2 | Sample size | Select how many rows to sample from an incoming stream.Setting a value of 0 will cause all rows to be sampled; setting a negative value will block all rows. |
3 | Random seed | Choose a seed for the random number generator. Repeating a workflow with a different value for the seed will result in a different random sample being chosen. |