Skip to main content

Unique Rows (HashSet)

Description

Unique Rows (HashSet) is a step in the Transformation Plugin for Process Studio Workflows. The Unique Rows (HashSet) step removes duplicate rows and leaves only unique row occurrences. The Unique Rows (HashSet) step tracks duplicates in memory and does not require a sorted input to process duplicate rows.

Configurations

No.Field NameDescription
1Step nameName of the step as it appears in the workflow workspace. This name has to be unique in a single workflow.
2Compare using stored row valuesStores values for the selected fields in memory for every record. Storing row values requires more memory, but it prevents possible false positives if there are hash collisions.
3Redirect duplicate rowProcesses duplicate rows as an error and redirect rows to the error stream of the step. Requires you to set error handling for this step.
4Error descriptionSets the error handling description to display when duplicate rows are detected. Only available when Redirect duplicate row is checked.
5Fields to compare tableLists the fields to compare---no entries means the step compares an entire row