Skip to main content

Sort Rows

Description

Sort rows, is a step in the Transformation Plugin for Process Studio Workflows. The Sort rows step sorts rows on the fields specified and on whether they should be sorted in ascending or descending order.Process studio sorts rows using temporary files when the number of rows exceeds the default limit of 1 million rows. In case you get an out of memory exception (OOME), you need to lower this limit threshold or change your available memory.

Configurations

No.Field NameDescription
1Step nameName of the step as it appears in the workflow workspace. This name has to be unique in a single workflow.
2Sort directoryThe directory in which the temporary files are stored in case when needed; the default is the standard temporary directory for the system
3TMP-file prefixChoose an easily recognized prefix so you can identify the files when they show up in the temp directory.
4Sort sizeThe more rows you store in memory, the faster the sorting process because fewer temporary files must be used and less I/O is generated.
5Free memory threshold (in %)If the sort algorithm finds that it has less available free memory than the indicated number, it will start to page data to disk.

Note: This is not exact science, because:

- This is checked every 1000 rows. Depending on the row size and other steps within complex workflows this could still lead to an OutOfMemoryError.

- In a Java Virtual Machine it's not possible to know the exact amount of free memory. As such it is not recommended to use this step for very complex workflows with other steps and processes that use up a lot of memory.

6Compress TMP FilesCompresses temporary files when they are needed to complete the sort.
7Only pass unique rows?Enable if you want to pass unique rows only to the output stream(s).
8Fields tableSpecify the fields and direction (ascending/descending) to sort. You can specify whether to perform a case sensitive sort (optional)
9Get FieldsClick to retrieve a list of all fields coming in on the stream(s).