Skip to main content

Merge Join

Description

Merge Join combines two sorted input data streams into a single output based on one or more common key fields. Use this step when you need to match and consolidate records from two sources — for example, linking customer orders with shipment details or enriching transaction data with reference information. It supports Inner, Left Outer, Right Outer, and Full Outer join types to control which rows appear in the result. Both input streams must be sorted on the join key before reaching this step.

Configurations

Field NameDescription
Step nameSpecify the name of the step. The step name should be unique within the workflow.
First StepUsed to specify first input step to the merge join (left).
Second StepUsed to specify second input step to the merge join (right).
Join TypeSelect the type of Join.

- INNER-Only rows having the same key in both datasets are included in the result.

- LEFT OUTER-All rows from the first dataset are included in the result, along with empty values for non-matching keys in the second dataset.

- RIGHT OUTER- All rows from the second dataset are included in the result, along with empty values for non-matching keys in the first dataset.

- FULL OUTER- All rows from both datasets are included in the result, along with empty values for non-matching keys in both datasets.

Keys for 1st stepUsed to specify key fields on which the incoming data is sorted.
Keys for 2nd stepUsed to specify key fields on which the incoming data is sorted.
Buttons:
Get key fieldsWhen clicked, it retrieves a list of fields from the step specified.