Merge Join
Description
Merge Join combines two sorted input data streams into a single output based on one or more common key fields. Use this step when you need to match and consolidate records from two sources — for example, linking customer orders with shipment details or enriching transaction data with reference information. It supports Inner, Left Outer, Right Outer, and Full Outer join types to control which rows appear in the result. Both input streams must be sorted on the join key before reaching this step.
Configurations
| Field Name | Description |
|---|---|
| Step name | Specify the name of the step. The step name should be unique within the workflow. |
| First Step | Used to specify first input step to the merge join (left). |
| Second Step | Used to specify second input step to the merge join (right). |
| Join Type | Select the type of Join. - INNER-Only rows having the same key in both datasets are included in the result. - LEFT OUTER-All rows from the first dataset are included in the result, along with empty values for non-matching keys in the second dataset. - RIGHT OUTER- All rows from the second dataset are included in the result, along with empty values for non-matching keys in the first dataset. - FULL OUTER- All rows from both datasets are included in the result, along with empty values for non-matching keys in both datasets. |
| Keys for 1st step | Used to specify key fields on which the incoming data is sorted. |
| Keys for 2nd step | Used to specify key fields on which the incoming data is sorted. |
| Buttons: | |
| Get key fields | When clicked, it retrieves a list of fields from the step specified. |