Both Sorter and Joiner transformations are used in ETL data processing, but they serve different purposes.
The Sorter Transformation is used to sort data in ascending or descending order based on specified key columns.
* Active Transformation → Can change the number of rows by discarding duplicates.
* Allows Sorting on Multiple Columns → You can prioritize sorting by multiple fields.
* Distinct Sorting Option → Can remove duplicates if configured.
* Uses Disk Storage for Large Data → If memory is insufficient, it spills over to disk.
The Joiner Transformation is used to combine data from two different sources based on a common key, similar to SQL joins.
* Active Transformation → Can filter data by applying conditions in the join.
* Supports Different Types of Joins:
| Feature | Sorter Transformation | Joiner Transformation |
|---|---|---|
| Purpose | Sorts data based on specified keys. | Joins data from two sources based on a common key. |
| Type | Active Transformation (if removing duplicates). | Active Transformation (filters unmatched records). |
| Output | Returns sorted records. | Returns combined records from two sources. |
| Data Sources | Works on a single data source. | Works on two different data sources. |
| Key Feature | Sorts in ascending or descending order. | Supports Normal, Outer (Left, Right, Full) joins. |