Description
The Split and Fold Columns activity is used to normalize a column that contains multiple values separated by a delimiter (e.g., comma, pipe). It splits the values into individual rows and duplicates the remaining column values for each new entry.
This transformation is helpful when your data contains lists or multi-valued fields that need to be processed or visualized independently.
Use this activity to:
- Convert comma-separated tags or categories into individual rows
- Prepare data for filtering or aggregation
- Normalize structured fields for downstream processing
Use case:
A dataset contains a “Tags” column with values like "Red,Blue,Green"
. Using this activity, each color becomes a separate row, making it easier to filter or analyze tag-specific data.
Type | Description |
---|
Data | A dataset where one column contains delimited values to be split into multiple rows. |
Output
Type | Description |
---|
Data | Transformed dataset with the selected column split into multiple rows, and other columns duplicated. |
Configuration Fields
Field Name | Description |
---|
Column To Split | Select the column whose values should be split into separate rows. |
Separator | The delimiter used to separate values within the selected column. Example: , for comma-separated values. |
ID | Name | Tags |
---|
101 | John | Red,Blue,Green |
102 | Alice | Yellow,Orange |
103 | Mark | Black |
104 | Emma | White,Gray,Silver |
105 | Bob | Pink |
Sample Configuration
Column To Split | Tags |
---|
Separator | , (comma) |
Sample Output
ID | Name | Tags |
---|
101 | John | Red |
101 | John | Blue |
101 | John | Green |
102 | Alice | Yellow |
102 | Alice | Orange |
103 | Mark | Black |
104 | Emma | White |
104 | Emma | Gray |
104 | Emma | Silver |
105 | Bob | Pink |