Remove Duplicate Rows
Description
This activity removes duplicate rows from the dataset based on the specified column. If a duplicate value is found in the selected column, all occurrences except the first one will be removed from the output data.
Input
Data only
Output
Transformed data
Configuration Fields
Column Name The column used to identify duplicate values. Rows containing duplicate values in this column will be removed, retaining only the first occurrence.
Sample Input
ID | Name | Age | City |
---|---|---|---|
101 | John | 25 | New York |
102 | Alice | 30 | Chicago |
103 | John | 25 | New York |
104 | Bob | 40 | Boston |
105 | Alice | 30 | Chicago |
Sample Configuration
Sample Output
ID | Name | Age | City |
---|---|---|---|
101 | John | 25 | New York |
102 | Alice | 30 | Chicago |
104 | Bob | 40 | Boston |