Skip to content

Remove Duplicate Rows

Description

This activity removes duplicate rows from the dataset based on the specified column. If a duplicate value is found in the selected column, all occurrences except the first one will be removed from the output data.

Input

Data only

Output

Transformed data

Configuration Fields

Column Name The column used to identify duplicate values. Rows containing duplicate values in this column will be removed, retaining only the first occurrence.

Sample Input

IDNameAgeCity
101John25New York
102Alice30Chicago
103John25New York
104Bob40Boston
105Alice30Chicago

Sample Configuration

alt text

Sample Output

IDNameAgeCity
101John25New York
102Alice30Chicago
104Bob40Boston