Skip to content

Filter on Values

Description

The Filter on Values activity filters or flags rows based on specified values within selected columns. It supports a variety of matching techniques and normalization strategies to suit different data scenarios.

Useful for retaining only relevant rows (e.g., filter only “Electronics” category), flagging suspect values, or excluding unwanted data entries before further processing.


Input

TypeDescription
DataTabular dataset with one or more columns to filter

Output

TypeDescription
Transformed DataFiltered or flagged data based on value-matching criteria

Configuration Fields

Field NameRequiredDescription
ColumnsYesOne or more columns to apply the filter condition.
ValueYesThe value to match in the selected columns.
Matching ModeYesDefines how the value is matched:
  • Complete Value
  • Substring
  • Regex
Normalization ModeNoAdjusts how comparisons are made:
  • Case Sensitive
  • Case Insensitive
  • Accent-Insensitive
Flag RowsNoIf enabled, flags rows using a 0/1 indicator instead of filtering them out.
Flag Rows ActionConditionalRendered when Flag Rows is enabled. Determines flag logic:
  • Flag if any column matches
  • Flag if all columns match
Flag Rows Column NameConditionalName of the new column to store the flag. Required when Flag Rows is enabled.

Sample Input

Product IDProduct NameCategoryPrice
101LaptopElectronics75000
102PhoneElectronics50000
103ChairFurniture3000

Sample Configuration

FieldValue
ColumnsCategory
ValueElectronics
Matching ModeComplete Value
Normalization ModeCase Insensitive
Flag RowsEnabled
Flag Rows ActionFlag if any column matches
Flag Rows Column NameIsElectronics

Sample Output (Flag Rows Enabled)

Product IDProduct NameCategoryPriceIsElectronics
101LaptopElectronics750001
102PhoneElectronics500001
103ChairFurniture30000

Sample Output (Flag Rows Disabled)

Product IDProduct NameCategoryPrice
101LaptopElectronics75000
102PhoneElectronics50000

Combine this with Extract Text, Normalize Text, or Lookup Replace for enhanced preprocessing before filtering values.