Skip to content

Find text

Description

The Find Text activity extracts specific portions of text from selected columns based on a user-defined regex pattern. It is useful for parsing structured tokens, keywords, codes, or patterns from unstructured text.

Use Case
Extract keywords such as product codes, IDs, or tags from a sentence or description field using regular expressions.


Input

TypeDescription
DataInput dataset containing text columns

Output

TypeDescription
Transformed DataNew columns with extracted values from patterns.

Configuration Fields

Field NameRequiredDescription
Columns To FindYesColumn(s) from which the text will be extracted using regex.
PatternYesRegular expression used to extract matching portions from the column text.
Output Columns PrefixYesPrefix used when creating new output columns for extracted matches.
Include OriginalNoIf enabled, original columns will be included in the output.

Sample Input

IDDescription
1This contains ABC and XYZ
2Find CODE inside this text
3No pattern matches here
4Extract INFO and DATA points
5SAMPLE test for extraction

Sample Configuration

FieldValue
Columns To FindDescription
Pattern([A-Z]{3,})
Output Columns PrefixColumn_
Include OriginalEnabled

Explanation: This regex extracts all words with 3 or more uppercase letters.


Sample Output

IDDescriptionColumn_1Column_2
1This contains ABC and XYZABCXYZ
2Find CODE inside this textCODE
3No pattern matches here
4Extract INFO and DATA pointsINFODATA
5SAMPLE test for extractionSAMPLE

Use grouping patterns like (\d{4}) to extract numeric codes, or #(\w+) to extract hashtags.