Description
The Extract Numbers activity scans a text column and extracts numeric values using various extraction modes. It supports multiple formats such as plain numbers, currency amounts, scientific notation, and even expanded notations (e.g., 5K → 5000
).
Use case:
Extracting invoice amounts, order IDs, and numeric metrics from unstructured strings such as “Order #1234, Amount: $45.67” or “Salary: 120K”.
Type | Description |
---|
Data | Dataset containing a text column |
Output
Type | Description |
---|
Transformed Data | Original or enhanced data with numeric values extracted from text |
Configuration Fields
Field Name | Required | Description |
---|
Column | Yes | Name of the text column from which to extract numbers |
Extract Mode | Yes | Extraction strategy:- Several – Extract all numbers separately
- JSON Array – Store as an array
- Expand Notation – Convert notation like 1K to 1000
- Decimal Separator – Handle regional decimal formats
|
Output Column | Yes | Name of the column to store the extracted numbers |
Include Original | No | Whether to keep the original column in the final output |
Separator | Conditional | Character used to split values (only shown when Extract Mode is Decimal Separator) |
Transaction Details |
---|
Order #1234, Amount: $45.67 |
Invoice 56789 - $120.50 |
Product ID: 998, Price: $30.00 |
Sample Configuration
Field | Value |
---|
Column | Transaction Details |
Extract Mode | Several |
Output Column | Extracted Numbers |
Include Original | Yes |
Separator | Not Applicable |
Sample Output
Transaction Details | Extracted Numbers_1 | Extracted Numbers_2 |
---|
Order #1234, Amount: $45.67 | 1234 | 45.67 |
Invoice 56789 - $120.50 | 56789 | 120.50 |
Product ID: 998, Price: $30.00 | 998 | 30.00 |