Description
The Extract HTML activity extracts tabular data from HTML files and converts it into a structured dataset. This is especially useful for processing reports, web-scraped content, or embedded tables from web pages or system-generated HTML files.
Use case:
Ideal for scenarios where data is embedded in HTML tables, such as downloaded web reports, email digests, or content management system exports.
| Type | Description | 
|---|
| File | HTML document (.html, .htm) | 
Output
| Type | Description | 
|---|
| Data | Structured tabular data extracted from HTML | 
Configuration Fields
| Field Name | Required | Description | 
|---|
| Add HTML Extract | Yes | Defines extraction rule(s) to identify and parse one or more HTML tables. | 
Not applicable — input is provided via uploaded HTML files.
Sample Configuration
| Field | Value | 
|---|
| Add HTML Extract | Table selector for parsing table | 
Sample Output
| Name | Age | Country | 
|---|
| John Doe | 28 | USA | 
| Alice | 31 | Canada | 
| Bob | 25 | Australia |