Generate Big Data
Description
This activity expands the dataset by generating additional rows based on a user-defined expansion factor. The widening process utilizes specified key columns containing unique sequential values to maintain consistency. If the number of data rows is “a” and the expansion factor is “b”, then the total number of generated rows will be a × b.
Input
Data only
Output
Transformed data
Configuration Fields
- Expansion Factor Multiplier to generate additional rows based on the original dataset.
- Key Column Column used as a unique identifier to maintain consistency in the generated data.
- Include Original
- Enabled Includes the original data along with the generated data while keeping the total output row count as a × b.
- Disabled Provides only the generated data as output.
Sample Input
Transaction ID | Product | Price | Quantity |
---|---|---|---|
101 | Laptop | 800 | 1 |
102 | Phone | 500 | 2 |
103 | Tablet | 300 | 1 |
Sample Configuration
Sample Output (with Expansion Factor = 2)
Transaction ID | Product | Price | Quantity |
---|---|---|---|
101 | Laptop | 800 | 1 |
102 | Tablet | 600 | 1 |
103 | Phone | 500 | 2 |
104 | Laptop | 400 | 2 |
105 | Tablet | 358 | 1 |
106 | Phone | 462 | 2 |