Predefined Value
The Predefined Value rule checks for the correct use of lookup operations within a system. A Predefined Value check operation involves searching for a value in a predefined set or dataset, such as a table or array, based on a key or criterion.
Rule configuration
The rule configuration for Predefined Value ensures data validation by comparing data against predefined static values stored in a reference table. The check can be case-sensitive. This means it differentiates between uppercase and lowercase letters to maintain accuracy and consistency in data matching and validation processes.
Static values Static values refer to predefined and unchanging values used in the process of validating and matching data. These static values are usually stored in a reference or lookup table. They help ensure that the data meets specific consistency or correctness criteria.
Case-sensitive Case sensitivity refers to whether the comparison between the value being looked up and the static reference values considers the difference between uppercase and lowercase letters.
Success criteria
Success criteria for a Predefined Value check in data quality depend on how accurately values from a static reference dataset match the source data. This helps ensure consistency and correctness. The key indicators of success include a high match rate, which is the percentage of source data that successfully maps to a reference value, minimal false positives or incorrect matches, low error rates such as missing or unmatched data, and efficient handling of exceptions.
- The success condition depends on how the
Static Values
field is configured. - The success condition is met if this count satisfies the selected
operator
and the specifiedvalue
. - For example, if a column contains an input such as
Mango
that exactly matches the static valueMango
, then it passes. Otherwise, inputs such aspineapple
ormango
, if case sensitivity is enabled, will fail.
Configuration fields
-
Operator options
Greater than
Less than
Equal to
Between
(requires specifying a start and end range) -
Operator defines the comparison operation. You can choose from Greater Than, Less Than, Equal To, or Between.
-
Value is the threshold used for the success criteria. It is required when you use the Greater Than, Less Than, or Equal To operators.
-
Value range is required only when the
Between
operator is selected. It specifies thestart
andend
range. -
Threshold type indicates whether the
Value
orValue Range
should be considered as a percentage or an absolute count. -
Allow null values determines if null values are permitted.
-
Check for match determines if data values align with predefined standards, formats, or reference values to ensure accuracy, consistency, and integrity.
Sample Input
ID | Vegetables | Fruits |
---|---|---|
1 | Tomato | Pineapple |
2 | Lettuce | apple |
3 | Carrot | Mango |
4 | Onion | Apple |
5 | broccoli | Banana |
Sample rule configuration
- Static values Vegetables = Tomato,Onion,Broccoli, Fruits = Apple,Banana,Mango,Pineapple
- Case sensitivity True
Sample success criteria configuration
- Operator Less than
- Value 4
- Threshold type Absolute Count
- Allow null values False
- Check for match False
Sample Output
Column Name | Rule Name | Success Count | Failure Count | Within Threshold | Null Count |
---|---|---|---|---|---|
Vegetables | Predefined Value check | 2 | 3 | Yes | 0 |
Fruits | Predefined Value check | 4 | 1 | No | 0 |