Skip to content

Predefined Value

The Predefined Value rule checks for the correct use of lookup operations within a system. A Predefined Value check operation involves searching for a value in a predefined set or dataset, such as a table or array, based on a key or criterion.

Rule configuration

The rule configuration for Predefined Value ensures data validation by comparing data against predefined static values stored in a reference table. The check can be case-sensitive. This means it differentiates between uppercase and lowercase letters to maintain accuracy and consistency in data matching and validation processes.

Static values Static values refer to predefined and unchanging values used in the process of validating and matching data. These static values are usually stored in a reference or lookup table. They help ensure that the data meets specific consistency or correctness criteria.

Case-sensitive Case sensitivity refers to whether the comparison between the value being looked up and the static reference values considers the difference between uppercase and lowercase letters.

Success criteria

Success criteria for a Predefined Value check in data quality depend on how accurately values from a static reference dataset match the source data. This helps ensure consistency and correctness. The key indicators of success include a high match rate, which is the percentage of source data that successfully maps to a reference value, minimal false positives or incorrect matches, low error rates such as missing or unmatched data, and efficient handling of exceptions.

  • The success condition depends on how the Static Values field is configured.
  • The success condition is met if this count satisfies the selected operator and the specified value.
  • For example, if a column contains an input such as Mango that exactly matches the static value Mango, then it passes. Otherwise, inputs such as pineapple or mango, if case sensitivity is enabled, will fail.

Configuration fields

  • Operator options

    Greater than

    Less than

    Equal to

    Between (requires specifying a start and end range)

  • Operator defines the comparison operation. You can choose from Greater Than, Less Than, Equal To, or Between.

  • Value is the threshold used for the success criteria. It is required when you use the Greater Than, Less Than, or Equal To operators.

  • Value range is required only when the Between operator is selected. It specifies the start and end range.

  • Threshold type indicates whether the Value or Value Range should be considered as a percentage or an absolute count.

  • Allow null values determines if null values are permitted.

  • Check for match determines if data values align with predefined standards, formats, or reference values to ensure accuracy, consistency, and integrity.

Sample Input

IDVegetablesFruits
1TomatoPineapple
2Lettuceapple
3CarrotMango
4OnionApple
5broccoliBanana

Sample rule configuration

  • Static values Vegetables = Tomato,Onion,Broccoli, Fruits = Apple,Banana,Mango,Pineapple
  • Case sensitivity True

Sample success criteria configuration

  • Operator Less than
  • Value 4
  • Threshold type Absolute Count
  • Allow null values False
  • Check for match False

alt text

Sample Output

Column NameRule NameSuccess CountFailure CountWithin ThresholdNull Count
VegetablesPredefined Value check23Yes0
FruitsPredefined Value check41No0