Skip to content

Formula Integrity

The formula integrity check in data quality refers to the process of verifying that the formulas used in datasets (such as calculations, aggregations, or derived fields) are correctly implemented and yield accurate results. It ensures that the logic behind the formulas is consistent, reliable, and free from errors, thus maintaining data accuracy.

Rule configurations

The rule configuration defines data validation logic using a formula expression and an operator. Operators like equals, not equals, greater than, and others determine how data values are compared or evaluated.

Formula Expression A logical or mathematical expression that determines how data should be evaluated or manipulated.

Operator A comparison or logical symbol used to establish the relationship between values in the formula expression. Equals Checks if two values are identical.

Not Equals Checks if two values are different.

Greater than Checks if one value is larger than another.

Greater than or Equals Checks if one value is larger than or equal to another.

Less than Checks if one value is smaller than another.

Less than or Equals Checks if one value is smaller than or equal to another.

Success criteria

The success criteria for a monotonic sequence is met when the sequence is consistently either non-increasing or non-decreasing (in the case of non-strict), or strictly increasing or decreasing (in the case of strict), without any reversal in direction.

  • The success condition depends on how the Formula Expressionand Operator is configured.
  • The rule configuration checks if the sum of Value 1 and Value 2 equals the expected result using the formula =Value 1 + Value 2. The success criteria considers values less than 50% of the threshold as failures.

Configuration fields

  • Operator options

    Greater than

    Less than

    Equal to

    Between (requires specifying a start and end range)

  • Operator Defines the comparison operation (Greater Than, Less Than, Equal To, or Between).

  • Value The threshold value used for success criteria. Required for Greater than, Less than, and Equal to operators.

  • Value range Required only when the Between operator is selected, specifying the start and end range.

  • Threshold type Indicates whether the Value or Value Range to be considered as percentage or an absolute count.

Sample Input

IDValue1Value2value1 + Value2
110515
220353
315722
4301222
525843

Sample rule configuration

  • Formula expression result=Value1+Value2
  • Operator Equals

Sample success criteria configuration

  • Operator Less than
  • Value 50%
  • Threshold type Absolute Count

alt text

Sample output

Column NameRule NameSuccess CountFailure CountWithin ThresholdNull Count
resultFormula Integrity Check23Yeso