Skip to content

Execute Data Quality

Description

The Execute Data Quality activity runs a selected Data Quality Profile, which includes a set of data quality validation rules applied to a dataset. These rules help ensure that the dataset complies with defined standards, such as completeness, consistency, and accuracy.

Each rule targets specific columns and performs checks like null constraints, pattern matching, range validation, etc. The result is a structured output that summarizes rule-level performance and data health.

Use Case:
Use this activity to validate raw or processed data before it enters downstream workflows like reporting, transformation, or export, ensuring early error detection.


Input

Not Applicable


Output

TypeDescription
DataList of rule-level execution results for each column

Configuration Fields

FieldRequiredDescription
Data QualityYesThe predefined Data Quality profile to execute. Selected via DataQualityId.

Sample Configuration

FieldValue
Data QualityCustomer Data Profile

Sample Output

Column NameRule GroupRule NameRule TypeSuccess(%)Total RowsRule Description
ageRangeAge Between 18–60Validation97.51000Validates if age falls between 18 and 60
emailPatternValid Email FormatValidation99.21000Checks if email matches standard pattern
idUniquenessUnique IdentifierUniqueness100.01000Ensures values are unique per row

Supported Rule Types

Rule NameDescription
Value Range ValidationEnsures numbers fall within a specified minimum and maximum range.
Formula Integrity CheckValidates formulas and computed field integrity within the dataset.
Not Null CheckVerifies that required fields are not left empty.
Consistent CasingEnsures text casing (e.g., UPPER, lower, Title Case) is consistent.
Custom SQL ValidationRuns user-defined SQL queries for specialized checks.
Recent Data ValidationChecks if date values fall within a recent or defined date period.
Distinct Record CheckEnsures records are unique across all specified columns.
Column UniquenessVerifies that a single column’s values are unique.
Pattern MatchingValidates values against a regular expression pattern.
String Length ValidationEnsures text values meet length constraints (e.g., min/max length).
Reference Data ValidationValidates values against a reference dataset or lookup list.
Predefined Value CheckConfirms that values exist within a defined static list.
String Value MatchingChecks for exact matches with expected string values.
SQL Pattern MatchingValidates values using SQL LIKE patterns (e.g., %abc%, abc_).
Monotonic Sequence CheckEnsures values are increasing or decreasing consistently.

Notes

  • This activity does not require input data directly but operates based on a previously defined data quality configuration.
  • If no rules are defined in the selected profile, the execution will still complete successfully but return no results.
  • Supports both exact-match and pattern-based validations, including date formats, SQL, and regex.
  • All validations are performed in bulk and summarized in the output table.