Skip to content

Starts With Check Rule

Description

The Starts With check rule in data quality refers to a validation or condition applied to data values to ensure they begin with a specific set of characters, text, or pattern.

Rule Configuration

The Rules Configuration for a starts with check rule is determined by whether the data begins with a specified start string, which acts as the prefix to be validated. If the rule is case-sensitive, the success is based on an exact match, where both the characters and their capitalization must align perfectly with the given start string. The data is considered successful if it adheres to both the specified sequence and case sensitivity conditions.

  • Start String The start string refers to the specific sequence of characters or pattern that the data is required to begin with. It’s the prefix or initial part of the data that the rule verifies against.
  • Case-Sensitive A case-sensitive in starts with check rule means that the comparison between the starting portion of a string and the given prefix must match exactly, including capitalization.

Success Criteria

The success criterion for Starts With check is met when the input successfully identifys and validate entries where the string starts with a specified prefix or value.

  • The success condition depends on how the Start String is given.

  • The success condition is met if it satisfies the given operator and value.

  • For example, if you check if a start string is “test,” it will only match names that start with exactly “test” (lowercase ‘t’, ‘e’, ‘s’, ‘t’). So, names like “testman” or “test123” would pass, but names like “Testman” or “TEST123” would not, because the case of the letters is different.

    Configuration Fields

    • Operator Options

      • Greater than
      • Less than
      • Equal to
      • Between (requires specifying a start and end range)
    • Operator Defines the comparison operation (Greater Than, Less Than, Equal To, or Between).

    • Value The threshold value used for success criteria. Required for Greater than, Less than, and Equal to operators.

    • Value Range Required only when the Between operator is selected, specifying the start and end range.

    • Threshold Type Indicates whether the Value or Value Range to be considered as percentage or an absolute count.

    • Allow Null Values Determines if null values are permitted.

Sample Input

IDDataContent
1testnameEngineering
2TesterCivilEngineer
3NullEngineeringBooks
4donetestMechanicalEngineer
5DeveloperEngineers
6Managerengineer
7testtypeEngineer

Sample Rule Configuration

  • Start String Data = test Content = Engine
  • Case-Sensitive True

Sample Success Criteria Configuration

  • Operator Greater than
  • Value 2
  • Threshold Type Absolute Count
  • Allow Null Values True

alt text

Sample Output

Column NameRule NameSuccess CountFailure CountWithin Threshold
DataData Starts With Check25No
ContentContent Starts With Check43Yes