Starts With Check Rule
Description
The Starts With check rule in data quality refers to a validation or condition applied to data values to ensure they begin with a specific set of characters, text, or pattern.
Rule Configuration
The Rules Configuration for a starts with check rule is determined by whether the data begins with a specified start string, which acts as the prefix to be validated. If the rule is case-sensitive, the success is based on an exact match, where both the characters and their capitalization must align perfectly with the given start string. The data is considered successful if it adheres to both the specified sequence and case sensitivity conditions.
- Start String The start string refers to the specific sequence of characters or pattern that the data is required to begin with. It’s the prefix or initial part of the data that the rule verifies against.
- Case-Sensitive A case-sensitive in starts with check rule means that the comparison between the starting portion of a string and the given prefix must match exactly, including capitalization.
Success Criteria
The success criterion for Starts With check is met when the input successfully identifys and validate entries where the string starts with a specified prefix or value.
-
The success condition depends on how the
Start String
is given. -
The success condition is met if it satisfies the given
operator
andvalue
. -
For example, if you check if a start string is “test,” it will only match names that start with exactly “test” (lowercase ‘t’, ‘e’, ‘s’, ‘t’). So, names like “testman” or “test123” would pass, but names like “Testman” or “TEST123” would not, because the case of the letters is different.
Configuration Fields
-
Operator Options
Greater than
Less than
Equal to
Between
(requires specifying a start and end range)
-
Operator Defines the comparison operation (Greater Than, Less Than, Equal To, or Between).
-
Value The threshold value used for success criteria. Required for
Greater than
,Less than
, andEqual to
operators. -
Value Range Required only when the
Between
operator is selected, specifying thestart
andend
range. -
Threshold Type Indicates whether the
Value
orValue Range
to be considered as percentage or an absolute count. -
Allow Null Values Determines if null values are permitted.
-
Sample Input
ID | Data | Content |
---|---|---|
1 | testname | Engineering |
2 | Tester | CivilEngineer |
3 | Null | EngineeringBooks |
4 | donetest | MechanicalEngineer |
5 | Developer | Engineers |
6 | Manager | engineer |
7 | testtype | Engineer |
Sample Rule Configuration
- Start String Data = test Content = Engine
- Case-Sensitive True
Sample Success Criteria Configuration
- Operator Greater than
- Value 2
- Threshold Type Absolute Count
- Allow Null Values True
Sample Output
Column Name | Rule Name | Success Count | Failure Count | Within Threshold |
---|---|---|---|---|
Data | Data Starts With Check | 2 | 5 | No |
Content | Content Starts With Check | 4 | 3 | Yes |