0% found this document useful (0 votes)
10 views2 pages

Data Quality Metrices

The document defines 16 metrics for measuring different aspects of data quality: completeness, accuracy, consistency, validity, timeliness, integrity, duplication, uniqueness, conformity to business rules, data profiling scores, format consistency, precision, and others. Each metric includes the calculation method and a description of what aspect of data quality it measures.

Uploaded by

Ambarish Pandey
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
10 views2 pages

Data Quality Metrices

The document defines 16 metrics for measuring different aspects of data quality: completeness, accuracy, consistency, validity, timeliness, integrity, duplication, uniqueness, conformity to business rules, data profiling scores, format consistency, precision, and others. Each metric includes the calculation method and a description of what aspect of data quality it measures.

Uploaded by

Ambarish Pandey
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Completeness:

 Metric: Percentage of missing values.


 Calculation: (Number of non-null values / Total number of expected values) * 100.
 Description: Measures the extent to which data is complete and doesn't have missing
values.

Accuracy:

 Metric: Error rate or percentage of inaccurate records.


 Calculation: (Number of inaccurate records / Total number of records) * 100.
 Description: Indicates the proportion of records that contain errors or inaccuracies.

Consistency:

 Metric: Percentage of inconsistent data.


 Calculation: (Number of inconsistent records / Total number of records) * 100.
 Description: Measures the degree to which data is consistent across different
datasets or sources.

Validity:

 Metric: Percentage of valid values.


 Calculation: (Number of valid values / Total number of values) * 100.
 Description: Assesses the proportion of data values that conform to predefined rules
or standards.

Timeliness:

 Metric: Time lag between data creation and availability for analysis.
 Calculation: Time taken for data to be available for use.
 Description: Measures how quickly data is made available for analysis after it is
created or updated.

Integrity:

 Metric: Referential integrity violations.


 Calculation: Number of records violating referential integrity constraints.
 Description: Checks the consistency of relationships between tables and the accuracy
of foreign key values.

Duplication:

 Metric: Percentage of duplicate records.


 Calculation: (Number of duplicate records / Total number of records) * 100.
 Description: Measures the extent to which identical records exist in the dataset.
Uniqueness:
 Metric: Percentage of unique values.
 Calculation: (Number of unique values / Total number of values) * 100.
 Description: Measures the proportion of unique values in a dataset.

Conformity to Business Rules:

 Metric: Adherence to business rules and logic.


 Calculation: Number of records conforming to business rules / Total number of
records.
 Description: Assesses whether data aligns with specific business rules or logic.

Data Profiling Scores:

 Metric: Various profiling scores assigned based on statistical analysis.


 Calculation: Automated profiling tools generate scores for different aspects of data
quality.
 Description: Provides an overall assessment of data quality based on statistical
analysis.

Format Consistency:

 Metric: Adherence to data format standards.


 Calculation: Number of records following the specified format / Total number of
records.
 Description: Assesses whether data adheres to predefined formats or patterns.

Precision:

 Metric: Number of decimal places or significant figures.


 Calculation: Count of decimal places or significant figures in numerical data.
 Description: Ensures that numerical values are recorded with the appropriate level of
precision.

You might also like