0% found this document useful (0 votes)
134 views11 pages

TDWI DataQuality Maturity Model Assessment Guide 2024 Web

The TDWI Data Quality Maturity Model Assessment Guide outlines the importance of data quality in organizations and presents a framework for assessing data quality maturity across five stages: Nascent, Early, Established, Comprehensive, and Advanced/Visionary. It emphasizes the need for effective data management practices, tools, and organizational commitment to ensure high-quality data, which is essential for accurate analytics and decision-making. The guide also discusses evolving trends in data quality management, including automation, new algorithms, and monitoring techniques to address the complexities of modern data environments.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
134 views11 pages

TDWI DataQuality Maturity Model Assessment Guide 2024 Web

The TDWI Data Quality Maturity Model Assessment Guide outlines the importance of data quality in organizations and presents a framework for assessing data quality maturity across five stages: Nascent, Early, Established, Comprehensive, and Advanced/Visionary. It emphasizes the need for effective data management practices, tools, and organizational commitment to ensure high-quality data, which is essential for accurate analytics and decision-making. The guide also discusses evolving trends in data quality management, including automation, new algorithms, and monitoring techniques to address the complexities of modern data environments.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 11

2024

TDWI Data Quality


Maturity Model
Assessment Guide

By Fern Halper
TDWI RESEARCH

2024

TDWI Data Quality Table of Contents


What Is Data Quality and Why Is It So Important? . . . . . . . . . 3
Maturity Model Areas of Data Quality Complexity . . . . . . . . . . . . . . . . . . 3
Assessment Guide Evolving Trends in Data Quality Management . . . . . . . . . . . . 3
The Data Quality Maturity Model . . . . . . . . . . . . . . . . . . 4
Stages of Maturity . . . . . . . . . . . . . . . . . . . . . . . . . 5
Overview of Stages . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Stage 1: Nascent . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Stage 2: Early . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Stage 3: Established . . . . . . . . . . . . . . . . . . . . . . . . 6
Stage 4: Comprehensive . . . . . . . . . . . . . . . . . . . . . . 7
Stage 5: Advanced/Visionary . . . . . . . . . . . . . . . . . . . . 7
Evaluating Assessment Scores . . . . . . . . . . . . . . . . . . . 8
Scoring . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
Interpretation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

© 2024 by TDWI, a division of 1105 Media, Inc. All rights reserved.


Reproductions in whole or in part are prohibited except by written permission.
Email requests or feedback to [email protected]. Product and company names
mentioned herein may be trademarks and/or registered trademarks of their
respective companies.

This report is based on independent research and represents TDWI’s findings;


reader experience may differ. The information contained in this report was
obtained from sources believed to be reliable at the time of publication.
Features and specifications can and do change frequently; readers are
encouraged to visit vendor websites for updated information. TDWI shall not be
liable for any omissions or errors in the information in this report.

TDWI RESEARCH   1


TDWI Data Quality Maturity Model Assessment Guide

About the Author


FERN HALPER, PH.D., is VP and senior director of TDWI Research for advanced analytics, focusing
on predictive analytics, social media analysis, text analytics, cloud computing, and other “big data”
analytics approaches. She has more than 20 years of experience in data and business analysis, and
has published numerous articles on data mining and information technology. Halper is co-author
of “Dummies” books on cloud computing, hybrid cloud, service-oriented architecture, and service
management, and Big Data for Dummies. She has been a partner at industry analyst firm Hurwitz &
Associates and a lead analyst for Bell Labs. Her Ph.D. is from Texas A&M University. You can reach
her at [email protected], @fhalper on Twitter, and on LinkedIn at
https://www.linkedin.com/in/fbhalper/.

About TDWI Research


TDWI Research provides research and advice for data professionals worldwide. TDWI Research
focuses exclusively on data management and analytics issues and teams up with industry thought
leaders and practitioners to deliver both broad and deep understanding of the business and technical
challenges surrounding the deployment and use of data management and analytics solutions. TDWI
Research offers in-depth research reports, commentary, inquiry services, and topical conferences as
well as strategic planning services to user and vendor organizations.

TDWI RESEARCH   2


TDWI Data Quality Maturity Model Assessment Guide Introduction

What Is Data Quality and Why Is It So Important?


Organizations understand the importance of data for deriving strategic insights; it is a critical
foundation of modern business. Today’s competitive landscape necessitates using data for all kinds of
analysis, which may include traditional structured data as well as unstructured and semistructured
data for more advanced analytics (such as machine learning and AI). For instance, in business, high-
quality, accurate, and complete data is needed for understanding customer behavior, churn analysis,
predictive maintenance, and financial planning. Managing this data is critical for advancing analytics.
Data quality refers to the overall suitability of a data set to serve its intended purpose. It’s a measure
of how well the data meets the requirements and expectations of its users, particularly in terms of
accuracy, completeness, reliability, and relevance. It involves the procedures, policies, and processes
that an enterprise uses to maintain trustworthy data. Poor-quality data can lead to incorrect
conclusions, resulting in financial losses and strategic missteps.
Modern data quality management involves scaling data quality, keeping up with evolving regulations,
and enabling self-service data practices—but within controls. It involves determining what
constitutes high data quality for new data types. It is no wonder that fewer than half of respondents
to TDWI surveys are satisfied with the quality of their data. It also explains why, in TDWI surveys,
data governance—of which data quality is an important part—is often at the top of the list of
priorities for organizations in data management.
Areas of Data Quality Complexity
Although organizations are trying to maintain the quality of their data, the reality is that data
quality management is continuous and complex. For instance, as organizations strive to better
compete, they are often using more advanced analytics. Organizations will need to ensure that
inputs to their predictive models have high integrity, as do the outputs. This is not something
enterprises may have addressed before.
Additionally, organizations will be dealing with new data types. For instance, they may deploy natural
language processing techniques on unstructured data to extract entities, sentiments, or concepts.
With the advent of generative AI, more organizations are making use of text data about their products
or services to help fine tune pre-trained large language models (LLMs). All of this will require new
processes to ensure the data they are working with is of high quality and maintains that quality level.
There will also be compliance regulations arising from AI that will necessitate trusted data. Already
the EU has passed the AI Act, which is intended to ensure the safety of AI systems. The U.S. has
introduced a number of federal and state regulations, some of which will surely pass in 2024. These
will, no doubt, involve data quality for modern analytics such as AI.
Another factor that adds complexity is the move to new platforms to support new data types.
Organizations are moving to cloud data warehouses, cloud data lakes, and cloud data lakehouses.
Although many enterprises are trying to unify their data architecture, they still have siloed systems
that can cause problems with data consistency and quality. For instance, they will need to ensure
data pipelines maintain data integrity both during migration and in production to support analytics
and applications.
Evolving Trends in Data Quality Management
To maintain data quality across numerous systems, high volumes of data, and highly diverse data
types, organizations will need new approaches—and particularly new technology approaches—to
help them. A few of these include:

TDWI RESEARCH   3


TDWI Data Quality Maturity Model Assessment Guide The Data Quality Maturity Model

• NEW ALGORITHMS FOR DATA QUALITY. Already new algorithms, approaches, and metrics are
emerging to address the data quality of new data types and analytics. One example is BLEU,
which stands for Bilingual Evaluation Understudy, a metric used for evaluating the quality of
text that has been machine-translated from one language to another. Open source platforms
provide tools to detect bias and toxicity. Some vendors are providing confidence scores for the
output of generative AI models or allowing for the building of predictive models to assess for
hallucinations.
• AUTOMATION AND AUGMENTATION. Given the volume of new data being used by organizations, a
major trend driving data quality management is the introduction of automated and augmented
tools. Modern data quality tools automate profiling, monitoring, parsing, standardizing,
matching, merging, correcting, cleansing, and enhancing data for delivery into enterprise data
warehouses and other downstream repositories. Additionally, AI technology is being infused
into the technology to augment it and enable tasks such as identifying sensitive information
or detecting outliers. These tools are found in data pipeline products as well as data catalogs,
among others. TDWI research has found that organizations that succeed with data quality are
more likely to use automated tools to manage the complexity.
• MONITORING AND OBSERVABILITY. Metrics are key for success with data quality. Data
observability is a relatively new method of ensuring data is healthy in real time and data
governance objectives are being met. It uses automated techniques to enforce consistency,
accuracy, and reliability of data. It can act as a single pane of glass across platforms to monitor
and manage data quality. TDWI sees observability tools gaining market traction.

The Data Quality Maturity Model


The TDWI Data Quality Maturity Assessment has approximately 50 questions across the five
categories that form the dimensions of the TDWI Data Quality Maturity Model (see Figure 1).

Organizational Roles and DQ Management Assurance Tools


Commitment Responsibilities and Impact

• Awareness • Accountability • Data trustworthiness • Data integrity • Scope of tools


• Strategy • Roles and structures • Data quality for • Risk mitigation • Automation
• Funding • Training analytics • Metrics • Monitoring and
• Review • Remediation • Compliance observability
• Scope • Metadata management
• Responsible data
and AI

Figure 1. TDWI Data Quality Maturity Model dimensions.

These dimensions are:


• ORGANIZATIONAL COMMITMENT: Organizational factors can make or break a data quality effort.
This dimension assesses maturity of awareness, strategy, and funding. It looks at how aware the
organization is about data quality and how serious they take it. For instance, is data quality
discussed at data governance meetings? Is there a strategy in place that all stakeholders have
agreed to? Does the scope of the effort include all types of data?

TDWI RESEARCH   4


TDWI Data Quality Maturity Model Assessment Guide Overview of Stages

• ROLES AND RESPONSIBILITIES: This dimension assesses the roles in place for data quality and
whether someone is accountable for data quality. It includes questions about both technical
and business staff responsibilities for data quality. Is there training for data quality? Are there
processes in place so people know if their data is of high quality?
• DATA QUALITY MANAGEMENT: This dimension examines the tools and processes in place to make
sure data is accurate, reliable, and relevant. It examines established processes for exposing and
remediating poor data quality. It also examines the importance of data quality in analytics and
responsible data and analytics.
• ASSURANCE AND IMPACT: This dimension examines how well data quality processes are working
vis-à-vis data integrity. It includes questions about data trust, compliance, and risk mitigation. It
also examines how the organization is measuring data quality.
• TOOLS: Organizations rely on a range of tools to manage and improve data quality. This
dimension tracks the availability and adoption of tools for data quality and how advanced they
are. Such tools include automated solutions as well as monitoring and observability solutions
that look at the health of the data.
Stages of Maturity
The TDWI Data Quality Maturity Model consists of five stages: Nascent, Early, Established,
Comprehensive, and Advanced/Visionary. As organizations move through these stages, they should
gain more value from their data and analytics investments. Figure 2 illustrates these stages.

Figure 2. Stages of maturity in the TDWI Data Quality Maturity Model.

This guide provides a brief overview of each of the stages of the TDWI Data Quality Maturity
Model. This description provides a context for interpreting your scores when taking the assessment.

Overview of Stages
Stage 1: Nascent

In the Nascent stage, organizations have no company-wide data quality strategy, funding, or
awareness by executives about why data quality is vital.
Companies in this stage typically do not maintain an inventory of data assets to aid in
standardization and understanding. The organization may be implementing data quality teams,
processes, and platforms but these are usually in pockets with no overall data quality management
practice anywhere in the organization. Where data quality processes exist, they are focused on
traditional data types.

TDWI RESEARCH   5


TDWI Data Quality Maturity Model Assessment Guide Overview of Stages

Although many in the organization may not be satisfied with the quality of their data, the value of
data quality management is not communicated throughout the organization and it hasn’t become a
priority. There are minimal processes to ensure accurate, reliable, relevant, complete, or timely data.
Data quality management tools are virtually nonexistent. Many in the company may not understand
what ensuring high-quality data entails and the enterprise may report problems with data accuracy,
consistency, integrity, compliance, and relevance.
The nascent organization risks falling into a situation where it may become a victim of its own lack of
attention to data quality and have trouble moving forward with analytics and other efforts.
Stage 2: Early

In the Early stage, the organization is starting to think about a more formal data quality program. It
is putting objectives in place and starting to make executives aware of the value of data quality. This
may be because an event has occurred where poor data quality impacted revenue (for instance, an
inaccurate forecast).
The company is trying to put the foundations of data quality management in place to ensure
complete and accurate data—primarily for traditional structured data. In addition to putting
processes in place for data quality, the organization is also starting to think about data quality tools,
typically for data profiling and cleansing. Stakeholders are beginning to emerge as the value of data
quality begins to be better communicated across the company.
The Early-stage organization is starting to take the steps needed to establish basic data quality
management.
Stage 3: Established

During the Established stage, there is an organizational strategy and plan in place for data quality.
This may also include preliminary metrics about what constitutes high-quality data in terms of
accuracy, completeness, timeliness, and other core data quality parameters. These metrics might be
in place for traditional structured data. Accountability and data standards are established and the
importance of data quality is being communicated across the organization.
The enterprise will most likely use tools to profile and cleanse data, but these tools may not yet be
automated. They are researching other tools. Likewise, the organization may have started to implement—
or at least investigate—tools such as data catalogs and data lineage solutions to help standardize data,
provide metadata, and better understand where problems with data could have occurred.
Those organizations in the Established phase will need to keep evolving their data quality strategy
because they will also most likely be in a position to start to collect and analyze more diverse data
and greater data volumes. This will necessitate new processes and tools, including automation.

TDWI RESEARCH   6


TDWI Data Quality Maturity Model Assessment Guide Overview of Stages

Stage 4: Comprehensive

During the Comprehensive phase, the organization has implemented a sound data quality program
and the results are measured. Data quality is an agenda item in data governance meetings. Tools are
in place for core data quality metrics. Data quality has become a strategic imperative. The enterprise
is now building on what it has to take it to the next level with new data types, such as text, machine,
and real-time data.
These organizations may have moved forward in their analytics efforts, including building machine
learning models. That means they are also dealing with assessing both the data quality of the input
to models and what comes out of the models. Enterprises are determining what constitutes high-
quality data for new data types and implementing processes and new algorithms for determining this.
They are also working on identifying ethical considerations for data, such as data bias and fairness.
Organizations with comprehensive data quality programs are also investing in tools to help automate
different parts of the data quality and data pipeline process. This includes augmented tools to alert
them to issues in their data and observability tools so enterprises can measure the health of their data.
All of this is important as their data ecosystem becomes increasingly complex.
Stage 5: Advanced/Visionary

Very few organizations reach the Advanced/Visionary stage of data quality. That is partly because
data quality management is, in many ways, a moving target. Visionary companies have put
standard processes in place to evolve their data quality strategies as they utilize more diverse data.
Stakeholders in both business and IT are accountable for data quality, and team structures are in
place. Collaboration occurs across the organization on data quality.
These visionary companies are also using automated and augmented tools for data quality. These
tools may be part of a larger solution, such as a data catalog or a data observability platform. They
may be in the data pipelines. Metrics associated with data quality tie to business goals and these are
consistently measured and publicized.

TDWI RESEARCH   7


TDWI Data Quality Maturity Model Assessment Guide Evaluating Assessment Scores

Evaluating Assessment Scores


The TDWI Data Quality Maturity Assessment has approximately 50 questions across the five
categories that form the dimensions of the Maturity Model. Of course, organizations can be at
different stages of maturity in these five categories, and most are.
Scoring
Questions are weighted differently depending on their relative importance. Each dimension has a
possible maximum score of 20 points. Because organizations can be at different levels of maturity
in the five dimensions, we score each section separately and provide an overall score. The assessment
output is a score in each dimension and the total score.
Interpretation
Once you complete the survey, you will see a report that breaks down your scores for each
dimension as follows:

Score Per Dimension Stage

5 or less Nascent

6-10 Early

11-14 Established

15-18 Comprehensive

19-20 Advanced/Visionary

For instance, if you receive a score of 11 in the Organizational Commitment dimension of the
assessment, you are in the Established stage for that dimension. You should expect to see your scores
vary across the different dimensions. Data quality programs don’t necessarily evolve at the same rate
across all these dimensions.
When you complete the assessment, you might see scores such as this:

DIMENSION SCORE STAGE

Organizational commitment 7 Early

Roles and responsibilities 5 Nascent


Data quality management 5 Nascent

Assurance and impact 6 Early

Tools 5 Nascent

This means you are more mature in your Organizational Commitment category but less mature in
some of the other areas. Understanding your relative strengths and weaknesses will help you establish
goals and, in turn, target your efforts and allocate resources.

TDWI RESEARCH   8


TDWI Data Quality Maturity Model Assessment Guide Summary

Summary
The TDWI Data Quality Maturity Assessment provides a quick way for organizations to assess their
maturity in data quality. The assessment is based on the TDWI Data Quality Maturity Model, which
consists of five maturity stages.
The assessment serves as a relatively coarse measure of your data quality maturity. The approximately
50 questions across five categories merely touch the surface of the complexities involved in building
your data quality program. To gauge precisely where you are, it may also make sense to work with an
independent source to validate your progress.

TDWI RESEARCH   9


TDWI Research provides research and advice for data
professionals worldwide. TDWI Research focuses exclusively
on data management and analytics issues and teams up with
industry thought leaders and practitioners to deliver both
broad and deep understanding of the business and technical
challenges surrounding the deployment and use of data
management and analytics solutions. TDWI Research offers
in-depth research reports, commentary, inquiry services, and
topical conferences as well as strategic planning services to
user and vendor organizations.

A Division of 1105 Media


6300 Canoga Avenue, Suite 1150
Woodland Hills, CA 91367

E [email protected] tdwi.org

You might also like