Create An Enterprise Vision For Data Quality and Observability Whitepaper
Create An Enterprise Vision For Data Quality and Observability Whitepaper
Executive summary
As organizations turn their focus to better leveraging their growing volumes
Data quality: of data, key business and technical stakeholders are working through the
the extent to which long, arduous process of making the case to formalize various data capabilities
data represents what it and investing in the related technologies.
purports to represent
A common reason to invest in a data strategy is an overall need for better
and the extent to which
data understanding and easier access to quality and trusted data to support
data satisfies a specific operational and analytical activities. Given that, why is an enterprise data quality
requirement tool commonly an after thought or put at the end of a wish list?
It’s likely because talking about data quality issues or quality management can
be overwhelming or too theoretical. Those seeking to establish a successful
enterprise data quality function need a concrete path to build confidence
and enthusiasm when making the case to invest in resourcing and a new tool.
This white paper outlines the critical components of a successful data quality
function and considerations on how to get there.
2
Whitepaper
By identifying the impact that the data capability can make, you can create
a plan to deliver value to the company as quickly as possible while building
a sustainable practice.
• Why are they critical, and what are their expected outcomes?
• Is that data “fit for use” and able to support the outcome? If not, why?
• Can you determine the impact of poor data quality on other key processes?
Any organization’s goals are going to change over time, and this line of
questioning should be frequently revisited at an enterprise, department and
data domain level to ensure alignment and progress is being made.
3
Whitepaper
Following are the building blocks for establishing a data quality function:
4
Whitepaper
• Data Quality Lead/Manager, who sets the vision and strategy for the data
quality function, executes the strategy and is accountable for measurable
enterprise-level impact and progress. An organizational change management
background is required for this individual to be successful. This role is focused
on promoting awareness, focusing on adoption and expansion.
• Data Quality Analyst(s), the experts who understand all aspects of data
quality and typically never want to do anything but profile data, talk about data
dimensions, translate the findings, or find new rules or more efficient ways to
apply the rules broadly across the data landscape.
5
Whitepaper
The number of data quality analysts also depends on the maturity of this
Data quality management: functional area and the volume of use cases and data. At least one skilled
Practice of defining data quality analyst is required to get the function going, either through cross-
expectations of data, training or as a new hire. These individuals also monitor results and help to
identify quality issues and ensure remediation occurs.
monitoring for conformance to
expectations and correcting
In the past, a leading measurement of data quality function growth was
non-conformance. expanding the team by bringing in another quality analyst. Collibra’s
automated machine learning rule identification is helping data analysts scale
monitoring of data sources, as well as bring data stewards to the decision-
making table, to review and approve the automated rule recommendation.
Collibra’s new-generation data quality tool has had a positive impact on
productivity through self-service and embedding quality rules more broadly.
6
Whitepaper
When developing a new report or testing out a new machine learning model,
many hours go into knowing the data needed to be successful and introducing
quality controls within the code, as well as standardizing, storing and creating
access paths to the data to deliver the anticipated results. Getting to the
right results typically requires collaboration between business and IT, with
individuals of varying skill sets and managing multiple responsibilities.
For example, when a new report or model is migrated into production, the initial
team remains dedicated to a successful launch. Output is monitored closely
and, often, the IT project team is still available to help find and fix any production
issues. The cracks start to appear when the data asset (e.g., report or model)
goes into maintenance mode and is handed off to a production support team.
The seven steps below outline activities needed to ensure data is assessed,
monitored and measured against expectations for use. A project team (such
as the example above) typically performs the first four activities very well with
or without an automated data quality technology, resulting in a successful
implementation. Once in maintenance and without a predictive technology,
such as Collibra Data Quality & Observability, many of these steps are highly
manual and require individuals to watch over the data flow. It is likely that quality
issues are not caught.
7
Whitepaper
Output view of the profiling step, which can be valuable to determine results of the
assessment.
Users can also view a sample dataset with results per column, with the option to
mask the data in the case of sensitive columns.
9
Whitepaper
10
Whitepaper
Scorecard of a specific job that runs daily and the number of issues caught by type (outlier, dupe, rule breaks)
11
Whitepaper
6. Data Issue Remediation – Identify and correct what caused the data issue; for example:
• Conduct a root-cause analysis to determine the reason for the data exception.
• Determine corrective action, which may include correcting a software defect,
implementing a business process change and providing business-user training,
correcting an input file, or potentially adding a new data quality rule as part of monitoring.
• Test and implement the corrective action.
• Remediate the erroneous data in the impacted databases.
13
Whitepaper
Conclusion
Now that you know the critical components of a successful data quality
function, create the vision for your organization. First, write the story of your
organization’s data quality journey. What is its current state? Where do you
need to get to and why? Seize the data-focused momentum and make your
case for data quality. Operationalize the function with the seven steps to ensure
efficiency and consistency across the organization.
15
Whitepaper
FSFP has been a trusted Collibra Partner since 2012. In 2020, FSFP received
Collibra’s Honorable Mention Partner of the Year commendation.
16
Whitepaper
About Collibra
Since 2008, Collibra has been uniting organizations by delivering trusted data
for every use, for every user, and across every source. Our Data Intelligence
Cloud brings flexible governance, continuous quality and built-in privacy to all
types of data. The Global 2000 relies on Collibra to create the critical alignment
that accelerates workflows and delivers better results faster. We have a diverse
global footprint, with offices in the U.S., Belgium, Australia, Czech Republic,
France, Poland and the U.K. To learn more, visit collibra.com, follow @Collibra
on Twitter or follow us on LinkedIn.
17