Ipc2022-87872 Structured, Systematic Threat Based Approach To Evaluate and Improve
Ipc2022-87872 Structured, Systematic Threat Based Approach To Evaluate and Improve
IPC2022
September 26-30, 2022, Calgary, Alberta, Canada
IPC2022-87872
1 © 2022 by ASME
GIS Geographic Information System 1, Data quality is part of data management, and data management
ILI In-Line Inspection is part of data governance.
ISO International Organization for Standardization
KPI Key Performance Indicator
QA Quality Assurance
QC Quality Control
SME Subject Matter Expert
SoA System of Access
SOP Standard Operating Procedures
SoR System of Record
SoT Source of Truth
1. INTRODUCTION
The importance of having complete, consistent, and reliable
data for operations and risk assessment, particularly as the
industry moves towards a quantitative risk assessment approach,
is significant. Two essential components of data quality
management are continuous data quality assessment and
maturity of data quality processes and capabilities. The data
quality process maturity (Process maturity) and data quality are FIGURE 1: TYPICAL DATA GOVERNANCE HIERARCHY FOR
expected to have a cause-and-effect relationship, i.e., mature AN ORGANIZATION
processes are likely to lead to higher quality data in an
organization. Thus, data governance holds the key to ensuring efficient and
effective use of data within the organization. FIGURE 1 shows
High-quality data leads to confident decision-making related to that data quality management is achieved with data governance.
risk assessment and integrity investing across the organization.
In addition, as organizations are progressively driven by The approach presented in this paper was founded on two
information and analytics, mature data quality processes enhance essential components of data quality management:
confidence in prioritizing the "right work." • Data quality assessment
• Process maturity evaluation
This paper focuses on the approach developed to assess the data
quality and Process maturity for a gas pipeline operator 3. METHODOLOGY
(Operator). The approach was centered on the external corrosion The approach includes the following steps:
(EC) threat. The process, as developed, is fundamentally based 1. Step 1: Baseline data quality assessment: This
on the guidance provided in ISO (8000-8) [3] for data quality step involves performing a baseline data quality
assessment and DNVGL-RP-0497 [4] for Process maturity assessment that includes developing Data Quality
evaluation. Indices (DQIs) or Data Quality Key Performance
Indicators (KPIs).
2. BACKGROUND 2. Step 2: Process maturity evaluation: This step
During a review of pipeline risk assessment results, the involves the development of a framework to assess
pipeline operator (Operator) found that risk results for a all processes, capabilities, and governance required
particular pipeline were driven by the mainline coating type to ensure high data quality within an organization.
being listed as "un-coated." However, further review of the 3. Step 3: Implement process maturity
records showed that the pipeline, in fact, was coated. improvements (Future Steps): This step involves
identifying action items for Process maturity
One of the Operator's foundational principles is 'data as an asset'. improvements and executing the specified action
Thus, the Operator understands the critical impacts, from items.
financial to public safety impacts, of such data inconsistencies 4. Step 4: Periodic reassessment of data quality
across many potential receptors. (Future Steps): This step involves a periodic
reassessment of data quality to measure the
Data governance defines the policies, processes, roles, and effectiveness of Process maturity improvements.
responsibilities required for continuous monitoring and
improvement in data quality. FIGURE 1 shows a typical data An organization's Process maturity evaluation provides a
governance hierarchy for an organization. As shown in FIGURE measure of the processes, capabilities, and governance required
to ensure high data quality. Ideally, improvements in the maturity
2 © 2022 by ASME
level of an organization should translate to improvements in its Source of Truth (SoT): The reference to which data
data quality assessment results. An efficient approach to do so users can turn when they want to ensure they have the
could comprise properly sequenced improvements in Process correct version of a piece of information.
maturity and data quality. Advances in Process maturity are
likely to result in improved results from data quality The SoR was determined for all the 329 EC data elements. In
assessments. A periodic data quality assessment could assess the addition, a comprehensive system map was created that
effectiveness of Process maturity improvements. Data quality demonstrated the flow of information between the different SoRs
metrics discussed further in this paper could be used to capture within the Operator's data systems.
the performance of continuous improvements in Process
maturity using trend lines, and activities could be adjusted The third task in the baseline data quality assessment was to
according to negative or positive trends. assign a priority from one (1) through three (3) to all the 329 EC
data elements. The priorities were assigned based on the impact
4. BASELINE DATA QUALITY ASSESSMENT of the data element. For example, data elements that are required
Data is defined in ISO (8000-8) [3] as: for regulatory compliance or essential for evaluating the threat
"Reinterpretable representation of information in a were given a priority of one (1). Whereas data elements that are
formalized manner suitable for communication, interpretation, nice to have information and would provide increased accuracy
or processing. The ability to create, collect, store, maintain, and granularity were assigned a priority on three (3). The priority
transfer, process, and present information and to support of the data elements was evaluated with respect to six EC
business processes in a timely and cost-effective manner requires processes: In-Line Inspection (ILI), Excavation, Cathodic
both an understanding of the characteristics of the information Protection (CP), External Corrosion Direct Assessment (ECDA),
and data that determine its quality, and an ability to measure, Geographic Information System (GIS), and Risk Process.
manage and report on information and data quality."
The results of the above three (3) tasks are presented in FIGURE
Information and data quality are defined and measured according 2. In addition, a list of observations was prepared to identify
to the following categories: action items that can help in an improved understanding of the
Syntactic quality is the degree to which data conforms characteristics of the 329 EC data elements.
to its specified syntax, i.e., requirements stated by the
metadata.
Semantic quality is the degree to which data
corresponds to what it represents.
Pragmatic quality is the degree to which data is found
suitable and worthwhile for a particular purpose.
3 © 2022 by ASME
measurements to be performed for the selected dimensions.
Some examples of dimensions are Accuracy, Conformance to An organization's internal requirements, nature of business,
metadata/schema, Precision, Timeliness, Format/structural culture, and priorities affect how its data quality activities are
Consistency, and Completeness. FIGURE 3 shows a designed, built, operated, and monitored. As a result, an
representative relationship between metrics and dimensions. organization's Process maturity varies from Level 1 (Initial) to
Level 5 (Optimized). FIGURE 5 shows the characteristics of
Process maturity levels for an organization.
Dimension Completeness
A1. Governance
A2. Organization & People
A3. Data Standards, Requirements & Metrics
A4. Process Efficiency
FIGURE 4: COMPLETENESS DATA QUALITY EVALUATION A5. Technology & Tools
ON A DEMONSTRATION DATABASE
4 © 2022 by ASME
6. IMPLEMENT PROCESS MATURITY
IMPROVEMENTS (FUTURE STEPS):
A review of the Operator's Process maturity evaluation
results with the respective data owners for every EC process
resulted in the identification of opportunities for improvement in
all five evaluation areas across the six EC processes.
Maturity Level
L2 0.84 0.79 0.64 0.81 0.58
Maturity Level
0.26 0.34 0.13 0.23 0.26
L3
5 © 2022 by ASME
FIGURE 8: EXAMPLE OF BAYESIAN NETWORK MODEL FOR
PROCESS MATURITY IMPROVEMENTS
6 © 2022 by ASME
(A3) is now higher in priority than improving L2 in A5. This
reversal of priorities between Scenarios 1 and 2 illustrates of how
non-trivial the optimal prioritization problem is, where the
solution may depend on details of any evidence that may become
available. Generally, the BN model can update the overall
recommendations considering any evidence or relevant
information that may become available.
REFERENCES
[1] Kjærulff, U. B., and Madsen, A. L. Bayesian Networks
and Influence Diagrams: A Guide to Construction and
Analysis (2-nd Edition), Springer, 2012
7 © 2022 by ASME
[2] BayesFusion, LLC provides artificial intelligence [4] DNVGL-RP-0497 Data Quality Assessment
modeling and machine learning software based on Framework, January 2017
Bayesian Networks: https://www.bayesfusion.com/
[3] ISO 8000-8 Data Quality – Part 8: Information and data
quality: Concepts and measuring, 2015
8 © 2022 by ASME