2007 Piarc Evaluating Automated Cracking Equipment
2007 Piarc Evaluating Automated Cracking Equipment
EQUIPMENT
We would like to thank the following organisations and their representatives that provided their time
and shared their precious expertise to the accomplishment of this World Road Association (PIARC)
document.
We also want to thank the persons who contributed directly or indirectly to the redaction of the paper:
The translation into French of the original version was produced by the Quebec Ministry of
Transportation (Quebec-Canada).
Pietro Bumma (Belgium), Paul Harbin (Canada) and Mathieu Grondin (Canada-Quebec) were
responsible within the Technical Committee of the quality control for the production of this report.
The Technical Committee was chaired by Bjarne Schmidt (Danemark). Guy Descornet (Belgium) and
Mathieu Grondin (Canada – Quebec) were respectively the English and the French-speaking
secretaries.
-2-
TABLE OF CONTENTS
SUMMARY ....................................................................................................................................................... 4
INTRODUCTION.............................................................................................................................................. 5
1. EVALUATION METHODOLOGY CONTEXT..................................................................................................................................... 5
1.1 WHAT IS A CRACK? .......................................................................................................................................................................... 5
1.2 UNCONTROLLABLE VARIATIONS IN MEASURING (DETECTING) CRACKS ...................................................................... 5
1.3 TESTING PROCEDURES ................................................................................................................................................................... 6
2. INCREASING THE REPRODUCIBILITY OF THE CRACKS MEASUREMENTS....................................................................... 7
2.1 DELIMITATION OF THE ANALYZED ZONE................................................................................................................................. 7
2.2 CRACK DESCRIPTION AND TYPE ................................................................................................................................................. 9
2.3 CRACK SEVERITY LEVEL ............................................................................................................................................................. 11
2.4 CRACK EXTENT............................................................................................................................................................................... 12
2.4.1 Research level validation test..................................................................................................................................................... 12
2.4.2 Project level validation test........................................................................................................................................................ 12
2.4.3 Network level validation test ...................................................................................................................................................... 13
3. SETTING THE METHOD TO ASSESS THE AUTOMATED CRACK MEASUREMENT DEVICES PERFORMANCES.... 13
3.1 RESEARCH LEVEL VALIDATION TEST...................................................................................................................................... 13
3.2 PROJECT LEVEL VALIDATION TEST.......................................................................................................................................... 15
3.3 NETWORK LEVEL VALIDATION TEST....................................................................................................................................... 17
4.0 CONCLUSION AND RECOMMENDATIONS .................................................................................................................................. 20
5.0 BIBLIOGRAPHY.................................................................................................................................................................................... 21
APPENDICE 1 DETAILED METHODOLOGY FOR THE RESEARCH LEVEL VALIDATION TEST ......................... 22
-3-
SUMMARY
Crack type, severity and extent are one of the most important data to select maintenance and repair
actions. These data are in fact, the basic data of a pavement management system.
The conclusions of a worldwide survey carried out by the PIARC’s former Committee C1 in the
previous cycle (2004-2007) revealed that performance tests of the automated crack measurement
systems are only conducted infrequently, and these are very often incomplete and non-uniform. In
addition, all of the experiments involving comparison of the results obtained using automated
equipment with those obtained by the manual method faced the same definitional problems relating to
distresses. In light of this, Working Group D of Committee TC 4.2 “Road/Vehicle Interaction” set as
its principal objectives the development:
of a detailed method for identification of surface distress (and more specifically of pavement
cracking);
of a method allowing evaluation of the reliability of the automatic pavement cracking detection
equipment.
First, the paper presents a series of more objective and more detailed rules to measure and classify
cracks. Some have the objective of increasing the reproducibility of measurements and allowing more
reliable comparison of the cracking data gathered by different equipment. A monitored zone has been
delimited as well as a better definition of a crack and a methodology to measure the crack severity and
extent.
Similarly, three methods of evaluation of the reliability of automated crack measuring equipment are
also proposed. The research level validation test is used to rate the sensor’s resolution and sensitivity
regarding the capability to capture raw pavement data (images or 3D data). The project level
validation test allows road administrations to rate the equipment’s bias and repeatability to precisely
measure cracks on some small controlled reference track which presents specific and various
conditions. While the last validation test, entitled network level validation test, differs from the
preceding one by the scope of the measurement campaign and by the precision level sought.
Performed on 50-100 km of road, the equipment can be tested on varied surfaces that are more
representative of the network reality.
The objective of this technical report is to help road administrations choose and select automated
systems in regard to their needs, therefore increasing the quality of the crack data used in pavement
management systems. The concepts presented in this document are not international standards, only a
summary of the experiences exchanged between world experts.
-4-
INTRODUCTION
The present working group is undertaking to pursue this goal, which breaks down into two stages:
This report presents the main conclusions of the work carried out by the members of the working
group D of the TC 4.2 committee on these two objectives. The objective of this technical report is to
help road administrations evaluate and select automated systems in regard of their needs and to
increase the quality of the crack data used in pavement management systems. The concepts presented
in this document are not international standards but only a summary of the experience exchanges
between world experts.
The purpose of this section is to set the background for the discussion of issues with respect to
automated crack detection to be presented in this document. In particular, what it is and why it is, that
there is such concern over the apparent and often proven differences between the many systems that
claim the automated detection of cracks. What are the issues?
One would think that this is a relatively simple question, however, there is no definitive definition of a
crack in objective engineering terms. Nowhere is it stated that a crack is a discontinuity in the road
surface that has a minimum length, width and depth.
Until such a common and widely accepted definition can be written, the real question to be answered
is: how do we define a crack? The answer will depend on the observer. Although this point may be
applied more to the classification of cracking than to the crack itself, the dilemma still remains the
same. For example, a series of small disjointed cracks could be considered as a series of cracks or only
one crack.
Although the context in which we are thinking is in terms of automated crack detection, what the
computer can see and how it is interpreted, one could easily argue that human pattern recognition is
also quite ambiguous. Different opinions will always exist.
There are many factors that can influence the capability and reliability of the automated crack
measurement systems. These factors can affect both the collection and the processing of the raw data.
Throughout the world there are many different pavement types and mixtures. A lot of road
construction depends on the local availability of the resources needed for road construction.
-5-
Geographic regions throughout the world, within a country or even a state or province will dictate the
requirements of road construction and surface properties. Because there is such a wide variety one
would ponder the feasibility of developing a singular test.
As there are different roads in different regions there will also be different maintenance programs for
these roads. As such, the requirements or information required from an agency for these roads from an
automated crack detection system will be different. The proposed testing and evaluation methods will
need to reflect these needs and ensure that the correct properties are being evaluated.
Road markings, whether they are intentional paint lines, oil patches, skid marks, bleeding or other
abnormal surface irregularities will cause havoc with the automated detection of cracking. Should
these be designated as false positives and be removed or is it sufficient to include the false positives
and discount the reported cracking by a variable amount? This type of decision may not be addressed
in the context of system evaluation but it must certainly be considered in a complete crack detection
program.
This section outlines the several different validation tests we propose for the evaluation of automated
crack detection equipment.
• A “Research level” test has been defined in order to rate the sensor’s resolution and sensitivity
regarding their capability to capture raw pavement data (images or 3D data). The purpose of this
test is to validate the manufacturer’s equipment and to determine what it can and cannot do. The
intent is to run the equipment on a short specific test grid (section of road) with predefined and
pre-cut cracks of various widths, lengths and depths. If the cracks can be seen in the raw data as
a visual image, then the equipment will have passed the equipment validation test.
This “Research level” test is normally carried out by the equipment manufacturer and it is done
for the specific purpose of making sure the sensors are performing to their design and
performance expectations. This validation is designed to reduce, to a minimum, the variability
associated with sources other than the variation of the lasers or cameras and thus, to focus on the
possibilities/performance of the system components. A client could perform this test on new
equipment according to his needs or accept the specifications provided by the manufacturer.
• A “Project level” test has been defined in order to perform a more detailed evaluation of the
equipment. This test would be the result of operating the crack detection equipment on sections
of in-service roads. It could be a single section (several kilometres), or several shorter ones, but
usually having some specific distresses of interest for the road administration. The road sections
would have to be ground truthed in order to perform an evaluation of the equipment. This test
could also be the basis of a monthly quality plan program.
• A “comprehensive network level survey” on 50-100 km of road can also be performed. This test
would need to be prepared according to the agency’s result expectations. This last step is likely
to be the most controversial of any test because the acceptance of the ground truth values will
always come into question.
At a conceptual level, the above referenced testing and evaluation steps seem quite objective and
simple, however, obtaining a reliable and repeatable ground truth, which include all of the particular
cracking patterns and pavement types an agency expects to come across is not simple. Finally, it must
be considered that it may be prudent to have separate tests for the evaluation of the collection and
processing technology. As there are many collection platforms there are many processing platforms as
well. It should be required, as part of the testing process, that crack detection applications are capable
of processing the raw data from other platforms in order to have a more standard evaluation process.
This just may lead to defining a standard format for the output of the raw sensors. Is this appropriate or
would it be considered too restrictive on the manufacturers of such equipment, limiting their ability to
advance technology?
-6-
2. INCREASING THE REPRODUCIBILITY OF THE CRACKS MEASUREMENTS
Excellent pavement distress identification manuals are available in many countries. In general, the
manuals do not cover all particular aspects and conditions that are regularly present when roadways
are assessed. For example, a transversal crack is defined in many manuals as a crack that is generally
perpendicular to the roadway surveyed. Few manuals specify a minimum length and orientation
(angle). There is still no accurate, universal method for measuring the width of cracks. This ambiguity
is quite frequent and exists for most defects measured. Whether automated or not, the analysis of
defects under these conditions may produce considerable variations depending on the operators and
the automated equipment used [1].
In order to properly evaluate automated equipment, we must be very careful not only with the test
procedures, but also, and mostly, with the description of cracks and their geometric characteristics, so
that one can distinguish them. This is the aim of this chapter.
With the aim to improve measurement repeatability, it is very important to precisely define the zone
where the systems have to detect and analyse cracks. A common way to do this is the worldwide
approach of strips [2].
The proposed method divides the surveyed lane into five strips. In the direction of travel, strips are
numbered from left to right (see picture below). Strips 2 and 4 match with wheel paths and their width
is 0.75 meter. The width of the central strip, number 3, is 1.00 meter. The width of the outside strips,
number 1 and 5, varies with the width of the lane. A typical lane width is 3.60 meters.
-7-
As shown in the figure 1, the surveyed area of the lane is limited by the inside of the lane markings. If
there are no lane markings for a distance ≥ 100 metres, the default lane width to be used is 3.60
metres, positioned at the centreline of the survey vehicle. The suggested widths are just
recommendations. They can be adapted to national or regional needs and requirements.
-8-
2.2 CRACK DESCRIPTION AND TYPE
The table 1 below gives a detailed description of some of the most frequent cracks and distresses
measured automatically.
Voie auscultée
Sens du traffic
-9-
Agglomeration of pavement cracks in the form of
a grid, with at least 3 pieces in each direction, and alligator cracking (d ≤ 300 m)
where the diameter of each piece is less than 300
Alligator
mm.
cracking
If the diameter of the pieces is greater than 300
mm, then the cracks are considered as distinct.
- 10 -
2.3 CRACK SEVERITY LEVEL
Cracks can have several appearances. Apart from changes in orientation and in length, width can also
change along a single crack. It also happens quite often that cracks are spalled (loss of material) or
branched out. Of course, this makes their characterization more difficult. When processing crack data,
the simultaneous presence of these phenomena, surface colour, changes of macrotexture or road
deformation, all amplify the problem. In order to improve the repeatability and the reproducibility of
measurements of degrees of severity, some rules are proposed below. These rules can be applied to all
kind of measured cracks, regardless of their orientation and position. They can either be used in
automated analysis algorithms, or in visual identification methods [3].
Each uniform crack segment is evaluated. The minimum segment length considered is 1 meter. The
representative degree of severity for a segment corresponds to the maximum width encountered on the
longest (more than 25%) portion of the segment. Spalled cracks are automatically considered to be of
high severity. Multiple cracks are considered to be of moderate or high severity (see multiple cracks),
and alligator cracks are considered to be of high severity. The thresholds that define the categories of
severity are not covered in this document, and each administration has to determine its own thresholds.
The figure 2 below represents a summary of the suggested method for the measurement of the degree
of severity of a crack.
- 11 -
2.4 CRACK EXTENT
This phase aims to check precision and bias of automated systems when measuring cracks under
controlled conditions. Tests are made on a track, where cracks are artificially simulated by sawing the
pavement in such a way that the length, orientation and width of the cracks are precisely defined. As
far as transversal cracks are concerned, only their number is counted. For all the other kind of cracks,
the extent is the actual length.
This phase aims to check the repeatability and reproducibility of automated systems when measuring
cracks on roads at project level. Tests are made on well known sections of in-service roads where all
the defined cracks exist at several degrees of severity.
Keeping in mind the proposed strips, it is furthermore recommended to divide the test area into 1
meter long subsections, as shown in the figure 3. The road surface will thus be divided into cells. The
method consists of recording whether there are cracks in each of the cells or not. When a cell is
“touched” by a crack, then it is an “allocated” cell. This will be the extent of the cracking. This
technique gives a kind of digital picture of the road surface, with pixels “allocated” or not.
Once a cell is defined as “allocated”, a degree of severity can also be added; this will add another
criterion to the analysis. If more than one crack touches a cell, its degree of severity will correspond to
the maximum of the considered cracks.
- 12 -
2.4.3 Network level validation test
This phase aims to verify whether the automated crack detection system is capable of identifying
lengths on a network that has both high and low levels of cracking. To do this, the surveyed area is
“overlaid” with a square grid; then any grid tile containing a crack is taken into account. The extent of
the crack to be considered is the total area of grid tiles containing a crack within a sub-section of
survey data. The dimensions of the tiles depend on the resolution of the tested device (typically 0,5 m
x 0,5 m). The recommended sub-section length is 50 meters.
The multiple step method proposed is intended as a suggested method of equipment validation. In
most cases the purpose of the testing is to evaluate specific equipment for use within an agency’s PMS
for their road network. It is up to the agency to determine which steps in the suggested three step
process need to be carried out.
Certain agencies such as the Highways Agency (HA) in the U.K. use a network level approach to
validate proper operation of equipment. This single step approach has been used for several years and
is considered sufficient as the HA is specifically interested in the big picture results for input into the
calculation of their key performance indicators.
As suggested, the three step evaluation methodology is not intended to be a set of mandatory steps but
a suggested process in the complete evaluation of a cracking system.
The first level of tests consists of evaluating the performance of the system components and
configuration. The intention is to begin by ensuring that the lasers or cameras used have the precision
and repeatability required to perform the desired work. The main objective of this test track is to
provide an objective way to evaluate the reliability of the automated equipment by controlling all the
subjective aspects of the measurements. All the bias and the lack of repeatability can then be attributed
to the equipment.
This test consists of evaluating the bias and repeatability of the systems for measuring certain targeted
crack characteristics. This stage would be carried out under what are reported as controlled conditions:
on a marked track with intentionally created cracks, using a statistical sampling plan that makes it
possible to isolate various targeted characteristics of the cracks, such as their width, position
(longitudinal strip), depth, length, and orientation. The cracks can be created using saw cuts. A guide
is presented below, which presents the method of building the track, the survey procedure to be
followed, and the method for calculating bias and repeatability.
- 13 -
The proposed artificially cracked tracks are divided transversally, based on the longitudinal strips
defined in section 2.1, and longitudinally every 10 metres. Each defined cell can then be associated
with a specific crack, however, some cells are left empty (without cracks) in order to avoid a
systematic apparition of cracks every ten meters. The artificially cracked track proposed in table 2
would have the following characteristics:
Appendix 1 presents a typical data file resulting from a research validation test as well as a more
detailed explanation on how to manage and analyse the data. By using this methodology, road
administrations can easily identify if the tested equipment has a problem and associate this problem to
a specific road characteristic (type, width, position, type of crack…).
Classification thresholds
The results obtained (repeatability and bias) can now be used to classify equipment in terms of
precision and reliability according with the tables 3 and 4. The classification thresholds used can be
determined on the basis of their effect on a known cracking index (cracking rate or other). The
highway administration can then make its equipment choice based on its management needs.
The speed range of a surveying vehicle on the test site must be recorded. The classification of vehicles
could only be valid for this speed range and will depend on the collecting process principle of the
survey vehicle. The following conditions of the test site could also influence the cracking index
provided by the survey van:
- 14 -
These conditions will influence the results, therefore the comparison of the results of different test
sites. This type of track can then be used to evaluate the influence of these factors on the system
reliability if the passes are performed in these different conditions.
The main objective of this validation set of tracks is to regularly test the quality of the equipment. For
example, a road agency with several systems can define a set of ten tracks according with the table 5
for validation and force each system to go through the validation process every month in order to
ensure that each system always satisfies a given class criteria. This test is carried out on several tracks
containing natural cracks. These tracks should be carefully selected in order to ensure that all of the
characteristics of the cracks to be measured in network mode would be represented.
The measurement protocol is simplified as much as possible. As presented in section 2.1, the surveyed
lane is divided into 5 longitudinal strips. It is then subdivided longitudinally every 1 meter. This gives
a continuous grid pattern that provides clearly-defined transversal and longitudinal positions, which
eliminates localization errors.
- 15 -
The choice of 1 meter sections is used to minimize the probability for a cell to contain more than one
crack. This length typically results in a meaningful and relevant cracking rate. In order to perform the
1 m validation, the systems tested should have some form of automatic triggering device to ensure that
the measurement starts and stop at the same location.
The use of 10 tracks is chosen to make sure that we obtain several cracking characteristics and to
ensure that the sample is representative of the most important characteristics fund on the road network.
The number and the type of cracks are variable since the validation process is performed on naturally
fissured tracks. A reference measure of the cracking rate must be obtained manually for all ten tracks.
As the survey van reliability could be affected by the speed, it is recommended to measure the speed
influence. Then, it is recommended that the van perform 5 passes on each track per each tested speed.
In its simplest form, presence or absence of cracks in each cell, the data to be collected consist of a
series of 0 (cell without crack) and 1 (cell with at least one crack). The data file would then contain
250 lines for each track and only one column (with zeros and ones) in addition to the column
identifying the location of the cells. The cracking rate is then obtained by summing the number of 1s
(ones) and dividing by 250 (the total number of cells).
In addition to the cracking rate, the concordance of the results will also be considered. The
classification is then determined by the proportion of cells with the same conclusion (presence or not
or crack in cells) for both the referenced and the measured cells.
When crack severity is considered, no cracking rate is calculated, only concordance is used for
classification. In order to evaluate the concordance, each cell is given the value of severity associated
with the most severe crack in the cell.
This methodology can be used without considering the crack type, or it can be used for each type of
crack. For example, one can give the value of 1 in each cell as soon as one crack of any type is
present and thus measure a general cracking rate, or one can give the value of one in each cell only
when a transversal crack is present and thus measure a transversal cracking rate, etc.
The repeatability of the measurement is also considered and the equipment classification is based on
the coefficient of variation of the cracking rate and of the proportion of concordant cell conclusions
Classification thresholds
The results obtained can be used to classify equipment in terms of precision and reliability according
with the tables 6 and 7. The classification thresholds used can be determined on the basis of their
effect on a known cracking index (cracking rate or other). The highway administration can then make
its equipment choice based on its management needs. This classification is limited to systems
evaluated on the same set of ten tracks. Since the tracks will vary from country to country (or agency
to agency), the equipment validation classification in different countries (or agencies) can not directly
be compared.
- 16 -
TABLE 6: CRITERIA PROPOSED FOR DETECTION AND BIAS
For the cracking rate we compare the reference cracking rate with the measured cracking rate and the
difference determines the classification. For example, if the reference cracking rate is 80% and the
measured cracking rate is 83% then the equipment is class A. As stated earlier these classifications are
associated to each set of ten tracks.
For crack severity, each cell is given the value of severity associated to the most severe crack in the
cell. The classification is then determined by the proportion of cells with the same severity for both the
referenced and the measured results. For example, if a piece of equipment produces the same
conclusion on severity for 86% of the 2500 cells, the equipment would be class A.
In the network level tests the goal is to asses the capacity of the equipment to measure cracks over all
the types of road that will be found on the network. Therefore, these tests cover sites up to 100 km in
total length and compare the results delivered by the crack measurement device with the true level of
cracking present on the sites, using a simple approach, as described below.
Different surface types or surface texture (there could be many of these on the network);
Longitudinal and transverse joints;
Fretting;
Fatting;
Road Markings;
- 17 -
Patches (sealed and unsealed);
Ironwork;
Longitudinal and transverse grooves (on rigid pavements).
It is probable that the test sites will be located in many different places on the network. These sites
should be defined in terms of road sections for the purposes of the tests.
The network tests would cover a wide range of challenges for the equipment. It is a further advantage
that a wide range of environmental conditions will also be covered (e.g. sun angles, surface state),
thereby providing a more comprehensive test. The tests will need to be carried out on many sites
which may have to be visited over a period of a few days by the equipment.
To test the automatic crack detection system the reference data is reported as the total area of grid tiles
containing crack within 50 m sub sections of survey data. The data is then normalised by calculating
the average area recorded over all of the sub-sections and dividing the area for each sub-section by this
average to obtain the Normalised Reference Area for each of the 50 m sub-sections. (Note: the overall
average value is 1).
For this network level test we need to know that the crack detection system is capable of identifying
lengths on the network that have high and low levels of cracking, and therefore the Normalised
Reference Area in each 50 m subsections is examined to classify the subsections as follows:
Sub-sections with a Normalised Reference Area greater than 1.75 are defined as sub-sections
containing high levels of cracking;
Sub-sections with a Normalised Reference Area less than 0.2 will be defined as sub-sections
containing low levels of cracking.
The automatic crack detection equipment surveys the network sites and the crack data provided is
processed in the same way. A grid is placed over the automated crack data and any grid squares in
which the automatic crack detection equipment has reported a crack is filled in. This is used to obtain
the area of cracking over each 50 m sub-section in the automated crack data. Again, the average
automated cracking area recorded over all of the sub-sections is calculated and the Normalised
Automatic Area for each of the 50 m sub-sections is obtained. The Normalised Automatic Area in
each 50 m subsections is examined to classify the lengths as follows.
Sub-sections with a Normalised Automatic Area greater than 1.75 are defined as sub-sections
containing high levels of cracking.
Sub-sections with a Normalised Automatic Area less than 0.2 will be defined as sub-sections
containing low levels of cracking.
Classification thresholds
The two datasets (reference and automatic) are then compared to determine if the system is able to
reliably identify cracking at the network level. In the evaluation the 50 m sub-sections containing a
high and low level of cracking in the Reference Data are compared with those reported in the
Automatic Data to contain a high and low level of cracking, and performance is assessed as follows:
- 18 -
If the Automatic Data reports a high level of cracking over at least 75 % of the 50 m sub-sections that
the Reference Data shows to have high level of cracking;
AND
The Automatic Data shows a low level of a cracking over at least 75 % of the 50 m sub-sections that
the Reference Data shows to have a low level of cracking,
THEN
The system is considered suitable for network level assessment of cracking. If the automatic system is
unable to meet this standard then it is likely that the system has a problem working on the range of
different surfaces and conditions encountered on the network. The developer of the automatic crack
detection equipment may need to investigate the reasons, which could include.
A further option for the network level tests is to assess the reproducibility of different crack
identification systems. In this test we assume that many systems have undertaken surveys of the test
sites and that there are several sets of Automatic Crack data available (one from each system).
A single average Cracking Area is obtained from this data set for each 50 m sub-section from the
Cracking Area provided by each set of survey equipment by averaging the results obtained from each
system over each 50 m length. The above tests are then repeated on the data provided by each system
to determine if they have a satisfactory level of performance when compared with each other. This test
can also include visual comparison of the crack areas provided by each system to identify any
significant local differences. The test also enables authorities to estimate sensitivity factors for
different systems so that the Cracking Areas reported by different types of equipment can be
correlated with each other.
- 19 -
4.0 CONCLUSION AND RECOMMENDATIONS
Cracks type, severity and extent are one of the most important data to select maintenance and repair
actions. These data are in fact, the basic data of a pavement management system. Much progress has
been made by some road administrations, service providers and researchers to develop automated
systems to collect the data. Since these technologies are available, few studies and publications have
been produced by the world scientific community in order to help road administrations to insure the
reliability of the automated crack measurements.
The quality of a manual survey at the network level depends on many factors such as the time of the
day when the analysis is carried out, the fatigue and the understanding and the subjectivity of the
operators. The automated systems are not affected by the same factors. Experience shows that
automated systems are more constant when it comes to surveying cracks than the manual method.
These systems can record and analyse many kilometres with the same good precision, especially if the
road conditions (texture, rut, sun etc…) do not change. The main problem is that at the network level,
the survey conditions as well as many road surface characteristics change at the same time, combined
with the lack of knowledge of the equipments performance, false detections and areas of inadequate
survey results are difficult to localize and to identify. These road characteristics that can affect the
analysis have to be taken into account when defining the validation test. Road administrations should
take time with the constructor or the service provider to establish a list of possible road conditions
affecting the results and try to test the survey vehicle in these conditions. The validation tests are
expected to evolve year after year and will depend on the knowledge gained of the system’s reliability.
These tests should become more precise and less time consuming over time.
Road administrations also have to identify their needs in terms of pavement management. This first
step will help them to identify which validation test to perform and the level of detail needed. It is
generally recognized that the more detailed the crack data are the more the PMS will recommend
precise and specific maintenance and repair interventions. Road managers should also fix the type and
the precision of the required performance curves they have or they will have in advance, which will
affect considerably the mid and long term strategies.
The concepts presented in this technical paper represent a summary of the discussions and the answers
received from a world wide survey of the road surveys and pavement management experts on this
subject. PIARC 4.2 Technical Committee members hope the concepts presented in this technical paper
will be viewed as real progress that aims at increasing the quality level of crack data measurements in
order to help road administrations to better manage their road networks.
- 20 -
5. BIBLIOGRAPHY
[2] ASSTHO Designation: PP44-01, Standard Practice for Quantifying Cracks in Asphalt Pavement Surface,
AASHTO Provisional Standards, May 2002.
[3] MINISTÈRE DES TRANSPORTS DU QUÉBEC, Guide de mesure et d’identification des dégradations des
chaussées flexibles, 2007.
[4] PETRA OFFREL, LEIF SJÖGREN AND ROLF MAGNUSSON, Repeatability in Crack Data Collection on
Flexible Pavements: Comparison between Surveys Using Video Cameras, Laser Cameras, and a Simplified
Manual Survey, Journal of Transportation Engineering, July 2005
- 21 -
APPENDICE
- 22 -
Here is an example of a typical data file resulting from a research level validation process:
TABLE A1. TYPICAL DATA FILE FOR THE RESULTS OF A RESEARCH LEVEL VALIDATION PROCESS
Table A1 shows that each line corresponds to a crack (the sampling unit) and, as we can only have one
crack per band per 10m, it is quite easy to make the association from repetition to repetition of the
same crack. Since the transversal cracks affect several bands the analysis criterion is based on the fact
that we can have only one single crack per line, we suggest to record transversal crack as being in
band #9.
At this stage, three parameters are studied: the proportion of detected cracks, the crack length and its
severity. For each of them, the results are presented in a specific table form. The following tables
present some examples for a better comprehension.
- 23 -
For the proportion of detected tracks, the table shows for each band and each repetition, the
proportion of cracks correctly identified. One can easily identify if a piece of equipment has a
problem with some specific bands (for example band #5 in table A2).
For crack length, the table shows, for each crack, the location of each crack, the reference length
value measured manually, the mean of the 5 repetitions, the standard deviation of the 5
repetitions, and the coefficient of variation (calculated from the reference value), the absolute
bias and the relative bias. The analysis can easily point out if the equipment has some
difficulties with some cracks, or with crack with given characteristics (for example in band #1 in
the cell starting at 140m, in table A3).
For crack severity, the structure of the table is identical as for crack length once again, the
analysis can easily show if the equipment has some difficulties with some cracks, or with crack
with given characteristics (for example in band #1 in the cell starting at 40m, in table A4).
The classification is directly determined from the table results. For example in table A2 (proportion
of detected cracks), the equipment must be classified AA for bands #1, #2, #3, #4 and #9 but
classified B in band #5 for longitudinal cracks. Notice that transverse cracks are recorded as being in
band #9. If a classification is given for all bands, it must be AA since the equipment detects more
than 90% in all bands. As far transversal cracks are concerned, the equipment would be classified as
AAA. For crack length and severity the classification thresholds are determined by counting the
proportion of cracks satisfying the relative bias or coefficient of variation (C.V.) criteria.
- 24 -
TABLE A3: CRACK LENGTH (M) (EXAMPLE)
- 25 -