Derivation and Modelling Hue Uniformity and Development of The IP
Derivation and Modelling Hue Uniformity and Development of The IP
Theses
7-1-1998
Recommended Citation
Ebner, Fritz, "Derivation and modelling hue uniformity and development of the IPT color space" (1998).
Thesis. Rochester Institute of Technology. Accessed from
This Thesis is brought to you for free and open access by the RIT Libraries. For more information, please contact
[email protected].
DERIVATION AND MODELUNG OF HUE
UNIFORMITY
by
Fritz F. Ebner
July 1998
COLLEGE OF SCIENCE
CERTIFICATE OF APPROVAL
Date
9/z'I/tff
11
DISSERTATION RELEASE PERMISSION
COLLEGE OF SCIENCE
CHESTER F. CARLSON
Title of Dissertation: Derivation and modeling of hue uniformity and development of the
I, Fritz F. Ebner , hereby grant permission to the Wallace Memorial Library of R.I.T.
to reproduce my thesis in whole or in part. Any reproduction will not be for commercial use
or profit.
Signature:
Date:
III
DERIVATION AND MODELLING OF HUE
UNIFORMITY
by
Fritz F. Ebner
Submitted to the
Chester F. Carlson
Center for Imaging Science
College of Science
in partial fulfillment of the requirements
ABSTRACT
Metric color spaces have been determined to be significantly non-uniform in the hue attribute
of color appearance. Several independent sources have confirmed the non-uniformity. A data
set was obtained during the course of this thesis work that contains the largest sampling of
color space to date which can be used to compare models of color appearance. The data set
obtained was compared to existing data sets and found to correspond closely.
Lookup table
methods were employed to test significant differences between data sets. A simple
modeling
approach was taken based on commonly understood color space models and knowledge of the
visual system. Several color spaces be derived using the simple model, and one was
can
chosen that models hue uniformity very well and has other desirable attributes. This new color
space is named IPT. Many visual data sets were plotted in the IPT color space and all show
IV
ACKNOWLEDGEMENTS
I time from
wish to acknowledge all the observers who selflessly (but not without reward) took
their busy schedules to stare at a CRT screen and make hard decisions. Names are listed in
arbitrary order.
Doug Kreckel
I would like to acknowledge the following people for going well beyond the call of
duty (or a
free lunch) bv making many observations, providing valuable feedback on the experimental
design and presentation to the observers, helpful discussions, and general support: Jacqueline
Holmes, Karen Braun , Dorothy Jaehn, Lee Cass, and Dave Birnbaum. I would also like to
acknowledge the
help of Dr. Robert Chapman from University of Rochester's Center for
Visual Sciences, for help with the analysis of semantic differential scale data.
DEDICATION
my life. Although she never knew it, she showed me that it is possible to be courageous in the
face of incredible adversity, and that you can keep your sense of humor through it all.
VI
Table of Contents
INTRODUCTION 1
BACKGROUND 3
vu
3.2.2.3.2 Model and Coefficients 146
3.2.2.3.3 Color Functions 147
Matching
3.2.2.3.4 Constant Hue data sets 149
3.2.2.3.5 Munsell Value 5 data 152
3.2.2.3.6 Neutral Lightness Response 153
3.2.2.3.7 Chromatic Lightness Response 154
3.2.2.3.8 MacAdam (PGN) Discrimination Ellipses and Spectral Locus 159
3.2.2.3.9 Suprathreshold Color Difference Ellipses 160
3.2.2.3.10 OSA Color System medium gray (L = 0) constant lightness plane 161
3.3 Experiments E2. Verification Experiments 162
5.5.7 Comparing CIELAB to Hung & Bents and Ebner data sets 762
3.3.1.1 Introduction 162
3.3.1.2 Experimental 163
3.3.1.3 Results 165
3.3.2 Comparing IPT color space to Hung & Berns and Ebner data sets 766
CONCLUSIONS 170
APPENDICES 173
REFERENCES 179
vm
List of Figures
sub-class) 41
Figure 10. Example images from PI experiment. Abstract and Data representative images. .. 41
Figure 11. Variance accounted for by factors from PCA 43
Figure 12. User interface for experiment P2 (sub-experiment 1) 51
Figure 13. Images presented to the observers for experiment P2 (sub-experiment 1) 52
hab=303
Figure 14. Example hue leaf of CRT and Printer gamut shapes at 56
Figure 15. Histogram of
AE*ab for exact match exercise for entire observer
group including
outliers 58
Figure 16. Median match values for 25 colors in a*b*
Filled arrow heads indicate model predictions. Hue leaves shown are for the hue angle
Filled arrow heads indicate model predictions. Hue leaves shown are for the hue angle
heads indicate observer results. Filled arrow heads indicate Katoh and Ito results. Hue
leaves shown are for the hue angle of the predicted match 70
Figure 22. Comparison of Observer results with best 2d enhanced Katoh optimization
parameters. Open arrow heads indicate observer results. Filled arrow heads indicate
model results. Hue leaves shown are for the hue angle of the observer match 72
Figure 23. Comparison of Observer results with best 3d enhanced Katoh model optimization
parameters. Open arrow heads indicate observer results. Filled arrow heads indicate
model results. Hue leaves shown are for the hue angle of the model prediction 73
Figure 24. Images used in experiment P2, sub-experiment 2 77
Figure 25. Histograms of colors where the null hypothesis was rejected between sub-contexts78
Figure 26. Mean mapping vectors for each of the 5 sub contexts 79
Figure 27. Mean mapping vectors of Text sub-context (open arrow
heads) and the mean of all
IX
Figure 29. Munsell renotation colors with extrapolation boundary and CRT gamut projection
Figure 34. Histograms of hue matches for 90 observations at reference hue angle of 0 degrees.
Histograms are plotted in the respective locations in the L*-C*ab plane of the reference
colors 95
Figure 35. Average and standard deviation of 95% confidence limits of mean hue matches as a
function of reference hue angle. Units of y axis are in degrees (CIELAB Ahab) 96
Figure 36. Histogram over the entire data set (306 colors X 30 observation means =
9180) of
Hue leaves are labeled by the CIELAB reference hue angles for comparison 103
Figure 43. Loci of constant hue in CIECAM97s C cos(h)-C sin(h) at different lightness levels.
Dots represent reference colors 104
Figure 44. Quantification of average hue non-uniformity for CIELAB and CIECAM97s 105
Figure 45. Quantification of maximum hue non-uniformity for CIELAB and CIECAM97s. 106
Figure 46. Constant hue surfaces shown in a linear opponent space (Y,X-Y,Y-Z). Numbers
represent CIELAB reference hue angles 107
Figure 47. Constant hue surfaces shown plotted in CIELUV color space. Numbers represent
0.45, G2 =
0.6. Right plot is f2 with parameters Cl =
1, a2 =
1. Note that at the origin,
Figure 55. Two dimensional "slider that controlled the pretransformation matrix 131
Figure 56. Examples of CIE 1931 color
matching functions transformed through
Figure 58. Example hue lines from Guth type model. The first image is rotated at an angle of
0 degrees, the second is at 45 degrees. Note the shape difference of the constant hue
loci 135
Figure 59. Nonlinear controls for visualization tool. Six parameters correspond to the positive
and negative parts of each of the three channels. Functions take the form of
Notice that the mid chroma colors are more purple than the constant hue locus.
Dotted lines are
Hung and Berns constant hue loci 144
Figure 62. Path of a gradient
sweep from gray to blue primary in IPT P-T space. Notice that
the mid chroma colors are more purple than the constant hue locus. Dotted lines are
locus 159
Figure 74. RIT-DuPont visual color-difference ellipses plotted in IPT (left) and CIELAB
(right) 160
Figure 75. OSA color system medium gray constant lightness plane plotted in IPT (left) and
XI
Figure 78. Scale values for judged uniformity between IPT and the 2 constant hue data sets.
H&B are
Hung and Berns data, E&F are Ebner and Fairchild's data 167
Figure 79. Mean scale values for colors and descriptive terms for abstract representative image
type 173
Figure 80. Mean scale values for colors and descriptive terms for data representative image
type 174
List of Tables
List of Equations
xn
1 Introduction
The progression of this thesis work that led to the eventual characterization of hue umformity
and development of uniform hue color spaces was not linear. In fact, it was more
evolutionary, in the sense that the fittest paths to completion survived, whereas the paths that
offered less opportunity were abandoned. This dissertation is intended to capture all of the
work that was conducted from the completion of the thesis proposal to the completion of the
The background section covers the study preceding the research proposal, as well as two
experiments that were performed subsequent to completion of the proposal. These subjects
are related to the study of two things. Firstly, the study of color
meaning in different contexts
as defined by image content, using semantic differential scaling. Secondly, the study of
find surfaces of constant hue in CIELAB color space. Fifteen equally spaced hue angles in
CIELAB color space were sampled. Thirty observers performed a hue matching task three
times each over 306 colors covering the full gamut of a CRT display. A computer program
and
accuracy of color appearance phenomena. A new color appearance space is proposed that
has acceptable hue uniformity, models several other visual attributes as well as or better than
Figure 1 shows the progression of the work performed throughout the research period. The
first two experiments, labeled PI, and P2 (P stands for proposal, or preliminary), were part of
the initial proposal that was signed off by the advisory committee. PI did not yield results that
were deemed significant enough to follow, thus the path was abandoned. P2 was published in
the
non-uniformity in hue in CIELAB color space were so bad, the color name was changed
when
using the color space to perform gamut mapping. The available data sets for constant
perceived hue were found to be insufficient, so the subsequent experiments (El and E2) were
1998.2
designed around hue uniformity. El was presented at EI98 in San Jose in January
Study of hue uniformity has been significant enough to warrant the completion of the degree
X
El: Finding constant
Dead End
hue surfaces
Derivation of constant
E2: Verification
experiments
Graduate
2 Background
The background section is comprised of two parts. First, a literature search is discussed on the
subjects of color order, color meaning, color harmony, and color preference. Secondly,
Each
A is way to describe the relationship between
color order system a colors. color order
system attempts to create a space within which colors can be specified and compared in an
intuitively clear way. The fact that there are so many such systems implies that there may not
be a most appropriate
way to specify colors for every situation and within
every context.
From a color
harmony point of view, there are a few important color order systems that
respective theories are based upon. Color order systems of Munsell, Ostwald, Hard and Sivik,
and Moon and Spencer are thus described along with the basic foundations of their respective
2.1.1.1 Munsell
Munsell3,4
Albert H. was bom in Boston, Massachusetts in 1858. Fie attended college at the
Massachusetts Normal Art School. Following this, he attended the Julien Academy in Paris on
a graduate fellowship. He won second prize in the Beaux Arts competition for his painting
"The Ascension of Elijah". Between 1898 and 1905, he developed his ideas on color order,
Although many others before Munsell have suggested ways to arrange colors, he was the first
Hue. The hue circle is arranged so that there is an equal perceptual distance between each
major hue category. Munsell's major, or simple hue categories were red, yellow, green, blue,
and purple. Purple is included here because Munsell thought that there was a larger perceptual
distance between red and blue than there was between red-yellow, yellow-green, and green-
blue. Intermediate hues, or compound hues, are defined between each of the major hue
fold for each simple hue. The value 5 is assigned to each simple hue and 10 to each
compound hue. The notation for a given color is defined by a triplet of hue, chroma, and
value. The value axis goes from 0-black, to 10-white. The chroma scale goes from 0-neutral,
to an open ended high chroma number, although larger values have been assigned
by some.
On the physical atlas of color, some pigments are naturally stronger than others. Therefore,
the strongest pigment of blue-green only goes to chroma of 5 whereas the strongest pigment
The Munsell notation for a particular color is <hue designation> <value>/<chroma>. For
example, a pink color might have a Munsell notation of 5R 7/3. Neutral colors are denoted N.
Munsell's system does not directly correspond to any theory of color vision, as do other
systems. Munsell was interested solely in the perception of color and did not attempt to
and color. He attempted, through heunstics and design tenets, to specify rules for creating
harmony and balance in color. Balance, as defined by Munsell, is visual comfort. Balance
means that a color scheme is neither too light or too dark, too weak or too strong, too hot or
The simplest balance of two colors is a balance about neutral gray. This means that two
colors that sit on a line drawn through N5 and are equidistant from N5 will be balanced.
Area of color combinations is important. A strong red will balance a larger field of blue
green of small chroma. "The stronger the color we wish to employ, the smaller must be its
2
larger
chroma."
area, while the the area, the grayer or weaker the Area is related to value
and chroma by the rule that the product of value and chroma should be inversely
proportional to the respective areas of the two colors in question. For example, 5R 7/6
Colors of the same hue can be made to balance if they are created with different values
A small amount of discord is not always undesirable since it makes the scene more
interesting.
There are 9 chief features balance described Faber Birren in "Munsell, A
of as
by grammar of
Color"
. These are:
1. Gray colors harmonize best when neatly and spaced. Its best to center around N5.
evenly
2. Harmonies with
only one hue look best when they are arranged and around a
neady evenly
4. Opposite colors of equal value but of different chroma should follow the above
5. Opposite colors of the same chroma but different value should fall on a straight line
6. Opposite colors of different value and different chroma should follow the area law.
7. Munsell believed, as did others, that color combinations most liked by people comprise
either colors that are closely related or colors that are in contrast.
"Neighboring colors
should be neatly stepped as to value and should find sequence at middle value 5. They
should be of the same chroma for good balance. One color could have strong chroma and
the other could have weak chroma". The area rule should
apply to these colors.
8. There is harmony in dirninishing sequences. These are sequences that start with high value
and chroma and each successive color is reduced in chroma and value, and
possibly
9. Munsell liked the path of an ellipse through color space from one color to its compliment.
All Munsell's balance centered around his color system and
of notions about harmony and
simple geometric relations between colors in this space. These relations were drawn from
expenence as an artist and designer, and as such should be viewed to have more
validity than
Spencer13
on the hue circle, and characterized them as most strongly contrasting. He also defined them
as two colors that made neutral when spun on a Maxwell disk. He goes on to say that an
gray
admixture of two colors of equal proportions makes a color that on the Munsell diagram
any
lies at the center of a line connecting the colors. This confusion between perception and
stimuli is interesting because it is the only place in his writings that he infers psychological
2. 1. 1.2 Ostwald
Ostwald6,7
Wilhelm was around 60 years of age before he became interested in color. Before
this, he had worked in the fields of mathematics, chemistry, and physics. Ostwald's system is
"fullcolor"
spectral "block dye". There are enough fullcolor descriptions to span the hue circle, thus
enabling a 3 dimensional solid to be formed. Ostwald created his system before the CIE 1931
standard observer was adopted, and before Stevens started work on his power law (1953) .
physical atlas from the use of disk colonmetry by mixing proportions of black, white and a
physical
analog of a "fullcolor". All colors that sit in a given triangle are compnsed of a
mixture of black, white and a given fullcolor. White and black content are denoted by lower
so
only two coordinates need be known to define the position in the triangle. The triangle is
set
up such that colors of equal white content fall on a line that is normal to a line that
intersects the white point, is normal to the equal white content line, and is parallel to the line
connecting black and fullcolor. The analogous rule applies for colors of equal black content.
Colorimetrically, the Ostwald system neither spans the entire visual space (as defined by the
denned all colors as mixtures of black, white, and a pure color, his system fits well with how
graphical artists deal with, and think about, colors i.e. in tints, tones, and shades. Faber Birren,
in his analysis of the Ostwald system, said "When contemplated in the light of average human
appreciation, colors are seen not so much in terms of lightness or darkness, but rather in terms
harmonious."
2.1.1.3 NCS
Sivik9
The Natural Color System designed Hard is a Swedish standard
(NCS) was
by and and
for color notation. The NCS is intended to be a system that can be used without a physical
atlas for reference, although one was created for it for illustrative purposes. NCS provides a
way to describe
allows the
color without reference to a particular viewing situation, thus ability
to describe the perception without regard to a particular stimulus. The NCS space is arranged
red and green. NCS notation is similar to Ostwald's notation in that a given color lies in a
triangle whose vertices are white black and a maximum chroma color. Colors are described by
having a resemblance to black, white, and to up to 2 of the 4 primary hues. Within the NCS
triangle, whiteness and blackness are described in the same manner as the Ostwald system, and
the equation s+w+c=100 is conspicuously similar, where the letter s denotes. The notation
for the NCS system is a triplet, where the first number is blackness, the second number is
chromaticness, defined as the distance from the achromatic axis, and the third element is an
alphanumeric that denotes hue angle. The hue circle is broken into quadrants of yellow, red,
blue, green, and the hue notation describes the angle from one primary to the next.
They
make the point over and over again that "subjective phenomena can only be meaningfully
measured
using subjective methods... Thus, the NCS, as a system for denoting colors as and
when
they appear to man, is general and not based on the availability of the NCS Color
Adas."
10
Hard and Sivik have criticized
many previous workers in color
harmony for creating long lists
of dogmatic rules that are
mostly opinion and not based on general
rules.10
any underlying
tncks"
entirely on what
they call "stimulus instead of perception (for instance that two colors
harmony of Geothe, Munsell, Moon and Spencer, and Chevreul have been shown to be too
Hard and Sivik have proposed a model of color combinations in which there are 9
dimensions.11
These are
"subdimensions"
Interval indicates the contrast between colors. Distinctness of border describes the borderline
between color surfaces and how sharp the edge is perceptually. Kind denotes where in space
the colors of the pair reside. Size is the distance the colors are from one another. The interval
describes how individual colors are experienced together. Complexity describes the number of
main attnbutes (the six elementary colors) that are represented in a combination. Content says
11
which mam attributes are in the combination. Type denotes what others may have called
complementary colors. It specifies whether colors on the hue circle are opposite one another
"Harmonics"
combination. Color similarities describes what aspects of the colors in combination are the
same, i.e. which attributes of each color are similar. For example, colors could be all of the
same blackness, or whiteness, or chromaticness, etc. Area relations describes the relative sizes
of the colors in question. Rhythm has to do with the regularity of a pattern or texture. A
This nine dimensional approach to color combinations seems excessive, and the dimensions
are
hardly orthogonal in concept. These attributes of perception, however, are
interesting and
of color and form when viewed in the real world. The basic elements of their color gestalt
12
are:
1. Color elements: Described by the shape of the contour that separates the color from its
surround and the actual color of the shape. There is no mention of smooth gradations
from highlight to shadow, but these presumably are part of the same color element.
particular area.
12
3. Overall Form: The pattern that the color elements and line network form. There may or
may not be an overall form. This depends on if there is a rhythm in the area in question.
4. Color Character: The color character can change if the color elements change, even if the
distinctness of border remains the same. The color character complements the form
The study of distinctness of border is of interest to both Hard and Sivik's theory of color
ideas.10
combinations and of their gestalt
They show that distinctness of border between
Swedish) was shown to be a one dimensional function. The research on the NCS lightness
1567
s = 100-w =
100
(r+56)
which says that blackness (or whiteness) is a hyperbolic function of CIE Y. Although
functionally different, the shape corresponds roughly to that of Stevens or Fechner. The
13
interesting part of this is that a constant difference in GT along the achromatic scale
corresponded to a constant blackness (or whiteness) difference. This makes NCS blackness
perceptually uniform with both blackness difference and with border distinctness, GT. This
relationship also holds with chromatic color pairs where the hue and chromaticness were kept
constant and the blackness varied. An equation was derived through multiple linear regression
coefficient. Furthermore, it was found that GT was not an additive quantity, i.e. if GT
between A and B is the same as GT between B and C (on the achromatic axis), the GT
frequency edges. The assumption is made here also that the borders between colors is sharp.
Very different results would have been found if the borders between the stimuli were even
slightly blurred.
Spencer13
Perry Moon and Domina took the CIE specification of the Munsell Color Order
System and created their "metric color space", which is basically just a of the CIE
warping
14
1. Any arrangement of colors that can be sensed as an orderly combination will be pleasing.
Thus, in the new G5 space, simple geometric figures connecting colors will result in a
pleasing harmony.
2. The interval (Euclidean distance in the W space) between any two colors is unambiguous.
be disharmonious. From this they created their areas of in their color space. For
ambiguity
similarity
Regions of identity, similarity, and contrast are assumed to be pleasing, whereas the regions of
the first and second ambiguities are assumed to be displeasing. The same regions are found in
lightness/chroma at constant hue values. Moon and Spencer (also referred to as M and S) go
names and denotations in Munsell terms. For harmony in area M and S state
corresponding
15
"A pleasing balance among n color patches is obtained when the scalar moments about the
patches"14
Pope15
in his criticism of the Moon and Spencer work points out that they have greatly
oversimplified the problem, that classification of harmonies into groups of simple figures looks
is divided
measure1
mathematical equation, since it was intended more as a basic principle for guidance of design.
context more
generally associated with finer art, and not with graphic design, or page layout.
Pope sums
up his feelings in the quote "geometric significance in the symbolic representation
does
significance."13
In Granger's experiment on area balance in color harmony, Moon and Spencer's predictions
with regard to area balance showed no predictive value, although Munsell's empirical rule
regarding area balance was shown to account for 35% to 53% of the total variance. Strangely,
there was a very high inter-observer correlation for area balance, between .67 and .73. This
may suggest that there is a predictor for preference of area balance of two color mixes.
2.1.1.5 Others
al.,18
derived, through linear regression, an equation to predict harmonies. Then he predicted the
16
aesthetic measures of 9890 two-color harmonies from the equation. From the predicted
aesthetic measures, his data suggested that two color harmonies of same or similar hues tend
be harmonious harmonize
contrasting hues.
to more than or Also, colors
complementary
better with N-9 (white), than with N-2 (blackish), and that aesthetic measure is affected
mostly
by value and value differences than other dimensions in color space. The experiment was
In summary, it seems that most color harmony models or systems, at best, provide some
feeling to designers for a more systematic approach to synthesizing color harmonies. No one
system has been shown to be general enough to apply tenets to all situations. To try to design
an analysis system that employed one or more of the rules to combinations of color would
both be a prohibitively large undertaking, and would not guarantee any useful results. The
only way to glean useful information about color harmonies is to study observer's reactions to
color combinations, both within the context it is to be used in, and over a wide
sampling of
the space.
Only from results from study of particular context and application can rules be
world. Although viewing conditions can change drastically in the natural environment (and
man made buildings), to a great extent colors of objects appear to remain the same, or at least
quite similar. This phenomenon, called color constancy, is greater if colors are associated with
17
The ability to see colors as belonging to objects, in spite of the differences of (to a great extent)
the actual physical stimulus reaching the eye allows humans to use color as a tool to help them
survive in their environment. The ability to communicate these colors and to assign
whereby the language must pass through successive stages of evolution to get to the next
level1
It has been shown that of the nearly 100 languages surveyed, a large majority of them
purple
gray
they have the word for red, and that the language will have a word for red before it has a word
for either yellow or green. A language may have, after red, the word for either green or for
yellow, but it must have both before it has a word for blue. These rules sound more like
arbitrary synthesis rules, but they have been derived from study of a large number of languages
in various stages of evolution. In the English language, there are an indefinite number of
18
1. The meaning of the color is not predictable from the of its parts, e.g. blue-green.
meaning
2. Its significance is not included in that of other color term. e.g. crimson is a kind of
any
red.
3. Its application cannot be restricted to a narrow class of objects, e.g. blonde may be used
to describe perhaps
only hair, complexion, and furniture.
Names20
The ISCC-NBS Dictionary of Color uses a similar attitude when
assigning color
names to volumes in the Munsell color solid. The hue circle is defined in terms of basic color
names descnbed above, with the addition of violet and olive. Intermediate hues are described
by compound color names, e.g. reddish-orange. A slice of constant hue (value vs.
chroma) is
Naming of colors is an outcome of the evolution of a language. As the lexicon grows, more
Dictionary of Color Names illustrates, in English, there are a huge number of color names.
The meaning associated with, or evoked by a particular color is a less well defined problem,
however. Connotation of
meaning of a particular color depends on context, experience,
culture, time, and other factors. There seems to be one invariant. Black seems to be the
universal color of mourning and death. Most people dislike black. Ancient cultures had
world"1
various and somewhat arbitrary meanings for the colors in their To Leonardo da
Vinci, yellow stood for earth, green for water, blue for air, red for fire, and black for total
darkness. Colors have been used to symbolize royalty. The royal color of the Sung Dynasty
19
(China) was brown, of the
Ming Dynasty was green, and of the Ch'ing Dynasty was yellow.
wedding, whereas the Chinese bride wears, and is surrounded by, red.
Physiological and emotional responses to colors have been noted and studied by psychologists.
Deutch21
For instance, Felix found that color brought about reflex actions in the involuntary
actions such as pulse rate, breathing, and blood pressure. He found, however, that red may
calm one person and excite another. There seems to be no general rule to attribute a response
to a given color stimuli. Deutch reasoned that physiological and emotional changes were
brought about
by associations, so on people reacted to
following order. Blue, red, green, violet, orange, yellow. However, this ranking was for color
subjects'
20
where the subject would fill in the most appropriate circle related to their response. Sivik and
la^y-
sick-healthy, cultured-uncultured, feminine-masculine, complicated-simple, stimulating-dull, cold-hot,
resultant data, five major factors were found. These were denoted Excitement, Evaluation,
Potency, Temperature, Activity. Since the experiment was performed to be "context free", the
the stage was set in a more narrow sense. Given this, the inter-observer variance showed to be
fairly low. Further studies have been performed that set the context. In "Color connotations
difference
colors,"26
of exterior the above experiment was repeated with the being that subjects
were shown pictures of colors on buildings instead of abstract patches. The list of variables
shown, and 67 colors were used that fairly evenly sampled the color space. 168 observers took
part in the study. Analysis on the data showed three major factors; emotional (or pleasantness)
evaluation, social evaluation, and a spatial factor. The spatial factor is somewhat obvious, as
there are four variables that are very closely related to spatial aspects. In addition to factor
21
in NCS
mapping"
space. This technique is interesting as it allows visualization of the meanings in planes of the
space, although the details as to the mechanics of creating the mappings are somewhat vague.
Whereas temperature was separated as a factor {warm-cold and la%y-energetic) in the context free
color study, it was found to be included in the factor called emotional evaluation. For color in
the building context the concepts of beauty and warmth are highly correlated, but for color in
not the case in many other parts of the world. The issue that meanings of words are context
dependent was also discussed. If the meaning of a semantic variable changes as a function of
validity of the variable is questionable. This illustrates the need for well chosen variables that
are
unambiguously understood as opposite within the context of study. For the comparison of
individual variables of
meaning between color in and out of context, warm-cold, hilarious-serious,
named evaluative in both studies were found to have very low correlation. Thus, this factor
Scale"
Kobayashi has created a "Color Image that assigns semantic descriptors to perceptual
dimensions of color. He uses warm-cool for hue, soft-hard for value, and clear-grayish for
chroma. These attributes were derived from a factor analysis. The color space was also
22
segmented by hue, and
by tone, which is a name
corresponding to the ISCC-NBS color
dictionary tone descriptors of a given hue. Kobayashi states that the three adjectival bipolar
context, personal taste, and other environmental factors, and therefore can be used in a general
setting. He goes on to
map the space and assign different meanings to different parts of space.
This seems to be a gross overgenerakzation, and although the adjectives that re-describe the
color space
may be fairly invariant, there is no reason to believe that this is an exhaustive use
Colorimetry enables color matches to be predicted when original and reproduction are
measured under identical viewing conditions. Color appearance models attempt to enable
models could
exactly predict matches across differing viewing conditions and media types,
they may or
may not
specify a preferred reproduction. Additionally, if viewing conditions are
constrained to be identical, and the original image is free from defect, a colorimetnc
reproduction
may not necessarily be preferred. Evidence suggests that, for several types of
Froelich28
In 1948 Buck and studied human complexion under a number of standard light
sources.
They determined the average complexion of the three major races (Caucasian,
23
assess the
quality of color reproduction, it is a very important volume of color space. They
found that "...the preferred source emphasizes the material in brightness and saturation
without marked change in hue. Thus, in each case, the preference was for the lamp whose
This led
viewed."
the design of the soft white fluorescent lamp, which has a spectral response that is close to
Bartleson29
preferred when
viewing portraits of people. In 1960 studied the memory colors of
familiar objects
using color patches. He found that the memory color of flesh was closest to
the measured colors than for other familiar objects such as grass, sky, and brick. It also had
the least interobserver variance. His overall results indicated that memory colors were
generally more saturated than the real object and that the hue shifts of the memory colors were
toward the most impressive chromatic attribute of the color in question. This result was also
He found that memory matching, that is successive matching, yielded higher variability, shorter
matching times, higher remembered purities and somewhat higher remembered luminances
Bartleson31,32
experiments by both and
by Hunt33. Primarily, they were interested in the most
prevalent colors that existed in natural scenes, specifically grass, sky and skin color.
24
'
Bartleson showed that the preferred color of skin in a reproduction was close to the
very
memory color of skin, and that it departs significantly from the true color of skin when
measured. The mean measured skin color from his subjects was Munsell 10R with a chroma
Bartleson32
3.5. later
chroma of showed that the
relationship between memory and preferred
hue. Preferred sky was higher in chroma, while preferred grass was of lower chroma.
Bartleson observed that while the preferred grass was of lower chroma, it may have been due
to the fact that the available gamut of the process they used could not
adequately produce
Hunt33
picked
up on this and performed an experiment similar to Bartleson's in 1974 when the
process capability was greater for making good greens. He found that the preferred color of
skin tended to be yellower and of around the same chroma as did Bartleson. For sky,
preferred and measured colors were of similar hue, but the preferred chroma tended to be
higher. For grass, preferred color had similar chroma as mean measured, but tended to be
yellower. Thus Hunt showed that Bartleson's results for green grass were
probably due to
25
have investigated it
al.34
from offset publications. They found that skin colors in publications were very similar to
fall in
u'v'
preferred skin color from their own experiments. Plotted in space, skin colors
ellipses that have small radii in hue and larger radii in chroma. As expected, Negro skin color
is darker than Asiatic, which is darker than Caucasian, while the hue angle of all skin tones
were quite close. Preferred skin color as a function of relative size of faces and color
background were studied. There was little difference in the results from differing sizes in the
color.
"punchier,"
reproductions that are i.e. more colorful and
possibly lighter and higher in
contrast, tend to be preferred over reproductions that attempt to faithfully render color. An
Fedorovskaya.35
attempt to
quantify this was done by It was found that for four natural scenes
CIELUV space) affected preferred image quality. Images where the average chroma was
increased by 5 to 10 units were found to be the most preferred. Gamut mapping was done by
clipping in chroma.
They found a
very close
relationship between naturalness and
quality as
well, which is
hardly surprising.
"colorimetric"
than reproduction. This may stem from the fact that most of the tone research
was done within the photographic field in a time when control over processes was determined
26
solely by exposure and development and chemistry. When the mode of reproduction
specification of color changed from densities and absorbance to CIE based metrics, it
may
Jones36
the source and the output luminance ranges. As pointed out in 1944, a proportional
reduction of luminance contrasts to fit the dynamic range of the output system creates a far
typically
brightest white and a CIE Y of 0.0 should correspond to the darkest black attainable. This, in
effect, is exactiy what Jones warned was inappropriate; a linear scaling of the available
luminance range. Unfortunately, without absolute data about the source luminance range, it is
impossible to know exactly how to tune the tone reproduction. However, some assumptions
may be made to guess what a typical source luminance range might be for an average outdoor
scene and an average indoor scene. Jones states that the average brightness scale, which is the
ratio of maximum to minimum object luminance, is 160, or 2.2 log units. If this is used as the
luminance range for digitized pictorial images, then we can estimate the amount of luminance
compression for a given output process. For instance, the output luminance range of a typical
color laser printer is close to 60, or about 1.78 log units. This suggests that a tone curve that is
more sigmoidal might give a more preferred reproduction than a linear scaling. Figure 3
27
Fig. 9. Tone reproduction
y^y curves t
iS/y?
,y//
>/// A, "exact
B, optimum
reproduction"
attainable with
semimatte paper
////
C, proportional reduction of
./'
luminance contrasts to fit
8*
matte paper
'y '7
D, tone reproduction suitable
^'/
~
for
lights
scene in which
are predominant
high
/
E, tone reproduction suitable
/ when only shadow details
are important.
2 9
*)0 0
Figure 3. MacAdam's tone compression curves for tone reproduction with limited
dynamic range.
Roufs38
approached this problem in a somewhat circuitous manner in a
study on perceptual
image quality. He used gamma functions instead of sigmoidal tone curves. Perceptual quality
as a function of gamma was studied. It was found that for different images, the gamma
corresponding to the preferred print was found to be different. He then scaled subjective
brightness contrast as a function of gamma and then related perceptual quality to subjective
and so then, less dependent on image content. I think that subjective contrast of the image
was a function of where the detail was in the tone scale, as suggested by Jones. The images
with the most detail in the shadow regions would find perceptually best reproduction
using a
gamma function that gave the most dynamic range to that part of the tone scale and where the
28
slope of the tone scale was similar to that of the luminance gradient of the original scene. As
part of the same study, perceived sharpness was scaled as a function of both cutoff frequency
and gamma. It was found that within a reasonable range of gamma functions, there is a
negligible effect on perceived sharpness. However, when blurred with a gaussian filter, the
perceived sharpness is reduced. The study did not include sharpening the images using digital
methods such as laplacian filtering, unsharp masking, or error diffusion. Both tone
reproduction and sharpness have a significant impact on the preferred reproduction of natural
29
2.2 Preliminary Experiments
The preliminary experiments were the intended set of experiments from the thesis proposal.
There were to be four experiments centered around the meaning of color in the context of
business graphics, and the relationship of color meaning to gamut mapping for preferred
approach.
That is,
above."
various
mapping techniques have been hypothesized and implemented, images were then
various hypotheses to determine which approach worked best under the conditions of the
This
experiment.39,40
is
above"
obtained. One primary reason that gamut mapping work has been done "from that
explicitly mapping each pixel of a scanned image whose color is out of gamut is a prohibitively
number of subjects. Further, even if the number of colors to be explicitly mapped is small
30
A fundamental drawback "from is
above"
is essentially infinite for all practical purposes: It is the set of three dimensional mappings that
can be made from a source gamut to a destination gamut. In order to bound the search space,
"reasonable"
assumptions are made for a
mapping approach
using heuristics based on
These
researchers'
experience. assumptions
may not always be valid for all uses of color
image dependent, both on color content and on the context to which the image belongs.
of gamut points found in a given region of the color space of an image. However, image
context has not been taken advantage of as a means to make gamut mapping decisions. This
stems
partially from the fact that most of the work in gamut mapping has been done on
Such may not be the case with computer synthesized graphical images that are commonly used
in business and the graphic arts. Typical graphical objects can fall into one of a small number
given below. It first makes the distinction between foreground and background (also called
form or fills. The fills are typically gradient fills and progress from a
fairly
of textures, patterns,
31
light color to white across the page for paper applications (under bright lighting conditions).
For applications where the medium is viewed under dark surround (overhead projectors or
film projectors) the background is typically quite dark, and the foreground objects are lighter.
(marble, wood), or
repeating patterns. Backgrounds can be more complex, for example
comprised of a scanned image. However this is less common as it may be confusing, thus,
Foreground images fall into one of at least six categories. The categories heretofore identified
are:
1. Uterally representative images, i.e. images created specifically in reference to the image of a
naturally occurring object such as a face, hands, a cow, a computer, a tiger, etc. Images
this
category are typified by the "clip software packages. Images
like"
of this category "look their real life counterparts. This is an
extremely broad category
and spans a similar amount of content as scanned images from the real world. As such,
results found from a given image cannot be generalized to the entire category.
2. Figuratively representative images, i.e. those images that unambiguously represent either
like"
something out of the natural world or a concept (such as "STOP"), but don't "look
the image of that thing. One example of this is the stick figures that denote men's and
32
ladies'
restrooms. This category is quite broad as well, and further categorization would
3. Externally imposed associations to familiar graphical objects. These are the images that don't
necessarily represent objects from the natural world, yet are so prevalent that they are
digital red X. Another example is a particular corporate color that is in standard use for
internal documents.
4. Data representative images, or the graphical communication of numbers. Charts and graphs
5. Simple geometrical objects. The colors that are attnbutes of images in this category are the most
abstract in terms of concrete association, and are probably more prone to symbolism from
personal experience.
Colored text may have significantly different connotations than other objects
similarly
colored.
33
Taxonomy of computer synthesized graphical imagery
Foreground Background
For each of the above mentioned categories of images and for images within each category,
images that are supposed to look like from the natural world
objects
(Literally representative
bizarre.
34
Figure 5. Different colors for the face. Face colors that are unnatural look
"wrong"
A study of
meaning of the color of these faces that are so unnatural has dubious value.
clearly
Of course, if the image is of a car then the coloration can take on a much wider range without
"natural"
Figuratively representative, data representative, and simple abstract objects have less obvious
some
thing or some idea, but don't look enough like what
they represent to be literal, the
"acceptable"
latitude of colors is greater. Take, for example, the following images of a teacher
lecturing to two students. The fact that the teacher figure is different colors
may have subde
differences in the perceived meaning of the image, but none of the images is any more absurd
than another.
35
m *_m tm m
the associations of the colors contained in them. For example, the same color
may have a
different connotation when it is associated with a stick figure than when it is associated with a
For this reason, one image selected from the of figurative representative images will
category
not be representative of the entire group. Because of the need for further category
segmentation based on image content, the results from an experiment with a small number of
difference in response to the same concept (e.g. car) when presented both as literal and as
figurative objects.
36
Data representative objects can take
many forms. However, all charts and graphs exist to
(minimizing perceptual
distance) of individual colors will create a useful
mapping
transforms using the matching method. The color matches could be modeled with a
number of simple models that are developed in P2. They remain to be verified with
psychophysical experiments, but the author is confident that with the proper set of
2. The change in the meaning of colors across contexts (same color in different image types),
conducted. This experiment showed that while there are some significant differences in
to those found in other studies, it was difficult to glean from that useful instances of task
descriptions.
37
be different (or both)
matching decisions
3. Gamut mapping color will when either the type
of image presented to the observer changes or the task description changes. Pilot
experiments were performed to verify these hypotheses. I concluded from these pilot
experiments (both comprise P2) that while the technique can be used to create interesting
models for gamut mapping, there was not a significant difference between color
matching means across images tested. Because of the failings of the PI pilot study,
the differences in task descnption between the two P2 experiments was the difference
define). On comparing the two experiments where the only difference was task
through looking at differences in the parameter values that went into the gamut mapping
model. No statistical
study was done of differences, but all model parameters were
very
close in value.
hypothesis 3 forced a change in direction. From the gamut mapping experiments (P2), it was
clear that hue uniformity was a significant problem. From these data, and the lack of a
significant
body of other data in the field, it was decided that hue uniformity be further
studied.
38
2.2. 1 Experiment P1 (not completed). Semantic differential scaling of colors in the context
of business
graphics
Osgood42
semantic differential scaling of colors. did pioneering work in the area of meaning,
that one could derive those dimensions from a factor analysis applied to semantic differential
scaling of the concept in question. A semantic differential scale is one where polar adjectives
(descriptive terms of opposite meaning) sit between a 7 step scale. The task is to judge the
concept against where it falls on the scale between the polar adjectives. Osgood repeatedly
dimensions he called Evaluation (e.g. good-bad), which was almost always accounts for the
largest variance (about twice as much as the next 2), Potency (e.g. hard-soft), and
Activity (e.g.
active-passive).
Potency and activity on average accounted for about the same amount of
vanance. Sivik and others applied Osgood's techniques to the meaning of color (as described
earlier on page 20-22), and found additional major factors, the most notable of them
being
could be derived from semantic differential scaling as it applied to the meaning of colors in the
39
The abstract and data sub-contexts were chosen to be evaluated as a first attempt. (The
figurative, literal, and text representative images were going to be added following verification
that meaningful factors could be extracted. The experiment was halted before this was
achieved.) Fifteen semantic differential scales were used which were copies of the ones Sivik
used in reference 22. An example script is shown (reduced in size) in figure 9. For the data
representative image, the script read: "Don't judge the color as such, judge it as the coloration
representative images types are shown in figure 10. Each of the image types was printed using
10 colors:
Sky Blue (printer Cyan), Red, Purple, Lime Green, Gray, Blue, Violet, Yellow,
Green, and Orange. For each image type (abstract and data), for each color (10 of them), 10
sampling is not sufficient to be significant, it seemed appropriate at the time as a quick test of
the method to see how and if useful information could be drawn from the analysis techniques.
40
Judge each color with respect to the scales shown below.
Interesting 3 ? Q Q 3 3 Q Boring
Easy 3 3 3 3 3 3 3 Difficull
Dangerous 3 3 3 3 3 3 3 Safe
Warm 3 3 3 3 3 0 3 Cold
LikeQ 3 3 3 3 3 3 Dislike
Serious 3 3 3 3 3 3 3 Hilarious
Beautiful 3 3 3 3 3 3 3 Ugly
Soothing 3 3 3 3 Q ? Q Exciting
Positive 3 3 Q 3 3 3 3 Negative
Complicated 3 3 3 3 3 3 3 Simple
Real 3 3 3 3 3 3 3 Fake
Passive Q 3 3 ? ? ? 3 Active
Expensive 3 3 3 3 3 3 ? Cheap
Success 3 3 3 3 3 3 3 Failure
Figure 9. Reduced version of task and semantic differential scales for experiment PI (abstract
sub-class)
12 3 4 5 6 7 8
Figure 10. Example images from PI experiment. Abstract and Data representative images.
41
Means were calculated for each combination of color and semantic differential scale. Because
there are 10 colors and 15 scales per image, there are 150 means that need to be analyzed.
Each mean is comprised of 10 samples. For this paltry subset of the experiment, 3000
observations had to take place. Figures showing the mean and standard deviation for each
color and scale are quite large, so are shown in appendix A: Mean data plots from experiment
PI. As can be seen from the plots, the data are quite noisy as error bars indicate standard
deviation of the mean. The only way that useful results would be gleaned from this
experiment was if the data indicated that for different image types, there was a systematic shift
in meaning that was uniform across all colors. As can be seen from the plots, there are
significant differences in the mean values of some scales for some descriptive terms, but they
are not uniform across all colors. For instance, the mean value for the descriptive term
"interesting", for the color orange is about -0.4 for the abstract image type whereas it is about
-1.6 for the data image type. The significance is judged by the mean of one scale
being
beyond the error bar (1 standard deviation) of the mean from the other image type1. Though
there are some significant differences between means of colors and descriptive terms across
or data), the trends are not systematic or uniform enough for the results to be used for
subsequent experimentation.
This is not strictly correct, as the standard error of the mean should have been used to judge significance.. However, the
standard error of the mean would result in a smaller error bar since it is calculated from 1.96 *
standard deviation divided by
the square root of number of observers (10 in this case). Since 1.96/sqrt(10) is less than one, then the standard deviation is
larger than the standard error.
42
To test whether the results from this experiment were similar to those done by Sivik, and
by
entire data set (pooled sub-contexts), and to the abstract and data sub-contexts individually.
Because none of this work was continued, only the pooled results will be discussed.
The PCA found three factors that were significant (eigenvalues >
1) which accounted for
about 63% of the variance of the data. Figure 11 shows the first 10 factors (of 15) and the
1
3
fl fl
a
"> 0.8
n n
'5
o
8 0.6
? cumulate
c
c a
(0 individual
re 0.4
I >
re
3 0.2
E
3
o 0 __JL-L-lTJiT-i-r-*1--ir-B1--T-n
1 23456789 10
Osgood43 Sivik44
Sivik's results included 4 factors, and accounted for 85% of the total variance of the system.
dimensional "semantic
space"
43
which the descriptive terms used in the experiment can be mapped as vectors in the space.
determine along which dimension a scale is most aligned. Table 1 show the scales arranged
To apply Osgood's terminology for the three major semantic dimensions, factor 1 would be
be named the
Potency factor. Quite few loadings
"clean,"
a of the are not that is, they don't
44
factors. This analysis does show, however, that the fifteen scales can be reduced to a small
number of dimensions that account for a significant portion of the variance of the system.
Although somewhat interesting, the applicability of the experimental results to determining the
best task description for gamut was deemed unfeasible, so further analyses and
mapping
45
2.2.2 Experiment P2. Gamut mapping through the use
of color matching (Gamut mappingfrom below)
This experiment was started concurrendy with experiment PI, and was intended to test the
1 . To create a gamut mapping model where the parameters can be fitted to psychophysical
2. To test if there is a significant difference in the mean color matches across sub-contexts
(image types).
3. To test if there is a significant difference in the mean color matches across task description
perceptual"
This experiment was split into 2 sub-experiments. The first experiment concentrated on
testing differences between sub-contexts, and the development of a general model to use to
create vector directions that make a color mapping similar to the color
matching mean
closest perceptual color matches of 25 colors when an exact match was not allowed. An
46
a Xerox 4920 color laser printer. Each of 21 observers performed color matches between out
of gamut colors and those on the artificial gamut's edge. Each observer made color matches
on 4 different images. The images represented some of the categories that business graphic
images can fall into. Between the different image types, there were no multidimensional
(MAN OVA) statistically significant differences at the 10% confidence level in any of the 25
colors tested. The mapping vectors showed that 1) observers don't make simple matches as
assumed
by most gamut mapping experiments done to date, 2) the influence of image content
for simple graphical images tested does not have a large effect when the task is to make closest
type gamut mapping is proposed. Results are compared to predictions of a new gamut
mapping technique that minimizes weighted color difference between the target color and the
gamut boundary.
definitions of gamut mapping. The definition assumed here is: Gamut mapping is a solution
approaches have been examined, and some have proved more useful than others. Stone, et
al.45
introduced the concept of gamut mapping, but lumped viewing conditions transforms into
Stone,46
the mapping, and used CIEXYZ space to alter color information. Later, Wallace and
47
Gentile, et
al.,47
such as CIELAB color space. Wolski, Allebach and have created transforms that
L*
use different types of mappings in different parts of color space (above and below of 50).
Berns49
Hoshino and provide a good summary of
existing techniques for gamut mapping, all
mapping is done in
that the a uniform color appearance type space.
assuming perceptually
Fairchild50
Montag and have recently completed a large systematic evaluation of existing gamut
mapping techniques, including clipping transforms, and compression transforms along lines of
constant lightness, constant chroma and constant saturation. Test images were simple spheres
functionally, then tested. Many functions have been proposed for mapping a point outside a
towards a central point in the gamut (e.g. 50,0,0 in CIELAB), and , 5) minimum distance or
taken based on location in color space and in certain sequences. A new gamut mapping
and will be discussed in more detail later. While linear and non-linear compression
48
techniques have been investigated, the direction of the compression either preserves lightness
and hue, or preserves saturation (or it's analog in CIELAB) and hue.
While many of the above transforms work well under certain conditions, the search space for
mapping a three dimensional point to another point in three space is essentially infinite. The
functions for transforming color data are based on heuristics learned through experience, and
from that point are not invalid. However, this does not guarantee that any transform tested is
With this in mind, an experiment was designed to find the perceptually smallest distance from
a point in color space to a closed surface of color (gamut boundary). The focus here was on
gamut
mapping of business-graphics images, although the results may be generally applied to
all image types. Additionally the experiment tested whether image type affected observer's
gamut mapping decisions. The task descnption was the same for all trials:
"Adjust the color of the image on the left hand side of the screen to make it look as close as
Three different image types were presented: Abstract (a colored square), data (a colored
and figurative (a cartoon car). An additional image was used (a colored square with a
chart),
49
allowed observers to pick colors from a two dimensional image of a simulated gamut edge to
gamut"
match colors that were presented in an image. Twenty five "out of colors were
2.2.2.1.3 Experimental
colors from CRT to hardcopy print, or from any source color gamut/ viewing condition to any
destination color
gamut/viewing condition. These are
1) accounting for differences in viewing
condition, i.e. viewing mode (self luminous vs. hard copy), chromatic adaptation, luminance
adaptation, etc. and 2) accounting for gamut mismatch. These experiments assume that
problem
1) has been solved
adequately and concentrate on
investigating problem 2).
The following sections explain the details of the experimental design. The user interface tool
was designed and implemented. The test images were selected. The viewing conditions were
set
up and the CRT was calibrated. The simulated gamut edge was created. The task
top right. The image that is manipulated to make a match was presented at the
top left. The
50
gamut edge (set of available colors from which to and a zoomed version of the gamut
choose)
White background
Ta rget
Irr tage
(outo f gamut)
next
done
"gamut" "zoom"
To use the tool the observer picks a color in the area. This changes the area
"zoom" "gamut"
to make the center color in the area the same as the color picked in the area.
"zoom"
When a point in the area is clicked, the color of the parts of the test image that are
alterable are changed to have the color selected. The colors can be updated
very quickly and
"zoom" "gamut"
change as the user drags the pointer around in the area. The area is a two
dimensional representation of a gamut edge in CIELAB L*C*abh color space. The x axis
represents hue and the y axis represents lightness. The color at the point x,y is the h=x, L*=y,
and C*^ =
the chroma limit at the corresponding L*,h coordinates. There are 36,000 distinct
colors in the gamut area, enough to create the appearance of a continuous gradation of color
51
Any target image can be used in this test. The parts of the image that are updated by the user
"contexts"
Four test images were selected to represent three different image The images
shown are simple computer generated graphical type images that represent the abstract, data,
and figurative image context groups. Not studied here were representative images from the
literal and text contexts due to limitations in the availability of software tools to generate the
proper polygon sets to represent the alterable pixels. Fig. 13 shows examples of the three
"contexts" "abstract"
selected image that were used in the experiment. Two contexts were
-P
12 3 4 5 6 7 8
Figurative Abstract no border Abstract Data
52
this work
may be used. The CRT and computer were set
up under controlled
lighting with a
chromatic adaptation issues. A mixture of fluorescent D50 and D75 lamps were used because
the color
rendering index of available simulated D65 source is quite low. A Minolta Chroma
Meter CS-100 was used to make all the color measurements. A PTFE
pressed puck
(Halon)
and paper measurements were taken at the same angle as the CRT screen. The xyY values of
the surfaces are shown in the first table in appendix B. The AE*^ calculated under D65 for
the difference between the measured white of left and right positions on the CRT was 3.5.
The CRT calibration was done in two steps. First the white point digital counts were selected,
18"
then the system was characterized. A Sun Sparc 5, CRT was used. There is no
way to
directly alter the gain of the monitor, so the digital count maxima must be altered to affect the
white point. The chosen white point was R=255 G=240 B=220, with xyY value of (.305 .321
79.1). The chromaticity of paper white under the selected illurnination was (.320 .333
80.3)
which is between D65 and D55. The gray background of the matte board at (.307 .321
13.3) is
quite close in chromaticity to the CRT white. Based on these measurements it was thought
that there would be little confounding from differences in surround luminance or chromaticity.
Once the white point was calibrated, the digital count to luminance relationship was measured
53
digital counts separately. A spline
fitting routine was used to create smooth curves that
mapped digital count to luminance of separate channels. Then XYZ measurements were
taken of R, G, and B maximum values, and a 3x3 matrix was used to create the transformation
to XYZ from RGB primaries. Flare was measured but not included in the CRT model. The
Once the CRT was characterized, forward and reverse models were coded and a 3 dimensional
LUT was made. Each of the test images was processed through this LUT to make the neutrals
The reverse CRT model was used to find the gamut of the CRT in CIELAB space. A
16x16x16 sampling of CRT device space was converted to 16x16x16 samples in CIELAB
space. The edge of this device cube (16x16x16 14x14x14 values) was used to make a
piecewise planar surface in CIELAB that represents the gamut edge. A two dimensional
gamut
boundary description was created
by using gamut software that performed
mapping
"leaves"
chroma with constant hue and lightness. Flue first
clipping were created
by sampling
CIELAB L*C*abh space at 100 lightness levels and 360 hue partitions with a chroma that is
higher than any device chroma everywhere (128 was used). The gamut software was
mapping
then used to
clip the chroma of each lightness level of each hue leaf. The resulting data can be
represented as a 2D image of L*C*abh values, or as a sequence of hue leaves around the hue
circle.
54
A second gamut was created from CIELAB values obtained from sampling output of a color
laser printer (Xerox 4920). A combination of the CRT and printer gamut was used to create
the simulated destination gamut edge for the experiment. A gamut boundary was created
by
taking the minimum chroma of the two gamut boundaries at each L*,h point. This creates an
"intersection"
gamut. A preliminary gamut edge chosen was an intersection gamut with
chroma scaled
by 90%. The resulting gamut edge was quite under saturated, though and it was
"intersection"
decided to change to an unsealed gamut edge. It was felt also, that the
intersection gamut is the most reasonable one for simulating real-life conditions.
"gamut"
scaled to fit in the window space of the color matching tool. Fig. 14 shows an
303
example of the CRT gamut and the printer gamut at a hue angle of degrees.
55
Figure 14. Example hue leaf of CRT and Printer gamut shapes at hab-303.
mapping that could be tested was where the CRT gamut is larger than the printer gamut. It is
well known that many printers can create colors the CRT is incapable of making, especially in
cyans and greens of mid lightness, and in bright yellows. Because of the limitations imposed
Twenty five colors were selected from the surface of the CRT gamut as test colors to match
"intersection"
with colors on the reduced gamut boundary. They were chosen to span the
color space
fairly uniformly. Representatives from the 11 most basic color terms were
hue non
uniformity in CIELAB space is in the blue region). Out of gamut colors were chosen
in parts of the color space where there was a large mismatch in volumes. This means that in
56
blues and magentas, the target colors were
significantly darker than in the greens, cyans,
yellows and reds. Because there is no large volume mismatch in light blues and magentas, nor
in dark greens, cyans, yellows, or reds, the sampling is sparse in those regions.
The colors selected for the gamut match test are shown in the second table in
mapping
appendix B. Twenty five colors seem to be close to an upper limit for a session
matching
without a break. Since most observers performed two tests in a given sitting, 50 colors could
25th
have been chosen with a break at the color. The mapping task is quite a bit more
frustrating than the exact match exercise though, so it was important not to make the task too
The determine
observers'
57
Frequency of Delta E
Figure 15. Histogram of AE*b for exact match exercise for entire observer
group including
outliers.
The average error is on the same order as the color error between the left and right areas on
the CRT screen that presented the target and test color patches. Much larger errors can be
Four observers were found to have average errors over 7. These observers were not allowed
to continue, as their average errors were too high. The remaining 21 observers had average
were presented to each observer over four different sessions. The image sets were presented
in the same order for each observer. Observers typically matched two sets of images in one
58
sitting.
Including the exact match set, an average of three sittings was necessary to complete
five sets of
matching tasks. The four sets of images were named abstract, data, fig, and
noborder. These sets corresponded to an image of a square (with a black border), an image of
a data graph, an image of a cartoon car, and an image of a square with no black border.
Pictures of these images are shown in Fig. 13. The list of colors was randomized once and
Each of the 21 observers performed one set of matches for each of the four image types. The
data set contains, for each color, for each observer, for each image type, one color match
vector. Each color match vector is composed of three independent numbers and must be
Interesting means include the mean mapping vector for each color and the mean vector for
the means of color vectors for each image type are statistically significant. If there is no
statistical difference in the means, they can be pooled to create a better estimate of the overall
For each of the 25 colors, a multivariate ANOVA was performed to test whether mean (three
dimensional) mapping vectors from the 4 image types were significantly different from one
=
HO: fj.1 =
(12 =
jX3 (14 where
59
(Il =
mean of context 1 (abstract)
p.2 =
mean of context 2 (data)
p.3 =
mean of context 3 (figurative)
|I4 =
mean of context 4 (no border abstract)
b*
There are 21 subjects X 4 contexts of data for each color for each of L*, a*, and vectors.
Since the mapping vectors lie in three dimensions, it is appropriate to use multivariate
techniques to analyze the difference of the means (MAN OVA). Three dimensional
MANOVA was calculated for each of the 25 colors tested. The SYSTAT statistics software
Wilks'
package was used to do all the analysis. Lambda, Pillai trace and
Hotelling-Lawley trace
were all performed to calculate f-statistics. The critical F value for these tests was calculated
based on 9/189, 9/240, 9/230 degrees of freedom respectively. The critical value for all tests
is equivalent to 9/ degrees of freedom. The critical F value at the 10% level is 1.6315.
The null hypothesis was not rejected at the 10 percent level for any of the 25 colors tested for
independent
colors'
dimensions did reject the null hypothesis. Based on these data, it was concluded that there is
60
Some comments of observers are relevant deserve Subjects
though, and comment. noted that
bright cyan and bright green, both device CRT maximum were the most difficult and
colors,
colors also appeared to be self luminous, or "fluorescent", and impossible to match with the
very light, brilliant greens and cyans, where printer the colors are
highly chromatic at much
darker colors.
The gamut boundaries shown are for the hue angle that the point mapped to, not the hue leaf
"constant"
The general trend seems to be a
(perceptual) hue mapping with lightness and
L*
the volume, although not
exactly at 50, but quite close in many parts of the color space.
In the yellow-green quadrant, the vectors all point toward greater than L*=50, while in the
L*
yellow red quadrant, the mapping vectors all point toward less than =
50. This may be due
to the yellow-green hue leaves being convex where there are more high chroma colors near the
target lightness. The yellow-red hue leaves are all concave, so more lightness must be
sacrificed to get reasonable chroma. The centroid behavior of the mapping vectors is not
61
ao 100 120
"match"
If the task description was to make a preferred or to make a match that preserves the
intent or
meaning of the color in the context of the specific image shown, this behavior may
change significantly.
Every color match vector can be thought of as a combination of component vectors that
originate at the target color and end somewhere on the surface of the gamut. A reasonable set
L*
1) constant vector,
62
2) centroid vector (points toward 50,0,0),
3) minimum distance to gamut surface vector in the L*-C*ab plane (at fixed hue angle).
that, as is the case in the experimental data set, the match vector is a combination of at least
component.
perceptually smallest
match vector
gamut
edge
The slope P of the line (in the L* C*^ plane) from the target color to the gamut boundary is a
* *
P =
(Lw *
lightness.y + Cw centroid.y + Mw
*
mindist.y) / (Lw
lightness.x + Cw * centroid.x + Mw * mindist.x)
for weighted component vector gamut mapping.
Equation 1. Slope calculation
63
Lightness, centroid, mindist are two dimensional unit vectors calculated for each target color.
The .x and
.y
suffix for each unit vector denote x and
y axes. The x axis corresponds to
chroma and the y axis corresponds to lightness. Lw, Cw, and Mw are scalar multipliers. The
mapped color C is a point that sits on the gamut boundary and intersects a line with slope P
The three dimensional extension to the model is to find the minimum AE*ab between target
color and mapped color P in a range of hue angles about the hue angle of the target color.
The parameter Hw determines how many hue leaves will be searched for a n^nirnurn AE*^
between the target color and the color P. Hw is an integer that is the extent above and below
A model was computed in both the two and the three dimensional cases. In order to find
model vectors and observer matches, as a function of the scalar parameters, Lw, Cw, Mw, and
"amoeba" C52
FIw (in the 3d case). The routine from Numerical Recipes in was used to find
minima. The routine implements an N dimensional simplex that crawls toward minima by
using reflection, expansion and contraction. Table 2 shows the results for two and three
64
choices preserved perceptual hue. Figs. 18 and 19 show the resultant vectors compared to the
observer matches. Note that the regions of greatest error are in the blues and in the concave
hue leaves of the red-yellow region. The hue leaves shown are for the hue angle of the
observer match for the 2d case and for the model prediction in the 3d case.
80 100 120 10 20 30 40 50 60
20 40 60
Figure 18. 2D weighted vector gamut mapping. Open arrow heads indicate observer results.
Filled arrow heads indicate model predictions. Hue leaves shown are for the hue angle of the
observer match.
65
100
r ^Op<4
\\l 80 jffli 5
^S1 /^53.9
60
e^ fes.
&3 SO 75
40
20
4>v
\
-100
Figure 19. 3D weighted vector gamut mapping. Open arrow heads indicate observer results.
Filled arrow heads indicate model predictions. Hue leaves shown are for the hue angle of the
predicted match.
Constant hue loci data from Hung and Berns were used to test hue nonuniformity of
mapping vectors. If the constant hue loci could be used to correct the hue nonuniformity of
the mapping vectors, it may be concluded that observers are performing color matches that
66
incoming hues that sat on constant hue loci, to the reference hues that were used to find the
loci. In between these 12 hue loci, linear interpolation was used to fill the rest of the space.
The constant lightness data set created a that corrected hue vectors
mapping significantly
toward 0 for the blue region, although the cyan region was unchanged and green was made
slightly worse (yellower). Fig. 20 shows the original matches, along with the "hue
correc
The data here strongly suggest, at the very least in the blue and cyan regions, that CIELAB
color space is not uniform enough to do constant metric hue gamut mapping. Additional data
must be drawn from constant hue loci experiments to straighten the hue metric before it will
67
b plot, L marks arrow tails
"corrected"
Figure 20. Hue angle matches on left, and hue angle matches on right (using Hung's
Constant Lightness data).
minimum weighted color difference in CIELAB space. The weighting of the color difference
68
A2 A2
AL*Y (AC* (AH*
?b
AE =
+1 +
ab
Kl Kc Kh
)
Kl, Kc, and Kh can be altered to change the shape of an ellipsoid like surface of constant
error.
Mapping vectors are defined by the target color and the point at which the smallest
ellipsoid intersects with the gamut volume (with the target color being at the center of the
ellipsoid). In a discrete sense (and how it was done in this work), the vector is found by
point on the gamut boundary. 36000 data points were used to represent the gamut boundary,
100 lightness levels at 360 hue angles. The mapped point is that which has the smallest
weighted error. Katoh and Ito tested their model with computer generated images and found
original images. This may be interpreted as lightness is more important to preserve than
Fig. 21 shows a comparison of Katoh and Ito's best parameter mappings against the observer
results found in this experiment. Katoh and Ito's results are based on the gamut of the printer
used and is likely to be somewhat different from that used in this experiment. There are
they
marked differences in hue in the blue and green regions. The largest differences in chroma
also appear in the blue region. This is not surprising as the blue region is most non uniform in
69
hue. However, the L*-C*A fit is quite close in a large part of the space. The total AE*^ error
between the predictions and the observer matches is 318. The average error is 12.72.
L*
b pi jr_, rr,.
100
5*103.
60
40
20
00
^^-20 7
30
60 ^K^ ^r-7T9 7
40
20
C
20 40 60 80 100 20 40 60 80 100 120 10 20 30 40 50 60
Figure 21. Comparison of Observer results with Katoh and Ito 1:2:2 vectors. Open arrow
heads indicate observer results. Filled arrow heads indicate Katoh and Ito results. Hue leaves
shown are for the hue angle of the predicted match.
model was made that gave better fit to the experimental data. This enhancement allows the
L*
ellipsoid to be rotated in the
C*A plane. The enhanced model has the form in equation 3.
70
( (
L*-50 L*-50
A =
AC * Sin Kt ArcTan + AL*
Cos Kt ArcTam
C* C*
( t
B= AC*
L *
-50 Vl L*-50
Cos Kt ArcTan AL *
Sin Kt ArcTan
C* C*
B\2 (AH*A2
AE
Kl) +{Kc) +{ Kh
L*
The and
C*^ in equations A and B are elements of the target color. Kt is the parameter
Optimized model parameters were found for both Katoh and Ito model and the enhanced
71
The results from the optimization show that the best fits occur when the coefficient of chroma
is greater than that for lightness, but coefficient for hue is less. This implies that observers are
more sensitive to lightness changes than to chroma changes, but they are more sensitive to
Ito's results that showed less sensitivity to hue shifts. The rotation parameter Kt shows that
rotating the ellipsoid can fit the observer match data better, but it complicates the
interpretation.
100
100
^103.
100
/ J-*53.3
60 60
60 V^^yr-^^STl 9 1 .
40 40
40
20 20
20
- C* c*
. ,
c
20 40 60 80 100 120 20 40 60 80 100 20 40 60 80 100
20 40 60 80 100 120 L0 20 3n 40 50 60
Figure 22. Comparison of Observer results with best 2d enhanced Katoh optimization
parameters. Open arrow heads indicate observer results. Filled arrow heads indicate model
results. Hue leaves shown are for the hue angle of the observer match.
Figs. 22 and 23 show the best results from the enhanced model. Fig. 22 shows the results of
2D matches. Notice that this fit is close in all regions of the space. This
very implies that the
72
model
may work
very well when a
perceptually uniform hue color space is developed.
Whereas this enhanced model, as well as the original model fit the data adequately, there are
the highest chroma values in a given region (colors lying beyond the tip of the hue leaf). This
"flat"
will tend to create spots on an image. A weighted vector
mapping has less tendency to
create
many to one mappings since it is influenced more by static directions in space. Note
that the weighted vector approach can fit the observer match data better than the Katoh and
L*
a b plot, narks arrow calls
100
^103.
SO
^0\gilp-^-.]
\\l ry 80
49
60
*>
.'
"7S V: SO 75
"Iki 40
a 20
-100
100
^S^-20.7
80
60 ^Sr^9-7
40
20
20 40 60 80 100 120 10 20 30 40 50 60
20 40 60 80 100
Figure 23. Comparison of Observer results with best 3d enhanced Katoh model optimization
parameters. Open arrow heads indicate observer results. Filled arrow heads indicate model
results. Hue leaves shown are for the hue angle of the model prediction.
73
These results show that both types of gamut mapping models can be used to fit the observer
1. Observers don't make simple matches of the kind that are assumed in-gamut mapping
experiments to date. However, three simple models were optimized against the observer
match data and found to fit the match vectors acceptably, and may be used to create a
in any of 25 gamut
mapping decisions over the four different images shown. The entire
data set has been treated as if it came from the same population.
3. CIELAB, arguably the most popular color space for performing gamut mapping
hue) matches.
The experiment
clearly showed that CIELAB is nonuniform in hue. An experiment needed to
be designed to characterize the nonuniformity over more of the color space in a simple,
74
macroscopic manner that can use results to create a transform to a color space whose metric
75
2.2.2.2 Preserving meaning in color matches (sub-experiment 2)
The above work was repeated almost exactly, with the exception of the addition of two new
"In the last experiment, if you did it, you attempted to find the closest perceptual match between two colored
images. The criterion for matching was to find the color that made the images look as close to one another as
possible.
In this experiment, you are presented with the same images and colors, but instead of attempting to make a
color match between images, you are to adjust the color of the image on the left to try to make the image
For instance, you may be making a presentation in Microsoft PowerPoint, and the color you want to use is the
one in the image onthe right. Your task is to pick a color from the available color palette that will convey the
same message, or the most similar message as that conveyed in the image on the right. Please don't make an
"name"
adjustment that is too radical to the color of the image on the left, i.e. the basic color or color category
shouldn't change (reds should remain reds, greens should remain greens, etc.).
As an example, I may be presented with a red color as a target color to match. But, the perceptually closest
color is a bit less saturated. In order to convey what I might want to convey with a saturated red (for instance,
danger or attention), I may want to find a color near red that doesn't match in hue as much, but is more
saturated.
In this experiment, you are presented with the same images and colors, but instead of attempting to
make a color match between images, try to find a best substitute for the target color. This may or may
not be the same as the closest perceptual color.
One approach to the problem is to first find a closest perceptual match. Once this is done, try to find a
color that you would use in place of the target color without
changing the color category (or color
name)."
Some observers had difficulty understanding the task. I discussed the intention of the task
76
12 3 4 5 6 7 8
The results from statistical analysis showed that for the multivariate ANOVA (MANOVA),
level. Further analysis showed that the text sub-context was the most different from the other
contexts. When the MANOVA analysis was re-run with the text data set removed, the null
hypothesis was rejected for only 5 of the 25 colors. A univariate analysis on differences of data
between individual sub-contexts shows that most univariate rejections happen between the
text sub-context and others. Figure 25 shows a graphical depiction of the colors rejected
across sub-contexts.
77
12
10
n
Abstract Data
Figure 25. Histograms of colors where the null hypothesis was rejected between sub-
contexts
Although the null hypothesis was rejected for several of the colors across sub-contexts, as
figure 26 shows, the actual value differences of the mean mappings for each context does not
show a
readily identifiable systematic trend. However, when
plotting the pooled mean of all
sub-contexts except text against the text sub-context, several comments can be made. Figure
rest of the subcontexts pooled (filled arrow heads). For many colors, the text
mapping is to a
lighter color than the pooled mapping, and in the green color, the text
mapping is directly
pointing at the origin, whereas the pooled mapping is yellower. The lighter mappings of the
"T"
text sub-context may be due to assimilation effects where the image of the is blended with
78
100
^^13.7
90
/ 5
^^-48.
60 y^^f?o.6
40
20
100
^\^.-21.2
80
60 &efrsi;l->>
6~'
40 ""-* -39.5
20
C
20 40 60 80 100 20 40 60 80 100 120 10 20 30 40 50 60
Figure 26. Mean mapping vectors for each of the 5 sub contexts.
~'
>^*U
80 -5^"73
'5
60
40
20
40 7 60
70.4
40
54.7
-56 5 20
Figure 27. Mean mapping vectors of Text sub-context (open arrow heads) and the mean of all
79
2.2.3 Conclusions hading to body of thesis work
Results from these experiments (the P2 experiments) showed that, although color
matching is
an
interesting approach to the gamut mapping problem, it is likely limited in the granularity of
the exercise. Restated, this approach was useful to derive a general gamut mapping model and
to show that CIELAB color space is very non-uniform in hue in the blue region. However,
Based on the conclusions drawn from experiments PI and P2, it was felt that attacking the hue
Thus, experiments were designed to more fully explore the nature of hue uniformity in color
abandoned.
80
3 Approach and Results
A search on data sets of constant perceptual hue was performed to sec if there were bodies of
the data set generation. Several sources were researched, but very little data was found that
was useful. The most obvious place to start is to investigate the Munsell
renotatio
colors.
"circles"
CIELAB color space. Concentric denote colors of
visually equal chroma and curves
emanating from the origin like the spokes on a wheel denote colors of
visually equal hue. At
-2QQ
400-
actually scaled
by observers. The colors outside of the squiggly line are all extrapolated (in xy
chromaticity space!). Additionally, figure 29 shows the projection of a typical CRT gamut onto
scaled data from the Munsell renotation. From these data, it was concluded that the Munsell
data set was deemed insufficient to use to judge perceptual hue uniformity. One explanation
for the low chroma limit is that the Munsell color chips were manufactured to be very durable
and fade resistant, and were made with a matte finish (which limits chroma). High chroma
paints that were available in 1943 were not durable and fade resistant enough to use for the
82
400-
* -200
.Q
450-
Figure 29. Munsell renotation colors with extrapolation boundary and CRT gamut
CIELAB
projection onto
a"-b"
plane.
There did exist one data set that was specifically derived to measure hue constancy in CIELUV
3
color space. This was the collection of two data sets from Hung and Berns. Two separate
experiments were performed to find loci of constant perceived hue in CIELUV color space.
The data sets were named CL (constant lightness) and VL (variable lightness). Figure 30
L* L*
chroma to % of max chroma (all of equal CIELUV lightness, with same as the of the
83
max chroma color). The VL data comprise 9 points per hue angle equally spaced in CIELUV
L*
from 20 to 90. In both cases, the reference color was the maximum chroma color of that
hue angle.
a*
(CD
a"
(VL)
-100 -50 0 50 -100 -50 0 50
100 100
1
i
\ i (
50
s
N
\
\
v
*
\
/
/
t
/
/
/
50
N
\ 1
/
/
b*
o
:-:i17 b*
o
11>
-50
-
i \ \
N
s
-50
-P* ^ v V
N
\
-100
N
\
s
-100
H
Figure 30. Hung and Berns loci of constant perceived hue. Left plot is CL data, right
plot is VL data.
Note that the CL data are smooth, while the VL data appear to be noisy. Notice, in particular,
that there is a crossover in the hue loci of blue and cyan-blue (lower left quadrant). This
implies that the data found are inconsistent, and were therefore judged to be unusable. This
conclusion is supported
by the variance measures reported by Hung and Berns in their article.
This analysis of available data from which to derive surfaces of constant hue in color space left
84
space occupied
by a CRT display. Thus, an experiment was conducted to find surfaces of
constant perceived hue. The goal of the experiment was to create a database large enough to
be able to derive and test a color space which has a metnc hue angle that represents constant
perceived hue.
85
3.1 Experiment El. Finding surfaces of constant hue in color space
3.1.1 Abstract
A colorimetrically calibrated CRT display was used to measure constant perceptual hue
surfaces in color space. 306 points over fifteen equally spaced hue angles
(every 24 degrees) in
each. Intra-observer variation was used to weight mean observer hue matches for each of 306
colors. Analysis of perceived hue uniformity was performed in CIELAB and CIECAM97s
color spaces. Other constant hue experimental results are analyzed and compared to data
obtained here.
3.1 .2 Introduction
The goal of a color appearance space, among other things, is to provide a structured
way to
model perception. That is, it should have appearance attributes that accurately reflect the
hue.56
saturation and
The most studied and best-understood perceptual attributes are lightness and brightness,
especially of neutral colors. This is not surprising, as lightness is the perception that defines
shapes and textures to the greatest extent and it is the most basic. Indeed, the first two colors
86
brightness are
fairly well understood phenomena. have
However, other perceptual attributes
important not to change the perceptual attributes of that hue. At the least, the hue name
very
Hung and Berns showed that for four color appearance spaces, none have accurate
representations of hue neither with lightness nor chroma. Additionally, color spaces have
different nonuniformities. For instance, for CIELAB the cyan-blue to blue-purple region is
the least uniform, whereas for Nayatani's space, the red region shows the least uniformity. It
enough in the blue region to unambiguously cross color name boundaries between blue and
If surfaces of constant perceived hue that correctly modeled human perception existed,
presumably a color space could be derived that fit the data. Unfortunately, such a set of data
does not exist in the literature. Munsell and NCS color order systems data were scaled
only for
colors with relatively low chroma because of material limitations. Other experiments have
87
been done to find constant hue surfaces over a larger gamut, but have been too cumbersome
and difficult to get significant results. Hung and Berns give a more detailed analysis
statistically
of these drawbacks.
Berns'
While the Flung and experimental results added more data to the available set, it did not
cover enough of the gamut to enable good modeling. Across 12 hue angles, their color
sampling was limited to 3 colors of equal lightness as the reference color plus 8 colors of
varying lightness along the gamut edge. While the experiment showed that perceived hue is a
function of both lightness and chroma, the relationship of hue with the other two dimensions
was not
fully examined.
In order to complete the description of full surfaces of constant perceived hue, an experiment
was performed that sampled an average of 20 colors over 15 equally spaced hue angles in
elements that might sacrifice precision. Thirty observers performed the experiment three
times each.
3.1.3 Experimental
number of colors to be tested. An alternate approach was designed that gave the observer
precision of .5 degrees at all lightness and chroma values. This allowed observers to match the
306 colors of one trial run of the experiment in an average of 90 minutes. Observers were
allowed to make as
many or as few matches as
they wanted in a given sitting. Test stimuli were
3.1.3.1 Script
Each observer was given a script with the same instructions (it was part of the user interface
"Make the hue of the test patch the same as the hue of the reference patch. The same hue
blueness."
means the color has the same amount of redness, greenness, yellowness, or
For the first session each observer was allowed to get acclimated with the interface and ask
questions until
they were comfortable with the task. Some observers had difficulty
between the test stimuli and the reference. This did not pose a problem in general, as the
confined to the CIELAB hue dimension. There were conditions due to gamut limitations,
four subjects who had this confusion despite verbal attempts to descnbe the task and to warn
then-
about the difference. These subjects also had large intra-observer variance, so
89
3.1.3.2 User interface
20"
The resolution was 1152x900 with a refresh rate of 76 Hz and a 0.31mm-phosphor trio pitch.
through July, 1997) to minimize drift from power cycling. The layout of the interface is shown
in figure 31. The stimulus squares for both test and reference subtended an angle of 4.2
degrees.
The white border had a luminance of 71 cd/m2. The luminance factor of the background gray
was .35 (25 cd/m2). The anchor colors showed what the test stimulus would look like if the
slider bar was pulled as far as it could go in either direction. The hue extent of all colors was
+-50 degrees. This amount was a good tradeoff between sensitivity and range of hue angle
based on
preliminary experiments. The user was instructed to make as good a match as
possible, then proceed by pressing the next button. If desired, the user could press the
previous button to see matches that were already made. The user could quit at
any time and
the state would be saved, so that the next session would begin where the last one left off (if the
90
^
10.5"
white border
gray background
reference
175"|
slider bar
gray
anchor
-?
'
( next )
C save j ( previous)
The topmost patch is the reference patch. There were 15 such patches, but they changed at
"next"
every color presentation because the colors were randomized for each trial. The middle
patch is the test color. This is the color that changes when the slider bar is moved. The colors
update as soon as the slider bar is moved, and remain where the user leaves the slider. The
"next" "previous"
patch is a
gray reference patch that has the same lightness as the current test patch. This patch
was included to be more compatible with the Hung and Berns interface, which also had a
gray
patch. The purpose of the gray patch was confusing to most observers and would not be
91
3.1.3.3 Viewing conditions
experiment took place under dark surround conditions. The walls of the room were covered
with black felt to eliminate the possibility of reflection from the CRT.
3.1.3.4 Calibration
Gain-Offset-Gamma58
The CRT was calibrated using a variant of the model. First the white
point was found through a search method that altered the digital count of white (it could not
be altered
by analog methods). Then the maximum red, green, blue channels were measured
to find the chromaticities of the primary colors. The gray scale was then measured to find the
luminance-digital count relationship. Ten-bit tone-reproduction curves were made from spline
and the model was tested for inversion error and colorimetric accuracy. The inversion error of
53
the model was less than .001 AE*^. The average AE*ab for a
sampling of colors was
times over the course of the experiment when the average color error became greater than 2.5.
306 colors were selected over 15 equally spaced hue angles in color space. Figure 32 shows an
example
sampling of a hue leaf at 0 degrees.
The reference color was chosen for each hue leaf to have moderate to high chroma, and to be
close to the center of the area of the hue leaf. It was felt that the reference color should not be
92
too far from any test color. This in Berns
is contrast to
Flung and who chose the reference
color to have the highest chroma. Figure 33 shows a projection onto the a*-b*
plane of the
Colors were chosen to span as much of the available color gamut as possible.
Sampling was
done as
uniformly as possible, but gamut restrictions made on a uniform grid
sampling
impossible.
Instead, within the gamut of a given hue leaf, the colors are uniformly sampled,
but at the edges the samples follow the gamut edge more closely. It was also felt that the
3.1.3.6 Observers
30 observers, 18 males and 12 females, between the ages of 27 and 59 participated in the
entice so
many willing and qualified observers. Each observer was given a brief color test with
the Ishihara color vision plates. Using this prescreening, four adult males were identified to
being very experienced color observers to complete novices. Knowledge of the objective of
observers'
93
100-,
90-
?
80 -
70- ? ? ? ?
60- ?
50- ? ? ?
? ?
40 -
30 -
? ? ? ?
20-
?
? ?
10 -
n - 1-
H 1 1 1
0 20 40 60 80 100
Figure 32. Hue leaf sampling at 0 degrees. The large square is the reference color.
^tOO-
89-
60.
f
s .40*r
*. 20
?I I * .?
DO 3D _?/!>. 50 100
-
-^-20
*-40
\
-60
-80
409-
colors.
94
3.1.4 Results and Discussion
CIELAB Ahj, space. Histograms of the 90 samples of each color were made. Figure 34
D c It
3e ;a@ !B
l an C to
ZjM
1
:E=JiJ-
:^ilr at
=jt=
~M :fft\ =^=
j jfl ::|^rfiH
ifillai
.i.l.l.l.lll..
nJr
_ilIILi.
Figure 34. Histograms of hue matches for 90 observations at reference hue angle of 0 degrees.
Histograms are plotted in the respective locations in the L*-C*ab plane of the reference colors.
the most part, the tails are symmetric. Confidence limits at 95% (1.96 times standard
error)
were calculated for each color from the entire data set of 90 observations. The standard error
of the sample mean is the standard deviation of the sample mean, and is calculated from the
sample standard deviation divided by the square root of the number of samples (fj/V90). This
95
is an estimate of how different the means would be if the experiment were to be run again.
Figure 35 shows the average and standard deviation of 95% confidence limits for each hue
surface.
24 48 72 96 120 144 168 192 216 240 264 288 312 336 0 deg
Odeg
deg deg deg deg deg deg deg deg deg deg deg deg deg deg
Figure 35. Average and standard deviation of 95% confidence limits of mean hue matches as a
Observers noted that for the cyan region (192 degrees 264 degrees) there was a large degree
of
uncertainty about where to make a match. Many hue locations looked very similar.
Many
also commented that the purple-blue colors were very easy to match and
they had a high
degree of confidence in making those matches. This can be seen from the average confidence
limit plot. The large average confidence limit regions show the most
variability in the match
data, since it was harder to make a match. The 312-degree hue surface is the blue purple in
If there is more
variability in the data, there is a larger area over which a match is acceptable.
96
These data be in
might useful
helping determine suprathreshold hue tolerances, and is in some
Overall weighted mean data were calculated. Weights were derived from
individuals'
inverse
observers all the time. Two metrics were investigated, sample variance and absolute
difference. A good
weighting metric should include more influence from internally consistent
observers without
completely excluding others. The inverse of absolute difference was chosen
over variance, since it was more equitable in weighting observers. Inverse variance
weighting
the mean. Figures 36 and 37 show the histograms of weights for inverse absolute difference
100.00%
80.00%
60.00%
--
40.00%
20.00%
.00%
o o o oooooooooooo
Figure 36. Histogram over the entire data set (306 colors X 30 observation means -
9180) of
97
100.00%
+ 80.00%
60.00%
40.00%
20.00%
.00%
oooooooo^
ooooooooo
Figure 37. Histogram over the entire data set (306 colors X 30 observation means
-
9180) of
If the mean was calculated without weighting, each sample would be normalized
by the inverse
the intra-observer
samples, in this 30, the mean of
of the number of case since we are taking
For the inverse absolute difference case (figure 36), roughly 45% of weights are above 0.0333,
whereas in the inverse variance case (figure 37), only 27% of the weights are above 0.0333.
Weighted mean data with confidence limits are shown in figure 38. The figure is a 2
dimensional representation of the mean hue matches around the hue circle. The x-axis shows
reference hue angle, and the y-axis represents the hue match with +- 95% confidence limits.
The data for each hue is sorted first by lightness (from dark to light), then by increasing
chroma. This means that the overall slope of the set of points that make up a constant hue
surface represents the variation of hue with lightness. The variation of hue with chroma can
be seen as points that make a sawtooth pattern within a set of constant hue points. Note that
98
the 95% confidence limits are smaller than most of the trends on the constant hue
significantly
surfaces.
Note at zero degrees the slope of the points is essentially 0. From 0 degrees to 72 degrees, the
slope of the points becomes increasingly positive, then decreases back to 0 at 144 degrees.
increasing lightness. From 144 degrees to 312 degrees the slope of the constant hue points
becomes increasingly negative, then tends back to zero as 0 degrees is approached. The
constant hue chroma relationships are most apparent at 264 and 288 degrees where the set of
constant data points deviate most from a straight line. Chroma relationships with hue are
visualized in a more intuitive way later. Figure 38 is primarily for visualization of the variation
99
350
Figure 38. Weighted mean hue matches in CIELAB space with +-95% confidence limits.
100
100
120 96
144 72
48
50
24
7
^k. ^ itH M jff
168
^^
0
b*
0
"^
192
216
-50
240
64^
W. 336
288
312
-100
L* L* L*
=
10 =
20 =
30
L* L* L*
=
40 =
50 =
60
1^
^ ^>
L* L*
L* =
70 =
80 =
90
reference colors.
101
It is clear that the blue region has the most chroma nonuniformity in CIELAB space. These
CIECAM97s6
in the proposed color appearance model (from the CIE TC1-34). The
recently
350
300
250 -
*v-
,VA
J3
"
200
a)
E
**s.
150
100
50
o mm/mm
24 48 72 96 120 144 168 192 216 264 288 312 336
Figure 41. Weighted mean hue matches plotted in CIECAM97s color appearance space. The x-
The in calculating
cd/m2
of luminance of white), Yb=35, X^=95.01, Yw=100, Zw=108.81, and the conditions for dark
surround: c=0.525, Nc=0.8, FL[=1.0, F=0.9. Note that, similar to but not as drastic as
CIELAB, the blue region has significant chroma non-uniformity. The lightness relationship
with hue in CIECAM97s can be seen to be very similar in trend to the CIELAB data.
102
The degree of
nonuniformity can be quantified in many ways. One way is to calculate the
mean color difference from the mean hue angle for each hue surface. This gives a convenient
and simple metric for cross color space comparison. This assumes that there is a reasonable
similarity between Euclidean distances in both color spaces, and that they are equally
discriminative throughout the entire space. While these assumptions are not strictly correct,
the metric gives a feel for a first order approximation of non-uniformity. Figure 44 shows the
mean distance from the mean hue angle for each hue surface for CIELAB and CIECAM97s
color spaces.
100
C sin(h) 0
-100
100
103
J=10 J=20 J=30
Figure 43. Loci of constant hue in CIECAM97s C cos(h)-C sin(h) at different lightness levels.
Dots represent reference colors.
The CIECAM mean distance is greater than the CIELAB mean distance for 10 of the 15 hue
surfaces. CIELAB distance is greater than CIECAM97s for 2 hue surfaces, and
they have
CIELAB is more uniform, but that would ignore the fact that the 288 degree hue surface has
104
-CIELAB
-CIECAM 97s
DOtMtDO-q-OOCMtD
iDGi-<--<t<OfD-T-n
*-i-CMC\IC\1CVJOC1
CIECAM97s.
Another way to quantify hue non-uniformity would be to measure the hue angle difference
between the greatest hue deviations from the mean in both directions. This would give a
105
CIELAB
CECAM97S
CIECAM97s.
Note that in figure 45, the entire region from 192 degrees to 288 degrees has higher maximum
error in CIELAB.
Figure 46 shows the projection of hue surfaces onto the opponent axis of linear opponent
Y-Z. Notice that the curvature for the blue region is significantly reduced in a non-
compressed color space. Although this is interesting, it is not suggested that one work in this
106
100
*
0 (Y-Z) 100
100
Figure 46. Constant hue surfaces shown in a linear opponent space (Y,X-Y,Y-Z). Numbers
represent CIELAB reference hue angles.
Notice that the blue curvature is lessened in this space, although visual inspection of the color
space shows clear color name change from blue to purple along a constant metric hue angle
107
120 96
4<
100
i 72
48
1 24
-100
:% 28?
\
312
3 16
100
100
Figure 47. Constant hue surfaces shown plotted in CIELUV color space. Numbers represent
Figure 48 shows
Hung and Berns constant hue loci for constant lightness (CL) and variable
lightness (VL) experiments superimposed over the results found in this research. For the CL
loci, the chroma non-uniformity trends are very similar. The respective CRT gamuts are
somewhat different, but the overall agreement appears to be very good. This should not be
surprising, as both experimental designs and presentation medium were quite close. The
varying lightness hue loci were derived for 8 lightness levels on the edge of the CRT gamut at
each reference hue. The VL data match to current findings is not as good as the CL data. In
due to the fact that Flung and Berns only had 9 observers, and as such had a higher degree of
108
1100 100
v \
1
/
f J /
1
1
50 \\ VI I'M JV
50
b* b*
0 0
fc^*.^:
*
^tff^^Jffi
-50 -50
>
*
-100
1^ -100
Left graph compares CL (constant lightness) data. Right graph compares VL (variable
lightness) data. Hung and Berns data are shown in bold dotted lines.
work was done. An experiment was conducted to compare constant metric hue surfaces for
perceptual uniformity. These experiments are described in a later section entitled "verification
experiments."
CRT screen. It is not clear if the data also applies to bright surround conditions and
viewing
109
Another concern has to do with the gamut limitations of the experimental medium, namely the
CRT gamut. To make these results general, some method of extrapolation should be done to
map the surfaces out to larger chroma values. It would be desirable to extrapolate to chroma
with extrapolation comes the need to verify accuracy of the resultant data set.
3.1.5 Conclusion
An experiment was conducted to find surfaces of constant perceived hue in color space. 306
colors were tested over 15 equally spaced hue angles in CIELAB color space. Results show
fairly well with Hung and Berns constant lightness data loci of constant perceived hue.
One of the goals of this work was to gather enough data to enable the modeling of a constant
perceptual hue color space that is functionally related to CIEXYZ or an equivalent color
space. This would allow users to transform their colorimetric data to constant hue color space
110
3.2 Development of constant hue color spaces
result in a metric hue angle consistent with perceived hue. The two approaches that yielded
useful results were the development of neural net based correction, and the development of a
visualization tool that was used to optimize simple, well known transformations. The neural
net approach was useful for generating three dimensional lookup tables that were used to
create transforms between CIELAB and a constant hue color space. The visualization tool
find for
enough"
in general and hue uniformity in particular within a well understood structure for color space
transformations.
It was initially thought that neural net ttaining could be used as a mechanism to find a general
purpose, invertable color space description that was uniform in perceived hue. This turned out
not to be the case, but neural networks were useful in creating table based hue correction
The only completely invertable (with no null space) neural net is one in which the hidden layer
has the same number of nodes as the input and output layers. A neural net with one hidden
layer with 3 nodes, and an input and output layer with 3 nodes each is shown in figure 49.
Ill
Nodes i Nodes k
Nodes j
The equation for computation of the numerical value for each of the hidden layer nodes is the
equation H .
,
=
/( V wi Xt + /? ) .
where i is the index for each of the input nodes, and
j is
the index for each of the hidden nodes. The function /was chosen to be tanh, a sigmoidal
function that has a slope of 1 at the origin. This function can be used to model compressive,
expansive, and linear function types depending on where in the domain the input data lies.
Figure 50. Tanh function is expansive in the negative range, linear near the origin, and
compressive in the positive range.
112
The bias term B can shift the domain to the appropriate part of the curve to match the nature
of the nonlineanty. The output node is computed as a weighted sum of hidden layer nodes,
Ok =
Zj wjk *Hj- Th15 is the general model for a neural net functional form, except that the
special case of
having 3 hidden layers This be
makes it invertable. can shown
by modeling the
neural net
by the equivalent functions in matrix form. The matrix form of the 3 node hidden
w, wn W13 x. B,
H2 w, w22 W23
*
*2 +
B2
w
31 W32 w33 *, B,
Ox W, w\2 W'n "#,
o2 W
21 W'l2 W'23 H2
O, w
31 w\2 "V H3
Equation 4. Matrix form of forward computation of 3 node hidden layer neural network.
The inverted form is obvious from equation 4, so is not shown. The matrices W and W must
To test whether the neural net approach was viable, it was tested on a known transform set.
The 306 color points that were the starting hue colors for the constant hue surfaces
experiment (El) were used to test if the neural network approach could be used to model the
transform from CIEXYZ (D65) to CIELAB color space. The same colors were used to train
and test the network. After training the network for several minutes, the training was
113
completed, and the network was tested. The methods used to build the network and
training
1.59,60'61
algonthms are discussed in detail in references 59-6 The AE^ error from testing the
trained data was 0.07, with a maximum error of 1.52. This shows that a neural network of the
type shown in figure 49 can effectively model the CIEXYZ to CIELAB transformation. This
should not be surprising since the transformation from CIEXYZ can be generalized to be a
3x3 followed followed by another 3x3. The bias term in this trained
by a nonlinearity case was
to fall on the part of the tanh curve that models a compressive function.
3.2.1 .3
Modeling the constant hue data set
The problem in which one knows the function to use to create the transformation between the
tristimulus values and the uniform color space (e.g. CIEXYZ to CIELAB) can be used to test
whether the training algorithms can accurately approximate the function. The problem that
remains once this is shown is to find the coordinates of the uniform color space that
correspond to the tristimulus values of the constant hue data. Immediately the issue arises of
trying to pick appropriate coordinate values of the uniform color space. We know that the
data that are in the same hue surface should lie on a plane, but the correlates of lightness and
"push"
mechanism could be found for the training algorithm to the data toward a convergent
solution that had good correlates of lightness, chroma and hue. This was not the case for the 3
node hidden layer network. The nature of the neural network is to try to fit the coefficients
(weights) of the model to minimize the error between the coordinates of the input and the
coordinates of the output. But there is no mechanism to tell the output coordinates where
114
they should go to minimize the error.
Additionally, there are no extra degrees of freedom to
use to force the model to account for errors of the output as there would be if the
coordinates,
angle of the hue planes, and used the predicted model to influence the placement of the
coordinates for the next iteration. None of these approaches yielded results. The
interesting
invertable neural network was thus abandoned for use in color space model development.
The more general neural network is not invertable, but is useful in creating a forward
making it 366 points. The neural network consisted of 6 input nodes, three of which were the
input nodes. The network had 1 hidden layer with 10 nodes, and the output layer had 3 nodes.
inverse directions. The lookup table was made using techniques similar to those in reference
62. These techniques are well known in the industry, and can be purchased off the shelf with a
commercial color correction software package. The AE*ab of the LUTs are shown in table 4.
115
LUT type Average AE*ab error Maximum AE*ab error
sampled planes in hue corrected space to CIELAB for display on the CRT monitor.
coordinates of the uniform color space, then to train a simple network to derive the
trying
parameters of the model that fit the transform. The flaw in this approach is that with a simple
coordinates must be very carefully placed in order to be able to model the transform with no
extra degrees of freedom. This means that the body of points that comprise the coordinates in
the uniform color space must all move together in a systematic way to conform to the
116
The alternative approach to
solving the problem (creation of a simple, invertable mapping
altered and the transformed coordinates are visualized in real time, then intuition can be
developed about which parameters make sense to change, and in which directions. This was
the approach that was used in the creation of the uniform hue color space. This is a viable
approach to use to model many color appearance phenomena and attributes, as it gives
immediate feedback, and can be applied to arbitrary sets of transformation data (i.e. data that
has some perceptual characteristic such as constant hue, and corresponding fundamental color
coordinate specification).
Two major types of color model were approximated with the visualization tool, the CIELAB
type model, and the CIELUV type model. These models are fundamentally different m the
calculation of the opponent data, so they must be separated. The CIELAB type model is
multiplication
by the lightness channel.
117
L XD65
M =RxOxPx Y
1
D65
S z 065
_ .
L'=
f{L)
M'=f(M)
S'=f(S)
~P~ "
M'
D = Tx
S'
Elements of the model R,0,P, and T are all 3x3 matrices with specific goals.
They are
separated in the first multiplication for purposes of clarity and to separate functions. The
fundamental cone tristimulus space. Many of these transformations exist, and most are linear
where the opponent step (calculation of luminance and chrominance channels from tristimulus
values) is performed before the nonlinearity, as in Guth's ATD color model63. The R matrix
allows the opponent transformed space to be rotated through 0 degrees through the use of the
matrix in equation 6.
10 0
0 Cos{9) Sin(0)
0 -Sin{9) Cos{9)
Equation 6. Matrix to rotate second and third row elements for the opponent response.
118
This assumes that the lightness correlate is made to be in the first coordinate, and the
chroma type compression (convert the second and third coordinates to polar
coordinates, and
fl=Lal;L>=0
/I=-(-L)<T2;L<0
(trt + 1)^
,2
al + L
Ja^H
f2
a2-L
/3=/2(VM2+52)
the second and third (M and S) coordinates, compressing with a hyperbolic function, then
converting back to M and S by taking C sm(0) and C cos(0) where C is the chroma value after
transforming through the nonlinear function. Figure 51 shows example plots of functions f,
and f2. It should be noted that none of the nonlinear transforms can
actually model CIELAB
nonlinearities, since
they have the bias term. This was deemed as too complex for the simple
119
model, and since RLAB has been successful without the added complexity, it was decided to
keep it out.
0.5
-0.5
Figure 51. Examples of nonlinear functions. Left plot is function fi, with parameters (Jl =
0.45,
0"2 =
0.6. Right plot is t~2 with parameters al =
1, 0"2 ~
1. Note that at the origin, fi has a slope
of 0.
This model is made more specific depending on whether the opponent stage is performed
after or before the nonlinear compression stage. If the sequence of transforms is like Guth's
model, the relevant transform steps are: P O R [nonlinear], thus the tristimulus values can be
pretransformed to cone values, then the opponent stage can be calculated, then the system can
is applied. One could envision further transformmg the output of the nonlinear stage, but no
If the sequence of transforms is like the CIELAB model, the relevant transform steps are: P
120
nonlinear compression is
applied, then the opponent stage is calculated. The opponent, and
The CIELUV type model general form is shown in equation 8. The constants krk5 are
parameters that can be changed to alter the shape of the resultant space. The P matrix has the
same effect as in the CIELAB type model in that it can serve to pretransform the XYZ
tristimulus values into a fundamental cone type space. The nonlinearity can be of the form f,
or fr, but not f3, as the second 2 elements of the transformed tristimulus values don't
L XD65
M = Px Y
1
D65
S 7 D65
_ _
M'=f(M)
*' L
k5-M
k2L + k3-M + kA S
u
*'
ks
v =
T"
K-y l /Ci Ka
M'-(u'-u'n )
u*
=
M'iv'-v'J
v*
=
M'
L* =
121
The CIELUV model is somewhat cumbersome from a
complexity standpoint since it does not
fit cleanly into successive stages of linear and nonlinear transformations. Additionally, it is not
a
very intuitively pleasing model because of the translation type of adaptation (although in
hindsight, this was really not needed, as the adaptation is assumed to have been done previous
to the transformation to the uniform color space coordinate system). The inversion of the
]M'=L*
M = f~\M ')
*
v v'
v'= + "
M'
*
,
u
U YU
M'
M{kl-k5-
c
O
k2k5u'-klk3V)
k\-kAV
k5M
u'
k\V
r
XD65 L
Y
1D65
= P_1X M
7 S
- D65.
viewing of the uniformity of the color space within the gamut of the CRT used. For this to
work, both the forward and the inverse models described in equations 5 and 8 were
122
implemented. Secondly, it allowed visualization of several psychophysical data sets. Various
data sets were used in order to ensure that the creation of a uniform hue color space did not
visualization of both constant hue data sets, quantitative data were calculated. This gave a
more objective
way to compare different prospective spaces. Thirdly, the visualization tool
used a multitude of controls to alter various parameters of the model. The control of
parameters was designed to give intuitive ways to change coefficients, since there is a large
The visualization tool user interface offered three separate 2-dimensional views of color
spaces: Slices of the opponent channels plane at varying lightness levels, slices of the chroma-
lightness plane at varying hue angles, and slices of the hue-lightness cylinder mapped onto a
any parameter value for the entire model, thus one can watch the space warp as a function of
parameter values. Figure 52 shows examples of the three visualization modes of the interface.
The constant lightness plane can be seen concurrently with either the constant hue plane, or
the constant chroma cylinder. A checkbox is used to control which plane is displayed.
Note, on the constant lightness slice, that there is a circle whose center lies at the origin and a
line that projects from the origin at a 45 degree angle. These are indicators that show at which
hue and chroma values the other 2 planes cut through the space. There are slider bars to
control the lightness level, the chroma value, and the hue angle in increments of 1/100 for
123
chroma and lightness, and 1/360 for hue. There is another slider bar that controls the number
of samples used to display the images. In figure 51, it is set to the maximum, which is 100
Figures 53 and 54 show close-ups of the control frames. The constant chroma frame is not
shown in close-up, because it resides in the same space as the constant hue frame. A checkbox
allows the user to toggle between constant hue and constant chroma.
124
Figures 52. Visualization tool user interface showing the three slices of color dimension,
constant lightness, constant hue, and constant chroma.
125
get it straight
CH 1 + and -
gamma
si
1^ 0 _l
s2 43 0 =
|
CH 2 + and -
gamma
S3 10 0 J
s4 43 0 J
~
CH 3 + and -
gamma
55 10 0 J
SB 43 0 =J
o'
type compression
hyperbolic j
I power
C*
_J
Show CRT primaries
* Show HB dataset
_|
Show big dataset
_|
Show Munsell
_|
Show avg hue
_|
Show map vectors
L*
_J_| Color plane
Lightness 43 0
rotate 0 0 I
prexform opponent
-rj WB opponer
both scale 100 Xscale 100 \ri
RL 40 0 100CL 40 0 100BL 20
J =
> 10C
Ra;Yd 250 100 J
j
-
J Q1 J Q3 Q5 J Q7 J Q9 Q11 Q13
_J
J Q15
data_.t
_J
Q2 J Q4 J Q6 J Q8 J Q10 Q12
J J Q14
HungBarns
| LAB
max h deviation, max =
26.68, avg =
12.71
LUV
0 24 45 72 96 120 144 168 192 216 240 264 288 312 336
126
Show color leaves
360
Hue 238A 0 =
max C 71 20 = 100
samples 35 10 100
L*
linearity I
show ellipses )
show CMFs
127
The color planes displays were created by sampling the planes in the resultant color space,
assuming the domain 0;1, -1;1, -1;1. The coordinates that lay on those planes were then
transformed through the inverse model to CIELAB and displayed. Out of gamut colors were
tune the pretransform, some controls affect the nonlinear steps, and some controls can change
1. Pretransform controls. There is a list of hardcoded pretransforms that are XYZ to LMS
linear approximations to cone primaries from several different authors. Additionally, a set
of 2 dimensional slider bars were used to control parameters for this 3x3 matrix.
1.1. The hardcoded transforms are derived from popular cone fundamental references.
Each matrix assumes as input CIEXYZ D65 values that have been normalized by the
white value of D65 (0.9504 1.0 1.0889) such that the input to the matrices had a range
from 0 to 1 for all channels. This was determined to be the most convenient way to
deal with the pretransform matrix (P), so the rows could be constrained to sum to 1
to maintain response.
gray
final choice for the final color space, was derived from the D65 normalized 3x3
128
model was derived from multiplying the D65 diagonal
matrix
by a matrix whose
elements comprised the D65 white data. Shown in matrix form, this is:
For this matrix, all rows summed to 5, so the resultant matrix used was to
simply
-0.5091 1.4098 0.0993 This may not be strictly correct, since the
normalized matrix was not derived from a D65 normalized matrix. However,
strictness was not an issue, as these hardcoded matrices were intended to act as
1.1.3. Guth primaries were found in Wyszecki and Stiles on page 646. The matrix was
treated to have rows that sum to 1 by dividing each row by the sum of the rows.
Again, this may not have been stricdy the correct transformation to represent the
129
data properly, but it was a reasonable transform when visualized. This matrix is:
1.1.4. Vos and Walraven primaries were found in Wyszecki and Stiles on page 615.
This matrix was treated the same way as the Guth matrix. This matrix is:
0.2346 0.8213
1.1.5. Smith and Pokorny primaries were found in Wyszecki and Stiles on page 615 as
1.2. Two dimensional slider bars were implemented to give fine and intuitive control over
the transformation that altered the fundamental LMS space. For each row of the
constrained to sum to 1 to preserve gray response. Each of the slider bars allowed the
changing of one of the LMS tristimulus values (after the 3x3 matrix). Thus, with this
130
"primary"
cone response. Figure 55 shows an example picture of the 2D slider bars.
through the identity matrix (thus the result are CIE color
matching functions), then
red
green
blue
bars''
the pretransformation matrix.
Figure 55. Two dimensional "slider that controlled
131
-if
^ Color Matching functions ^ Color Matching functions
The choice of pretransform matrix has a significant effect over the resultant shape of the color
space. Take, for example, the following senes of figures, together labeled figure 57. Each of
the color
matching functions is shown next to the resultant color space shape. The only
difference in each of the color spaces is the pretransform matrix, which in turn affects the
shape of the CMF and color space. Appendix C shows the for
corresponding parameter set
132
I I
Figure 57. Three examples of CMFs along with their influence on the shape of the
133
2. The O matrix controlled the opponent step for the Guth type model. This type of model
performs the opponent calculation before the nonlinear compression stage. The opponent
transform was calculated using only 2 rows of the matrix, as the third was reserved for the
lightness response. Since, for opponent response, the corresponding rows must sum to 0
to ensure grays lie at the origin, only 2 elements of each row need to be controlled. This
control was offered through the use of 4 slider bars that changed the first 2 elements of
elements are shown in equation 6. One parameter, the angle of rotation, is needed, and
was provided for through the use of a slider bar. Rotation of the domain of the Guth type
rotation steps. The effect on the shape of the color space in this type of model is to warp
hue lines away from the axes and toward the diagonals (axis plus 45 degrees). This is
because the slope of the nonlinearity is greatest at the origin. This effect is visualized in
figure 58. This type of model was not given much consideration because it worked only
with a nonlinear compression stage that had a slope near 1 at the origin (hyperbolic type),
thus the lightness response deviate quite far from CIELAB. Additionally, it was very
difficult to find rotation and opponent parameters that showed improved hue constancy
over current spaces (CIELAB, CIELUV). Note, in the figure, the blue constant hue locus
(for Flung and Berns data set) has a different shape on the left figure than on the right.
134
This is due only to the opponent channels through 45 degrees. The
rotating an angle of
Figure 58. Example hue lines from Guth type model. The first image is rotated at an angle of 0
degrees, the second is at 45 degrees. Note the shape difference of the constant hue loci.
positive and negative parts of the curves respectively. Figure 59 shows an image of the
nonlinearity controls. Figure 51 shows an illustration of how the controls can make
various curve shapes. For the power function, the parameters would be 45 and 60 (both
are divided by 100) and for the hyperbolic function, the parameters would be 10 and 10
(both args are divided by 10 to make a reasonable scale range). The selection range on the
135
interface was 0 to 100.
CH 1 +and -gamma
0
=J
si 38A
s2 43 ,
0 =j
CH 2 + and -gamma
S3 37 ^
0 =J
s4 43 ,
0 =J
CH 3 + and -gamma
0
=J-
S5 BO sas
s6 43 ,
0 =J
o'
type compression
hyperbolic |
power
C*
Figure 59. Xonlmear controls for visualization tool. Six parameters correspond to the positive
and negative parts of each of the three channels. Functions take the form of hyperbolic, power
The opponent channel controls are intended to influence the shape of a CIELAB type
model where the opponent calculation is done subsequent to the nonlinear function. Since
rows of the opponent channels sum to 0 to make grays lie on the origin, only 2 parameters
are necessary. The 3 lightness controls allowed experimentation with how much of the
three channels should be used to create the lightness response off the gray axis.
By forcing
the sum to equal 1, the neutral response can be constrained to be simply a function of the
nonlinearity. Thus, the RLAB response can be exactly mimicked on the neutral axis as
136
long as the
nonlinearity is power function with gamma =
0.43, and rows sum to 1 for the
was the Ebner and Fairchild data set (306 points of constant hue on 15 different hue
surfaces). One could use the interface to include or exclude of the 15 hue surfaces
any
through checkboxes. Of course at least one surface is needed to provide data to the
regression engine. The regression model data needs pairs of triplets to perform the
optimization on. The first set of triplets was made by taking the CIEXYZ input data,
transforming it to the appropriate coordinate system for input. For the pretransform case,
this was simply the CIEXYZ values. For the posttransform case, the data had to be
processed through the pretransform and the nonlinearity. The second set of triplets was
made
by processing the input data through the entire model, then calculating the average
hue angle for each hue surface. The average hue angle then replaced the actual hue angle
for each color, then the data was processed through the inverse model to the appropriate
point. In the pretransform case, the data was transformed through the inverse
posttransform and the inverse nonlinearity. In the posttransform case, the data was used
directly.
Multiple linear regression was applied to the input-output triplet pairs to generate new 3x3
matrices for the model. This technique worked reasonably well if it was applied
only once
or twice, but the 3x3 matrices diverged when the regression was applied successive times.
137
3.2.2.2. 3 Quantitative metricsfor hue uniformity andpsychophysical data set visualisation
Several data sets were investigated in the development of the color space. These data sets were
used to verify that alterations made to the color space did not
adversely affect other attributes
"best"
of the space. This also makes the decision of which parameters are much more
complex, since it is not easy, or even feasible to assign quantitative metrics to all the data, let
alone optimize a space with respect to all attributes. Each of the data sets will be discussed
here along with the intent and the source. Only the constant hue data sets and the chromatic
Fairchild). Quantification of the constant hue data sets took the form of three different
metrics. For the visualization tool, the metric used was the most severe: For each set of
constant hue data, the absolute difference between the minimum and maximum hue angle was
calculated. The other 2 quantifications are holdovers from the calculation method used in the
publication of reference 2. These quantifications found the mean and maximum absolute
difference (in hue angle) from the mean hue angle for each constant hue set. Thus there are 3
Fairchild2
1. The Ebner and data set was used to derive one of the two
uniformity metrics,
and the 306 points were displayed on the interface. The data were originally derived in
normalized
by the white point coordinates (0.9504, 1.0, 1.0889). This data set can be
http://www.cis.rit.edu/people/facultv/fairchild/CAM.html.
138
Berns"
2. The Hung and constant lightness data derive
(CL) set was used to the other
uniformity metric. The 48 points were displayed on the interface, an example of which is
shown in figure 58. The data were copied out of the paper (table III), then normalized by
the white point coordinates for in the The data
use tool. original were derived under D65
surround.
3. Munsell Value 5 data were obtained from the RIT VAX computer. The Munsell data was
"circles"
used to scale the x and and to minimize of the chroma in
y axes, eccentricity the
destination color space. These are the Munsell renotation data that can be found in
Stiles65
Wyszecki and on pages 840-852 as well as in the original reference66. These data are
RLAB67
converted to CIEXYZ D65 using the most recent equations
assuming average
surround, and
hardcopy viewing (D=l, Yn=200 cd/m2, rj=l/2.3). The data were then
normalized
by the white point for use in the tool.
4. Lightness response
along the neutral axis was chosen to be nearly identical to the RLAB
color space, which is very nearly the same response as CIELAB. Lightness response was
L*
plotted as a function of CIELAB in the visualization tool.
5. Lightness of chromatic object color data were used to make the new color space more
accurately represent lightness of colors off the neutral axis. These data were copied from
the
reference68
(Table I). These 36 points were reported in CIELAB L*C*abh space, with
an illuminant near D65. The data were converted to CIEXYZ D65 assuming D65
139
illuminant. These data quantify the Helmholtz-Kohlrausch effect, which states that
chromatic colors appear lighter than neutral colors of the same luminance factor. Since
CIELAB uses only the luminance information to derive the lightness response, it cannot,
aspects of the effect can be reduced, but it was found that this simple model does not
predict the Helmholtz-Kohlrausch dependence on chroma, but can reduce the hue
6. MacAdam Ellipses were calculated for observer PGN's data. These data are
discrimination data that show threshold sensitivity to the dimensions of CIE xyY
chromaticity space. In a
perfectly uniform color space, the ellipses should resolve to
circles, although threshold information is less interesting in a uniform color space than it is
in a color difference space. These data are from Wyszecki and Stiles on page 309. These
data are defined in CIE xyY space under illuminant C. The ellipses are defined using
major and minor axis lengths, center coordinates (x and y, all data were equiluminant), and
rotation in degrees. For each ellipse, 20 samples in CIE xyY of the perimeter were made.
and b are the axes of the ellipse. The point (x'y') can be found by rotating point (x,y)
x'=x-Cos(9) +
ySin(Q);y'=-x-
Sin(6) + y-Cos(6). The resultant points can be
x"
140
y" y'
x'> Oy By 9 degrees
-
origin of the ellipse, samples can be made in the color space. The sampled points are then
the tool.
7. Suprathreshold difference
ellipses69
from (Table
color were copied the reference II, Cross
noticeable, so are
possibly more important to get closer to circular than are the MacAdam
ellipses. These 19 ellipses were sampled in a manner similar to the MacAdam ellipses.
These data were copied from Wyszecki and Stiles (reference 65), pages 870-871. The data
were expressed in CIE xyY (D65), and converted to normalized XYZ for use in the tool.
functions"
9. In addition to data sets, the "color matching of the CIE color
matching
function can be thought of as the response of some cone fundamentals that are a 3x3
matrix linear transform away from the CIE 1931 (2) color
matching functions.
10. The spectral locus is plotted in the color space for reference.
141
3.2.2.3 Properties of the chosen color space (IPT)
The IPT color space is named such that its coordinates have some degree of
relationship to
the meaning of the dimensions. The lightness dimension is denoted as I, which can be loosely
related to the word intensity which provides a clue to it's meaning. The red-green dimension is
"dominated"
denoted as P, which can be related to the fact that it is
by the red response
(protan) and is the dimension lost by protanopes. The yellow-blue dimension is denoted as T,
using the same argument for the tritan response. IPT is also short for Image Processing
with some
engineering constraints in mind. The biggest engineering constraint is that it
should, for the popular CRT gamut, not be discontinuous in hue. That is, it should not, for a
given hue angle, for a CRT type gamut, have in-gamut colors at a larger chroma than out-of-
gamut colors. This situation can happen if the color space is warped appropriately, and it can
happen with a space that accurately models constant perceived hue. It also happens to occur
in the blue region of the color space (I think coincidentally). The following scenario illustrates
the point.
coordinates of Red: 0.630 0.340, Green: 0.310 0.595, and Blue: 0.155 0.070 with a white point
142
of D65. Figure 60 shows a line connecting the blue primary to a point on the neutral axis, lets
gradient
sweep from device blue to a neutral gray, or to white, it will be immediately apparent
hue. Here, the Hung and Berns loci are used because they are simpler to visualize.
0.10.20.30.40.50.60.7
Figure 60. A line showing the path of a gradient sweep from gray to the blue primary (in CIE
xyY space).
chromaticity
CRT description in a uniform hue color space is illustrated in figure 62. This
in the gamut
143
m
100
N '
'
/ /
W /
0
Vv
100 Vv
-100 100
b*
Figure 61. Path sweep from gray blue primary in CIELAB Notice
a*
of a gradient to space.
that the mid chroma colors are more purple than the constant hue locus. Dotted lines are Hung
and Berns constant hue loci.
Note the prominent leftward curvature of this path. Note also, that it accurately describes the
perception that the path is more purple in the mid-chroma region, whereas if one would
believe that CIELAB was uniform, it would appear that the CRT primary actually becomes
bluer in the mid-chroma region. From Figure 62, it is not hard to imagine a case where the
curvature of the primary blue signal would become so extreme in certain color spaces, that, at
a line projected from the origin toward a direction near the primary blue signal, the colors
could go out of gamut, then come back into gamut as chroma is increased. This is, in fact,
exactly what happens when a color space is designed that exactly straightens the Hung and
144
angle, it was decided that the color space should be explicitly designed to not exhibit the
artifact.
0.5
-0.5
Thus, the IPT color space does not exhibit hue discontinuity at specific metric hue angles. It
does this at the expense of getting the Hung and Berns constant blue locus exacdy straight.
However, as will be discussed later, it was judged more uniform than either of the constant
Other design criteria were that the color space should be as simple as possible and still do a
good job. The final model that was decided upon is extremely simple. It is also
very amenable
145
which supports conversion
using cascaded sequences of 3x3 matrices and one dimensional
non linear steps. The decision to use a power function, as in RLAB, simplifies the model
when used in a high accuracy scenario (data not cached into quantized
lookup tables), as it
All the following visualizations and quantifications are shown with respect to CIELAB color
space. This is because CIELAB is the most popular gamut mapping color space, and the
order of
complexity between the two spaces is nearly equivalent.
The IPT forward model is shown in equation 9. Equation 10 shows the inverse model.
L'=L043;L>=0
L'=-(-L)043;L<0
M'=M0A2;M >=0
M'=-(-M)043;M <0
S'=SOAi;S>=0
S'=-(-S)OA3;S<0
L'
0.4000 0.4000 0.2000
M'
4.4550 -4.8510 0.3960
5'
0.8056 0.3572 -1.1628
146
L 1.8501 -1.1383 0.2385
M'
=
0.3668 0.6439 -0.0107
L =
L'23256;L'>=0
L =
-(-L')23256;L'<0
M =M'23256;M'>=0
M=-(-M')23256;M'<0
S =
S'23256;S'>=0
S =
-(-S')23256;S'<0
D65
1.0 -0.1139 0.1332 M
functions (xbar, ybar, zbar) transformed through the first 3x3 matrix of the model as a
function of wavelength. These can be thought of as the fundamental signals that are then
function. This function has more responsivity at the left tail, and less responsivity at the peak
than does the Hunt-Pointer-Estevez function. Note that the resultant yellow-blue response is
147
near the wavelength of the peak of the blue response. This results might be evidence that
very
the functions for transformation to the primary cone response might be improved using an
alteration of this nature, as the conversion to a (metric) uniform color space is likely to mimic
The alteration was made to the pretransform matrix because it noticeably straightened the
constant hue loci for the blue region in both constant hue data sets. For example, the Hung
and Berns absolute maximum deviation metric for the Hunt primaries (all else
remaining
constant in the model) was 10.3, while the metric was 7.9 for the IPT color space (altered
primaries). Although this may not seem like a large difference from the metric, the visual
stimulus clearly contained purple in mid-chroma with max chroma at the blue primary, while
148
HuntL
Hunt M
Hunt S
IPT L
IPTM
IPTS
co^-^-'^-'^-LnLnLncDCDCDi^t^
wavelength (nm)
Figure 63. Color matching functions of the IPT color space compared to the Hunt-Pomter-
Estevez primaries. Note the larger response on the left tail of the M (green) response function.
Figure 64 shows an overlay of both constant hue data sets. The left image shows constant hue
data sets in IPT, and the right image shows the surfaces in CIELAB. Hung and Berns
constant lightness data is shown as bold dotted lines. The IPT plot appears to have different
scale factors for the blue (lower left) Hung and Berns line vs. the Ebner and Fairchild surfaces.
This is an artifact of the the sampling was done in CIELAB space for the constant hue
way
experiment. Notice on the CIELAB plot, that the Hung and Berns line for blue (the most
farther Ebner
quadrant) lies over, but
curved line in the lower nght extends much than the and
Fairchild surface at the same location. These colors were initially sampled in a more cyan
position than the device blue, so were limited by the gamut of the CRT display.
149
1 100
I
V
> -. i
il i \
*
50 V N /
V /
^ 1 /
Il I/ y 1
f
> 50
AT *
xv w>y
\A
^-^^^A ==^s="
p** -
T*100 0 0
b*
J/, *
-50
1
-50
1
* X
s
/ Ik
-100
-50 0 50
-100 -50 0 50
P*100
(CL) a*
Figure 64. Constant perceived data sets plotted in IPT (left) and CIELAB (right).
Berns53
Constant Lightness (CL) data is in dotted lines. IPT is
Hung and set
Notice that, in the blue region, the hue loci are straighter in the IPT space than in CIELAB.
Furthermore, the hue loci in the rest of the space appear to be as straight in IPT as in
CIELAB. Quantification is shown for CIELAB, CIECAM97s (just because it was already
done, so I didn't remove it), and IPT for the mean distance from the mean hue and the
maximum absolute distance from the mean hue in figure 65. Clearly, the mean distance from
the mean hue is smaller over the entire space than is either CIELAB, or CIECAM97s. The
maximum absolute deviation from the mean hue angle is significantly smaller in IPT than in
either CIECAM97s or CIELAB in the blue region. There are 4 reference hue angles where
the IPT color space has a larger maximum deviation from the mean hue angle. These
150
red). The absolute magnitude of hue nonuniformity is quite small for 144 and 168, and while
the magnitude of the metric is larger at 312 and 336, hue nonuniformity was not observed.
Interestingly, these two reference hue angles had the lowest average confidence limits (see
figure 35). As will be seen later, IPT was not judged less uniform than either constant hue data
set at
any of the reference hue angles.
Figure 65. Quantification of hue nonuniformity for CIELAB, CIECAM97s, and IPT.
The visualization tool used a different metric to calculate hue uniformity. It used the most
calculation was done both on Ebner and Fairchild's data set (as in figure 65), and on
Hung and
151
Hung and Berns max. deviation Ebner and Fairchild max. deviation
25-
20-
DIPT a ipt
5 15-
CIB_AB_EQ
a CIB-ABEo|
10-
J 1
TfrhKHh^li TT ^fcffilfflii
V <0 (M IO O
O
R RY Y VG G GC C CB B 8M M MR
hue
Berns'
especially with
Flung and (CL) data.
Munsell renotation data is plotted to show that the chroma response of the space is not
terribly different than that of CIELAB. No quantification has been done of the eccentricities
of the constant Munsell chroma loci. Figure 67 shows the Munsell value 5 data plotted in IPT
space on the left and in CIELAB on the right. The CIELAB plot appears to be slightly more
uniform in the low chroma circles, but neither appears to be perfect, and neither appears to be
need to be careful to only use the non-extrapolated data to tune the chroma response of the
space. Other chroma data sets would be interesting to find as well to corroborate the Munsell
data base.
152
1
150
0 .75
100
i.i=V>Vin::::
:
'25-
-1-0.75-
O'.'&Q :.%$--l'<l-
0 . 5 0.75 1 -150 ~iqcr -.5.&.;-.n
?$?:; '.'loo 150
-
0 , 5 [
-100
0.75
-150
-1
L*
lightness response. CIELAB is known to be very representative of the lightness response
L*
of humans under average surround conditions. The CIELAB function was mapped as
which is 1/(2.3), or 0.43478. If desired, for coherence, the exponent could be changed to
1/(2.3) with
virtually no effect on the response of the color space. Figure 68 shows the IPT
L*
lightness response as a function of CIELAB Note the major difference in the response is
at the dark end of the curve. The difference is due to the fact that the slope of a power
153
L*
.CIELAB
-Ix 100
OTj-OOCMCM^-OOCD'^^t
OOT;'!,'oj<NCOCO'd:
i-
CM CT IO CO I"- OS
L*
CIELAB
L*
Figure 68. IPT lightness correlate as a function of CIELAB
The chromatic lightness response is modeled on data derived from Fairchild and Pirrotta. The
Wyszecki data set referenced in that paper was not investigated, but might be used in further
refinements of the model. The observed lightness response of chromatic colors was measured
by having observers make lightness matches between chromatic colors and neutral colors. The
chromatic lightness response is a measure of how well the color space predicts the lightness of
chromatic colors. To show this, a comparison is done between the lightness response (in the
color space of
interest) of the chromatic colors to the lightness response of the neutral colors
that were judged as matches. Thus, in a color space that had perfect chromatic lightness
response, the metric lightness of chromatic and neutral colors (which were judged by
observers as
having equal
lightness) would be the same. This is best explained
along with an
L*
illustration. In figure 69 the CIELAB of a neutral color that matched a chromatic stimulus
154
is plotted as a function of the CIELAB L*
of the chromatic stimulus. If CIELAB had a
perfect chromatic lightness response, the data would fall on a straight line with a slope of 1.
L*
Neutral
L*
CIELAB
30 40 50 60 70 80 90
L"
Figure 69. Chromatic lightness response of CIELAB. x axis is of chromatic stimulus, y axis is
L*
of neutral stimulus that matched in lightness (mean observer judgement). RMS error between
L*
observed lightness and of chromatic color is 7.12.
L*
The RMS error between the observed lightness of
gray stimulus and the CIELAB of
L*
Neutral
predicted
30 40 50 60 70 80 90
L*ew= L'
Figure 70. CIELAB matching L* response as a function of predicted + 0.098 C\b.
155
L*
If a linear regression is applied to the model L*new= + k C*ab, the constant k is found to be
0.098. The plot of this simple prediction function is shown in figure 70. Note that the match
points are closer to the straight line. The RMS error for this simple prediction is 5.97. By
adding enhancements to the predicted Lightness model, the RMS error can be reduced to 4.2.
The form of this most elaborate model is denoted L**, and is a function of lightness, chroma,
and hue of the chromatic color. Equation 1 1 shows the entire lightness prediction model and
its parameters.
L** =
L*+f2(T*)-f\(h)C
.. (h -90
f\(h ) -
0.116 Sin + 0.085
2.5-0.025- T
/2(L*) =
Equation 12. Final lightness predictor for chromatic colors from Fairchild and Pirrotta68. RMS
error between observed lightness and L**(L*,C*ab,h) is 4.2.
156
The same data is shown plotted in IPT in figure 71. Although the RMS error is
only slighdy
smaller than the RMS error in CIELAB space, notice that the layout of the points falls along a
path that is straighter than the CIELAB data points. This allows the creation of a more simple
hue.
Neutral I
0.9
0.8
0.7
0.6
0.5
0.4 m*
0.3
IPT I
0.3 0.4 0.5 0.6 0.7 0.8 0.9
L* L*
Figure 71. Chromatic lightness response of IPT. x axis is of chromatic stimulus, y axis is
of neutral stimulus that matched in lightness (mean observer judgement). ). RMS error between
L*
observed lightness and of chromatic color is 6.73.
157
model Inew= I + k done find k
Using the C*IPT, a regression was to the value of =
0.202. The
that is a function of L*,C*ab, and h (RMS error was 4.2). The equivalent of
2(L*) was used to
enhance the prediction, which is shown in its full form in equation 12.
C*IPT=Jp2+T2
Equation 13. Final lightness predictor for IPT color space for chromatic colors from Fairchild
and Pirrotta68. RMS error between observed lightness and predicted is 3.5.
Figure 72 shows the plot of the final model of a lightness predictor for chromatic colors in
IPT. Note that the RMS error is lower, and the model is simpler because it has no hue angle
dependency.
Neutral I
predicted
0.3 0.40.5 0.60.7 0.80.9
Figure 72. Plot of final lightness predictor for IPT color space for chromatic colors. RMS i
between observed lightness and predicted is 3.5.
158
3.2.2.3.8 MacAdam (PGN) Discrimination Ellipses and Spectral Locus
Although the intention of a uniform color space is to accurately model appearance attributes
mapped in the color space. Therefore, both discrimination (at or near color difference
threshold)and suprathreshold color difference data are investigated. Figure 73 shows the
MacAdam ellipses plotted on IPT space (left), and CIELAB space (right). Along with the
MacAdam ellipses, the spectral locus, monochromatic equienergy signals, are plotted
300
0 200 300
-3001
locus.
IPT for the blue region, models the ellipses with less
Note that the color space, especially
difference between major and minor axes. The spectral locus for both of the color spaces
159
approaches chroma of 0 as the stimuli reach the edge of the spectral sensitivity of the color
matching functions.
Supra threshold color difference ellipses are shown plotted in IPT and CIELAB in figure 74.
These color ellipses model a color difference on the order of 1 delta E*ab unit. Because the
unit difference of the reference was near gray, the color difference was
definitely above
threshold, but is still considered a small color difference. Notice that these plots exhibit a
similar behavior in the blue region as did the MacAdam ellipses. Most notably, the eccentricity
75
0.4
50
i"\
0.2 i
25
*?0
0 o
^ i*\ or . i 4*$$
'
"O (n **
-0.4
-i?.-2>-S 0.2 0.4 75 25 50 75
...*/
*
-5'O-'--^0
-^
?'
.
--C.
: jiq:2
-50
-0.4
-75
160
3.2.2.3. 10 OSA Color System (L =
lightness plane
medium gray 0) constant
Constant lightness plane from the OSA color system is shown in figure 75. Data are plotted in
IPT on the left and CIELAB on the right. This data set is included in addition to Munsell V5
? ?
* *
? i
? ? *
1-
-#
- -
? '-! -?-
1*
-0 0 5
*
Figure 75. OSA color system medium gray constant lightness plane plotted in IPT (left) and
CIELAB (right).
Note that in CIELAB, the vertical loci data are curved dramatically, whereas in IPT, though
the data are not exactly evenly spaced, the uniformity is much improved. It appears that the
curvature is slighdy reversed in IPT with respect to CIELAB. This may suggest that the color
point to a fundamental difference in perception between object colors and self luminous (CRT
displayed) colors.
161
3.3 Experiments E2. Verification Experiments
The constant hue data sets and the IPT color space were tested to measure the difference in
uniformity when entire hue leaves were viewed. This task is fundamentally different from the
task that was performed in deriving either of the constant hue data sets in that there was a
"same"
large amount of color context (the entire CRT gamut at the hue) to judge uniformity
within. Both experiments A and B were designed to be identical in procedure. The only
difference between the experiments was that the first one compared CIELAB, hue correction
based on
Hung and Berns, and hue correction based on Ebner and Fairchild, and the second
experiment compared the IPT color space to the 2 constant hue data sets.
3.3.1.1 Introduction
A paired-comparison experiment was conducted to test the hue uniformity difference between
two uniform perceived hue data sets and CIELAB color space. A colorimetrically
characterized and calibrated CRT display was used to present the stimuli.
Viewing conditions
were set to dark surround, near D65 white point, with a maximum luminance of 70cd/m2.
Berns' Braun70
table
lookup scheme that enabled forward and inverse transformation between CIELAB color
transformation between CIELAB color space and the respective uniform hue data space.
162
3.3.1 .2
Experimental
spaces, then transformed to CIELAB for display on the calibrated CRT display. Out of gamut
colors were converted to CIELAB coordinates 50,0,0. For each hue angle sampled, the three
color spaces were compared. For each hue angle, three presentations were required to
compare each color space to the other (N*(N-l)/2). For each observation, two repeats of the
data set were presented to the user to account for screen non-uniformity. Each observation
session required 90 judgements (15 hues X 3 Images X 2 repeats). The list of pairs of hue
comparisons was
pseudo-randomly shuffled for each observation session. The user interface
io.5">|r
JVhite^^. White
S
Black^^
Gray out of gamut
Blacjtx^
region
C left ) ( riant ]
( back j
Gray background
( uuil ) <-13.5"
163
The task was as follows:
'You will be shown pairs of images. For each pair of images shown, pick the image that has the best hue
uniformity. The maximum chroma color for each image is the same color. Compare the colors in each image
"
Only same base hue images were compared to each other, never were a mix of hues presented
to the observer.
Thirty observations of the entire data set were made. Nine observers took part in the
experiment. All observers had experience with color, and were familiar with the terminology,
judgement71
and the concept of hue uniformity.
Using Thurstone's law of comparative (case
V), fifteen interval scales were derived, one for each hue sampled, that compared relative hue
uniformity of the three color spaces. Confidence limits were calculated based on the number
of observations of the entire data set. The confidence limits were calculated
using the
1.96-V2
following equation: j=
,
where N =
30, the number of observations (not number of
observers). This formula is based on the assumption that each scale value is a mean of a
normal distribution where the units of the scale are V2 times the standard deviation of the
distribution. Thus, a 95% confidence interval is calculated from 1.96 times the standard
as is done in Thurstone's case V. This seems inconsistent with the calculation of the 95%
164
3.3.1.3 Results
Figure 77 shows the results from the experiment. Higher bars mean that the color space was
1.5
0.5
]J- MJ 4U -17 :
=e_ I io 'r4 T51 \i)g
1
>
-0.5
llCIELAB
0) H&B
-1
u QE&F
V)
-1.5
-2
-2.5
-3
-3.5
Figure 77. Scale values for judged uniformity between CIELAB and the 2 constant hue data
sets. H&B are
Hung and Berns data, E&F are Ebner and Fairchild's data.
The most prominent feature of this figure is the significant difference in judged uniformity at
the 264 and 288 degrees reference hue angles (these represent the blue region).
Flung and
Berns and Ebner and Fairchild data sets were judged to be not
significantly different in the
blue colors. Most of the other reference hue angles seem to have very close scale values,
165
which leads one to believe that they are equally uniform at those hue angles. There is an
the reference hues 24, 48, and 72, where the Ebner and Fairchild
interesting trend observed at
hue surfaces are judged to be less uniform than both CIELAB and the Hung and Berns hue
correction. This trend was also observed in the gamut mapping experiment conducted
by
Braun70, where the red color of the parrots head was mapped to a color that was too magenta
when
using correction based on either constant hue data set. The results from Braun's
experiment led him to make a new hue correction LUT that only affected the blue region of
CIELAB space. This solved an immediate problem, but is much less general than a functional
color space approach to the constant hue problem. The root of the judged non-uniformity of
is not well understood. While the Hung and Berns results are not, from this result, judged to
be statistically less uniform, the shape of the E&F curve (at 48 degrees) and the shape of the
red stimulus of H&B have the same trend (see figure 48, left plot ). This bend in both loci of
the red stimulus may be more apparent in spot colors, and there may be preference influences.
3.3.2 Comparing IPT color space to Hung <& Berns and Ebner data sets
The experiment described in E2, part A was repeated
using hue planes from the new IPT
color space in place of the CIELAB color space. The identical user interface, script, data
preparation, and data analysis was performed. The number of observers for this experiment
was 10, and 30 observations were made over the entire data set. Confidence limits were
166
same (since the number of observations was the same). Figure 78 shows the results for the
mean
uniformity judgements.
Figure 78. Scale values for judged uniformity between IPT and the 2 constant hue data sets.
Fairchild's data.
H&B are Hung and Berns data, E&F are Ebner and
The first characteristic to notice about this plot is that the extent of the scale values is of the
for IPT color space are greater than either of the constant hue
that the scale values uniformly
significandy different
statistically).
data set scale values (although not all
167
Clearly, the IPT color space is judged either more uniform (when the confidence limits are
beyond the mean of the other two space, such as with reference hue 48), or no less uniform
(e.g. reference hue 144) than the constant hue data sets. Experiment E2 A. has shown that for
the reference hue of 48 degrees, CIELAB was judged more uniform than either of the
constant hue data sets. The magnitude of uniformity difference between CIELAB and the
next judgement was about 0.5 units, which is contrasted to roughly 0.75 units in figure 78 (for
reference hue angle 48). From this we can conclude that the IPT space is roughly as uniform
as CIELAB at the reference hue of 48 degrees, and more uniform than CIELAB everywhere
else. Additionally, the IPT color space is judged more uniform than the constant hue data sets
The last statement is a bit confusing. That a derived color space is more uniform than data
sets that were derived to be perceptually uniform seems to be wrong. But one must consider
that in making lookup tables that model the constant hue color space, and transform between
CIELAB and the constant hue data, some error will be introduced. Even when the data used
to create the transformations are very accurately modeled in the LUT transformations (which
it is in both cases), there will be uncertainty in accuracy of the transformation in other parts of
the space. It seems that it is always better to have an analytical description of a transformation
than to use lookup table based approaches because of the above issues (noise and accuracy
between sample points). Additionally, perceptually derived data bases are always calculated as
a central
tendency of a dispersion of individual results. The average response should reflect
the nature of the tendency, but there are many ways to calculate a central tendency, none of
168
which will be
"accurate"
all the time (accurate is in quotes because its judgement relies on
individuars responses which are not the same as the mean in almost all cases).
169
4 Conclusions
This work began with an in depth literature search that describes aspects of color such as color
order systems, color harmony, color names, color meaning, and preferred color reproduction
of pictorial images. From the literature search, a set of experiments was defined that sought to
were conducted that sought to determine the influence of image context (in business graphics
business graphics.
applications. A paper
describing this and the gamut
mapping experiment was published in
found 306 points that spanned the gamut of a CRT display that found 15 surfaces of constant
170
Based on the above experimental results, a general tool for visualization of color space
transformations in real time was developed. The techniques used for visualization and the
A new model of color space named IPT was formulated from the use of the visualization tool.
Many popular color appearance data sets were used in the development of the new color
space. The IPT color space is a simple transformation from CIEXYZ D65, average surround
conditions (reference viewing conditions). The model consists of a 3x3 matrix transform, a
nonlinear power function, and another 3x3 matrix transform. This model is a generalization of
the CIELAB color space that takes advantage of a transformation to cone fundamental
tristimulus space before signal compression, then a linear transform to create the opponent
The constant hue data sets were judged against the CIELAB color space, and the IPT color
space in 2 subsequent experiments. The CIELAB color space was judged much worse in blue
superior in hue uniformity to both constant hue data sets at all hue angles tested.
has many appearance attributes that are superior to the CIELAB color space
IPT color space
171
There are several areas where the IPT color space can be utilized. The color space is both easy
to implement for Image Processing Transformations (IPT's), and is a much better predictor of
color appearance attributes than is CIELAB, and it is much more uniform in hue than any of
5 Future Work
Additional work needs to be done in two areas. Firstly, further verification must be done to
test whether the IPT color space is the preferred color space to use for gamut
mapping
well as psychometric uniformity. The second area of work is involved with refinement of the
Specifically, the relationship of rate of change of perceived hue as a function of hue angle
should be tested to give better data with which to model hue uniformity. These goals are
172
6 Appendices
orange
green
Dyellow
Oviolet
Dblue
gray
Dyel green
Dpurple
red
Dcyan
Figure 79. Mean scale values for colors and descriptive terms for abstract representative image
type.
173
1 i 1 .
=
i r^-^ (t
g ^^=S==;
1-| _^
1
'
1 "
UXUUTIUIVU i i
J- "
1 1 1
^^^==
1 1
1
i
' 1
1 h
=^ 1
H
1
l~j 3 1 1
1 i r i
cornplicatecP
BBBBBBBJ 1
=t
'
1
~"
[
1 ] 1
OBBBSHS9H 1
B orange
B green
' 1
1 1
1= ? yellow
' B violet
|
H ? blue
1 1
' Bgray
|
1 ? yel green
1 1 ? purple
,_
' Bred
t :
i
1
| 1
? cyan
1 i : | i
BBIB1BBJW 1
MU^ 1
1 i
^g
1-
|l'
1 1
1
==i 1
1 '
""
1 1=
1
t
1 |
1
1
1 1 1
!
Figure 80. Mean scale values for colors and descriptive terms for data representative image
type.
174
6.2 Appendix B: Data from experiment P2.
X
y Yfcd/m2)
D65 white point .313 .329 1
Luminance of Halon under .320 .333 90.6
source
on
on
175
Table 6. Colors chosen for gamut mapping color match.
L* b*
h degrees
a*
176
6.3 Appendix C: Three parameter files corresponding to Figure 57
"premat"
Notice that for each of the parameter files, only the second row of the entry named
premat
postmat
nlinvals
Ulargs
2 110 4
premat
oppmat
rotmat
177
0.000000 1.000000 0.000000
0.000000 -0.000000 1.000000
postmat
nlinvals
premat
nlinvals
178
7 References
1
Ebner, F., and
Fairchild, M, Gamut Mapping From Below: Finding minimum perceptual distances for colors outside the
gamut volume, Color Research and
Application, VOL, NO, PP. (1997)
2
Ebner, R, and Fairchild, M, Finding constant hue surfaces in color space, Proceedings o/SPIE, Color Imaging: Device-Independent
Color, Color Hardcopj, and Graphic Arts III, 3300, 107-117, (1998)
3
Munsell, A. FL, A Color Notation, Munsell Color Co. 12th ed. 1971
4
Munsell, A. H., A Grammar of Color, Van Nostrand Reinhold Co., 1969
5
Birren, F., Application of the Ostwald Color System to the Design of Consumer Goods,/. Opt. Soc. Am. 34 No. 7 396-399
(1944)
6
Zeishold, H., Philosophy of the Ostwald Color System,/. Opt. Soc. Am., 34, No. 7, 355-360 (1944)
7
Foss, C. and Nickerson, D., Analysis of the Ostwald Color System,/. Opt. Soc. Am., 34, No. 7,361-381
(1944)
8
Stevens, J.C., and Stevens, S.S., Brightness Function: Effects of
Adaptation, ,/. Opt. Soc. Am., 53, No. 3,375-385 (1963)
5
Hard, A. and Sivik, L., NCS-Natural Color System: A Swedish Standard for Color Notation, Color Res. and Appl 6 No. 3
(1981)
10
Hard, A. and Sivik, L., Some Reflections on
Studying Colour Combinations, Color Res. and Appl, 19, No. 4, (1994)
11
Hard, A. Sivik, L., Outiines
and of a
Theory of Colors in Combination, Man-Environment Systems, 9:217-228, (1979)
12
Hard, A. and Sivik, L., Distinctness of Border: An Alternative Concept for a Uniform Color Space, Color Res. and Appl, 11,
No. 2, (1986)
13
Moon, P. and Spencer, D., Geometric Formulation of Classical Color Harmony, ,/. Opt. Soc. Am., 34, No. 1, 46-59 ,(1944)
14
Moon, P. and Spencer, D., Area in Color Harmony, / Opt. Soc. Am., 34, No. 2, 93-103, (1944)
15
Pope, A., Notes on the Problem of Color Harmony and the
Geometry of Color Space,/ Opt. Soc. Am., 34, No. 12, 759-765,
(1944)
16
Moon, P. and Spencer, D., Aesthetic Measure Applied to Color Harmony, / Opt. Soc. Am., 34, No. 4, 234-242, (1944)
17
Granger, G.W., Area Balance in Color Harmony: An Experimental Study, Science, YI, 59-61, Jan. (1953)
18
Nayatani, Y. et. al., An Appraisal of Two-Color
Harmony by Paired Comparison Method, Acta Chromatica, 2, No. 1 (1969)
19
Berlin and Kay, Basic Color Terms, U. of California Press, (1969)
20
Kelly, K. and Judd, B., The ISCC-NBS Method of
Designating Colors and A Dictionary of Color Names, National Bureau of
Standards Circular 553, (1955)
21
Birren, F., Color and Human Response, Van Nostrand Reinhold, (1978)
22 Studies of Color
Sivik, L., Meaning, Man-Environment Systems, 5, no. 3, 155-160, (1975)
23
Taft, C, The Stability and Variability of Color Meaning Associations Across Time and Cultures, Colour 89, Proceedings of the 6th
AIC Congress, (1989)
179
26
Sivik, L., Color Meaning and perceptual color dimensions: A study of Exterior Colors, Goteborg Psychological Reports, 4, No. 1 1,
(1974)
27
Kobayashi, S., The Aim and Method of the Color Image Scale, Color Research and Application, 6, No.2, (1981)
28
Buck, G.B., and Froelich, H.C., Color Characteristics of Human Complexions, Illuminating Engineering, Jan (1948)
29
Bartleson, C.J., Memory Colors of Familiar Objects, Journal of Opt. Soc. ofAmerica, 50, No. 1, (1960)
30
Newhall, S.M., Burnham, R.W., and ClarkJ.R, Comparison of Successive with Simultaneous Color Matching, Journal of Opt.
Soc. of America, 47, No. 1, (1957)
31
Bartleson, C.J., Some Observations on the Reproduction of Flesh Colors, Photographic Science and Engineering, 3, No. 3, (1959)
32 Green-Grass
Bartleson, C.J. and Bray, C.P., On the Preferred Reproduction of Flesh, Blue-Sky, and Colors, Photographic
Science and Engineering, 6, No. 1, (1962)
http://www.icsd6.tj.chiba-u.ac.jp/ICSD6/papers/asada/skin.html, (1994)
35 Natural Scenes Transformed in
Fedorovskaya, E., Blommaert, F., de Ridder, H., Perceptual Quality of Color Images of
CIELUV Color Space, Proceedingsfrom the First IS&T and SID 's Color Imaging Conference, (1 993)
36
Jones, L., Psychophysics and Photography, Journal of Opt. Soc. ofAmerica, 34, No. 2, (1944)
37
MacAdam, D.L., Quality of Color Reproduction, Journal of the SMPTE, 56, (1951)
38
Roufs, J.A.J Perceptual Image Quality: Concept and Measurement, Philips J. Res., 47, (1992)
.,
39
RS. Gentile, E. Walowit, and J.P. Allebach, A Comparison of Techniques for Color Gamut Mismatch Compensation, /. of
y,16, No. 4, Oct. 1990
40 M.C. Stone, W.B. Cowan, J.C. Beatty, Color Gamut Mapping and the Printing of Digital Color Images, ACM Transactions on
Graphics, 7, No. 4, Oct. 1988
41 J. Meyer, B. Barth, Color Gamut Matching for Hard Copy, SID 89 Digestpp 86-89, 1989
42
Osgood, Suci, and Tannenbaum, The Measurement of Meaning, University of Illinois Press, 1978
43
Ibid, page 73. Relative importance of sematic space dimensions.
44
Same as ref. 24, page 10.
45
Stone and J. Beatty, Color Gamut Mapping and the Printing of Digital Color Images, ACM Transactions on Graphics, 7, No. 4,
249-292 (1988).
46
W. Wallace and M. Stone, Gamut Computer Generated Imagery, SPIE Vol 1460 Image Reproduction
Mapping Handling and
50
Montag, E.D. and Fairchild, M.D., Simulated Color Gamut Mapping Using Simple Rendered Images, Proc. SPIE 2658, San
Jose, 316-325(1996).
180
51 N. Katoh and M. Ito, Gamut Mapping for Computer Generated Images (II), IS&T and SID's 4th Color Imaging Conference:
ColorScience, Systems, and Applications,
126-129, (1996).
52 William H. Press, Saul A.Teukolsky, William T. Vetterling and Brian P. Numerical Recipes in
Flannery, C, Second Edition,
Cambridge University Press, ISBN 0-521-43108-5, pp.408-412, 1992.
53 A P. Hung and R.
Berns, Determination of Constant Hue Loci for CRT Gamut Their Predictions Color
a and
Using
Appearance Spaces, Color Res. Appl, 20, No. 5, 285-295, (1995).
54
Newhall, Nickerson, Judd, Final Report of the O.S.A. Subcommittee on the Spacing of the Munsell Colors, ,/. Opt. Soc. Am.,
33, No. 7,385-418(1943).
55 RW.G. Hunt, Measuring Color, Second Edition, Ellis Horwood, ISBN 0-13-567686-X, pp.142, (1991).
56 R W .G.
Hunt, "Colour Terminology", Col. Res. <&Appl, 3, No. 2, pp. 79-87, 1978.
57
B. Berlin and P. Kay, "Basic Color Terms", pp. 17-23, University of California Press, Berkeley, 1969.
58 R. S. Berns, R J. Motta, and M. E. Gorzynski, "CRT colorimetry, part I: theory and practice", Col Res. e>Appl., 18, pp. 299-
314, 1993.
59
Timothy Masters, Practical Neural Network Recipes in C++, 1993 Academic Press
60
Timothy Masters, Signal and Image Processing with Neural Networks, 1994 Wiley
61
Timothy Masters, Advanced Algorithms for Neural Networks: A C++ Sourcebook, 1995 Wiley
62 R Rolleston, "Using Shepard's Interpolation to Build Color Transformation Tables", IS&T and SID's 2nd
Color Imaging
Conference: Color Science, Systems and Applications pp. 74-77 (1994).
,
83 S. L. "Model for
Guth, color vision and light adaptation", J. Opt. Soc. Am. A, 8, No. 6, pp. 976-993 (1991)
64 Color- Appearance
M.D. Fairchild, RS.
Berns,
and "Image Specification Through Extension of CIELAB", Col Res. &AppL,
18, No. 3, pp. 178-189, 1993.
65 G. Wyszecki and W.S. Stiles, "Color Science: Concepts and Methods, Quantitative Data and Formulai", Second Edition,
Wiley Press, ISBN 0-471-02106-7, (1982).
66 S.M. OSA. Subcommittee
Newhall, D. Nickerson, D.B.Judd, "Final Report of the on the
Spacing of the Munsell Colors",
/ Opt. Soc. Am., 33, No 7, pp. 385-418, (1943).
<"
M.D. Fairchild, "Refinement of the RLAB Color Space", Col. Res. &Appl, 21, No. 5, pp. 338-346, 1996.
70 Personal communication with Gus Braun, and an abstract submission by Gus Braun, the author, and Mark Fairchild for
71 G.A.Gescheider, "Psychophysics, Method, Theory, and Application", Second Edition, Lawrence Erlbaum Associates, ISBN
0-89859-375-1, pp. 152 (1985).
181