Image Processing
Image Processing
P. Gilmore
John F.
Artificial Intelligence Branch
Artificial Intelligence Branch
Georgia
Georgia Tech
Tech Research
Research Institute
Institute
Atlanta,
Atlanta, Georgia 30332
Abstract
Image processing technology concentrates on the the development of data extraction
techniques applied toward the statistical classification of visual imagery. imagery. in classical
In classical
image processing systems,
systems , an image
image is
is [1]
[1] preprocessed
preprocessed toto remove
remove noise,
noise, [2]
[2] segmented to
produce close object boundaries, [3] [3] analyzed
analyzed to
to extract
extract aa representative feature vector,
vector,
and [4]
[4] compared
compared to
to ideal
ideal object feature vectors by by aa classifier
classifier to
to determine
determine the
the nearest
object classification and its its associated confidence level. This type of of processing
attempts to formulate aa two-dimensional
two -dimensional interpretation
interpretation of three-dimensional
of three -dimensional scenes using
scenes using
local statistical analysis, an entirely numerical process. process. Symbolic information
information dealing
relationships, object attributes, and physical constraints is
with contextual relationships, is ignored
ignored inin
such an approach. This paper describes a number of artificial intelligence techniques
which allow symbolic information to to be exploited inin conjunction
conjunction with
with numerical
numerical data to
improve object classification performance.
improve performance.
Introduction
Image processing isis aa technology
technology long
long entrenched
entrenched in
in the
the development
development of statistical
vision systems. Applications in
in the
the areas
areas of
of target
target recognition,
recognition, bio-
bio-medical
medical analysis,
scene matching, and manufacturing have spurred
spurred the development of a number of of image
processing systems specifically tailored
tailored to
to address
address individual
individual problem characteristics.
The classical
classical approach to image processing (Figure
(Figure 1)
1) adhered to by most of these systems
-
consists
consists of
of four
four processing stages:
stages: preprocessing, segmentation, feature extraction,
extraction, and
classification.
Preprocessing
Preprocessing "'! Segmentation
Segmentation Feature
Feature ** Classification
Classification joi.
Extraction
Extraction
o Noise
o Noise removal
removal o Edge
o Edge detection
detection oo Statistics
Statistics o Class
o Class separation
separation
oo Edge linking
Edge linking oo Shape
Shape metrics
metrics o Distance
o Classifiers
Distance Classifiers
Though widely used through the years, several shortcomings exist in in aa purely
statistical classification. First, most segmentation
statistical approach to object classification. segmentation algorithms
algorithms assume
assume
that
that the objects being segmented will be
be of
of aa uniform
uniform intensity
intensity nature.
nature. This assumption
assumption isis
in light of
invalid in of the
the bi
bi-modal
-modal characteristics
characteristics associated with the
the majority
majority of
of real
real world
world
object,
object, such as cars in which engines may possess radically
radically different
different intensities
intensities compared
compared
192
192 //SPIE
SPIE Vol.
Vol. 528
528 Digital
Digital Image
Image Processing (1985)
(1985)
Image
Image Understanding
Image understanding combines the data extraction techniques of imageimage processing
processing with
the information exploitation techniques of artificial intelligence in in order to fully
interpret aa three dimensional scene (Figure 2).
2) . Object recognition in in an
an image
understanding system integrates four types of processing: two-dimensional
two -dimensional analysis,
temporal processing,
temporal three-dimensional
processing, three -dimensional processing,
processing, and
and knowledge
knowledge base
base processing.
processing.
Two-dimensional
Two- dimensional analysis complements the statistical classification of of objects
previously described with information on local object interrelationships.
interrelationships. After anan object
object
has been segmented and classified, its its local
local context
context is
is analyzed
analyzed to
to provide
provide positive
positive or
negative support to to its
its initial classification. For example, consider the primary
statistical classification of an object as aa vehicle. Examining it's local
interrelationship oror context may provide information
information that the vehicle isis located
located within
within aa
2D ANALYSIS
oo feature
feature
oo interrelationships
interrelationships
4
IMAGE'
IMAGE PREPROCESSING
PREPROCESSING SCENE OBJECT
HISTORY
HISTORY RECOGNITION
RECOGNITION
oo motion
motion
3D
3D ANALYSIS
ANALYSIS KNOWLEDGE
BASE
BASE
o stereo
o stereo vision
vision
o surface
o surface orientation
orientation
oo shape
shape
oo interrelationships
interrelationships
Figure 2.
2. An Image
Image Understanding
Understanding System Prototype
SPIE
SPIE Vol.
Vol.528
528 Digital
Digital Image
ImageProcessing
Processing(1985)
(1985)// 193
194 //SPIE
194 SPIE Vol.
Vol. 528
528 Digital Image Processing (1985)
Digital Image (1985)
SPIE Vol.
SPIE Vol.528
528Digital
Digital Image
ImageProcessing
Processing(1985)
(1985)/ / 195
Natural
Language
Interface
t
Control
Structure*
Structure
o control strategy
o rule interpreter
Knowledge Base
'Knowledge Base 1 Global Sensory
Data Base Information
source
o knowledge source
o knowledgerules o symbolic o Input data
rules
o inference rules information
representation
Inexact
Inexact Reasoning
Advanced vision systems engage inin two sorts of high level activities: goal goal achievement
achievement
and belief maintenance. Goal achievement is is an active, knowledge-driven,
knowledge- driven, foreground
activity that
activity that directs
directs system processing
processing toward the
the achievement
achievement ofof predetermined goals.
goals. For
example, the
the control
control structure of aa model-
model-driven
driven vision
vision systems
systems selects
selects rule
rule firings
firings based
based
on how closely aligned the final deductions are in relation to the model being
investigated. Belief maintainence
maintainence [5]
[5] is
is a passive, data-driven,
data -driven, background that
background activity that
keeps beliefs consistent in in light
light of
of system analysis. Suggestive evidence, certainty
factors
factors and
and inexact fall into
inexact reasoning methods fall into this
this category.
category. As an example,
example, consider the
inexact
inexact reasoning
reasoning concept employed in
in the
the TESS
TESS [6]
[6] system.
system.
recursive confidence
TESS uses a recursive confidence evaluation
evaluation scheme
scheme loosely
loosely based
based on
on the
the Dempster-
Dempster-Shafer
Shafer
approach, except that
that the initial
initial seed
seed confidences
confidences are
are provided a statisical
statisical classifier.
Contextual information exists inin one
one of
of three
three forms:
forms: positive evidence
evidence which increases
increases an
object's classification confidence, negative evidence which decreases an object's
classification confidence, or zero evidence which has no effect on an object's
confidence. In
classification confidence. In TESS
TESS positive
positive evidence isis denoted byby 1,
1, negative by -1,-1, and
zero by O.
0.
Suppose that
Suppose that an
an object
object has
has been
been detect
detect in
in an
an image
image and its context has
and its has been identified
as being on a road, in
in a column, and in in motion. The object's initial statistical
is given
classification confidence is given as:
as:
(Object-13
(Object-13 is a (Vehicle .63)
. 63)
(Tank .57)
. 57)
(Clutter .30)
.30)
(Unknown .16)
.16)
196 //SP
SPIE Vol. 528
/E Vol. 528 Digital
Digital Image Processing (1985)
(1985)
(.63 ++ (.63
RES vehicle = (.63 (.63 ** (1
(1-.63) 0 ** .67))
-.63) * 0 .63
.67)) = .63
RES tank = (.57 ++ (.57
(.57 ** (1
(1-.57) * 11 ** .67))
-.57) * = .73
.67)) _ .73
RES clutter = (.30 ++ (.30
(.30 ** (1-.30) *-l
(1 -.30) * * .67))
-1 * .16
.67)) = .16
RES undefined= (.16 + (.16
(.16 ** (1
(1-.16)
-.16) *-l
* -1 * .07
* .67))= .07
This process is
is repeated
repeated for
for each
each applicable
applicable rule
rule firing
firing that
that would
would occur in the
system.
Cueing
Contextual Target Cueing
Two expert
Two expert vision
vision systems
systems which attempt
attempt to
to address
address the
the deficiencies
deficiencies of
of image processing
processing
in
in automatic target recognizers
recognizers (ATRs)
(ATRs) have been independently
independently developed. Each system
exploits
exploits the
the global
global information
information in
in aa scene
scene that
that is
is discarded
discarded in
in current
current systems
systems to
to improve
upon the
upon the performance of localized ATRs. These contextual
contextual cueing
cueing systems form the initial
application
application of
of artificial technology into
into the
the automatic
automatic target
target recognizer
recognizer problem
problem domain.
domain.
Contextual target cueing is a concept in which object object classification confidence is is
enhanced by by the positive/negative
positive /negative evidence
evidence provided
provided byby their
their context. Consider the scene
shown
shown in in figure
figure 4. 4. Using
Using existing
existing statistical
statistical methods
methods, r four
four objects
objects have
have been
been identified
identified as
as
possible vehicles and have each been assigned a vehicle's confidence measurement (V1
possible (Vl=80%
=80%,r
V2=70%,
V2 =70 %, V3=80%
V3 =80%,f V4=75%).
V4 =75%). Assuming
Assuming aa threshold
threshold of
of 75%,
75%, one
one false
false vehicle
vehicle would
would be detected
detected
(V4)
(V4) while one real real vehicle would bebe omitted
omitted (V2).
(V2). By exploiting the context of of each
object, the true true confidence level level of
of each potential
potential vehicle could be determined. For For
example, VI V1 and V3 are bothboth on
on aa road,
road, in in motion and in in aa column.
column. This is is all
all positive
contextual evidence supporting aa classification
contextual classification ofof vehicle,
vehicle, so so each confidence measurement
is
is mathematically
mathematically improved
improved to
to reflect
reflect object
object context.
context. V2V2 possesses
possesses thethe same
same characteristics
characteristics
its score improves to 85%
and its 85% thus
thus exceeding
exceeding the
the 75%
75% threshold
threshold mark.
mark. V4,
V4, on
on the
the other hand
hand
is
is in
in a a lake,
lake, isolated, and has no motion. This This information is is all negative evidence
against aa classification of of vehicle but
but positive
positive evidence
evidence toward a potential secondary
classification of rock. V4's confidence drops to 60% 60% and
and is is now substantially below
threshold. After contextual analysis has completed,
After contextual completed, allall three vehicle are detected and and the
false vehicle has been discarded.
false
Two separate context cueing systems
systems have
have been
been developed.
developed. Hughes Aircraft [7] [7] has
implemented an object-oriented
object- oriented vision
vision system approach to
to automatic target recognition.
Using aa spatial
Using spatial blackboard concept
concept [8],
[8], the system maintains
maintains scene information in in a multi-
level
level inheritance
inheritance structure (Figure
(Figure 5).
5). Knowledge propagation is is the central idea
idea behind a
blackboard vision configuration. Knowledge propagation in in aa blackboard architecture
consists of two two steps. First, high confidence
confidence knowledge
knowledge sources
sources at
at all
all levels of the
identified. A propagation effect follows in which hypotheses are
blackboard are identified. are generated
for each
each high
high confidence knowledge source.
source. Hypotheses
Hypotheses are
are then
then propagated outward toward
low confidences
confidences and resolve representation
representation conflicts.
conflicts. For example, the bottom level may
For
raw image pixels, the second level
contain raw level enhanced
enhanced pixels,
pixels, the third level contrast
contrast--
derived line segments, the fourth level closed segmentation boundaries, etc.. A
etc..
hierarchical
hierarchical linking
linking of
of each
each entity (e.g. pixel
entity (e.g. (xl,yl) is
pixel (xl,yl) is part
part of
of region
region 10
10 segment 32) is
segment 32) is
maintain for
for system traceability. Information on various types of of contextual relations
internal to an
internal (such as temporal
an image (such temporal data,
data, global context, and object structure)
global context, structure) asas well
well
as
as external
external to
to the
the image (such as
image (such as TAG
TAC RECCE
RECCE and
and digital
digital terrain
terrain data)
data) can
can be
be fused into the
fused into the
blackboard
blackboard system
system to
to form
form an
an all
all encompassing
encompassing symbolic
symbolic representation
representation ofof aa scene.
scene.
SPIE Vol.528
SPIE Vol. 528 Digital
Digital Image
Image Processing
Processing(1985)
(1985)// 197
197
4. An
Figure 4. Example Of
An Example Analysis Using
-Based Analysis
Rule-Based
Of Rule Suggestive Evidence
Using Suggestive
model objects
A knowledge base consisting of model (e.g tank, truck,
objects (e.g their
APC) and their
jeep, APC)
potential contextual relationshi ps is the focal point of
relationships of high level knowledge base
processing.
processing. Each model object is
is maintained in architecture and
in aa semantic frame architecture
the class
represents the
represents tactical objects
of tactical
class of may to
that may
objects that encountered during the processing
be encountered
to be
of a specific mission.
of models are of
mission. The knowledge base models nature similar to
hierarchical nature
of a hierarchical
the symbolic scene representat
the scene ion
representation allowing a
a hierarchica
hierarchical l classifier to
to be implemented.
The classifier
The attempts to match each unknown
classifier attempts model against known knowledge base
symbolic model
unknown symbolic
best match
models to determine aa best classification and
match classification confidences.
classification confidences.
and associated classification
Martin Marietta Aerospace [9]
Martin [9] has also implemented a model based hierarchical vision
based hierarchical
system but has taken a slightly different
different approach
approach to
to target
target recognition. Rather than
system architectur e,
integrate artificial
integrate intelligenc
artificial intelligence e at both high and
and low levels
levels in
in a
a blackboard
blackboard architecture,
Martin has separated the processes so so that they may
that they independently. The original
be worked independently.
may be
ATR has
ATR supplemented with additional algorithms
has been supplemented classification,
for global region classification,
algorithms for
motion target identificat
motion ranging, and advanced object recognition
ion, passive ranging,
identification, recognition. . When
combined with ancillary data in the form
in the of digital
form of maps, TAC
digital terrain maps, TAG RECCE, timetime-of-day
-of -day
and
and weather conditions, this information constitutes a symbolic powerful scene
representat ion which
representation is fed into an expert system context
which is context analyzer.
The context analyzer is model based
is aa model rule system capable of
based rule of reasoning about scene
context and determining true
context true object and region classification confidences through aa
region classification
positive-ne
positive- gative suggestive evidence scheme.
negative ATR data forms the to be enhanced by
classification confidence
contextual evidence classification seeds initial
confidence seeds system.
initial provided to the system. Three
types
types ofof contextual evidence exist: evidence, and neutral
evidence, positive evidence,
exist: negative evidence,
evidence.
evidence. For the fact
example, the
For example, that an object is
fact that in aa lake
is in may be
lake may for
evidence for
be negative evidence
the
the existence of
of a tank, positive evidence
evidence for
for the
the existence
existence of
of a
a boat, and neutral
evidence in
evidence in support of the object being rock.
being aa rock. The context analyzer has the the power to
feedback adjustable algorithm parameters in support of
in support hypothesis-and
of aa hypothesis- -test capability
and-test
feedback adjustable
context.
required to fully validate scene context.
required
198 //SPIE
SPIEVol. 528 Digital
Vol. 528 Processing (1985)
Image Processing
DigitalImage
SCENE
BRIDGE
OBJ 1
OBJECTS OBJ-CLASSES
GRASS 4:1)
CDLID
PLANAR
REG 10 GREEN
TRAPEZIOD
REG 11 BLUE
ECTANGLE
ACUTE r
L
J
OBTUSE
PIXELS / PXL-CLASSES
PXL- CLASSES
Figure 5.
Figure 5. An
An Example
Example Of
Of The Blackboard Knowledge Representation Scheme
Scheme
SPIE Vol.
SPIE Vol.528
528Digital
DigitalImage
ImageProcessing
Processing(1985)
(1985)/ / 199
199
Summary
Artificial intelligence offers aa variety of of enhancement techniques in the area of
in the of
identified aa number
This paper has identified
computer vision. This of existing
number of existing high applications in
level applications
high level in
image understanding, knowledge base systems,
the areas of image
the knowledge
systems, inexact reasoning and knowledge
representation. in low
Applications in level processing
low level also of
using heuristics are also
processing using value.
of value.
For example, the segmentation of infrared imagery
of infrared is currently
imagery is concept in
image concept
single image
currently aa single in
which all adjustable algorithm parameters are are predetermined. By taking a multi-frame
multi -frame
approach, heuristic rules can be created to enhance the imagery and address object
obscurations and mergings during the data extraction phase ratherrather than taxing the high
than taxing
level
level symbolic processor.
200 //SPIE
SPIE Vol
Vol.528
528Digital
DigitalImage
ImageProcessing
Processing (1985)
[1]
[1] B.
B. G.
G. Schunck,
"The Motion Constraint
"The Constraint Equation
Equation For
For Optical
OpticalFlow
Flow",
",
Seventh IJCPR, Montreal,
Seventh IJCPR, 1984.
Montreal, August 1984.
[2]
[2] C.
C. M.
M. Brown,
Brown, D.
D. H.
H. Ballard, and O.
Ballard, and 0. A. Kimball,
Kimball,
"Contraint Interaction
"Contraint InteractionInInShape-
Shape-Form-Shadowing Algorithms"
Form -Shadowing Algorithms ",,
DARPA
DARPA Image
Image Understanding Workshop, Sept.
Sept. 1982.
1982.
[3]
[3] A. R.
A. R. Bruss,
Bruss, and B.K. Horn,
Horn,
"Passive Navigation
Navigation",
",
DARPA
DARPA Image
Image Understanding Workshop, Sept 1982.
Workshop, Sept 1982.
[4]
[4] S.
S. A.
A. Shafer,
Shafer, and T.
T. Kanade,
Kanade,
"Using In Finding
"Using Shadows In Finding Surface
Surface Orientations
Orientations",
",
DARPA Image Understanding Workshop, Sept
DARPA Image Sept 1982.
1982.
[5]
[5] T. D. Garvey, J. D. Lowrance, and
J. D. and Martin
Martin A.
A. Fischler,
Fischler,
"An
"An Inference
Inference Technique for
for Integrating
Integrating Knowledge
Knowledge From
From Disparate
Disparate Sources
Sources",
",
IJCAI,
IJCAI, British
British Columbia, Canada, August 1981.
Columbia, Canada, 1981.
[6]
[6] J. F. Gilmore,
J. F.
"TESS
"TESS -The Tactical
Tactical Expert
Expert System
System",
",
Fourth International Conference On On Expert
Expert Systems,
Systems,
Avignon,
Avignon, France, 1984.
France, May 1984.
[7]
[7] Kirn,
Kim, J.
J. H.,
H., Payton, D. W.,
Payton, D. Olin, K.
W., Olin, K. E.,
E., and
and Tseng,
Tseng, D.
D. Y.,
Y.,
"A Context Dependent
Dependent Automatic
Automatic Target
Target Recognition
RecognitionSystem
System",
",
Intelligence,
Applications of Artificial Intelligence,
Proceedings
Proceedings of 485, May 1984.
of SPIE 485, 1984.
[8]
[8] Hanson,
Hanson, A.
A. R.,
R., and Riseman,
Riseman, E.
E. M.,
M.,
Computer
Computer Vision Systems,
Systems f
Press, 1978.
Academic Press, 1978.
[9]
[9] Forman,
Forman, A.
A. V.,
V., Rowland,
Rowland, P.P. J.,
J., and
and Pemberton,
Pemberton, W.
W. G.,
G.,
Analysis of
"Contextual Analysis of Tactical
Tactical Scenes
Scenes",
",
Intelligence,
Applications of Artificial Intelligence,
Proceedings
Proceedings of
of SPIE 485,
485, May 1984.
1984.
[10] D.
[10] H. Ballard, and C.
H. C. M.
M. Brown,
Computer Vision,
Englewood Cliff, Jersey,
Cliff, New Jersey,
Prentice-Hall,
Prentice -Hall, 1982.
SPIE Vol.
SPIE Vol. 528
528 Digital
Digital Image Processing
Processing(1985)
(1985)// 201
201