Image and Cognition
Image and Cognition
Piotr Kozak
BLOOMSBURY ACADEMIC
Bloomsbury Publishing Plc
50 Bedford Square, London, WC1B 3DP, UK
1385 Broadway, New York, NY 10018, USA
29 Earlsfort Terrace, Dublin 2, Ireland
Piotr Kozak has asserted his right under the Copyright, Designs and
Patents Act, 1988, to be identified as Author of this work.
Cover image: Suprematism No. 56, 1916 Kazimir Malevich 1878–1935 Russia,
USSR, Russian, Federation, © Peter Horree / Alamy Stock Photo
Bloomsbury Publishing Plc does not have any control over, or responsibility for, any
third-party websites referred to or in this book. All internet addresses given in this
book were correct at the time of going to press. The author and publisher regret any
inconvenience caused if addresses have changed or sites have ceased to
exist, but can accept no responsibility for any such changes.
A catalogue record for this book is available from the British Library.
A catalog record for this book is available from the Library of Congress.
To find out more about our authors and books visit www.bloomsbury.com and
sign up for our newsletters.
To Kasia, Tadeusz and Józef
vi
Contents
Introduction 1
Notes 185
Literature 196
Index 225
Figures
Books never have a single author. They are always the fruit of discussion. I
would like to thank Bartosz Działoszyński, Paweł Gładziejewski, Witold Hensel,
Mateusz Hohol, Małgorzata Koronkiewicz, Paweł Kozak, Mira Marcinów,
Jakub Matyja, Bence Nanay, Kristόf Nyíri, Michał Piekarski, Robert Poczobut,
Robert Rogoziecki and Nastazja Stoch for inspiring discussions and valuable
advice. I am very grateful for the anonymous reviewers for their helpful
remarks. Special thanks to Maja Białek, Krystyna Bielecka, Marcin Miłkowski
and Marek Pokropski, who read the early version of the manuscript for their
insightful comments. I am also grateful to Kamil Lemanek and Krzysztof Gajda
for proofreading.
I wrote this book during the Covid-19 pandemic and finished it when the
Russo-Ukrainian War started. It was not a good time to write books. The only
explanation why this book was written is the continuous support of my family.
Thus, I would like to express my deepest gratitude to the most thoughtful and
caring person I know, who happily is my wife, and the people I love the most,
who happen to be my children.
Chapter 2 includes material from my paper ‘The Diagram Problem’,
Diagrammatic Representation and Inference 2020. LNCS (pp. 217–24). Springer.
Chapter 6 includes parts of my paper ‘The Analog-Digital Distinction Fails
to Explain the Perception-Thought Distinction: An Alternative Account of the
Format of Mental Representation’, Studia Semiotyczne, 35 (1), 73–94.
This work was supported by the research grant ‘What Is Thinking with
Images?’, SONATA 10, granted by the National Science Centre, Poland, based on
the decision No. 2015/19/D/HS1/02426.
x
Introduction
The main research question of this book is this: What is thinking with images?
The question is analogical to such questions as ‘What is thinking with language?’
It means that if we can ask whether we can think in or with language, then we
can ask whether we can think in or with images.1
The question follows from a commonsensical observation: when we ask how
many windows are in the flat, someone will probably form and inspect a mental
image of the flat and count the windows. If an architect designs a house, then
they design the house with the help of drawings. One may use a map if one tries
to get from point A to point B.
The aforementioned examples are instantiations of what can be called
‘thinking with images’. However, listing examples of imagistic thinking is a
relatively easy task. The difficult task is to say what thinking with images is.
Imagistic thinking is understood here in three ways: as a faculty, an act and a
mental state or an event. The faculty of imagistic thinking refers to the capacity
to use images in thinking. The act of imagistic thinking is exercising this faculty.
The mental state called ‘imagistic thought’ is a product of such an act.
These three understandings of the term ‘thinking’ are interconnected, for
we can only study the faculty of thinking through its expressions in the acts
of thinking. The acts of thinking are interpretable only through studying their
products. Similarly, we can understand what thoughts are only if we understand
what the acts and the faculty of thinking are. Thus, a full-fledged theory of
imagistic thinking should provide a theory concerning the faculty, the act and
the mental state of imagistic thinking.
Imagistic thinking is commonly contrasted with thinking with words (e.g.
Slezak, 2002b; Zhao et al., 2020), for, as is commonly held, not everything
that is depicted or represented in an image can be described in language-like
and propositional form. However, that is only a negative description of the
phenomenon.
The main difficulty stems from the fact that thinking with images is not
a case of trying to solve a clear-cut problem where we need answers to some
ready-made questions. It is the opposite: we need a basic reconsideration and
reorganization of the issue. The fact that we think with images is intuitive but
poorly understood. It is partly caused by the fact that the concept of thinking
with images has always been formulated in various ways, often just in ordinary
language. The problem with spelling out what the expression ‘thinking with
images’ might mean is a crucial part of what thinking with images is. Yet it is
easier to point out what kind of definition we do not want than to produce one
that we would accept.
Introduction 3
Let me give an example of what such a definition cannot look like. It may
be tempting to hold that iconic representation can be ascribed to the format
of ‘showing’ and the propositional ones the format of ‘saying’. However, it is a
dead end for two reasons. On the one hand, one can overvalue the dissimilarities
and claim that thinking with images expresses the so-called iconic difference,
that is, it is governed by a ‘logic of showing’ which follows different rules (if
any) than propositional logic – a ‘logic of saying’ (e.g. Belting, 2001; Boehm,
2007; Mersch, 2003, 2011; Mitchell, 1994; Müller, 1997). On the other, one may
overestimate the analogies and claim that there is only one format of thinking –
a propositional one (e.g. Fodor, 1975, 1987; Pylyshyn, 2003a). All other formats
are only epiphenomena of the propositional one.
Both approaches are unsatisfactory. The first one is doomed to vague
metaphors and cannot spell out the similarities between propositions and icons.
Images and propositions can indicate a subject and attribute a property to it. If I
take a picture of a man committing a crime, the picture can express a proposition
that can be put into the sentence ‘this man has committed a crime’ (e.g. Kulvicki,
2020; Novitz, 1977).
The second approach is doomed to reductionism and cannot spell out the
differences between propositional and iconic representations. Unlike images,
propositions are governed by the rules of logic. Propositions can be negated and
inferred. Images cannot. Thus, it is essential not to exaggerate the similarities
and not to underestimate the differences.
Notice that the question ‘What is thinking with images?’ is not to ask ‘Do
we think with images?’ One of the assumptions of the book is that we do. The
question is, what does that mean, and what follows from it,
Although the nature and role of (mental) images have been investigated in
many particular branches of philosophy, for example, in logic (e.g. diagrammatic
reasoning), philosophy of mind (e.g. the imagery debate) and aesthetics (e.g.
theory of depiction), what we certainly lack is a general account of what images
are and what role they play in thinking.
The general form of the main question follows from our overall attitude
towards what the philosophical problem is to begin with. As Sellars famously
noted (2007, 369), the aim of philosophy is ‘to understand how things in the
broadest possible sense of the term hang together in the broadest possible
sense of the term’. Thus, although the nature and the role of images are studied
in many specific branches of philosophy, the general aim of philosophy is to
deliver as broad an understanding of the nature and function of images as
possible.
4 Thinking in Images
view. And this book is an attempt to do just that: to look at images and thinking
from above.
There are two contrasting and firm intuitions standing behind the main
question. First of all, there are instantiations of some mental processes, based,
loosely speaking, on using images that we would rightly like to call ‘thoughtful’
but that do not fit into our general view of the nature of thinking. They do not
have a propositional or language-like form, a form expressed with propositions
or sentences.
For instance, we would certainly like to interpret architects’ drawings as part of
their thought process. And similarly, we may want to interpret how engineers,
biologists and chemists use diagrams as an essential part of their scientific
practices (e.g. Rheingold, 1991; Earnshaw and Watson, 1993; Nersessian, 2008;
Tufte, 1997). Scientists think with diagrams (e.g. Ferguson, 1992; Gooding,
1990, 2010; Mößner, 2018; Sheredos et al., 2013; Shin, 2015, 2016; Sloman,
2002). Finally, we would like to say that the result of an artist’s work, such as a
painting or a piece of music, is a result of thoughtful action, even if such results
are put exclusively in pictorial or auditory representational form (e.g. Carvalho,
2019). As Ryle (1979, 85–6) points out:
All the mentioned examples of using images in science and art are hardly
translatable into sentential forms, and it appears that they cannot be put into a
logical form that would determine which step is necessarily next. Therefore, some
acts of thought are hard to reconcile with our general view of what thinking is if
we limit our understanding of thinking to propositional or language-based acts.
Second of all, there is a widespread assumption that images, particularly
visualization, play an essential role in cognition and facilitate solving cognitive
tasks. Most commonly, it is claimed that images are more informative and
basic than sentential representations, epitomized by the maxim ‘a picture is
worth a thousand words’, and that visualization fosters learning and reasoning
processes. However, such generalizations about the benefits of images over
good old-fashioned sentential representations beg the question of what
6 Thinking in Images
is cognitively gained from them. What, for example, is the status of the
commonly held claim that pictures convey more information than any formal
representation or symbol? It is obviously false, for there are symbols, such as
the symbols (1, . . ., ∞), that are infinitely more informative than any picture. It
is not even the case that pictures necessarily convey more visual information.
Some phenomena can be represented symbolically but not pictorially, such as
the concept of an infinite set.
Moreover, it is far from clear what follows from the claim that visualizing
helps in information acquisition (Salis and Frigg, 2020). For instance, how
would visualizing the definition of an integral help us learn how to integrate a
linear function? Visualization is neither necessary nor valuable in that context.
This tension is evident from the perspective of conflicting results in cognitive
science in studies on the effects of visualization in reasoning. Some studies
report that performance on some problem-solving tasks, such as those involving
transitive inference, depends on the ease of imagining the premises of the
reasoning (e.g. Clement and Falmagne, 1986; Shaver et al., 1975). Problem-
solving task participants who build a physical model of a problem are more
likely to solve it. That includes visualizing with the help of graphs and the help
of gestures (e.g. Bocanegra et al., 2019; Goldin-Meadow, 2003; McNeill, 1992;
Vallée-Tourangeau et al., 2016). However, numerous studies have failed to
identify visualization as having any effect on reasoning. For example, Sternberg
(1980) found no difference between the accuracy of solving difficult and easy-
to-imagine problems. There is no correlation between the ability to visualize and
scores on IQ tests and subjects’ ability to reason. Richardson (1987) reported
that individuals reasoning with concrete and easy-to-visualize problems were
no better than individuals reasoning with abstract problems. According to some
scholars (e.g. Byrne and Johnson-Laird, 1989; Knauff, 2013; Ragni and Knauff,
2013), images are irrelevant for reasoning. Human thinking is based not on
abstract symbols or the manipulation of visual images but on spatial cognition,
particularly on multimodal spatial layout models, which are more abstract than
images and more concrete than symbols. It has even been demonstrated (e.g.
Knauff, 2013; Knauff and Johnson-Laird, 2002) that visualization may, in fact,
impede the process of thinking, which is known as the visual impedance effect.
Knauff and Johnson-Laird (2002) show that visual imagery can slow down the
reasoning process. In other words, engaging imagery skills can interfere with
thinking.
The debate, however, is obviously unsolvable if we do not provide a clear-cut
definition of what images and thinking with images are. For example, Knauff
Introduction 7
(2013) rightly argues that mental imagery may actually hinder reasoning.
However, the claim is only justified if mental images are understood as modality-
specific, visual, conscious, picture-like representations that are supposed to play
the same role as propositions. There is, however, no logical or nomological
necessity for understanding them in this way. In fact, mental images are not
necessary to be visual, conscious, picture-like representations or to play a
proposition-like role.
We encounter the same problem when we assess the value of diagrammatic
representations in the logic and philosophy of mathematics. Although it is rightly
claimed that diagrams play a crucial role in mathematical practice, there is no
consensus on the credibility of reasonings conducted with the help of imagistic
means (e.g. Barwise and Etchemendy, 1996; Giaquinto, 2007, 2008, 2011;
Giardino, 2017; Macbeth, 2012; Mancosu, 2005; Mumma, 2010; Nelsen, 1993).
For example, it is hard to imagine Euclidean geometry without diagrams (e.g.
Macbeth, 2010; Manders, 2008); simultaneously, diagrammatic representations
have been suspected to be unreliable and heterogeneous (e.g. Brown, 1999;
Giaquinto, 2011; Kulpa, 2009).
Moreover, the lack of a general understanding of the nature of images and
their role in thinking may have a negative impact on our understanding of
the role that imagistic representations play within cognitive systems. We risk
making theoretical choices based on tacit knowledge and implicit assumptions
about what images are and what role they can play in thinking without a clear
account of what imagistic representations are. In other words, our theories of
imagistic representations may very well be undercut by conceptual prejudices
rooted in our vague intuitions. For example, it is argued (e.g. Pylyshyn, 2003b)
that what hinders the imagery debate is the lack of agreement concerning what
mental images can be and how we can explicate the phenomenon.
Consequently, we do not know what the empirical findings indicate. Without
a comprehensive account of what ‘mental image’ means, we are condemned to
be immersed in an abyss of vague metaphors and confusing concepts such as
‘functional images’ and ‘quasi-pictures’ that both are and are not images or a
‘mind’s eye’ that is neither in the mind nor the eye. Additionally, it is argued (e.g.
Berman, 2008; Reisberg et al., 2003) that the conflicting intuitions regarding
the nature of mental images may be one of the causes of incompatible data in
the imagery debate. It is plausible that preconceived theories about the nature
of mental imagery influence what subjects say about their experiences, for
theoretical views can and sometimes do influence introspection-based claims
about imagery. Several studies (e.g. Intons-Peterson, 1983; Intons-Peterson and
8 Thinking in Images
apply the results of studies on the pictorial experience of mental images to motor
mental images, and it is not clear how investigations of the nature of graphs
might apply to pictures. It is difficult to make predictions about the cognitive
value of mental images based on findings concerning the cognitive value of
diagrammatic reasoning or to make predictions about pictures’ cognitive
function based on metaphors’ cognitive function. What is needed is a systematic
approach to interpreting the nature and merits of imagistic representations.
Without such an approach, we have no principled way of making sense of the
vast empirical literature on the cognitive function of (mental) images. Therefore,
what is needed is both an answer to the general question ‘What is thinking with
images?’ and a view that accounts for various uses of imagistic representations
and the interplay between internal (mental) and external representations.
Image definition
Some remarks concerning basic terms are required. When I use the terms ‘image’
and ‘imagistic representation’, I interpret them as synonyms of the Peircean
terms ‘iconic sign’ and ‘iconic representation’.
According to Peirce (CP 1.545-567), representations can be divided into
symbols, icons and indices. Symbols refer to their objects via conventions; icons
share some qualities with their objects; indices are causally interconnected with
the represented objects. This trichotomy is based on the form of the relation
between the representation and its object. The distinction denotes different ways
in which this relation can be founded, referred to by Peirce as the ‘ground’ of the
sign (Short, 2007). We can speak of symbols, icons and indices, depending on
the identified ground of the relation – an arbitrary convention, a similarity or a
causal interconnection.
However, a decent interpretation of Peirce’s theory of icons shows that it is
disputable whether Peirce holds that the concept of resemblance is sufficient to
explain how icons represent (Ambrosio, 2014; Chevalier, 2015; Hookway, 2000,
2007; Pietarinen, 2006, 2014; Stjernfelt, 2000). For the reasons mentioned later,
it is better to understand the notion of ‘image’ as a representation that bears a
natural and direct relation to the represented object (e.g. Burge, 2018; Giardino
and Greenberg, 2015). This description is motivated by the observation that
there is, without specifying it now, a natural relation between images and
depicted objects which is believed to be based on exemplifying certain features
of said objects. In comparison to the language, it is not accidental that a dog’s
10 Thinking in Images
picture is like a dog, where it is largely irrelevant whether we call a dog ‘dog’ or
‘Hund’. The words ‘dog’ and ‘Hund’ are not dog-like; the image of a dog is. In
comparison to natural signs, such as animal tracks in the snow, images do not
have to be causally mutually connected with the represented object. An image
of Santa Claus does not have to be caused by Santa Claus, while animal tracks
have to be caused by animals. If there were no animals, there would be no animal
tracks. We cannot say the same about Santa Claus.
Importantly, we cannot determine the sign’s ground outside of the context
of how the sign is taken. The letters p and b can be considered icons if we
pick out the spatial similarities between them. They can be considered different
symbols if we isolate the conventional relation by which they represent. Thus,
depending on the interpretation of the relation between representations and
represented objects, we can take these letters as instantiations of icons or
symbols. Consider the diagram of a triangle ∆. Depending on the way it is
interpreted, it can be an icon; a symbol, for example, ∆ABC; or an index, for
example, a symptom of the mental illness manifested by compulsive drawing
of triangles.
Following the Peircean tradition, I do not restrict the definition of images
to visual representations. One can talk about tactile (e.g. Lopes, 1997; Kulvicki,
2006b; Yoo et al., 2003), haptic (e.g. Klatzky et al., 1991), olfactory (e.g. Bensafi
et al., 2003, 2005; Djordjevic et al., 2004, 2005; Gilbert et al., 1998), gustatory
(e.g. Croijmans et al., 2019; Kobayashi et al., 2004) and auditory images (e.g.
Halpern et al., 2004; Hubbard, 2010; Jakubowski, 2020; Zatorre et al., 2010).
They are just as common and just as psychologically important as visual ones
(e.g. Sebeok, 1979; Newton, 1982). When addressing questions like ‘How do
you clean a window?’ or ‘How would you feel if you won a Nobel Prize?’, one
can form a motor (e.g. Guillot, 2020) and an emotional image (Blackwell, 2020;
Nail, 2019; Thagard, 2005). We can use mental images to think about temporal
relations (e.g. Viera and Nanay, 2020). Maps are imagistic representations of
spatial relations. And diagrams are images spatially representing non-spatial
relations. In a broad sense, even language is describable in pictorial terms. For
instance, visual metaphors or graphic descriptions may be interpreted as literary
devices that form a literary image or contribute to a literary image (e.g. Collins,
1991; Scarry, 1999; Troscianko, 2013).
Such a description is far from clear, and it comes as no surprise that there is
no accepted definition of an image among psychologists and philosophers. That
being said, an explication of the concept of an image is one of the aims of this
book.
Introduction 11
Imagistic thinking
The main problem with describing what thinking with images is stems from
the fact that the general way in which images are described, including visual,
auditory, olfactory, literary images, and so on, makes it difficult to read off
the properties of imagistic thinking directly from the properties of images. In
contrast, we can read off the properties of thoughts from the logical properties
of language.
It is easier, however, to say what thinking with images is not. Imagistic
thinking is contrasted here with propositional and language-like thinking. The
term ‘propositional thinking’ refers to how we think about thinking and how
we interpret the nature of thoughts. Thoughts are interpreted as contents of
propositions. The thought that snow is white is the content of the proposition
snow is white. Thinking is, first, the way we stand in relation to certain
propositions, such as snow is white, which is most often interpreted as holding
a propositional attitude, such as ‘I believe that snow is white’ or ‘I doubt that
snow is white’ (e.g. Bermúdez, 2003; Peacocke, 1986),5 and, second, as the way
thoughts stand in relation to each other, which helps us to explain the nature of
logical inferences between thoughts (e.g. Braine and O’Brien, 1998).
Introduction 13
Paradigmatic examples of thinking with images are the way musicians use
auditory images to think about music; the way painters use pictures to express
thoughts about perceptual qualities; the way architects employ drawings to
reason about spatial relations; the way scientists and engineers use diagrams and
sketches in scientific reasoning; and the way people employ imagery skills.
According to the imagistic theory of thought, some thought processes are
necessarily based on rule-governed mental or extramental image sequences.
Images are the building blocks of some thoughts, and they determine the content
of those thoughts.
I will argue that the imagistic theory of thinking can be defended within
the framework of measurement theory. The basic idea is that we can apply
measurement-theoretic concepts to the analysis of imagistic thinking in the
same way as we apply them to study propositional attitudes (see Dennett, 1987,
1991b; Dresner, 2010; Marcus, 1990; Matthews, 2007, 2011).
14 Thinking in Images
Content overview
Dennett has once said that philosophers are better at asking questions than
answering them. I am not convinced that this is entirely true. However, I am
certain that a clear formulation of a problem is a crucial part of a philosopher’s
toolkit.
Following this line, the book consists of two parts. In the first part
(Chapters 1–3), I explain the nature of the problem of thinking with images.
In the first chapter, I present the problems related to the idea of thinking with
images. I argue that explaining the nature of thinking with images requires
proving two theses. According to the Irreducibility Thesis, images are necessary
for (some) thinking processes. This involves showing that images can be bearers
of thoughts and express a non-propositional kind of content and knowledge.
According to the Translatability Thesis, some imagistic thoughts and contents
can be expressed by propositions. I hold that both theses can be explained and
reconciled by adopting the measurement-theoretic account of images, according
to which images are measurement devices, just like rulers and balances.
In the second chapter, I show how thinking about the nature of thoughts
and images can lead to conceptual problems of thinking with images. Images
cannot provide any theory of knowledge and content. They cannot explain the
systematic and decomposable nature of thoughts. I introduce the so-called
Received View, which is a set of beliefs and intuitions regarding the nature of
thinking and which can be framed in a phrase that thinking is propositional
and action-based. I demonstrate that the Received View is incompatible with
the so-called Traditional View regarding the nature of images, which holds that
images resemble represented objects.
In the third chapter, I explore the kind of answers we can expect by asking
questions about the nature of thinking with images. I present two strategies for
answering these questions. According to the neo-Lockean approaches, imagistic
thoughts are abstract entities that mediate between perceptual and discursive
representations. According to Wittgenstein–Ryle’s line of argument, the question
about the imagistic vehicle we think with is senseless. I show that both strategies
fail. I introduce the so-called operational approach, according to which we can
think of imagistic thoughts as the kinds of operations that require images to be
expressed. The question about the nature of imagistic thoughts is the question of
the nature of these operations.
In the second part (Chapters 4–7), I present a positive proposal for addressing
the problem of thinking with images. I do this by analysing two case studies. In
16 Thinking in Images
the fourth chapter, I study the content of knot diagrams. To explain how they
represent, I introduce the concept of construction, which refers to the procedures
of arriving at a target by determining the parameters of some logical or physical
space. I show that the concept of construction is crucial for our understanding
of how images represent.
In the fifth chapter, I analyse the black hole picture. In order to explain its
semantics, I introduce the concept of recognition-based identification. I hold
that recognition is distinct from having beliefs. It is a kind of reference that is
also different from demonstratives and descriptions. Recognition consists in
identifying construction invariants.
In the sixth chapter, based on the concepts of construction and recognition,
I present the so-called two-dimensional (2-D) model of iconic reference.
According to this model, images denote their targets and exemplify the rules of
construction that identify the referent. I show that the best way to explain the
semantical properties of images is to think of them as measurement devices.
In the seventh chapter, I show how the measurement-theoretic account of
images can address the challenges described in the second chapter. I argue that
the 2-D model provides us with a theory of knowledge and content. It allows
us to explain the systematic and decomposable nature of thoughts. Last but not
least, I show the metaphysical consequences the measurement-theoretic account
of images has on our thinking about the nature of the mind. I apply the 2-D
model to explain the nature of the representational format and mental imagery.
This book is long. Although I highly recommend reading it in the order
presented in the consecutive chapters, it can be read in at least two ways depending
on the subject of interest. Those who are intrigued by the philosophy of mind
can go to Chapters 2–3 and 7. Those interested in the theory of depiction can
focus on Chapters 4–6, where I present a model of iconic reference. However,
this book should not be taken as presenting a full-fledged theory of depiction.
That would require giving a detailed context of a discussion, explicating the
positions I am arguing against and reasons why my model is better than others.
The reader will not find it here since it would make the book unreadable. My goal
is more moderate. I take the expression that we think with images for granted
and investigate its consequences for our conceptions of thinking. I believe that
the best way to make this expression sensical is by accepting the measurement-
theoretic model of images.
1
Now, however, it has become so thoroughly absorbed that it has disappeared from
philosophy’s surface and an imagistic account of thinking such as is outlined in
Russell’s Analysis of Mind (Lecture X) or elaborated in H. H. Price’s Thinking
and Experience is usually no more felt to deserve critical attention than is, say, a
geocentric account of the universe.
Let me underline this point. The imagistic theory of thought has been replaced
with the propositional one not because a competing theory with a more
explanatory power has been introduced. The propositional theory of thought is,
in many aspects, no better than the imagistic one. Instead, the imagistic theory of
thought appeared to be internally inconsistent, and it was accordingly rejected.
As Sellars (1963, 15) puts it: ‘all attempts to construe thoughts as complex
patterns of images have failed, and, as we know, were bound to fail.’
However, regardless of the staunch rejection of the imagistic theory of
thought, even its harshest critics agree that even if not all thought processes are
conducted in images, some of them are or at least that some thought processes
do involve (mental) images. For instance, it is believed that images are crucial
to explaining the nature of animal thinking (e.g. Burge, 2010; Gauker, 2011;
Mullarkey, 2011) and the nature of concept formation (e.g. Barsalou, 1999;
Carey, 2009). Whatever may be meant by the claim that mental images do or do
not exist, mental imagery is a real phenomenon and quite open to quantitative
scientific investigation (e.g. Shepard, 1990). It is claimed (e.g. Block, 1983a;
Gauker, 2011; Kaufmann, 1996; Rollins, 1989) that the debate on the possibility
of thinking with images challenges the dominant view of the nature of thinking.
However, we still lack a clear understanding of what we mean by the imagistic
theory of thought (e.g. Abel, 2012; Bechtel, 2008).
The Semantical Thesis: Images possess a kind of content distinct from that of
non-imagistic representations;
There are two ways of describing the irreducible role of images. First, we can
understand the role of images weakly in terms of the facilitating role of images.
What is the Problem of Thinking with Images? 21
That means that the basic role of images is to prompt some thoughts and facilitate
intellectual processes. Images can be interpreted as a necessary condition of
some thought processes, where necessity is understood in epistemological terms:
images are necessary to grasp some thoughts and acquire information. In the
same way, computers are necessary to conduct some complicated calculations.
Second, we can understand the role of images in strong terms, where images
are interpreted as constitutive elements of cognitive systems. According to the
Strong Interpretation, the existence of some thoughts is necessarily bound with
the existence of images. Necessity is understood in metaphysical terms, which
means that it is impossible to have some thoughts without an image. As an
example, numbers are necessary for counting; without numbers there would be
no elements we could count with.
We can distinguish between two interpretations of the imagistic theory of
thought with respect to each of the above. According to the Weak Interpretation,
some cognitive processes, such as learning, reasoning or problem-solving, are
based on iconic representations. For example, it is held that the use of mental
images is crucial to solving mental rotation tasks or that there is a well-
established relationship between the ability to use diagrams and success in
mathematical problem-solving tasks (e.g. Casati, 2018; Grandin, 2006; Hegarty
and Kozhevnikov, 1999; Thagard, 2005; Tversky, 2011, 2015).
That being said, the Weak Interpretation is insufficient to explicate the imagistic
theory of thought. Let me demonstrate that using Larkin and Simon’s theory of
diagrammatic reasoning.2
According to Larkin and Simon (1987), diagrams can be distinguished
from sentential representations by the way they display information. Diagrams
provide a one-to-one mapping of information stored in a spatial form at the
particular locus of a diagram, including information about relations with the
adjacent loci. We can infer the features of the represented objects by inspection
of the spatial features of the representational vehicle.
In comparison to sentential representations, diagrams are cognitively less-
loaded. They represent more pieces of information at once. They help in solving
deductive and abductive tasks (e.g. Coopmans, 2014; Kirsh, 2010; Kitcher and
Varzi, 2000; Latour, 1990; Zhang, 1997), for the process of acquiring information
is perceptually enhanced by the way the information is organized (e.g. Bauer
22 Thinking in Images
reasoning, can efficiently make the content explicit. Images can draw attention
to unnoticed properties of the content. They can prompt different ways of
perceiving objects. Yet, they do not determine the content of thoughts. They can
make some thoughts more precise and point out some features of the content, but
they cannot change the content. In the same way, a microscope is a valuable tool
to make evident the content of a sample, but it does not determine the sample’s
content. A microscope does not create microbes, but it may be necessary to
reveal them. Larkin and Simon’s argument demonstrates that images can play a
crucial role in grasping the content of some thoughts but does not prove that we
think with images.
Let us turn to the Strong Interpretation. According to the Strong Interpretation,
imagistic thoughts can be distinguished by the nature of the operation. The
nature of imagistic thought is interpreted in terms of a family of rule-governed
processes. To ask about the nature of the operation is to ask what the operation
does. For example, if one is interested in the nature of a quadratic function, then
one is interested in how one operates with some variables. If one understands
what a quadratic function is, one understands what it does. In the same way, if
we are interested in the nature of imagistic thoughts, we are interested in what
they do. In a nutshell, some operations could not exist if there were no images,
in the same way as some mathematical operations would not exist if there were
no mathematical objects. The question is what these operations are.
Accordingly, we can define the Strong Interpretation of the imagistic theory
of thought.
The Strong Interpretation holds that images are irreplaceable in thinking. The
irreplaceability of images means that some thoughts do not exist, if there are
no images expressing them. In other words, some thinking processes can be
identified with genuinely imagistic operations. The main task here will be to
establish this irreplaceable role of images.
One of the main problems with the Irreducibility Thesis is that it has to be
reconciled with the Translatability Thesis. According to the Translatability
24 Thinking in Images
The reason for accepting the Translatability Thesis is that we want to maintain
correctness conditions for imagistic thinking, which means that we can be
wrong when thinking imagistically (e.g. Dilworth, 2008; Langland-Hassan, 2015,
2020). First, we want to maintain the idea that it is possible to make a mistake
in describing the content of an image. For example, it would be a mistake to
say that a photograph represents a blond girl if it represents a dark-haired man.
If propositions and images were untranslatable, then it would be impossible to
say that they fit each other, in the same way as it would be nonsensical to ask
whether a random set of letters describes a girl.
Second, we want to maintain the idea that imagistic representation may
correctly or incorrectly represent the world. If I depict a wanted thief as a blond
What is the Problem of Thinking with Images? 25
girl and the thief turns out to be a dark-haired man, I can say that I was wrong
because the proposition the thief is a dark-haired man is true. If imagistic
and propositional thoughts were untranslatable, it would be difficult to say in
what sense I could be wrong when I depict the thief as a blond girl.
Most theories of imagistic thoughts are in tension either with the Irreducibility
Thesis or the Translatability Thesis. There is a tendency to swing between the
two. On the one hand, imagistic thoughts are sometimes believed to be a kind
of propositional form of representation (e.g. Pylyshyn, 2003a), which makes it
unclear how to express the genuine value of imagistic representation. On the
other, imagistic thoughts are believed to be non-propositional (e.g. Gauker, 2011;
Mößner, 2018), but it is unclear how we can translate the content of imagistic
thoughts into propositional content. Hybrid views are often formulated (e.g.
Camp, 2007, 2015, 2018; Denis, 1991; Fodor, 1975; Kulvicki, 2020; Langland-
Hassan, 2015), which state that a proper part of the content of imagistic
representation is propositional. However, it is not easy to see what, according to
hybrid views, the contribution of the iconic content of thoughts is to cognition
and what functions it plays in the broader cognitive economy.
Let me illustrate the problem with hybrid views using Fodor’s theory of mental
images. Fodor (1975, 2008) holds that images have to be put under description
to fix their meaning which then allows them to be implemented in the mental
mechanism. A sentence in the language of thought (LOT) has to be attributed to
imagistic content. In a nutshell, if we introduce a system of mental symbols, such
as LOT, and assign these symbols to images, we can determine the meaning of
images and incorporate images into the machinery of thought.
However, Fodor’s hybrid strategy renders imagistic representations redundant.
If a discursive representation determines the content of images, then in principle,
imagistic representations have no genuine role in cognitive architecture and can
be reduced to propositions. That is, if we assign each image a sentence in LOT
and define thinking operations as operations on LOT’s symbols, then images do
nothing in the mind’s machinery. Images can play a role in grasping the content
of some thoughts, but they are not necessary for having any thoughts.
In this book, I argue that any attempt to defend a non-trivial imagistic
theory of thought has to take into account both the Irreducibility Thesis and the
Translatability Thesis. It means that imagistic thoughts have to be expressible
in propositions and have to carry such information that propositions cannot
express. Most of this book will be devoted to showing how the Irreducibility and
Translatability Theses can be comprehensively explicated in the context of the
imagistic theory of thought. I shall argue that the former can be defended based
26 Thinking in Images
Summary
What is thinking?
The Epistemological Challenge does not hold that images cannot be a source
of knowledge. They obviously can. They can prompt some knowledge; they can
justify modal (e.g. Williamson, 2016) and factual knowledge (e.g. Dorsch, 2016;
Kind, 2018). Images can be direct objects of research. Images can be efficient
tools for thinking (e.g. Larkin and Simon, 1987; Sloman, 1978, 2002). They can
present procedural steps of reasoning in a more comprehensive manner and can
be part of argumentation processes. Thus, images can be a source of knowledge,
mainly because the human mind cannot grasp all information discursively.
However, knowledge does not consist of images. In the same fashion, perception
is a source of knowledge; and yet it is not an element building the structure
of knowledge. According to the Epistemological Challenge, images have an
instrumental nature – they help us acquire knowledge but do not constitute
knowledge.
It is held that knowledge consists in providing information that something is
the case. It is truth-evaluable and consists of elements that have truth-conditions.
To have truth-conditions means to be a truth-bearer. Propositions are believed
to be primary truth-bearers, which means that their content can be evaluated
according to whether they are true or false. Knowledge consists of truth-
evaluable and justifiable elements, such as propositions and logically structured
relations between these elements. Thus, the knowledge that something is the
case is traditionally dubbed propositional knowledge.
What Is Thinking? 29
intends that this resemblance have this effect because the viewer recognizes this
intention.
However, this strategy is a dead end. We can appeal to the communicative
context in the case of pictures. This strategy fails in the case of mental images.
Internal representations are not part of any communicative practice.
We cannot determine the content of images based on the author’s intentions,
either. It would be circular to appeal to the author’s intentions to determine the
content of mental image. To know our own intentions, we need to know the
content of mental images, but to know the content of mental images, we would
need to know the content of our intentions. It would not be a problem if we
could easily distinguish between the content of mental images and pictures. Yet,
that would not provide a unified account of imagistic content.
Moreover, the intention-based accounts of depiction cannot explain how
pictures represent. X-ray pictures are correct or incorrect regardless of the
intentions of the author. Automatically taken pictures have content but not
authors. There are unfulfilled intentions too. If I intend to draw a horse, but
due to a lack of skill, the drawing is more like a cow, we would not say that the
drawing represents a horse only because I intended to depict one. Therefore,
the communicative context and author’s intentions cannot solve the problem of
content indeterminacy.
Second, we can claim that although pictorial content is indeterminate, it is not
indeterminate all the way down. At least part of the content is determined. We
can claim that there is some primitive content that is fixed and constrains possible
interpretations of pictures. Kulvicki (2006a, 2014, 2020), following Haugeland
(1998), calls it bare bones content. Bare bones content is defined in terms of
projective invariants, that is, these representational features that are common for
different interpretations of a picture, such as certain patterns of colours or spatial
relations.3 Bare bones content goes beyond the context of interpretation and
captures basic semantic information displayed in pictures. It can be compared
to the concept of character in the philosophy of language (Kaplan, 1989). In
contrast, fleshed-out content is content that we can describe as a picture of John
or a map of London and depends on our ability to recognize kinds of objects by
the features of bare bones content. It can be compared to Kaplan’s concept of
content. According to Kulvicki (2006a), although pictures admit of alternative
interpretations of the fleshed-out content, all these interpretations have a
common bare bones content that constrains possible interpretations of the
picture. For example, a square-like picture can represent a square or a face of the
cube seen from a certain angle. These different interpretations mark the fleshed-
34 Thinking in Images
For logical relations to hold, the elements of the relation have to possess
a logical form, that is, a syntactically fixed structure, such as a set of logical
constants and variables, with determined transformational rules that preserve the
logical values of its components. If A implies B, then based on transformational
rules, it is possible to transform the truth of the first into the truth of the second.
Propositional logic shows how the truth and falsehood of complex propositions
depend on the truth and falsehood of simple ones. Truth-functions operate on
propositions that can be negated, disjointed or conjoined; they can imply one
another or be equivalent. One of the main reasons for talking about propositions
at all is that they explain how things can stand in these logical relations.
Images lack logical form. They can illustrate logical relations, just like
Venn diagrams can illustrate relations between sets, and apples can illustrate
calculations. Images can help us to grasp logical transitions, just like apples can
help us to grasp calculations. Yet they do not have a logical structure, just like
apples are not logical entities. There are no truth-preserving transformation
rules for imagistic representation. There is no pictorial negation (Crane, 2009;
Sainsbury, 2005), conjunction or disjunction (Heck, 2007); images cannot
express implications, logical modalities, quantifications (Frege, 1984) and so on.
For instance, there is no image of all people being bald or such that it is necessary
that they are bald.
Language is essential here. Grammatical constructions indicate relations
between concepts and point to contradictions, consequences and correspondence
between pieces of information (Loewenstein and Gentner, 2005; Lupyan et al.,
2007). Images do not provide such opportunities.
Thus, images do not do any logical work. The best explanation of these facts
is that images are not truth-bearers the transformational rules can be applied to.
According to Frege, the logical structure of thoughts mirrors the logical
structure of language. Thus, the only route to an analysis of the logical structure
of thought goes through an analysis of language. Dummett (1993, 128) calls it
‘the fundamental axiom of analytical philosophy’.
The implication of Frege’s argument is that it draws a clear line between images
and beliefs. Beliefs are interpretable in logical terms; images are not. Beliefs are
‘inferentially promiscuous’ (Stich, 1978), which means that they can figure as
premises in inferential transitions. It helps us to explain why images behave
differently than beliefs. Images are not committed to truth; beliefs are. I cannot
believe that 2 > 3 if I know that it is the case that 3 > 2. In contrast, I know that it
is the case that there can be no flying horses, and yet that is not a reason to refrain
from depicting one. Deprived of logical form, images are independent of beliefs.
36 Thinking in Images
At first glance, it may seem that the Frege-Davidson argument can be refuted by
pointing out straightforward counterexamples. For instance, if I want to negate
that John has red hair, I can depict him with blond hair. If I depict a green and
a red apple, I express an alternative of a green and red apple. If one places two
pictures next to each other, much like in a comic book, then one can say that
their content is conjoined or that one implies the other (e.g. Malinas, 1991;
Westerhoff, 2005).
These examples are misleading. The role of logical form is to determine the
truth-conditions of its elements. In the case of images, truth-conditions cannot
be determined. Having a picture of John with blond hair may be a negation of
What Is Thinking? 37
him having red hair, or black hair and so on; for the content of not having red hair
is not simply being blond but an infinite alternative of the form ‘having blond
hair, or having black hair, or having green hair, etc.’. No image can represent
infinitely many properties.
By the same token, conjunction, disjunction and implication do not simply
represent a sequence of its elements; they set up a logical link between their
components. In the case of two pictures, there is no way to determine the nature
of this link – whether it is a temporal sequence, causal link, spatial transformation
or simply a set of two unrelated pictures. In all these cases, the pictorial form is
the same.
Moreover, images lack the generality and precision required by logical
operations. For instance, it is impossible to depict the difference between the
claim that ∃xP(x) and ∀xP(x) (Hintikka, 1987).
However, there is a more sophisticated line of argument available. It is based
on two points. First, opponents of the propositional view of iconic content, such
as Crane, usually agree that we can negate pictorial content. For instance, we
can point at a picture representing John as bald and claim ‘it is not the case that
John is bald’. According to Grzankowski (2015, 2018), negation is a good way of
testing for propositional content. If something can be negated, then it is truth-
evaluable. Second, when seeing a picture, we can describe the picture’s content in
terms of modal propositions. For instance, when pointing at a picture depicting
John being bald, we can assert that it indicates that in some possible world, it
is true that John is bald (e.g. Kulvicki, 2020; Malinas, 1991; Matthen, 2014).
Consequently, if the content can be negated and if we can say of the picture’s
content that it represents logical modalities, then it has a logical form.
This argument is invalid. Yet, it is invalid in an instructive way. Logical
operators apply to propositions. They do not apply to the way pictures look.
Thus, the picture’s content must be put into a propositional form to which we
attribute a logical operator. This is clearly possible. As Kaplan (1968) observed,
many of our beliefs are of a form such as ‘the colour of her eyes is [_]’, where the
blank ‘[_]’ is filled with a pictorial representation. Hence, images can figure in
what we believe and can play a role in forming propositional contexts.
Yet, the fact that something can be put into a propositional context does not
imply that it has propositional content. Pointing at a knife can be used to express
the proposition that ‘it is (not) possible that it is the weapon Hamlet killed
Polonius with’, where ‘it’ is a deictic word, the referent of which is determined by
indication. It does not mean that the knife is propositional. The proposition has
been expressed deictically by indicating the knife. Still, the propositional content
38 Thinking in Images
characterizes the deictic use of the word ‘it’ and an ostensive act of indicating the
knife – not the knife as such.
This context can be multiplied, as in the case of deferred reference (Nunberg,
1993; Quine, 1968). Deferred reference is the use of an expression to refer to an
entity that is not denoted directly by this expression. For instance, I can point at
Quine’s photograph and say that ‘that is most probably the greatest philosopher
of the twentieth century’. The word ‘that’ does not refer to the photograph but
to Quine. The visual properties of the photograph are used to recognize Quine
or, putting it differently, to transfer the reference from the photograph to Quine.
Similarly, I can point at John’s portrait and express the proposition that it is
possible that he is bald. Yet, the propositional content characterizes the deictic
use of the picture, not the picture itself. The picture is used here in a deferred
context. We represent John as bald to transfer the property of baldness to John.
Properties of representation are used to transfer reference to the represented
object. In our example, the first attribution – representing John as bald – concerns
the properties of representation and the second – attributing baldness to John
– the properties of the represented object. Properties of representation describe
the way the object is represented. Properties of represented objects describe the
object itself. Only when we attribute properties to the represented objects can we
speak of truth-conditions. That is why I can depict John as bald without being
committed to holding the belief that John is bald.
Thus, we have to distinguish between the properties of representation and
the properties of the represented objects. Properties of representation single out
the way something can be represented. Properties of represented objects single
out the way an object is or can be. The inability to distinguish these two kinds of
properties is only another instantiation of the phenomenological fallacy. Let us
dub it the ‘pictorial fallacy’.
Let me illustrate the pictorial fallacy with the help of a drawing of a flying
horse. If I draw a flying horse, I can attribute the ability of flight to the depiction
of the horse. Therefore, I can say that it is possible to represent horses as flying,
which is true. Yet it is something different than saying that it is possible that
horses can fly, which is necessarily false in the same way as saying that it is
possible that water is not H2O.4
The argument that images can express modal and negative propositions is an
instantiation of the pictorial fallacy. Note that the force of this argument depends
on how we understand the concept of ‘expression’. On the one hand, we can say
that numerals express numbers, which means that numerals are necessary means
to represent numbers. On the other, we can say that a book cover expresses the
What Is Thinking? 39
content of a book, which means that the book cover is an auxiliary means to
understand the content of the book. The properties of the book cover represent
the book’s content and are used to recognize the properties of the book’s content.
In other words, the book cover is an illustration of the book’s content. However,
the book does not need illustrations to have content.
Pictures can illustrate modal and negative propositions. They can represent
something as possible and negated, but they do not represent that something
can be or is not the case; propositions do. Thus, indicating the fact that pictures
can express modal and negative propositions does not imply that pictures have
propositional content and can stand in logical relations.
itself. Although we have the conceptual capacities to individuate the steel balls –
for instance, with the concepts this ball and not-this ball – we may lack the
skill to apply these concepts successfully. We may fail in concept-application, but
that is not particularly unexpected. I may think that the girl I saw yesterday was
Jane, but it was Mary – her twin sister. It may happen.
In contrast, images do not provide us with tools to individuate objects we
predicate of and it is debatable whether they can express predication. First, as
Wittgenstein’s argument shows, the content of images cannot be independently
determined. The problem is not that we cannot be wrong about the interpretation
of the content of an imagistic thought. The difficulty is that if the content of
the images is indeterminable, then there can be no correct interpretation of the
content. Thus, images cannot satisfy Russell’s Principle. Propositions can.
Second, as Frege-Davidson’s argument indicates, images lack logical form. As
a consequence, we face the so-called binding problem of iconic representations.
It is analogous to the unity of propositions problem in the philosophy of
language (e.g. Gaskin, 2008). In the philosophy of language, the question is how
to distinguish a proposition, such as this square is red, from a list of names
‘this square’, ‘is’ and ‘red’. The answer comes easier if we assume that propositions
have a logical structure that puts the names together. In the case of an image of a
red square, it is either an image of a square, where the colour is irrelevant, such
as in the case of black-and-white photographs, or an image of redness, where
the shape is irrelevant, such as in the case of colour samples or, lastly, an image
of a red square. The problem is that without a logical structure, it cannot be
determined whether the content is combined or not.
Consequently, images are not capable of expressing predication, for
predication requires a logical apparatus they lack. In particular, it requires a
separation between arguments and predications. Propositional representations
can do that. In the sentence ‘it is a red square’, we can point out the part that
represents the argument and the part that corresponds to the predicate. In an
image of a red square, the same part of the picture corresponds to the argument
and the predicate. Consequently, a proposition that a square is red can predicate
the redness of the square, for the square takes the place of a named argument
we predicate redness of. In contrast, an image of a red square is in line with the
interpretation that the square is red and that the colour red is square-like.
Let me clarify this point. Images surely bind contents together. An image of a
red square combines red and square and differs semantically from an image of
a red triangle or a green square. However, the Semantical Challenge shows that
we cannot determine the content of imagistic thoughts.
What Is Thinking? 43
What I am suggesting here is that the Traditional View makes them vulnerable
to the Semantical Challenge. That may be, however, an argument for changing
our traditional way of thinking about images and not against an imagistic theory
of thought.
It is believed (e.g., Davidson, 1997; Laurence and Margolis, 2012; Solomon et al.,
1999) that concepts are the building blocks of thoughts. Why is that so?
As Frege-Davidson’s argument shows, one of the distinctive features of
thoughts is that they are systematically structured. Entertaining a thought
of one kind entails a capacity to entertain a thought of another kind. For
instance, entertaining the thought that John is happy and that Mary is sad is
systematically connected with the cognitive ability to entertain the thought that
Mary is happy and that John is sad. Having the thought that John is happy
entails a capacity to think that someone is happy. In Evans’s words (1982, 104),
‘if a subject can be credited with the thought that a is F, then he must have the
conceptual resources for entertaining the thought that a is G, for every property
of being G of which he has a conception.’
The same rule applies to inferences (e.g. Fodor and Pylyshyn, 1988). If I think
it is dark and cold and raining, I can infer that it is cold and raining; for
from P & Q I can infer that P (or Q). By the same token, I have to be capable of
inferring from it is cold and raining that it is raining. If I cannot do so, I
do not know what inference is.
Thus, thoughts must be systematically co-related (e.g. Heck, 2000; Peacocke,
1992), which means that they are systematic in nature. Evans (1982) calls this
requirement the Generality Constraint.
To meet this requirement, thoughts have to consist of recombinable
constituents that can build more complex structures. It means that thoughts
are compositional in nature. The compositionality of thoughts means, first,
that the meaning of a complex thought is determined by the meaning of
its constituents. The constituents of thoughts are parts of thoughts that are
canonically distinguishable, for not every partition of thought makes sense. The
idea is that canonically decomposed parts are syntactically and semantically
meaningful units. For instance, the thought that John loves Mary can be
decomposed into John loves and Mary, but not into John . . . Mary (e.g.
Fodor, 2008).
44 Thinking in Images
Second, the meaning of complex thoughts must come from the meaning
of its canonically distinguishable parts together with the rules of composition,
for not all combinations are allowed. The recombination of the parts must be
meaningful. John loves Mary can be recombined into Mary loves John
but not into John Mary loves.6 These rules are recursive. If I have a thought
that John loves his mother, I have to be capable of having a thought that
John loves his mother’s mother and so on. Putting it together, it means
that thoughts have a recursive syntax that combines canonically distinguishable
parts according to the combinatorial rules (e.g. Pagin and Westerståhl, 2010).
Systematicity and compositionality allow us to explain how thoughts can
be productive. The productivity of thoughts means that we can entertain a
potentially infinite number of different thoughts. Moreover, we can understand
an indeterminate number of thoughts we have never entertained before. It all
makes sense if we assume that thoughts are made of systematically structured
and combinable elements. These elements are concepts, and the complex
conceptual structures are propositions (Peacocke, 1992).
Language seems to be systematic and compositional. It has syntax and
distinguishable syntactic and semantic parts. Thus, one can hold that thoughts
consist of language-like representations (e.g. Devitt, 2006). In contrast, images
lack systematicity and compositionality; therefore, they do not have metaphysical
properties we are willing to ascribe to thoughts.
Images are neither systematic nor compositional. As Fodor shows, they lack
syntactic structure and canonical decomposition.7 Images lack grammar capable
of generating infinite sequences of sentences (Eco, 1995; Wollheim, 1993).
Therefore, iconic representations do not meet the Generality Constraint.
Fodor’s argument (2007, 2008) takes the form of the so-called Picture
Principle. According to the Picture Principle, iconic representations can be
distinguished topologically: although images have interpretable parts, they lack
canonical decomposition. It means, loosely, that we can cut up an image however
we like, and each image-part will represent a relevant part of the represented
object. Thus, every part of the representation represents some part of the scene
represented by the whole representation (e.g. Green and Quilty-Dunn, 2021;
Quilty-Dunn, 2016, 2020; Sober, 1976). In contrast, discursive representations
have canonical decompositions, which means that they cannot be cut into
pieces however we like. Discursive representations have constituent parts. For
What Is Thinking? 45
concept red square or the proposition some squares are red? Language-like
representations have syntactical structure and can distinguish between concepts
and propositional structures made of concepts. Images do not.
To sum up, images lack syntactic structure, and, therefore, they do not meet
the requirements of the Generality Constraint. They are neither systematic nor
compositional. Any theory of imagistic thinking has to address the problem of
systematicity and compositionality of thoughts.
each other because, in their causal history, certain facts have causally linked the
mental states into pairs, ensuring that if one pair member is activated, the next
one is also activated. For instance, the frequency with which a certain organism
came into contact with events X and Y determines the frequency with which that
organism will have related thoughts about X and Y.
Hume’s theory of ideas illustrates this well. It is primarily a theory of how
perception (impressions) determines strings of thoughts (ideas). According to
Hume (1975), ideas are copies of impressions in the following fashion: if the
impressions IM1 and IM2 are related in perception, then the corresponding ideas
ID1 and ID2 are also related. We do not need to refer to any intermediary entities
like implicit rules. The causal order of perception thus determines the order of
thought.
Associationism has many advantages. It explains the psychological
mechanism of thought acquisition well. It is metaphysically unencumbered and
does not postulate any hidden logical mechanisms. It explains many mental
phenomena too. It is used in learning theory (e.g. behaviourism), theories of
reasoning (e.g. dual-process theories) and theories of thought implementation
(e.g. connectionism). It is in line with nowadays theories of mind, such as
Enactivism and Dynamic Systems theories. All these theories are independent
but share an empiricist core.
Associative structures are usually contrasted with propositional structures, in
which individual elements are logically correlated. Propositional structures are
not just a causal sequence of associations. Associative structures express only
causal relations between representations, for example, the associative structure
green-tree says that we associate green with trees; it does not express the
proposition that (some) trees are green. The thought some trees are green
does not tell us that there is a causal relationship between the green-tree
representation. Instead, it predicates of some trees that are green.
Accordingly, associative inferences are transitions between thoughts that do
not follow from logical relations between elements of thoughts. In this sense,
associative inferences are contrasted with logical inferences, such as those
made use of in the computational-representational theory of thought, in which
inferences are truth-value-preserving transitions between thoughts that are
determined by their syntactic properties. Associative inferences are based not
on syntactic properties of thoughts but on associative relations between the
contents of thoughts. For example, we can associate the thought London and it
is raining because we once got wet in London. We cannot infer that it is raining
in London based on the formal properties of London and it is raining.
48 Thinking in Images
How does one find a place for imagistic thinking within this division? There
are two dominant strategies. First, we can try to interpret images in propositional
terms, trying to adjust images to the requirements set by propositional theory
(e.g. Blumson, 2012; Camp, 2007). Second, we can bite the bullet and claim
that imagistic thinking is more appropriate to associationist theories of thought
(e.g. Quilty-Dunn and Mandelbaum, 2020).9 However, both strategies are
unsatisfactory. On the one hand, propositionalism cannot provide a theory of
imagistic thinking, for images are non-propositional in nature. On the other,
adopting an associationist theory of imagistic thinking is running away from
the problem.
Granted, associationism is not a false theory. After all, thinking can take
different forms. When I think that trees are green, I can infer that some
trees are green, but I can also think of summer. However, one wants to choose
propositionalism over associationism because the first can provide a theory of
rational thinking while the second cannot.
We want to ensure that rationality may be expressible in imagistic thinking since
images are crucial for science and can work as arguments in reasoning. Moreover,
according to the Translatability Thesis, we should be able to transform images into
propositional forms and model relations between images according to the rules
of logic. We certainly do that. Operations on sets can be performed on both Venn
diagrams and propositions. Any theory of imagistic thinking should be able to
include inferential transitions between images. Therefore, it has to respond to the
challenges set by propositionalism and not be left at the mercy of associationism.
What is the lesson? It seems that neither propositionalism nor associationism
offers a theoretically respectable way of incorporating images into a coherent
theory of thought. Thus thinking with images cannot be explicated within the
propositionalism–associationism distinction. However, this problem is deeper.
It stems from the impossibility of freeing oneself from the traps of metaphors
that determine our thinking about thinking.
Metaphors of thinking
There are two dominant metaphors for what thinking is that are at the heart
of different theories of thinking (Schooler et al., 1995). According to the moving-
through-space metaphor, thinking involves moving through a logical or physical
space. Thinking is something we do and for which we can determine certain
methods and rules in the same way as we can determine pathways in space. We
use the spatial metaphor when we use such phrases as ‘searching one’s mind’,
‘approaching the problem from a different angle’ or ‘changing the direction of
thought’. We are also willing to answer the question ‘What are you doing?’ with
‘I am thinking’. According to the spatial metaphor, thinking is a kind of activity
at which we can be better or worse. We can be ‘too tired to think’ or we can be
‘deep’ or ‘slow thinkers’.
According to the perceptual metaphor, thinking is an act that happens to us,
often unconsciously and automatically. Thinking is a kind of seeing, not doing.
We use the perceptual metaphor when we use such phrases as ‘seeing a solution’,
‘gaining insight into the problem’ or ‘casting light on something’.
These two metaphors do not overlap with the distinction between propositional
and associationist theories of thinking. Frege, computational theories and
behaviourism adopt the spatial metaphor. The perceptual metaphor has been
developed within theories of divine illumination, the Cartesian theory of clear and
distinct ideas, Locke’s theory of ideas and Gestalt theory (Figure 2.1).10
The moving-through-space metaphor lies at computationalism’s heart (Fodor,
1975; Newell and Simon, 1972). Solving a problem can be seen as moving through
a problem space from an initial state to the goal state. Movement through the
problem space requires the use of operators. These are actions that fulfil certain
subgoals. Moreover, holding propositional attitudes can be interpreted as
occupying a place in logical and physical space and as rule-governed transitions
from one state to another. For instance, the belief that grass is green can be
considered a state in a logical space isomorphic with the corresponding physical
exemplar of the sentence ‘grass is green’ at a particular location in the human brain.
This sentence may be the basis for further computational processes comparable
to movements through the state-space, such as the belief that my lawn is green,
which, in turn, can be the basis for action, for example, to water the lawn.
The same moving-through-space metaphor forms an underpinning of the
behaviourist theory of thinking. Behaviourists, such as Skinner (1957), held
that mental states could be interpreted as abbreviations of certain actions. For
instance, the feeling of sadness is an abbreviation of certain external behaviours,
such as crying and facial expressions, and internal behaviours, such as neural
activations, muscle tension and so on. In this vein, they assume that thinking
is a ‘silent speaking’, manifesting in subtle behaviours such as laryngeal muscle
movements.
However, the spatial metaphor does not seem to fit theories of imagistic
thinking. Entertaining an imagistic thought of a sunny day in summer does not
seem to be a movement in a logical space. Instead, it is a form of perception.
In particular, the theory of thinking as silent speaking does not seem to apply
to musical or visual thinking acts. When a painter thinks of a landscape, his
thinking process takes the form of perceiving an image rather than speaking to
himself.
The perceptual metaphor is storied. It has its roots in the Aristotelian theory
of intellectual intuition. It has been used, among others, in medieval theories
of divine illumination, the Cartesian clear and distinct ideas, Locke’s theory of
ideas and Gestalt Psychology.
However, the perceptual metaphor is at loggerhead with what we know
about perception and thinking. On the one hand, philosophical theories cannot
explicate the difference between perception and thinking (more on that in
Chapter 3). On the other, even though it is promising, Gestalt Psychology is far
from providing a comprehensive theory of thought.
Gestaltists analyse thinking in problem-solving contexts and emphasize
insight’s role in understanding a problem’s structure. Within Gestalt Psychology,
understanding is not an incremental and continuous process but is sudden and
spontaneous, similar to an act of perception. However, Gestalt Psychology is
fairly accused of being uninformative (e.g. Jäkel et al., 2016). It does not explain
52 Thinking in Images
phenomena but describes them. It does not indicate the mechanisms governing
the thinking process but describes the qualitative structure of the thinking
process. It does not explain, for example, what a sudden process of insight
consists in. Thus, it cannot be a candidate for a theory of imagistic thinking.
Where do we stand? On the one hand, the spatial metaphor is inconsistent
with how we use images in thinking. On the other, the perceptual metaphor
does not explain what a theory of imagistic thinking could be. However, it is
forgotten that these two metaphors are not mutually exclusive. Although they
are frequently discussed independently, they are complementary.11 When
moving through a physical space, we have to see where to go next. In problem-
solving, movement from an initial state to the goal state requires recognition of
the correct operators. Thus, these two metaphors can be readily combined. With
respect to thinking, such a combination can be beneficial because it suggests that
multiple processes may contribute to what we call thinking. It also suggests that
numerous factors may contribute to an impasse in thinking. On the one hand,
one can know how to move through a logical space but fail to see the goal. On
the other, one can see where to go but not know how.
A more adequate metaphor for thinking would be that of measurement.
Thinking is like applying measures, and thoughts are the products of those
applications. The reasoning is to see what results from applying different measures.
The measurement metaphor contains the perceptual and spatial metaphor.
Measurement is the result of a certain action. For instance, it may be the action
of measuring the time between thunder and a flash of lightning. It is moving
through logical space by determining spatial and temporal relations between
measured objects. Yet, to conduct a measurement, one has to recognize what is
to be measured – in this case, distance from the storm. With this metaphor in
hand, we can consider alternative ways of building theories of imagistic thinking.
Summary
The Received View is a loose collection of beliefs and intuitions about the nature
of thinking, according to which thinking is propositional and action-based.
Thinking is a matter of operations carried out on propositional structures made
of concepts.
The Received View gives a clear idea of what knowledge and the relation
between knowledge and thinking are. It provides a theory of the content of
thoughts. It is in line with metaphysical assumptions concerning the nature of
What Is Thinking? 53
Bertrand Russell noted (1919, 11) that ‘If you try to persuade an ordinary
uneducated person that she cannot call up a visual picture of a friend sitting
in a chair, but can only use words describing what such an occurrence would
be like, she will conclude that you are mad’. However, the claim that we think
with images has always been subject to suspicion, for our prescientific intuitions,
which supposedly support imagistic theories of thought, have always been vague
and fallible. That is a simple consequence of our lacking a clear understanding
of what these intuitions are thought to support. The imagistic account of human
thought is a source of misunderstandings and disputes mostly because it is not
clear what we really mean when we say that we think with images. As Pylyshyn
(2003b, 113) observed, despite claims that thoughts have a picture-like format
having persisted for such a long time, the problem of stating clearly what it
means for thought to be imagistic has rarely been explicitly tackled. Yet the real
difficulty lies deeper: we do not even know what kind of answers we expect when
we ask, ‘What is thinking with images?’ To know the content of a question is to
know what counts as valid and invalid answers to the question. To know the
content of the question ‘What is thinking with images?’ is to know what kind of
answers we should expect. In other words, we should know what counts as an
explanation of what thinking with images is.
We have to be careful not to trivialize the answer. We face the problem of
triviality when we try to describe a phenomenon without explaining it. The
problem of triviality stems from the intuitive character of imagistic theories
of thought. At first glance, it seems like there is nothing wrong with imagistic
thoughts and that they do not need any clarification: the claim that we can
think with images sounds intuitive, and we have to accept it as a brute fact.
This prescientific intuitiveness is the cause of our continued attachment to the
imagistic theory of thought, but it is also a significant source of conceptual
confusion.
56 Thinking in Images
The leading cause of the revival of imagistic theories of thought in the twenty-
first century is a belief that a full-blooded theory of thought should explain
What Answers Should We Expect? 57
Let us suppose then the mind to be, as we say, white paper, void of all characters,
without any ideas: How comes it to be furnished? [. . .] Whence has it all the
materials of reason and knowledge? [. . .] To this I answer, in one word, from
experience. [. . .] First, our Senses, conversant about particular sensible objects,
do convey into the mind several distinct perceptions of things, according to
those various ways wherein those objects do affect them: and thus we come by
those ideas we have of yellow, white, heat, cold, soft, hard, bitter, sweet.
Thoughts–perception border
for perception and thoughts since that very commonality is logically excluded.
Images cannot put perception and thought together like some other kind of
representation cannot put colours and numbers together.
The difficult task at hand is to understand how to reconcile the claim that
thoughts are perceptually grounded with the thesis that there is a border
between thoughts and perception. Imagistic thoughts are believed to cover both
cases. Therefore, they are supposed to possess characteristics that would enable
us to interpret them as perceptual but not as instantiations of the genera of
perception. Thus, any neo-Lockean explanation of what thinking with images is
would have to be general enough to spell out what connects imagistic thoughts
with perception and specific enough to point out what differentiates them. To
make it clear, it is not logically excluded that it can be done.
Moreover, it is the case that every theory of thinking has to face this dilemma.
The point is that it cannot be done simply by invoking a third term – an image –
that mediates between perception and thoughts. Let me illustrate this problem
with two contrasting neo-Lockean approaches: Jesse Prinz’s (2002) theory
of perceptual concepts and Christopher Gauker’s (2011) theory of imagistic
thinking.
The Lockean way of thinking about the perceptual grounding of thoughts
was based on the idea that thoughts are abstract objects with an image-like
nature. It may mean that the meaning of mental representations is grounded
in relation to a corresponding mental image. However, such an interpretation
is nonsensical, and it seems to have the same scientific status as the geocentric
theory in astrophysics.
A more sophisticated way to spell out the neo-Lockean theory is based on
the idea that mental representations do not require an imagistic counterpart to
ground their meanings but have an image-like nature. Let me illustrate this
concept with Prinz’s proxytype theory.
According to Prinz (2002, 149), proxytypes are ‘mental representations of
categories that are or can be activated in working memory’. The central idea
is that thinking is a simulation of perception – it is an idea borrowed from
Larry Barsalou’s seminal paper on perceptual symbol systems (1999) – and to
think about something is to put oneself into a state that resembles the state of
perceiving it. For example, the concept dog is a set of perceptual representations
acquired when one encounters, imagines or hears stories about dogs. All the
experiences of dogs one has met, all of the images of dogs one has seen, all
the stories one has heard, plus mental links that bind together those states build
the concept dog. On a given occasion, specific subsets of the set can be recruited
What Answers Should We Expect? 61
to recognize or classify items in the world and form parts of thoughts as parts of
larger imagistic or perceptual scenarios – these are proxytypes. They are context-
sensitive. For example, if I think about a watchdog, I will invoke a representation
of a German Shepard Dog and not a Poodle, but if I think about a cute small
dog, I will invoke an image of a Poodle and not a German Shepard Dog. The
‘proxytype’ we construct on a given occasion depends on our cognitive needs in
the moment.
The main advantage of the proxytype theory is that it helps us make sense of
concept empiricism – the idea that all concepts are grounded in perception. It
also explains some phenomena in cognitive psychology. For example, it explains
the retrospective nature of representations that underlie many of the violation-
of-expectance looking-time experiments or lags in understanding that manifest
themselves in infant looking-time studies and tasks requiring explicit imagistic
representations (e.g. Carey, 2009).
However, it is doubtful whether the theory of proxytypes solves the binding
thoughts and perception problem, for the theory does not meet the Semantical
Challenge. Notice that when one invokes two perceptual representations – a
representation of a German Shepard Dog and a Poodle – we have two different
representations. They do not represent what they have in common. The concept
dog does that. If concepts were perceptual representations of some general
kind, such as a perceptual representation of a canine, we would need another
perceptual representation that would determine the similarity between the
representation of a canine, the German Shepard Dog, and the Poodle, and it
would not stop there. We would need another representation, ad infinitum.
Moreover, a concept like the concept dog has a place in propositional
structure. By substituting a representation of an object into an argument,
we can form a proposition, such as all dogs are mammals. If the concept
dog were a perceptual representation of the same kind as other perceptual
representations, such as a representation of a mammal, there would be no way to
determine whether the representation dog plays the role of a named argument
or a predicate in a proposition. In other words, if mammals and dog were both
perceptual representations, it would be impossible to distinguish between two
different propositions like All dogs are mammals and All mammals are
dogs. Prinz recognizes the problem and tries to develop the idea of mental links
that bind together perceptual representations and determine the functional roles
of perceptual representations. But what he achieves is introducing concepts
under a different name, implicitly restating that perceptual representations alone
cannot be concepts.
62 Thinking in Images
The main philosophical problem with the question ‘What is thinking with
images?’ is connected to the overwhelming temptation of explaining away the
problem, which is based on the assumption that the very question is ill-posed.
As one may argue, it is not the case that we think with images, just as it is not the
case that we think with or in words.
This claim is most frequently expressed by the view that we think neither in
images nor in words but in thoughts. Although such an answer to the question
may seem tempting, it is nonsensical. To say that we think in thoughts is no
more informative than saying that we do not speak in French or English but
use language. The misleading character of the answer follows from the fact that
to study language, we study particular instantiations of language – for instance,
French or English. The same is true in the case of words and images. We can only
study the nature of thoughts by studying particular instantiations of thought
processes expressed in language and images.
A more sophisticated way to render the question absurd is to argue that the
very expression ‘thinking with’ is senseless. Expressions such as ‘thinking with
words’ or ‘thinking with images’ lack meaning. Wittgenstein (1953, 1958) and
Ryle (1979) undertook this sceptical line of argument (Bennett and Hacker,
2003; Slezak, 2002b).
Let me illustrate this strategy with Ryle’s (inspired by Wittgenstein) argument
against the belief that thoughts consist of word-like objects that are expressed
in language. According to Ryle (1979), it would be senseless to say that one
composes their speech expressing some English-like thoughts. Considering
English words and phrases, modifying and rejecting others, stringing together
words into sentences and paragraphs, one is not thinking in some English-like
medium. Their speech will be, of course, in English, but we cannot say that the
mental process of composing the speech was in these words. They did not have
these words while composing the speech. They were searching for these words
in the process of speech composition. Thus, the expression ‘thinking with words’
is senseless.
The general objective of a Wittgenstein–Ryle style argument is to undermine
a dominant view on the nature of thinking, which could be put in terms of a
What Answers Should We Expect? 65
to say that we can think in words and in images, as well as of words and
images. In contrast, we can think of cabbages, but we cannot think in cabbages.
Analogously, we can speak in words without speaking of words, and we can
speak of cabbages but not in cabbages. What does it mean to think in words or
in images?
First of all, it cannot be reduced to thinking of words or images, just as worship
with images is not reducible to the worship of images. Thinking of cabbage is not
reducible to thinking of a word or an image representing a cabbage since the
thought of a word or an image would need another word or image, and so on,
ad infinitum – that was Ryle’s primary point. Words and images are somehow in
mind, while the cabbages, words and images we think of are not.
Second of all, thinking of words and images and thinking of cabbages do
not require producing them. The phrases ‘thinking in words’ and ‘thinking in
images’ do not mean that thinking is a three-term operation between thinking,
the object of thought and some entity, a word or an image, that moderates
between the act and the object of thought.
In contrast to the Lockean approach, images do not have to be postulated
mental objects. I can think of a hexagon and draw a hexagon without forming an
image of a hexagon in my mind. Such a proposal does not require presupposing
objects such as words and images we think of to substantiate thinking in words
and images. Instead, thinking like speaking and manipulating images may be
interpreted, in Kantian and Fregean fashion, as an operation conducted in the
mind, where mental and extramental images may be interpreted as the final
results of those operations and necessary means to represent these operations.
The term ‘operation’ is polysemic. In general, it means a rule-governed process
of taking input values into the output values. Notably, operations are not limited
to computations. Instantiations of operations are subtraction, construction,
functions mapping propositional contents onto a set of truth values, a partial
ordering function and so on. Two points have to be stressed. First, the
distinction between operations and the results of operations is not the same as
the distinction between actions and the products of actions (Twardowski, 1999).
To be an operation is to be a rule-governed process, for example, a function,
regardless of whether the operation is carried out in a relevant action.
Second, operations and the results of operations are different sides of the same
coin. Every operation can be reformulated into a relevant result of an operation,
and vice versa. For example, mathematical axioms can be reformulated into
inference rules and the other way round. From a logical point of view, their
status is equivalent (e.g. Stegmüller, 1969).
What Answers Should We Expect? 67
The point is that just as there is no counting without numbers and without
numerals that represent them, images can be interpreted as necessary vehicles of
thoughts. In the same sense, some operations would be impossible to carry out
if no images instantiated and represented the operations. The nature of counting
and thinking with images is determined by the nature of the operations, not by
the nature of the vehicle, such as the features of numerals and pictures. Numbers
can be represented by digits, words, such as ‘one’ and ‘two’, and apples. What
determines that an expression is a numeral is not that it consists of words or
digits but the fact of what the expression represents. Every representation of a
number is a numeral, whether formed in words or apples. To understand the
nature of numbers, one must understand what kind of operation it is a part of.
For example, to understand the number 2, one must understand that 2 + 2 = 4 or
that a set contains two and only two elements.
By the same token, to determine what it is to count and what it is to think
with images is to determine what we do, not what numerals or pictures hanging
on a wall are. At the same time, numbers and images are not tools to think with,
where the nature of the operation of using a tool is separable from the features
of the tool. Numbers are not like calculators, for it is not the case that when
learning how to count, one first learns how to perform calculations and then
what numbers are. Numbers are necessary if operations like counting are to
be carried out. Similarly, images are necessary for thinking in the sense that
some operations would not be possible without them. For instance, it would be
nonsense to say that we first learn how to form an image, such as a drawing, then
try to find tools to draw. Learning how to draw is inseparable from drawing.
According to the operational approach, imagistic thoughts are like numbers.
They are defined by the operations they are a necessary part of. Just like numbers,
imagistic thoughts require a representable medium, that is, images, to express
these operations. Consequently, the question ‘What is thinking with images?’ is
interpreted as equivalent to the question ‘What kind of operation is expressed
by images?’ The question may be confusing, just as it may be confusing to ask
‘What kind of operation is expressed by numerals?’, but that does not mean that
it is senseless. And even if the operations we are looking for are not the same
operations as the ones expressed in language, no one has to assume that they
have to be the same. Not all thoughts can be expressed in language; some can
only be expressed in images. The point is to find out what these thoughts are.
In contrast to the neo-Lockean approach, the operational view is non-
committal on the point of any mental objects and episodes displayed before the
mind’s eye, such as words or images we think of. The intuitive idea is that when
What Answers Should We Expect? 69
we think of, for instance, what the result of an operation like 2 + 2 is, we do
not have to write down a formula on a piece of paper or construct an explicit
mental representation consisting of the symbols ‘2’ and ‘+’. We think in numbers
without having to think of them. However, any operation has to be conducted in
a representable medium, for example, in words and images.
Thinking about images in terms of operations may seem counter-intuitive.
A comparison to the propositional theory of meaning in the philosophy of
language offers a better picture of the point I am presenting.
According to the propositional view in the philosophy of language, the
meaning of the sentence S is the proposition P expressed by S. Much ink has
been spilled to argue that the propositional view is incomprehensible. For
example, we know neither how propositions exist nor whether they exist in
time and space. As a result, it could be argued in a Quinean (e.g. Quine, 1960)
fashion that we have to reject the propositional view as postulating ontologically
unnecessary entities.
However, the propositional view is non-committal on the point of the
existence of abstract entities such as propositions. For instance, intensional
semantics interprets propositions in terms of functions from possible worlds
to truth values. It means that propositions are operations that determine the
conditions according to which a sentence S is true or false in a possible world.
The indispensable role of S is to make this function representable. Therefore,
we can accept propositions as bearers of meaning without positing unnecessary
abstract objects.
The main point is that we can accept images and words similarly as bearers
of thoughts without being committed to the existence of image- or word-like
objects in mind. Images and words express some meaningful operations and
are indispensable for those operations to exist. The concept of a bearer of
thought is no more mysterious than the concept of a proposition, and therefore
Wittgenstein–Ryle’s argument misses the mark. The question of the nature and
function of imagistic representations in thought processes is the question of the
nature and function of bearers of thought in the same way as the question of
the nature of the meaning of words can be interpreted in terms of the question
of the nature of propositions. And although that does not mean that the question
of the nature of propositions is easy, it is not senseless.
One caveat is in order, the operational view is not reducible to the dispositional
view, and thinking with images is not reducible to a disposition to produce a
mental image (Price, 1953), to a certain skill (Bennett and Hacker, 2003; Ryle,
1949) or the state of readiness for producing images and for judging presented
70 Thinking in Images
images as agreeing with one in mind (Goodman, 1984). To put it more precisely,
interpreting the nature of an imagistic bearer of thought in terms of a disposition
does not suffice to explain the operations of using images in thinking.
Granted, possessing a disposition to produce and interpret an image is
necessary for thinking with images. In the same way, to understand the meaning
of a knight in chess is to have a disposition or a skill – interpreted loosely as a
regular disposition – to move it in a certain way.
However, it is not a sufficient condition. One can be disposed to think and
speak like Kant without any understanding of Kantian thought. One can have
the disposition to recognize similar pictures of a triangle without understanding
that the pictures represent a triangle. To master an operation, one has to know
what the correctness conditions of the operation are. In other words, one has to
know what counts as a correct instantiation of an operation.
In the same way, mastering the counting operation is to master a certain skill
concerning counting, but explaining the nature of counting is not the same as
explaining the nature of the corresponding disposition. Explaining the nature of
counting is to explain the rules governing a family of functions. An explanation,
such as a psychological explanation of the nature of the disposition, may be
part of an explanation of a function, yet it is not reducible to the latter.
To sum up, to make sense of the question ‘What is thinking with images?’,
it shall be interpreted here as a question concerning the nature of operations
expressed by images. It shall not be interpreted as a question concerning the
nature of some mental objects, such as imagistic thoughts, picture-like ideas
or mental images, or a question about the nature of dispositions. In short, the
images we think with are primarily something we do and not something we
possess.
Consequently, we can infer the nature of the given operation from studying
external instantiations of the operation. In the same way, we investigate the
nature of abstract objects, such as functions, by examining their instantiations,
for example, equations, and the nature of mental phenomena, such as linguistic
meaning, by studying linguistic representations.
Explaining operations
As I have argued, any attempt to answer the question ‘What is thinking with
images?’ has to begin with careful isolation of what is to be explained and an
understanding of what kind of answers may be considered satisfactory. For in
What Answers Should We Expect? 71
the case of such operations as thinking, the very concept of explanation may lead
to confusion. The issue is particularly prominent in explanations of the nature of
such objects as gold and water.
When we try to explain the nature of gold, what is to be explained is the
nature of an object – a sort of yellow metal – and the explanans is the structure
of gold and the relations of gold to other elements. Let us say that gold is the
chemical element with the atomic number 79. We communicate the number of
protons in the element and the element’s position in the periodic table. If we say
that water is H2O, we are trying to explain the nature of some wet substance. Our
explanation is a description of the chemical structure of water and the relation of
water to other elements, for example, to oxygen.
In the case of thinking, the situation is different, for there is no thinking-like
object that is to be explained. As Ryle noted (2009, 271):
images in thinking. In other words, the first objective is to establish what images
do in the operation of thinking.
Second, after isolating an explanandum, the required explanation needs
to describe the conditions that must be met if images can perform certain
functional roles.7 Therefore, the first step is to investigate the functions of
images in cognitive processes to determine which functions are in accord with
the imagistic theory of thought. The point is to distinguish such operations
that cannot be carried out without images and distinguish them from non-
imagistic ones, such as operations requiring numerals and words to be
expressed. The second step is to establish conditions that have to be met if
images are to cover the requirements of the latter. It has to be established
how the concept of an image needs to be interpreted if images can play the
designated role. In short, to answer the question ‘What is thinking with
images?’, we must first ask what thinking with images does and then we must
try to find something that does that.8
The Traditional View holds that the answer to both questions is that images
represent by resembling objects. I will show that such an answer is not completely
wrong but not completely right either. In the following chapters, I show that
images are essential for the operations of constructing and recognizing objects.
Following that, I show that an image can represent the ways of constructing and
recognizing objects by exemplifying these ways.
Summary
To answer the question ‘What is thinking with images’, one has to determine
what kind of answers one expects. There are two strategies available. Neo-
Lockean theories hold that imagistic thoughts are theoretical objects introduced
to explain the relation between thoughts and perception. Imagistic thoughts are
abstract entities mediating between perceptual and discursive representations.
However, the problem of the relation between thoughts and perception
cannot be solved by invoking the concept of imagistic representation. There is
a thoughts–perception border, and no intermediary object can bring together
representations that possess mutually exclusive characteristics.
In contrast to Locke, Wittgenstein–Ryle’s sceptical argument holds that
thoughts cannot be kinds of objects. In particular, there is no vehicle we think
with; thus, the question is senseless. If there are no objects we think with, the
question of what characterizes thinking with images is deprived of content.
74 Thinking in Images
Drawing on the case of knot diagrams, I demonstrate that one of the roles of
images is to represent the ways objects and events can be constructed. Let me
start by describing the knot theory.
Knot theory is a branch of topology dealing with knots and knot diagrams. A
knot is a smooth closed curve in a three-dimensional Euclidean space. For every
76 Thinking in Images
Figure 4.1 The unknot (on the left) and an equivalent knot (on the right). © Piotr
Kozak.
What Do Images Do? 77
Figure 4.3 The left-trefoil knot and the right-trefoil knot. Source: Wikimedia
Commons.
Let us reconsider the knot diagram represented in Figure 4.1. What distinguishes
this diagram from a scribble is that it has content, that is, it can be correct and
What Do Images Do? 79
Figure 4.4 A square pattern matched with a tangram set. © Piotr Kozak.
moves to localize the represented object. In the same way, interpreting a sentence
and producing the sentence are governed by the same rules of grammar. Even
though these rules are applied differently, they can be easily brought together.
In the case of applying the rules of interpretation, it is essential to identify
in a representation the information to localize the searched object. In the case
of the rules of production, it is essential to identify the matching elements to
create this object. Let us consider a ruler. It shows you how to find the searched
distance and how to create a line segment. By the same token, a triangle diagram
can show you how to form a triangle (according to the rules of production) and
identify it (according to the rules of interpretation). In both cases, the diagram
represents the rules of construction of a triangle.
What is construction? It is usually interpreted as a procedure (or an outcome
of such procedures) of employing instruments, such as a ruler and a compass,
to draw a geometrical figure. This is called classical construction. However, in
the most general sense, employed by Euclid and Proclus, the concept refers to
a procedure of adding something that is ‘lacking in the given for finding what
is sought’ (Proclus, 1992, 159).2 Let me elucidate it with the help of a Euclidean
example.
Take three line segments a, b and c, which satisfy the condition that any two
line segments together are longer than the third. This is ‘the given’ in Proclus’s
description. Next, we construct a line segment DE, and we place on it the line
segment DF that corresponds to the length of a. Then we construct the line
segment FG that corresponds to b. We do the same with the line segment GH
that corresponds to c. Then we construct two circles. The first circle has a centre
F and radius FD. The second circle has centre G and radius GH. The two circles
intersect at K. It is now enough to join K, F and G to arrive at the construction
of ∆FGK (Figure 4.5).
Knowing the length of a, b and c and the definition of a triangle is insufficient
to solve the task of constructing a triangle. We need to know the rules of
construction for triangles. Even if we seek a polygon with three edges and three
vertices, we need to know the correct moves leading from the line segments a,
b and c to a triangle. Knowing the construction rules allows you to arrive at a
triangle if you are given three line segments and know what a triangle is. The
diagram represents these construction rules.
The concept of construction is reducible to neither the concept of the
construction elements nor the definition of the constructed object. It is possible
to get different triangles from the same definition and the same line segments.
In the case of Figure 4.5, you will get triangles with different properties, for
82 Thinking in Images
example, different heights, depending on which line segment is used as the base.
Construction is a skilful application of these elements in order to get to the
defined object.
According to Tichy (1986),3 the concept of construction can be generalized
and applied to non-geometrical objects. Following Tichy, I characterize
constructions as denoting the procedures of arriving at a goal (an object, event
or scenario). How does this apply to non-geometrical objects?
Let me use an algebraic example.4 Note that the term ‘7 + 5’ denotes the same
mathematical object as the term ‘20–8’, namely the number 12. However, these
terms denote 12 in different ways. Apart from naming 12, they identify different
procedures for arriving at 12.5
The construction ‘20–8’ consists of ‘20’, ‘8’ and the subtraction function. In
the same way, the construction of a triangle consists of the line segments and
the procedures of arranging them. The construction ‘20–8’ consists of the same
elements as ‘8–20’, including the same subtraction function. And yet, these
are different constructions, for the construction rules set a different order of
arguments. In the same way, by applying different construction rules to the same
elements, we construct different geometrical objects.
How to understand the concept of construction as a procedure for arriving at
a goal? Let us consider the case of projective geometry. It consists of a set of rules
localizing objects in space. However, projective geometry does not describe some
prefixed properties of some absolute space. It constructs a space. Geometric
projection does not represent localizations of some points, as if there were some
absolute space with fixed localizations. It represents the ways of localizing these
points by determining the parameters of the space. In Riemannian terms, to
determine the spatial properties of some manifold, we have to apply a geometric
measure. One of such measures is projective geometry.
What Do Images Do? 83
We can localize the states of two different spaces. The logical space is the set
of all possible states. Determining the parameters of the logical space is about
searching for possible distinctions among ways for the world to be as well as the
relations between these possible states. It is about searching for such pairs of
distinctions which coincide and such which are in contradiction.
The physical space is a set of states of affairs. Determining the parameters of
the physical space means searching for what the world is. Moreover, it is a search
for the relations between these states of affairs.
The logical space is closely related to the physical space, for determining the
parameters of the physical space is to single out one side of the distinction made
in the logical space (Rayo, 2013). For instance, to construct the proposition
snow is white, we isolate two possible states of the world – ‘white-snow’ and
‘non-white snow’ – and hold that the actual world overlaps with one side of this
distinction.
The simplest case of construction is localizing a point in the physical space.
Let us take a plane and try to locate a point on it. To do that, we have to determine
the values of the physical space and the relation between these values that will
establish the coordinate parameters. To find the localization of a point is to
determine these parameters. A Cartesian diagram exemplifies this construction.
It shows you how to find this localization. Interpretation of the diagram is based
on the reading of the construction parameters.
Notably, a point represents spatial locations and is determined by spatial
coordinates, but it is not something that is in space being projected onto the
84 Thinking in Images
plane. Space is made of points, but it does not imply that there is some space with
points in it. Space is a set of points. Something can be in some location in space,
but localizations are not in space in the same way as chairs are. Localization does
not have the property of being localized, as if there were some other localization
containing it. Localizations are in relation to other localizations, in the same way
as being a 5 is being in relation to other numbers, nothing less or more. To set
a point in some space is to set relations of this point to others. In this sense, to
determine the localizations of the points is to determine the properties of space.
The Cartesian diagram does not depict some prefixed space. It determines the
properties of the space to localize the point. Compare it to the way rulers work.
They do not represent some prefixed distances; they localize these distances by
determining the properties of the space.
Interpreting construction (in terms of the result of operation) is based on
localizing the construction parameters, that is, relevant sets of information, their
order and relation to other sets, to identify the searched object. Additionally,
constructions consist in abstracting away from irrelevant information. For
instance, the construction of a triangle identifies spatial properties, such as an
orientation and the length of line segments, and sets the geometrical relation
between these line segments. It determines the parameters of space to localize
some spatial structure, that is, a triangle. At the same time, it abstracts away
from irrelevant information, such as the colour and thickness of line segments.
Interpreting the triangle diagram is based on identifying the ordered triple of the
construction parameters.
By the same token, constructing 12 consists in identifying the values of the
logical space, arranging them in order, and applying a relevant function, such
as the subtraction function, to localize the value 12. Moreover, it abstracts away
from irrelevant information, such as whether apples or fingers are subtracted.
By constructing an object, we set the parameters of some space that determine
what is being represented. However, constructing objects has to be distinguished
from evaluating the truth values of representation. The construction ‘7 + 5’
localizes ‘12’, which, in turn, can be the basis of a proposition that 12 is a sum of
7 and 5. This proposition has true values. However, the very construction is not
truth-evaluable.
Compare this proposition to the operation of dividing by 0. It is not a false
operation. It is an operation that fails to determine the parameters of content.
Here, we do not ask whether the proposition identified by the operation of
dividing by 0 is true or false. We ask what value, if any, can be identified by such
an operation. Depending on whether the goal has been achieved, a construction
What Do Images Do? 85
can be successful or abortive but not true or false. We can be wrong trying to
divide by 0, but we are not expressing a false proposition. We are applying an
operation incorrectly.
Constructions may seem to be truth-evaluable, for they can be put into
descriptive contexts. For instance, I can hold that it is true that you can construct
a triangle from three straight lines. However, knowing construction procedures
has to be sharply distinguished from knowing a description of a procedure.
Constructing an object is a matter of exercising a skill rather than knowing a
description of the object construction. For instance, applying Reidemeister
moves is a skill that requires practice rather than descriptive knowledge of what
Reidemeister moves are.
To stress this difference, let us compare constructions with algorithms.
Informally speaking, an algorithm is a description of a procedure for finding
a solution in a finite number of steps. Some constructions are algorithmizable,
for algorithms can represent constructions. However, constructions cannot be
reduced to algorithms.
An algorithm can be characterized as a finite set of well-defined instructions
on how to move from an input to an output effectively. Moreover, according to
the Church-Turing thesis, every algorithm is computable and implementable
in a Turing Machine. The most common way to implement an algorithm is to
express it with a recursive function. For instance, subtraction is based on the
algorithm made of some values, representing the algorithm’s input and the
recursive subtraction function. The subtraction function is applied to calculate
the output of the algorithm.
How does the concept of construction fit into this picture? Constructions
cannot be functions. Functions are elements of constructions. A function is a
procedure of associating the elements of a domain with the single elements of
the range of the function. However, a function can operate only if the elements
of the function’s universe are predefined. In other words, we must determine the
sets it operates on to apply a function. A construction localizes the function in a
certain informational space that we consider in a given situation.
Consider the operation of dividing by 0. The condition not to divide by 0
does not follow from the content of the divide function. Dividing by 0 is not a
wrong function, either. It is the same function as dividing by any other number,
but the function is applied incorrectly here. Dividing is a function that calculates
the number of times one number is contained within another. To know that we
cannot divide by 0, we have to notice that the application of the divide function
for the divider 0 leads to contradictory results. The condition not to divide by
86 Thinking in Images
0 was introduced to our understanding of the divide function since, without it,
we would get nonsensical mathematical constructions, such as 1 = 2. In other
words, the application of the divide function for the divider 0 is an abortive
construction, but it is not a wrong function.
Similar observations apply to algorithms. To use an algorithm, one must
construct it. For instance, it has to be decided what kind of a function has to
be used and what the sets representing the input and output are. Although the
construction of an algorithm is algorithmizable, it cannot be an algorithm since
then we would need an algorithm for the algorithm constructing algorithms and
so on.
In contrast, let us think of constructions in terms of procedural knowledge.
Learning constructions is a matter of acquiring the skill of applying a procedure
to localize a space state rather than knowing a description of this procedure. It
is a matter of exercising procedural knowledge based on recognizing parameters
of some space and not exercising propositional knowledge based on the
description of this space. That is why constructions have to be exemplified rather
than described.
is to identify the construction rule, which specifies that you can arrive at the
unknot if you apply the first type of Reidemeister move to the knot’s crossings
without cutting any of them. To understand a triangle diagram is to identify the
construction rule that specifies the construction parameters of the triangle. In
sum, the content of diagrams consists of the construction rules used to localize
a constructed object.
The content of a diagram identifies the construction rules of the object rather
than denoting the object it stands for.6 Thus, two diagrams have the same content
iff they express the same construction rules instead of denoting the same objects.
The terms ‘7 + 5’ and ‘20–8’ represent different constructions, although both
denote 12. The number 12 is individuated by different constructions.
The most obvious illustrations of this definition of content are architectural
drawings, manuals and maps. The standards of correctness for the architectural
drawing of a building are determined by such construction rules that allow us to
erect it (according to the rules of production) and identify it (according to the
rules of interpretation). An incorrect architectural drawing is such that it does
not allow us to construct or identify a building. Interpretation of the drawing
is based on identifying the construction parameters and the construction rules
that determine the permissible ways of transforming these parameters. Manuals
teach us how to build something step-by-step and how to identify the relations
between the elements of the mechanism. A correct manual is the one that takes
us from an initial state to the goal state and helps us to identify the represented
mechanism. A map is correct if it aptly represents the ways we can localize some
points in some space. A correct interpretation of the map is based on identifying
these ways.
To get a broader picture, consider two less-intuitive examples. The line graph
displayed in Figure 4.6 represents inflation growth. It identifies the inflation rate,
mapping it on time. These are the construction parameters.7 Inflation growth is
an abstract object localized by construction procedures setting the partial order
between the parameters of the inflation rate and time. The graph represents
inflation growth by identifying the target, which is an increasing function
associating the inflation rate with time. Understanding this graph is identifying
the construction parameters and the rule that if you map the inflation rate on
time, the inflation rate will eventually rise. The diagram localizes the inflation
88 Thinking in Images
growth in relation to the inflation rate and time. Vertical and horizontal axes
indicate relevant information. The irrelevant information is the colour of the
line.
Next, consider the pulley system diagram represented in Figure 4.7. It
shows how the mechanism behaves if the string is pulled down. The correct
interpretation of the diagram is based on identifying the construction
parameters: the information on the orientation and direction of the string as
well as the relation between them. The visual shape of the weights is irrelevant
information. Understanding this diagram consists in grasping the rule that if the
string is pulled down, the weight goes up. The movement of the string can be
visualized. Yet, the visualization is only a means to grasp the diagram’s content.
Understanding the pulley system diagram is about knowing how the system
behaves if you pull the string.
Thus, the inflation and pulley system diagrams represent constructions.
They show that if you identify the basic construction units and connect them
What Do Images Do? 89
according to the rules, you arrive at the target. The correct interpretation of these
diagrams is based on identifying these rules.
One caveat is in order. There is a difference between Figures 4.6 and 4.7. The
inflation diagram represents specific data and the relation between them. It
represents a construction rule for particular values. It is equivalent to a diagram
of a particular token, such as a diagram of a particular triangle. Let us call it a
‘token-diagram’.
In contrast, the pulley system diagram represents a general rule of how a
system behaves if the string is pulled down. It abstracts away from particular
values of weight or shape. It represents a general construction rule for building a
type of mechanism. It is equivalent to a diagram of a triangle as such. Let us call
it a ‘type-diagram’.
Every diagram can be interpreted as either a type-diagram or token-
diagram.8 For instance, an architectural drawing can show how to build
a particular building, but it can be taken as showing how to build a type of
building. The inflation diagram can be taken as a type-diagram if it represents
the general rule of how inflation rises over time. The pulley system diagram
can be taken as a token-diagram if interpreted as representing a particular
mechanism.
Knowing how to construct a type-diagram does not imply constructing a
token-diagram. I can know a general rule for constructing triangles without
constructing any particular one in reality. Moreover, a token-diagram can never
express the generality of type-diagram content.
However, every type-diagram requires the existence of a token-diagram to
be represented. For example, to show how to construct triangles as such, I have
to use a triangle token-diagram. Having a type-diagram implies that I have the
disposition to construct and recognize the token-diagram of a relevant kind.9
Properties of constructions
Figure 4.8 Line graphs (a) and bar charts (b) convey the same information but in a
different way, affecting the accessibility of the information. © Piotr Kozak.
number of calories burned because the bars comparing the data for calories and
the amount of exercise are closer to one another. The line and bar graphs employ
the same data but construct it differently. The graphs highlight different aspects
of information, depending on the applied construction.
Explaining resemblance
of objects. The diagram representing inflation growth does not resemble inflation
growth. Inflation growth is an abstract object that cannot be a relatum in the
relation of resemblance.
At first glance, we can try to explicate the concept of resemblance in terms
of structural similarity. The concept of structural similarity denotes sharing
the same structure in the following sense. Two sets, A and B, are structurally
similar iff there is a function mapping (some) elements of A and (some) relations
defined over the members of A onto (some) elements of B and (some) relations
defined over the members of B such that preserves the second-order properties
of the relations in A. Thus, if some relation holds for (some) elements of A,
the corresponding relation holds for (some) elements of B (e.g. Kulvicki, 2014;
Swoyer, 1991).12 Consequently, two objects, a and b, are structurally similar if
and only if various relationships among the parts of a correspond to important
relationships among the parts of b.
Structural similarity is not a sufficient condition to determine the content of a
diagram. A trace left by an ant can be structurally similar to Winston Churchill,
but it does not refer to Winston Churchill (Putnam, 1981). Yet, structural
similarity seems to be a necessary condition of being a diagram and having
content.
To illustrate the idea of structural similarity, let us consider Euler diagrams.
The core idea is that we can represent subset, intersection and disjoint relations
between sets in terms of inclusion, overlapping and disjointness of circles
(Figure 4.9). It may seem that the concept of resemblance is crucial to explain
why we can represent abstract relations by Euler diagrams. According to this
view, Euler diagrams represent logical relations because they mirror the logical
structure of these relations.
However, the concept of structural similarity is not necessary to explain how
Euler diagrams work. We can explain it by pointing out that they represent
the same construction rules. For instance, we can represent the subset relation
in terms of the inclusion of circles as they represent the same constructions.
Figure 4.9 Euler diagrams representing the subset, intersection and disjoint relations,
respectively. © Piotr Kozak.
94 Thinking in Images
The subset relation is constructed according to the same rules as the inclusion
of circles in the following sense. You can arrive at subset A if you include all
elements of subset A into set B. In the same way, you can arrive at the inclusion
of circles if all elements of one circle are contained within another.
Explaining how Euler diagrams work in terms of sharing the same
construction rules has a significant advantage over explaining diagrams in terms
of resemblance. It allows us to avoid the following categorical fallacy. Note that if
one uses the resemblance relation to explain the fact that the subset relation can
be represented in Euler diagrams by the inclusion of circles, one holds that the
subset relation and the inclusion of circles are identical in some respect.
However, that is nonsense. Subsets do not share any non-trivial properties
with the circles represented in Euler diagrams. Subsets and drawings belong to
different ontological categories. Subsets are logical, while drawings are physical
objects. Claiming that they are alike is as much nonsense as claiming that
numbers share some properties with apples. Apples can be red, numbers can be
even, but not the other way round. Numbers can represent the quantity of apples,
but it does not imply that they are alike. By drawing circles, we can represent
subsets, but it does not imply that the representation and the represented object
share some common properties.
Seemingly, the categorical fallacy objection does not apply to the structural
similarity theory. After all, it does not hold that logical objects, like relations
and numbers, and physical objects, like drawings and apples, share some first-
order properties, like being red or tasty. The idea is that these objects share some
second-order properties. Thus, a subset is structurally similar to a drawing of two
circles as some second-order properties, such as being an asymmetric relation,
are preserved. Second-order properties do not characterize the objects of first-
order properties. An apple can be red (first-order property), red is a colour
(second-order property), but an apple is not a colour. Therefore, the ontological
fallacy argument is misplaced.
Still, this gives us an unattractive ontology of mathematical objects. Compare
two sets, A and B, where B has the first-order property F of being a subset of A,
and two objects, a and b, where b has the first-order property G of being inside
a. The structural similarity theory holds that a and b can represent these two
sets in the following way. First, b has the property G, and G has the property H
of being a relation such that all elements of b are included in a. The object b is
not a relation, hence, it does not have the second-order property H. Second, set
B has the property F, and F has the property H of being a relation such that all
elements of B are included in A.
What Do Images Do? 95
This story suggests that there is some logical object, ‘a subset’, that can be
described by the relation of being included. This does not seem right. The term
‘being included’ is only another way of saying that something is a subset. The
inclusion relation is not the property of being a subset as if there were some
logical objects described by this property. The inclusion relation is a way of
isolating some elements of a certain kind in a set. The effect of such a procedure
is a subset. But a subset is nothing more than ‘being included’. Something cannot
be a subset without being included.
In contrast, b is an object that can be identified by the property of being
inside a. However, b is not equal to the property G of being inside a. We use the
property G to identify b. Yet, the object and its property are distinct. The object
b could have no property G.
What is the moral of that? Note that the story told by the structural similarity
theory is plausible only if we adopt some sort of mathematical Platonism
regarding the nature of mathematical objects. It does not have to be wrong. It is
not necessarily correct either. However, it would be unfair to insist on adopting
Platonism only for the purposes of explaining how Euler diagrams work.
These ontological problems are avoided if we explicate Euler diagrams
in terms of representing the same construction rules, for the properties of
constructions are not the properties of the constructed objects. Euler diagrams
and logical relations can share the properties of constructions without sharing
the properties of objects. We do not have to be Platonists, which is fair.
Moreover, resemblance-based theories give us the wrong idea of the
epistemology of diagrams. First, it is not clear how resemblance-based accounts
can distinguish between misrepresentation and non-representation (Suárez,
2003). To misrepresent something is to assign the represented object with
properties that it does not have. A non-representation does not assign any
property. According to resemblance-based accounts, representing something is
sharing first- or second-order properties. However, if something does not share
any property with the represented object, it is not a representation. This seems to
be clearly wrong since misrepresenting a circle with the diagram of a square does
not imply that a diagram of a square is not a representation.
We cannot simply bypass this problem by invoking the idea of preserving
a partial structure (e.g. Bueno and French, 2011), for this idea is too inclusive.
For one thing, there is a partial structure-preserving relation between a
representation of a dot and a diagram of a circle and a square. Yet, it does not mean
that the single-dot diagram represents circles and squares. For another, there is
no way to distinguish between partial representation and non-representation.
96 Thinking in Images
A scribble does not represent anything, yet it preserves some partial structure
with any object.
Second, the concepts of resemblance and structural similarity are too weak
to identify the represented objects. In order to know what is being resembled,
we have to identify relevant aspects of resemblance. Yet, according to the
well-known Wollheim’s argument (1998, 2003), to identify relevant aspects of
resemblance, one has to identify what is being resembled. In Wollheimian terms,
recognizing an object depicted on a picture’s surface presupposes that we have
a seeing-in experience of this object. Provided that we have such experience,
we can recognize the depicted object in the marks on the picture’s surface. If
seeing-in were not prior to the experienced resemblance, there would be no way
to discern accidental and non-accidental marks on the surface.
Consider an ellipse-shaped diagram. It is structurally similar to a circle seen
from a perspective. Without knowing what this diagram represents, we cannot
know which parts of the structure are relevant. However, it was by comparing
structures that we were supposed to identify the object of reference.
At first glance, this should not be particularly disturbing. Most of the
proponents of resemblance-based theories acknowledge that sharing
properties is not a sufficient condition of representation. It is held (e.g. Abell,
2009; Blumson, 2014; Bueno and French, 2011; van Fraassen, 2008) that the
epistemological problems can be resolved by supplementing the condition of
sharing properties with the pragmatical condition that the representation has
to be used by an agent in a proper way. The general idea is that to recognize
the relevant aspect of resemblance, one has to recognize the intention of the
agent.
However, such a move only replaces one problem with another. Note that we
shift the burden of the argument from resemblance-based accounts to intention-
based accounts, which have their own problems, and, as Wittgenstein’s argument
from content indeterminacy shows, cannot explain depiction.
Moreover, such a shift renders the resemblance relation functionally
irrelevant. The resemblance does nothing in the process of recognizing the sign’s
content, for all we have to do is to recognize the agent’s intentions. Without any
supplementation, the intention-condition is insufficient.
Explaining the content of diagrams in terms of the rules of construction
not only gives us a better picture of the content. It avoids the epistemological
problems of resemblance-based accounts and provides us with an explanation
of what resemblance is. Most of all, it allows us to make sense of the concept of
resemblance without sharing first-order or structural properties.
What Do Images Do? 97
Let me illustrate this with the following example. Take two diagrams
represented in Figure 4.10. The dashed line represents unfinished lines. Note
that they lack similarity in shape and structure since some lines are missing.
However, they represent the same construction rules, thus having the same
content. For it is possible to construct a triangle if the straight lines are drawn
further in thought. The fact that we run out of space on paper plays no role in
constructing triangles.
In contrast, two identical ellipse-shaped diagrams can denote two different
objects: either an ellipse or a circle seen from a perspective. These diagrams have
the same spatial properties but different content, for different construction rules
are applied.
Next, depending on the kind of identified constructions, we can determine
the relevant aspects of resemblance. Consider two triangles ∆ABD and ∆ACE
in Figure 4.11. Do they resemble each other? It depends on the rules of
construction we recognize as relevant. Suppose we want to know whether they
are geometrically similar. In that case, we are interested in the construction rule,
which identifies as relevant the information on the ratio of the sides’ lengths
and the measures of the angles. If we want to know whether they are congruent,
Figure 4.10 Two diagrams of a triangle with the same construction rules. © Piotr
Kozak.
Figure 4.11 Depending on the identified rules of construction, ∆ABD and ∆ACE are
similar or congruent to each other. © Piotr Kozak.
98 Thinking in Images
the construction rules single out the information on shape and size. ∆ABD and
∆ACE are similar or congruent depending on the identified construction rules.
Consequently, the diagrams ∆ABD and ∆ACE represent the same or different
objects. In the most general sense, any two token-diagrams of triangles are
similar if they depict the construction rules for any polygon with three edges
and three vertices.
Finally, the idea of construction gives us an insight into Gombrich’s so-called
riddle of style. According to Gombrich (1960), adopting the resemblance-based
view of depiction does not allow us to address the question of why people at different
times and in different cultures tend to depict objects in different ways. Gombrich
argues that this riddle is based on a false assumption that depiction is based on
capturing a visual likeness of the depicted objects. Based on this assumption, pictorial
representational systems should have developed in one direction, perfecting the
ability to express the resemblance so that eventually, there should be one style that
unifies all others and represents the world most adequately.
To address the riddle of style, Gombrich acknowledges that depiction is based
on recognizing familiar objects in patterns of lines and colours, which elicits an
illusion of seeing these objects. To recognize objects in these patterns, we have
to employ schemata, that is, stereotypic ways of depicting objects. Schemata are
a means to represent the world in the same way as projection in cartography
serves to represent certain regions in the world. Different schemata are used to
represent different aspects of the world, just like different methods of projection
are used to represent different aspects of space.
It is rarely noticed that the same riddle of style applies to diagrams.13 If
diagrams were based on structural similarity, then we might expect a tendency
to create a unified diagram form that would preserve structures most adequately.
That contradicts the common observation that there is a reverse trend in creating
many different forms of diagrams.
The concept of construction covers Gombrich’s intuitions without applying
the vague illusion-metaphor. Let us compare the line graph and the bar chart
displayed in Figure 4.8. They both represent the same structure of information.
Yet, depending on their construction, the aspects of information they represent
are different. We expect that different styles of diagrammatic representations
will coincide with the different constructions they represent, which seems to
agree with our observations. Moreover, the illusion-metaphor can be explicated
in terms of sharing the properties of constructions but not the properties of
objects. That seems to align with Wittgenstein’s and Frege-Davidson’s argument
of lack of truth values.
What Do Images Do? 99
Summary
Based on the analyses of the content of diagrams, I demonstrate that diagrams are
necessary for representing the construction operation. I explicate the concept of
construction in terms of procedures of arriving at a goal by means of determining
the parameters of some informational space. Accordingly, I distinguish between
the properties of constructions and the properties of objects.
I describe the relationship between the concept of construction and
resemblance. I claim that the presented description of content is able to explain
the concept of resemblance without taking this concept as an explanans of the
content of diagrams.
100
5
Recognition-based identification
In the previous chapter, based on the analysis of diagrams, I pointed out that
images exemplify the ways of constructing objects. In this chapter, I discuss the
second role of images: triggering the ability to recognize objects and events.
Drawing on Evans, I introduce the concept of recognition-based identification.
I argue that images can convey a reference relation distinct from demonstratives
and descriptions. It can be explicated based on the construction concept. I
hold that recognition is based on identifying the construction invariants which
determine the properties of some space in order to localize the depicted object.
When the Event Horizon Telescope (EHT) caught its first glimpse of the M87
black hole, located over fifty-four million light years away from Earth, one could
almost hear a moan of disappointment. After all, did we not have more detailed
pictures before? Simulations of black holes had looked more stunning. Is the
blurry picture of a hazy cosmic flame loop all that we can get?
The picture represented in Figure 5.1 may not seem like much, but it is a
breakthrough in physics. Why does having this picture matter?
The black hole picture helps us validate theories and measurement methods.
Up-to-date simulations of black holes based on Einstein’s theory of general
relativity have been commonly used in science to predict how a black hole
behaves. As Einstein’s theory predicted, the shape of a black hole would be
almost a perfect circle, and its size should be directly related to its mass. As it
turned out, it is. The picture has allowed us, for the first time, to test Einstein’s
theory of relativity for black holes and other regions containing dense matter.
Furthermore, the picture is an object of investigation in that we now have a
way to study black holes as physical objects rather than mathematical constructs.
102 Thinking in Images
Figure 5.1 The first picture of a black hole. Credit: Event Horizon Telescope
collaboration et al. National Science Foundation.Source: European Southern
Observatory (ESO) via Wikimedia Commons.
Thanks to the picture, we are able to calculate the black hole’s mass more precisely.
Using the EHT, we are able to measure the radius of M87’s event horizon directly
and calculate the hole’s mass. Consequently, the calculation helps us validate
available methods of mass estimation.
Additionally, the picture shows us the features that equations have not
yet captured. For example, there is the open question of why the black hole’s
silhouette is irregular in shape. It is a question that we hope to answer in the
future. It is also believed that the data gathered by the EHT will offer some
insight into the formation and behaviour of black holes. The structure revealed
in the image can help to reveal how material gains the energy needed to fall into
a black hole or spiral away in jets. Lastly, it is believed that information provided
by the EHT will help us resolve fundamental physics problems, such as the black
hole information paradox.
Thus, the black hole picture is much more than a blurry illustration of a night
sky. Let us think of it as the final result of a measurement operation. The picture
represents measured features of the object, where the measurement values can
be projected onto the measured object. We study the features of the picture to
learn something about the features of the black hole. Before the picture was
taken, some facts were unknown and could not be deduced. If they could be
deduced, physics would be reducible to metaphysics (e.g. Elgin, 1996a, 2017).
However, the fact that the picture may seem underwhelming to some teaches
us something important about our understanding of what images are. Let me
illustrate it with Dennett’s (1990, 1991a) argument from computer reasoning.
Recognition-based Identification 103
The black hole had been first measured and described in the numerical system
of representation, which, in turn, was presented as the black hole picture.
Displaying the information on the picture does not change the content of the
information. Before we displayed the information on the properties of the black
hole, we had had the information.
Moreover, based on the syntactic properties of numerical data, we can compute
the non-measured properties, such as the mass of the black hole. Displaying the
information via the picture does not affect the content. Therefore, having the
black hole picture is inessential for knowledge and reasoning. To show what
is wrong with this argument, let us turn to the idea of the naturalness of icons.
Naturalness of icons
The backbone of Dennett’s argument is the belief that depictions and numerical
representations carry the same kind of content since both code the same
information. The only difference is the way the information is displayed.
This belief is based on Goodman’s idea (1976) that images and language-like
representations are different kinds of a conventional representational system.
However, this does not seem right, for it fails to explain the properties of
the content of images. In contrast to language-like representations, images seem
to bear some natural relation to the represented objects, which explains why
a depiction of a horse looks like a horse. The language-like representation of
a horse does not have to be like a horse.1 What does it mean to bear a natural
relation?
The notion of ‘natural relation’ is deeply ambiguous. In the sense that
interests us here, it refers to a nonconventional relation. There are, however,
two understandings of the convention concept. The first holds that a sign
is conventional iff it has arbitrary semantics. The second holds that a sign is
conventional if it belongs to some representational system. It is noncontroversial
that images are conventional in the second sense, for they can be easily classified
according to different representational styles (Willats, 1997). It is highly
controversial whether they are conventional in the first sense. The concept of a
natural relation will be understood here as bearing a non-arbitrary relation to
the represented object.
The nonconventional relation between a sign and the represented object
can be explicated in two ways. First, it can be put in psychological terms (e.g.
Burge, 2018; Giardino and Greenberg, 2015). According to the psychological
104 Thinking in Images
the relata. In the case of the black hole picture, such a comparison is impossible.
Pictures are commonly used to depict unobservable phenomena (Schöttler, 2017).
Resemblance-based theories cannot explain how this is possible. Thus, we need a
better (and non-psychological) explanation of recognition.
What is recognition?
depicted object and the features of the picture’s surface or between the object
and the background.
Perceptual recognition describes the ability to associate perceptual objects
with each other and reidentify an object after some changes. It is often described
as recollecting information and having a sense of familiarity with the recognized
object (e.g. Dokic, 2010; Hope, 2009; Hummel, 2013). Here it is understood as a
perceiving that perceived objects belong to the same kind.3
Misrecognition can be easily confused with errors in that-perception. For
instance, looking at the black hole picture, I can misinterpret it as a picture of
a donut. Yet, errors in perceptual recognition are distinct from errors in that-
perception. The inability to recognize two objects as belonging to the same kind
is different from the inability to identify the kind they belong to.
Perceptual recognition is a non-propositional, non-conceptual and non-
linguistic form of perception. A child can recognize triangles without knowing
that they are polygons with three edges and three vertices or that they are called
‘triangles’. In order to do that, the child must learn that some spatial properties
remain the same across changes in a different context.
This ability is non-propositional. Children can be wrong in confusing
triangles with squares, but they are not expressing any false proposition. It does
not require concepts (e.g. Siewert, 1998). It is, rather, a prerequisite for having
concepts (Peacocke, 1992; Giovannelli, 2001). Before one can subsume an object
under a concept, one has to be able to recognize this object. Recognition is a
gateway from object perception to cognitive processes such as categorization
and reasoning, but it is not based on these processes.
Only that-perception requires the presence of language-like, conceptual and
propositional structures (e.g. Armstrong, 1997). If I see that it is a black picture,
it must be possible to express it in the language. It is conceptual since it involves
subsuming a representation under a concept. It is propositional since perceiving
states of affairs is truth-evaluable. If I am wrong when I see that it is a black hole,
then it is not true that it is a black hole.
Object perception, perceptual recognition and that-perception of the black hole
use the same kind of perception. Seeing-that is based on sensory perception, but
it is not another kind of perception. It is not ‘perceiving the truth of propositions’
as if there were a perception of propositions. Propositions are ways of identifying
how the world can be. These are not the objects we can perceive. Seeing-that is,
rather, a form of propositional attitude based on perception. Accordingly, I cannot
see that it is a black hole without having a belief that it is a black hole. Seeing that
it is a black hole and believing that it is not a black hole is self-contradictory.
108 Thinking in Images
Varieties of reference
Such files would have to contain information on the features of the target, which,
in turn, would make them unable to explain why the ability to track objects is
disassociated from the information on the object’s features. Moreover, it does
not solve the combinatory explosion problem without any explanation of how
looking for similarities works. Interpreting the idea of similarity as primitive and
unanalysable does not offer an explanation but excuses.
Second, the proponents of the object-file framework can interpret recognition
in terms of identifying probabilistically determined causal links in the following
way. If we systematically encounter target T producing demonstrative file D(t),
there is an increasing probability that it would result in producing associated
memory file M(t). Recognition is a matter of causally driven activation of D(t)
and associated M(t).
However, that won’t do. For one thing, the causal story is vulnerable to
counterexamples. I can recognize objects I did not encounter in the past,
for example, I can recognize my mother in her childhood photograph. I can
recognize objects I have a wrong idea of, too. If it turned out that previous
visualizations of a black hole were wrong, and the depicted black hole picture
resembled a square rather than a circle, scientists would still identify it as a black
hole picture.
Moreover, the story is empirically implausible, for recognition abilities seem
to be functionally dissociated from the ability to retrieve information from
memory causally. I can remember perceptual objects without recognizing
them. For instance, patients with associative visual agnosia can successfully
copy drawings but cannot recognize the copied objects (e.g. McCarthy and
Warrington, 1986). Next, I can recognize a familiar scenario connected to a false
memory of myself as a child being lost in a supermarket. False memories are not
causally connected with any events.4
Thus, although the object-files framework can work well to explain the
mechanisms of demonstrative reference, it does not seem to explain the nature
of recognition-based identification. This suggests that recognition is not a kind
of demonstrative reference.
It seems natural to think that recognition involves perception. Consequently,
it may seem that recognition-based identification is a kind of demonstrative
reference (e.g. Strawson, 1963). Demonstratives are based on a causal
information link between a representation and its target and the ability to locate
and track this target in space. This implies that to understand the content of
a demonstrative, one has to identify a relevant causal link and find the target
within some egocentric frame of reference, determining the here-now relation.
110 Thinking in Images
instance, I can know what my aunt looks like but cannot recognize her in a
photograph. The recognition-based identification seems to refer to the target
directly. Recognizing my aunt in a photograph differs from knowing a description
‘the woman represented in the photograph is my aunt’ (Lopes, 2010; Terrone,
2021a). The question is what such recognition-based identification involves.
Construction invariants
see a cat, I perceive an invariant of the cat. When I encounter a picture of this cat,
I am prepared to pick out the relevant invariant. This is not to say that I see an
abstract cat, I have that-perception of a cat, or I perceive common features of the
class of cats. What I get is information on the persistence of structure.
A picture is a record of perception. It enables the observed invariants to be
stored. These invariants can be retrieved to convey knowledge. Thus, pictures are
an efficient method of teaching and learning. However, the knowledge pictures
convey is not explicit and language-like. According to Gibson (1978, 1979), the
formless invariants cannot be put into words. They can be captured but not
described.
However, the invariant concept is unclear (Fodor and Pylyshyn, 1981; Gombrich,
1971). Particularly, it is not clear what it means that some invariants can be perceived
when the perspective is arrested. Let us consider the projective geometry case. In
his early account of depiction, Gibson (1954) defends the projection-based theory,
according to which a picture conveys optical information through the geometrical
projecting of three-dimensional scenes onto two-dimensional surfaces. In short,
picture P depicts object O iff there is a systematic mapping between P and O
such that it preserves some properties determined by the projection invariants.
Based on the principles of projective geometry, interpretation of P is a matter of
identifying the scene that was projected onto the picture.
The projection-based theory seems to fit well with satellite maps, photographs
and Alberti’s paintings. However, Gibson (1971) explicitly rejects this theory.
First, it cannot explain distorted pictures like Cezanne’s paintings. Second, it is
at odds with depiction practice. Contrary to popular belief, artists have never
paid much attention to projective geometry clarity and abstract elegance. For
instance, they commonly confuse the habit of putting the vanishing point in
the centre of the picture, which is a matter of composition, with the perspective
projection system as such (Gombrich, 1972). Third, the ability to recognize
depicted objects in the flesh survives significant changes in the properties of
the representation and the represented object (Lopes, 1996). For instance, we
can reidentify depicted objects, even if the depiction does not represent the
perspective correctly and the depicted object has changed over time. This implies
that the idea of projective geometry is too restrictive to explain recognition
(Inkpin, 2016).
However, projective geometry gives us an insight into what invariants are. Let
us consider the cross-ratio invariant represented in Figure 5.2.
The cross-ratio remains unchanged across projective transformations. It is
preserved regardless of the shape of the projection plane. It is the invariant and
Recognition-based Identification 113
Figure 5.2 The cross-ratio invariant. The relation between points is preserved in
projective transformations in such a way that the cross-ratios AC / BC: AD / BD and
A′C′ / B′C′:A′D′ / B′D′ are equal. © Piotr Kozak.
the property of the projection system. However, it would be wrong to say that it
does not describe the properties of the depicted object. It certainly does. Based
on the lengths of A′C′, B′C′, A′D′, B′D′ we can deduce the lengths of AC, BC, AD,
BD.
The main idea is that images are representations of invariants. We can observe
the invariants in the projection plane and apply them to the represented reality
(e.g. Elgin, 2010a). In this sense, we see invariants in images. How should we
understand this?
The concept of construction is useful here. The content of diagrams has been
provisionally defined in terms of representing construction rules. A successful
interpretation of a diagram is based on identifying construction parameters
determined by these rules. The same definition is extendable to all image genera.
Constructions have been characterized in terms of determining the parameters
of some space in order to arrive at the constructed object. By determining
these parameters, we determine the properties of some informational space.
The general idea is that some properties remain unchanged across different
constructions. These properties are the construction invariants. They determine
what is being preserved during different acts of determining parameters of a
space.
that whatever the construction parameters, such as the vanishing point or the
lengths of the line segments, are, the cross-ratio remains the same. We can
identify the represented object length based on the lengths of the projected one.
The idea, however, is that projective geometry is just a special case of the more
general construction operation. As Riemann might say, projective geometry is
only one of the many possible constructions. In other words, projective geometry
is only one way of determining space parameters.
To explain the concept of construction invariants, let us consider the terms
‘7 + 5’ and ‘20–8’. They represent different constructions that identify different
parameters of the logical space. However, these constructions pick out the same
target. The number 12 is no less than the invariant of different constructions
(while not being a projective invariant). We identify 12 by finding that this value
remains unchanged in various constructions.
Next, think about triangles different in shape and size, represented by
two diagrams. The diagrams exemplify different constructions. However,
these constructions preserve some properties, such as the number of angles.
Interpretation of these diagrams is based on identifying these construction
invariants. Based on that, we can identify the constructed objects as belonging
to the same kind.
Depending on the identified construction invariants, we can recognize
different targets. For instance, if we construct a triangle whose invariant is the
sum of its angles, then we identify different shapes of triangles as representing
one kind. If the construction invariant is the angle measure, then we identify two
triangles with the same angles as similar.
Cezanne’s paintings present a more demanding case. To explain it, let me go
back to the Gibsonian concept of perceptual invariance. It describes a familiar
phenomenon of perceiving objects as stable throughout environmental changes.
For instance, perceived colours seem relatively constant under changing
illumination conditions. A red apple looks red at midday and at sunset. This
subjective constancy of colours helps us to identify objects and is indispensable
in object recognition.
According to Gibson (1979), the (approximate) invariance of the colour
experience cannot be a matter of identifying the perceiver-independent
properties of a perceptual object, such as the light wavelength. These properties
differ throughout the changes in illumination and perspective. Instead, colour
constancy is a matter of tracking relational features of the perceptual object. It
is a skill of identifying the perceived colour in relation to other colours across
contexts (Buccella, 2021; Green, 2019).
Recognition-based Identification 115
Figure 5.3 Munker-White’s Illusion. Although the grey bars A and B depict the same
hue, B appears to be brighter. Source: Wikimedia Commons.
members of A onto (some) elements of B and (some) relations defined over the
members of B such that preserves the second-order properties of the relations
in A. For instance, sets A and B are structurally similar if they are isomorphic
without necessarily sharing any first-order property.
Now, structural theories of depiction are not wrong. Moreover, the theory
of construction invariants can be seen as a version of a structural theory. A
structural correspondence between the representation and represented
object is another way of saying that some object properties are preserved
by the representation. For instance, a map identifies a territory only if there
is a structural correspondence between the map and territory. However, the
structural theory heavily depends on how we understand the condition ‘some’.
If we want to find these elements and relations, we need a defined domain, range
and mapping function. It is no problem in the case of mathematical operations.
It is a problem, however, if we want to map a defined mathematical structure
onto an undefined set of states of affairs. Structures are not some Platonic
objects waiting to be discovered in reality and compared. It is not sufficient to
say that semantics is fixed or that some classes or mappings are more natural.
It is nothing more than a metaphysical postulate (Isaac, 2019; van Fraassen,
2008).
The image goal is to mark the elements and relations to impose order on some
manifold and identify the structure. Images determine the properties of the
informational space to localize the searched object. Successful interpretation of
an image is based on finding the construction parameters (together with knowing
construction rules) that help us to localize this object. For instance, the triangle
diagram highlights the construction parameters, like the number of angles, in
order to identify some objects as belonging to the same kind. Understanding the
diagram implies that you know that you can change the diagram’s size and shape
(but you cannot disconnect the elements of the structure), and some properties
are preserved. It does not involve comparing it to some ideal Platonic object. It
involves knowing how to identify triangles by learning what remains unchanged
across different constructions.
Let me illustrate it with the following example (Figure 5.4). Suppose you have
some manifold represented by a set of points (Figure 5.4a). Images impose order
on this manifold by determining the construction parameters: a relevant set of
points and order relations (Figure 5.4b). Interpretation of the image is based
on identifying these construction parameters. Knowing that this construction
preserves the number of angles, we can identify it as representing a triangle
(Figure 5.4c).
118 Thinking in Images
Figure 5.4 Images impose order on some manifold to localize the elements of the
structure. © Piotr Kozak.
how you know that it preserves shape properties and not the colours is like
asking how you know that a ruler represents a distance and not a ruler’s colour.
Thus, knowing how to localize objects is not based on feature comparison
or looking for the similarities between some objects. It involves knowing what
properties remain unchanged during different constructions. Consequently, we
can track and reidentify objects with different physical properties, for the object
description can change across different ways of localizing searched objects. The
invariant can be the mere property of being individuated by some construction.
Let me illustrate this with three cases.
First, consider a simple case of setting a localization of some point in space.
It can be done by determining the coordinates in relation to some frame of
reference represented by a Cartesian diagram. However, the frame of reference
and, respectively, the coordinates can be altered. Yet, when changing the frame
of reference, we do not change the localization of the point. We change the
description of the localization. This does not imply that the point is localized in
some absolute space. Here, localization is the invariant of the way we localize the
point. The Cartesian diagram determines the properties of this space.
Second, in the tunnel effect experiments, subjects are not asked to identify
the properties of perceived objects. They judge that the object that disappeared
when moving beyond the occluding shape and the one that reappeared on the
other side are one and the same even if all object properties have changed. Thus,
identifying likenesses is not necessary for recognition-based identification. In
the case of the tunnel effect, identifying the relation to the vanishing point and
the motion vector as a transformation invariant is sufficient to recognize these
objects as belonging to the same kind.
Third, the represented properties do not have to be ‘localizable’ in the image
(Wollheim, 2003). I can see that a depicted person is happy even though no single
property of the image can be identified as representing happiness. Happiness
is the depicted object’s property localizable by identifying construction
invariants. I can see-in the depicted person’s happiness by identifying patterns of
representation, for example, the composition of colours and lines in a painting.
These patterns are the construction invariants represented in a picture through
which we identify the represented object.
Let us go back to the black hole picture. Its interpretation involves determining
the construction parameters of the black hole. It is based on isolating the object
from the background by determining the parameters of some space. Recognition-
based identification of the black hole means identifying the pattern of the black
hole. It is not based on comparing the picture’s properties to the black hole, which
120 Thinking in Images
Iconic convention
The fact that there are many iconic conventions is not accidental. If
constructions are characterized in terms of determining the parameters of some
informational space, then different constructions can highlight different states of
the space. Iconic conventions can be compared to different systems of reference
that identify different invariants. If there is no absolute point of view, then there
can be no one iconic convention.
Interpreting iconic convention in terms of construction invariants types helps
us to address the problem of natural generativity. The problem is how to explain
the ability to interpret a whole system of representation based on the meaning
of its part. Applying the concept of the iconic convention makes the concept of
natural generativity explainable and avoids the problems of recognition-based
accounts of depiction. Consider a case of a mathematician who learns how to
interpret a graph of the function f(x) = x + 1. He learns the graph construction
rule. Knowing this, he can recognize other functions, such as f(x) = x + 2,
displayed on a graph, too. This seems to be the case of natural generativity.
However, there is no need to be perceptually acquainted with any function. It is
sufficient to know what properties it preserves.
Consequently, we can address the problem of the naturalness of icons.
According to the metaphysical understanding, the representational system
is natural only if the properties of the system depend on the properties of the
represented object. In this sense, iconic representations are natural. For instance,
if the distances between represented objects were different, then the distances
displayed on the projection plane would have to be different, too, for the cross-
ratio is preserved in projections. If Cezanne had a different visual experience,
then the visual properties of his paintings would have to change, too, for the
experience properties are preserved in impressionist paintings. In contrast,
symbolic representations are not natural. Whether the symbol A represents the
left or right direction is an arbitrary choice.
In contrast to recognition-based theories, knowing an iconic convention
does not involve perceptual acquaintance with the represented object. It is
sufficient to know which properties are preserved by the iconic convention. This
seems justified since we commonly depict objects we have never seen before.
For instance, we have no problems depicting fictional objects like dragons.
We do not have to see dragons in the flesh. Recognition-based identification
does not involve comparing two relata: the depiction and the depicted object.
It is sufficient to know how a dragon would look if it existed. If we know what
properties are preserved by the iconic convention, we can infer what kind of
properties dragons would have.
122 Thinking in Images
Let us go back to the black hole picture. Recognizing the black hole in the
picture does not require that we stand in some perceptual or resemblance-like
relation to the black hole. If we know the iconic convention of the picture, we
know which properties of the black hole it preserves. That is why we can infer
the properties of the black hole from the picture.
Consequently, the skill of recognition can be dissociated from having
descriptive knowledge. Let us suppose that the picture of a black hole resembles
a square rather than a circle. Still, knowing the iconic convention that optical
information is preserved in a photograph, we could recognize it as a black hole
picture. The square-like black hole would not correspond to our common way
of representing black holes, but no one says that science should align with how
we commonly represent things.
Summary
This chapter discusses the recognition concept and its relation to images.
Recognition is a natural and direct relation to the represented object. I hold that
it is a distinct kind of reference relation that is irreducible to demonstratives and
descriptions. Images are necessary to represent this reference relation.
I explain the recognition in terms of identifying the construction invariants,
that is, the properties that remain preserved in different constructions. I hold
that recognition is based on identifying these invariants. In the last section, I
explicate the concept of iconic convention, characterized as a set of construction
rules that preserve the same invariants.
6
What is an image?
they depict them in different ways, having different content. There can be cases
when pictures have the same content but different targets, too. The same picture
of John can represent John, as well as a man as such if it figures in a biology
textbook.
Seeing Marilyn Monroe on black canvas (Schier, 1986) is a special case when
we identify a target without matching content. We take Marilyn Monroe to be a
target, but the content of the black canvas is not Marylin Monroe. The Putnam’s
ants case (1981) is the reverse of that. A Churchill-like trace left by an ant does
not refer to Winston Churchill. He is not a target of this representation. However,
the content of the trace identifies the properties of Winston Churchill. There can
be targetless representations that have content, too. Impossible images aim at
no target. Still, the impossible fork and the Penrose stairs have different content.
In depiction theories, these dimensions are often conflated. Wollheim’s (1987,
1998) seeing-in theory fails to distinguish between the content and target.
Consequently, it cannot identify correctness standards that determine whether
the viewer has correctly perceived what a picture represents. Wollheim suggests
that the author’s intentions can do the job (Wollheim, 1980, 1993). However, the
concept of intention can be applied to identify the target but not to the content.
My seven-year-old son and Alberti can have the same intentions, but they
produce different content (Hyman, 2012, 2015).
The resemblance-based theories are the theories of content, not target.
Resemblance can determine correctness conditions for content but is too liberal
to specify a target. The reason is that, in some respect, everything resembles
everything else. Goodman’s famous attack on resemblance (1972, 1976) is an
attack on resemblance as a theory of target (e.g. Greenberg, 2013; Walton, 1990).
However, Goodman’s theory is a non-starter, too, for it rejects the distinction
between content and target, reducing the theory of content to the theory of target.
The inability to distinguish between content and target is a source of confusion
in the theory of depiction. It suggests that the main problem is not that there is
no unified theory of depiction; maybe no such theory is possible. The problem is
that we have many theories that pick out different topics.
We are now in a position to characterize the iconic reference. The relationship
between an image, content and target is two-dimensional. The general idea is
that an image (in respect of the circumstances of evaluation) denotes a target
and the properties of the carrier exemplify the properties of content. In turn, the
properties of content (in respect of the iconic convention) are used to reidentify
(in terms of recognition-based identification) the referent. The content indicates
the target by means of reidentifying the referent. Depending on whether the
What is an Image? 125
referent matches the target or not, the image is either accurate or inaccurate. Let
me call it the two-dimensional model of iconic reference (2-D for short). It is
represented in Figure 6.1.
Let us start with the concept of an image. In the 2-D model, it is the carrier of
the representational function. The carrier of a representation can be a physical
object, for example, a picture or a diagram, and a psychical one, for example,
a mental image. For instance, the carrier of the portrait of Duke Wellington is
the canvas covered with pigments, and the carrier of the mental image of Duke
Wellington is the mental representation that underlies the experience of the
Duke. The main question of the theory of iconic reference is how the properties
of the carrier, such as pigments on the picture’s surface, can represent something
outside of the carrier. In other words, what is it that turns an image into a
representation of something else?
Drawing on Goodman (1976), I distinguish between denotation and
exemplification as two kinds of reference relations. According to the 2-D model,
images denote the target and exemplify the content. Denotation is a dyadic relation
between a representation and something it stands for.3 Exemplification runs in
the opposite direction. It refers back from the selected properties of the object
to its representation. Additionally, it requires instantiation. A representation has
to have the exemplified property. For instance, a water sample exemplifies the
water quality, for it instantiates it. Exemplification is instantiation plus reference.
However, not every instantiated property is exemplified. A water sample
instantiates the property of being taken on a certain date. Yet, the sample does
Figure 6.1 The 2-D model of iconic reference representing the relationship between
an image, its content, the referent and the target. An image denotes a target in respect
of the circumstances of evaluation and exemplifies the content, which reidentifies the
referent in respect of iconic convention. The content indicates the target by identifying
the referent. © Piotr Kozak.
126 Thinking in Images
not exemplify the date. Depending on the context, only selected properties are
exemplified.
The role of a sample is to highlight the selected properties of the object. To
do so, the highlighted properties of the sample have to be salient. A bald man
wearing a hat is not a good sample of baldness. A sample has to be filtered (Elgin,
1983, 1996b; Goodman, 1984; Goodman and Elgin, 1988).
Let me explain the nature of iconic reference, starting with iconic denotation.
There are two general approaches to define it (Abell and Bantinaki, 2010). On
causal models, images denote their objects in virtue of being part of a causal chain
between the representation and the represented object. In intention-based models,
images denote in virtue of the author’s intentions which aim at specific objects.
The intention-based approach works well for paintings and drawings. Yet,
it hardly applies to photographs, for they are based on causal rather than
intentional relations. Causal models do not apply to fictional representations.
There is no causal relation between a unicorn and its picture, but there are
pictures of unicorns.
To find a way out, we need to make two remarks. First, these two approaches
are often conflated. Let us recall Kripke’s causal theory of proper names (1980). In
order to identify the denotation of a proper name, you have to fix the denotation
in the act of the initial baptism. However, the initial baptism is not part of the
causal chain, which is fixed intentionally. The causal chain is the way we identify
the intention-based act of baptism.
The general idea of the intention-based approach is that intention carries
a teleological function. A intends B in the sense that B is a goal of A. Causal
relations can be easily wired into such teleological functions. For instance, some
causal chains can be picked out by carrying some teleological function in a
biological system (Cummins, 1996).
In the case of photographs, certain causal chains can be picked out by
distinguishing the teleological function of the photographs. If a picture is
distorted due to a lens defect, the lens defect is part of the causal chain, but it is
not the target. We exclude this causal chain from the reference relation because
the photograph has not been designed to depict lens defects.
Second, intentions are not transparent. It is not the case that, when producing
an image, we have a clear view of our intentions, as if some transparent mental
events guide our actions. Rather, we infer our intentions by judging the final
product of our intentional act. Architectural sketches well illustrate this. If I start
sketching a building, I do not have to know what I want to draw. I am searching
for it.
What is an Image? 127
The source of confusion is that the concept of intention is nested within our
cognitive practice of explaining representational content. Ascribing intentions
can be seen as taking an intentional stance (Dennett, 1987) to explain the object
of representation. It can be useful to describe an image in terms of mental
events, such as intentions, to identify a targeted state. Yet, intentions are only a
method of identifying these states. The concept of intention is like the concept
of a meridian. It is used to identify some world properties but is not the same
object as a chair.
Does this mean that intentions are not real? This is a misplaced question. It
is like asking whether a meridian is real. These are certain ways of describing
the world, but they are not a description of the world. Depending on the level of
explanation, the intentional stance can be replaced by the design stance or the
physical stance. However, taking the design or physical stance does not falsify
the intentional stance. It is only a matter of the way we describe the world.
The measurement-based account can be reconciled with causal theories in
the following way. Identifying the target enables us to pick out relevant causal
factors. The black hole picture denotes the black hole, for it is part of a causal
chain whose initial element is the black hole. At the same time, the picture has
been designed to depict the black hole. Depicting it was a goal of the picture.
Therefore, irrelevant causal links, such as distortions caused by the defects of
measurement devices, are excluded from the denotation.
Yet, the existence of the causal relation is neither necessary nor sufficient for
denoting the target. The unicorn picture identifies a possible world inhabited by
unicorns and picks them as its target.4 Yet, there is no causal link between them.
However, this does not imply that causality plays no role in determining
iconic denotation. There are images, such as unicorn pictures, that denote
objects in some possible worlds, and images, such as black hole pictures, that
denote objects in the actual world. The key is to identify what possible world we
are in. The condition of identifying isomorphism between the properties of a
possible world and an image is too weak. A map of London can be isomorphic
with Mordor, but Mordor is not part of the actual world. Looking for causal
correlations seems to be a good criterion for such identification. Picking out
the causal relations between an image and the target indicates that the possible
world overlaps with the actual one.
Following Kaplan (1989), identifying the state of space and the possible
world we are in establishes the so-called circumstances of evaluation. It can
be defined as a function from the image to the possible world. By identifying
the possible world, an image can locate a target in a possible world w1 and be
What is an Image? 129
targetless in w2. Compare a documentary and a feature movie. The target of the
first is located in the actual world; the target of the second is in a fictional world
(Terrone, 2021b).
To sum up, we arrive at the following definition of iconic denotation.
The general idea is that iconic content exemplifies the construction rule, which
determines the parameters of some informational space necessary to identify
the referent. Iconic content shows the ways to localize some state of physical
or logical space. We can unpack this definition based on the concepts of
recognition-based identification and iconic convention.
Primarily, an image does not exemplify the properties of the target. This
restriction is necessary, for, without it, the 2-D model would be vulnerable
to objections of the following kind. For one thing, exemplification could not
ground the content, for not every property can be instantiated. A picture of my
mother represents her as happy, but the picture is not happy. Similarly, there is
no way to exemplify the properties of unobservable phenomena. An image of
the Higgs boson does not instantiate any property of the boson (e.g. Frigg and
Nguyen, 2020; Mößner, 2018). For another, the symbol ‘word’ refers to a word
and exemplifies the property of being a word. Yet, it is not a picture of a word
(Goodman, 1988).
We can easily avoid these objections, for an image exemplifies the properties
of constructions, not the constructed objects. The properties of construction
and the constructed objects should not be confused. Consequently, images can
exemplify unobservable phenomena, as they do not show them. They show how
to localize them in some physical or logical space.
130 Thinking in Images
However, not every carrier property is a property of the content. Only these
properties of the carrier that are highlighted by the construction parameters
are relevant. For instance, I can draw a square imprecisely because of my
lack of skills. Still, I recognize it as a picture of a square, for it represents the
construction rules of squares. The content highlights relevant properties of the
representational carrier.
Notably, iconic content does not represent objects. It represents the ways
of identifying objects. Interpretation of iconic content is based on identifying
construction parameters determined by the construction rules. These
parameters are the basic units of construction, their order and the relation
between them. Construction rules determine the permissible transformations of
these parameters. Based on the iconic convention, we can identify construction
invariants, that is, the properties that remain unchanged across different
constructions. Finding these invariants brings about the recognition-based
identification of the referent. The referent is a property, object, event or scenario
identified by the content.
The construction parameters of iconic content are representational units,
such as colours and lines, and relations between them. They have to be
distinguished from the registration of the representational content. For instance,
brush strokes on the picture’s surface register the colour, but the representational
unit is a colour, not a brush stroke. Finding how to register content is a matter
of representational technique. In the same way, it is a matter of measuring
technique to use mercury (instead of, e.g., alcohol) to represent the temperature
in a thermometer. However, the representational unit is the height of the mercury
column, not mercury.
Consider a picture of John. The target of representation is John. However, the
way it is represented, the composition of lines and colours, is the content of the
picture. The referent is the object identified by these lines and colours. Brush
strokes are the ways colours and lines are registered.
The referent of iconic content can be either a particular or general object. Let
us recall the distinction between token- and type-diagrams. A token-diagram
represents a particular token, such as a specific triangle. A type-diagram
represents a general object, such as a triangle as a such. Depending on the
identified construction invariants, every image can be taken to represent either
a particular or general object. However, every type-representation involves
representing a token (but not vice versa).5
Importantly, recognition-based identification is directly referential. It implies
that iconic content is identified directly in the picture. It is not inferred from the
What is an Image? 131
picture. However, this does not mean that representations have a mysterious
feature of being intentional in terms of seeing a nonphysical Bildobjekt (Husserl,
2005) or that we are seeing-in the represented objects (Wollheim, 1987). These
objects are artefacts of the measurement operations. How should we understand
this?
Consider the measurement operation. Saying that the term 20°C indicates the
particular temperature is not saying that we are intentionally directed to some
abstract number. This number is a construction property by which we localize
some state of the physical space determining its relations to other states, such as
19°C and 21°C (Armstrong, 1989; Swoyer, 1987). Saying that the temperature
is 20°C means that this number localizes some particular state of the physical
space. In this sense, we can directly identify this state by this number.
Let us compare it with iconic denotation. It has been characterized in terms
of dyadic relation to the target. In contrast, iconic content is not a relation to
some abstract referent. It is a way of localizing the referent. Localizing a referent
means finding its location in some physical or logical space, in the same way as
the number 20°C directly localizes some particular temperature value.
Images appear intentional, for we use intentional vocabulary to describe them.
For example, I can say that I see my mother in the picture. However, saying that I
can see my mother in a picture is not saying that I can see an abstract idea of my
mother. In the same way, saying that the temperature is 20°C is not saying that
the temperature has some numerical content. Iconic content, just as 20°C, is the
way to localize a certain state of the world.
Exemplifying content is insufficient to recognize the referent, for the
representational carrier can instantiate many different properties. Content
requires interpretation following the iconic convention. The goal of
interpretation is to select the properties that contribute to the recognition of
the target. Interpretation highlights these properties by identifying construction
parameters.
Interpretation of content is ruled by the Davidsonian-like principle of charity.
The principle holds that selected construction properties are exemplified to bring
us closer to the target. In other words, we presuppose that the goal of content is
to identify the designed target.
However, the nature of the principle of charity is heuristic. We can take
different interpretational stances and shift the target. For instance, we can
abstract away from the author’s intentions and study the social factors that are
causally manifested in a picture. In this sense, the picture’s interpretation is free
of the author.
132 Thinking in Images
Thus, we arrive at the following definition of the 2-D model of iconic reference.
2-D model of iconic reference: image I refers to target T, iff it (i) I iconically
denotes T; and (ii) I exemplifies iconic content C; and (iii) C identifies referent
R; and (iv) C indicates T through identifying R.
convention of the black hole picture, we can identify the referent. Recognizing
that the target matches the referent is knowing that they both localize the same
state in the physical space.
Maps illustrate this point clearly. The target of a map is a state of some
physical space, for example, London streets. The content of a map, for example,
the composition of directions and points, identifies the ways of localizing these
streets on the map. Knowing that the map preserves spatial relations, we localize
a street layout. This is a referent. However, to use the map to get about London,
we must locate ourselves on the map, that is, we must recognize that the map’s
street layout localization indicates the same localization in the physical space,
that is, London. We use the referent to identify the properties of the target. A
map is correct if the referent and target mark the same localizations.
Importantly, in contrast to the identification of a target, identification of the
referent is not relativized to possible worlds. Consider a cross-ratio invariant. A
geometric projection preserves cross-ratios in every possible world. No matter
what kind of space is projected onto the plane, whether we are in the Euclidean
3-D space or in the flat world, the cross-ratio is preserved in the projection. By
the same token, the triangle invariants, such as having three sides, are identical
in all possible worlds. The idea is that a referent localizes the same objects across
all possible worlds.
Consequently, depending on the kind of reference we pick out, an image
refers to the represented object rigidly or non-rigidly. From the perspective
of content, an image behaves like a rigid designator. From the perspective of
denotation, an image is a non-rigid representation. For instance, in a possible
world w1 a map of London denotes London, while in a possible world w2 the
same map denotes Mordor. However, in both worlds, the content of the map
is the same. Iconic content rigidly identifies the referent in the same way as the
standard metre identifies the same distance in all possible worlds.
Correctness conditions
The distinction between the iconic content and target expresses two kinds of
image correctness standards. In general, these standards are accuracy conditions.
Accuracy is a matter of degree. An image can be more or less accurate. However,
accuracy is a combination of two different kinds of standards. For one thing, an
image can be a true or untrue representation of the target. For another, iconic
content can be precise or imprecise. An image can be true but imprecise, and
134 Thinking in Images
Figure 6.2 The relation between trueness, precision and accuracy conditions. A
representation is accurate iff it is both precise and true. © Piotr Kozak.
precise but untrue. It is accurate iff it is both true and precise. The relation
between the concepts of trueness, precision and accuracy is represented in
Figure 6.2.
The concepts of trueness, precision and accuracy come from the
measurement theory. Trueness represents how close a measurement is to a true
value. Precision is the closeness of the measurements to each other (ISO, 1994).
We can apply these concepts to the analysis of iconic reference. First, the
general idea of the precision condition is that measurement results should be
unambiguous to identify the measured quantity. By the same token, the properties
of iconic content should be unambiguous to identify the referent. It implies that
the construction invariants should identify an unambiguous set of properties
that enables us to recognize the represented object. For instance, a photograph is
sharp if it identifies the properties that make us able to recognize the represented
object. Otherwise, it is unsharp. The inflation diagram is precise if it displays the
data that allows localizing inflation growth. Otherwise, it is imprecise.
The precision conditions are relativized to the goal of the representation
and the cognitive skills of the interpreter, which both build the context of
understanding an image. An image can be too little or too precise, depending on
What is an Image? 135
the goal. For instance, a map has to be precise enough to identify localizations.
It cannot be too precise, for then it would overlap with the territory. Depending
on the cognitive skills of the interpreter, images can be understandable or not. A
mathematical graph is understandable only if its interpreter has the mathematical
competence to understand it.6
Second, a measurement is true if it matches the value of the quantity it was
designed to measure. By the same token, an image is true if the referent matches
the target. For instance, a picture of a square is true if it was designed to represent
squares. It is false if otherwise. I am not imprecise if I intend to draw a square but
draw a circle instead. I am wrong.
Importantly, we should not confuse the trueness of images with the truth
of propositions. The concept of trueness should be taken in metaphysically
neutral terms. We do not have to interpret it as presupposing the existence of
some true values (Giere, 2006; Swoyer, 1987). Instead, we can think of it in
terms of robustness conditions, that is, the concepts that introduce coherence
and consistency to measurement outcomes (Chang, 2004; Tal, 2013, 2017; Teller,
2013, 2018). For instance, the standard metre is not true in terms of representing
the true nature of what one metre is. It determines what one metre is. By the
same token, a painting can set standards of what beauty is. It does not imply,
however, that some abstract object exists called ideal beauty.
Yet, although the measurement-theoretic concept of trueness is distinct from
the concept of the truth of propositions, they are linked. Let me illustrate this
with the concept of a sample. A sample is true if it exemplifies the measured
value. For instance, the water sample is true if its quality is close to the water
quality. However, it is not guaranteed that the sample will represent it. The
sample can be badly taken, or the distribution of water quality values can be
abnormal.
In contrast, the truth of propositions requires that we can distinguish the
atomic formula of a semantical system and provide its translation into a
metalanguage that satisfies a T-schema. For instance, the proposition snow is
white is true for it satisfies the T-schema such that ‘snow is white’ is true iff
snow is white. By using the T-schema, we can determine the truth-conditions of
compound formulas. For instance, a compound proposition of the form A and
B is true iff both A and B are true. This implies that propositions have a logical
form that provides correctness conditions for such translation.
Samples cannot satisfy these conditions. They can be badly taken, but they are
not true or false in terms of the truth of propositions. No atomic formulas can be
distinguished, for any sample represents many properties at once. Samples have
136 Thinking in Images
no logical form. A sample of water quality does not logically imply water quality.
It instantiates water quality.
However, a sample can be a basis of a proposition in the following sense. The
water sample localizes the water quality property. It provides us with information
about the quality of water and can be a source of the proposition that water
is contaminated. The sample works as a justification for this proposition, too.
If someone asks why I hold that the water is contaminated, I can point to the
sample. Yet, this does not imply that I cannot be wrong. The sample can be badly
taken, and the proposition based on the sample can be wrong.
By the same token, an image is not true or false in terms of being a true or false
description of a target. Images can be properly or badly taken. At the same time,
images can be true in terms of the measurement theory if the referent identified by
the properties of the content is close to a target. An inflation diagram is true if it is
close to the true value of inflation growth. A conventional picture of a dragon is true
if it is close to the stereotypical depictions of dragons. An image of a square is true
if its properties are the same as those identified by the square definition. An untrue
picture is such that the referent localized by the content is far from the target.
To conclude, we get different correctness conditions depending on the kind
of iconic reference.
Precision condition: image I is precise iff the I’s content has such properties that
allow one to identify the referent unambiguously.
Trueness condition: image I is true iff the I’s referent is close to its target.
Two remarks are in order. First, the iconic reference and correctness conditions
are defined in non-mentalistic terms. They do not refer to the mental states of
a perceiver or a phenomenology of pictorial experience. However, it does not
imply that mental states are irrelevant to understanding the meaning of images.
No one holds that. It means that a theory of how we grasp meaning does not
overlap with the theory of meaning.
Second, the 2-D model heavily depends on our understanding of reference,
exemplification and indexicality concepts. It cannot be otherwise, for this model
is intended to be part of a general theory of meaning. The implication is that
the 2-D model can be incorporated into the general framework of the theory of
meaning. That being said, not all model concepts can be sufficiently explicated
since this would assume that we have to clarify other concepts of the theory of
meaning. No book is long enough to do that.
What is an Image? 137
Image: a kind of measurement device that shows how a measured object would
look in some measurement set-up by representing the ways of identifying this
object.
Just like rulers and balances, images represent the ways of localizing the
referent. They represent the ways how to localize some states in a rule-governed
manner. They are not only applications of some measures. Images exemplify
these measures. Images represent the outcomes of measurements together with
the rules of construction, showing how to arrive at these outcomes.
Let us consider the inflation diagram. It does not tell us only about the
inflation growth value. It represents how to localize the inflation growth value
by mapping time onto the inflation rate. In the same way, a world map does not
tell us that London is localized at some geographic longitude and latitude only.
It also informs us how to get to London.
In contrast to rulers and balances, images can represent non-numerical and
qualitative properties. A colour sample shows you how to find the matching
colour. A depiction of John shows how to identify John. Images can represent
incomparable and unscalable phenomena. Think about the feeling you had the
day your child was born. I cannot describe it or ascribe it any number. Yet, I can
show you a way of how to arrive at it, for example, I can express it in music.
Thus, from a measurement-theoretic perspective, images play a twofold role.
On the one hand, they are the outcomes of some measuring procedures. For
instance, a portrait of John is an outcome of a measuring procedure performed
on the depicted situation, in the same way as 20°C is an outcome of measuring
temperature. An image localizes a depicted situation, just like 20°C identifies
a temperature value. On the other, images represent measures that determine
the ways we localize the measured values. Just as a ruler shows how to identify
a measured length, the portrait of John shows how to identify John. Images are
the tools by which we measure the world.8
What are the consequences of the measurement-theoretic approach for
understanding the nature of images? First, the question about the nature of
images is misleading. If we hold that images are a kind of measurement devices,
then we can characterize images only in functional terms. Let us think about
what connects mental, auditory and olfactory images. One can argue that the
use of the term ‘image’ in all these contexts is different. That suggests, however,
that there are essential properties of being an image, just like the atomic
number is the essential property of being a chemical element. According to the
measurement-theoretic approach, this assumption is false. Asking about the
common nature of mental, auditory and olfactory images is like asking about
the shared nature of rulers and balances. This is a badly posed question, for
there is no essential property of being a measurement device besides bearing a
measurement device function. We distinguish between different measurement
140 Thinking in Images
instruments depending on what and how is being measured. By the same token,
images can be characterized only functionally. They are the objects that are
necessary for some measurement operations. Mental, auditory and olfactory
images are different kinds of measurement instruments.
Second, according to the Traditional View, images are copies of the world.
In the measurement-theoretic framework, they are not copies, just as the ruler
is not a copy of some distance. Images may appear as if they were copies of
the world, just like a ruler may appear as if it were a copy of some distance.
However, just like rulers, images are measurement devices that represent the
measures to localize some state of the world and help us find our way in the
world.
Let us consider the portrait of John. According to the Traditional View,
the portrait is a copy of John. However, we can think about the portrait not in
terms of copying John but in terms of the ways of localizing John. According to
the measurement-theoretic approach, the portrait is a measure that we use to
identify John.
Perceiving the portrait of John does not imply that, to understand it, we need
to know what John looks like; in the same way, to understand what the standard
metre represents, we do not have to know what the length of one metre is.
Perceiving the portrait does not imply that when we meet John, we will see him
from exactly the same perspective. The portrait gives you a measure to localize
John, in the same way as the standard metre is a measure to localize the length
of one metre. This measure can be applied in different contexts, for it is not
characterized by copying reality but searching procedures.
Compare this to a map. On the one hand, it is an outcome of some
measurement procedures. On the other hand, it exemplifies a measure that helps
you find your bearings in space. We do not have to know what London looks like
to understand the map of London. Conversely, we use the map to learn where we
are. Moreover, to find out where we are in London, we do not have to pick out
the same perspective as that represented on the map. The ability to recognize its
spatial invariants and apply them in different contexts is crucial for using a map.
Does that imply that the portrait of John and a map of London cannot
misrepresent John and London? It does not. The portrait and the map, just
like any measurement, can be inaccurate. Is this not a vicious circle, however,
one could ask? To know whether a map is accurate, you need to know what it
represents. To know what is represented, you need an accurate map.
This problem can be generalized and applied to the 2-D model of iconic
reference. The ability to identify construction rules presupposes that we recognize
What is an Image? 141
the final effect of the construction. Knowing how to reach a target presupposes
knowing where we want to get. At the same time, to recognize an object, we
have to identify the construction parameters. For instance, to recognize the
mountain seen in a painting, we must identify marks on the picture’s surface
that are used to represent it. However, to identify these marks, we must know
that they represent the mountain. Thus, the ability to construct an object has to
be taken together with the ability to recognize it.
However, the circle is not vicious. Consider the so-called coordination
problem in the measurement theory. It concerns the problem of coordinating
theoretical quantity terms with measuring procedures. The empirical adequacy
of the theory and the reliability of the measuring procedures presuppose each
other. To establish a theory, we have to test its predictions. However, this requires
a reliable method of measuring. This, in turn, involves having background
theoretical knowledge.
The traditional approach to the coordination problem holds that coordination
is accomplished by specifying definitions for some quantity terms. These
definitions are taken to be analytical and require no empirical testing. This solves
the problem of circularity but forces us to accept many metaphysically loaded
assumptions. The solution is particularly disturbing if we are not fans of the
analytic-synthetic distinction.
According to the coherentist approach to the problem (e.g. Chang, 2004; Tal,
2013; van Fraassen, 2008), we do not have to cut the circularity of coordination.
We can try to show that the circle is not vicious, for constructing a quantity
concept and standardizing its measurement are co-dependent and iterative.
Each iteration in the history of standardization respects and corrects existing
traditions. As van Fraassen (2008) argues, the coordination problem arises
when one adopts a foundationalist view and attempts to find a starting point for
coordination.
Let us consider the calibration function of measurement. It is inseparable
from any measurement activity, for every measurement is underdetermined
by instrument indications. The same indication may be taken as evidence for
multiple knowledge claims about the measured quantity, depending on which
background assumptions are used to interpret the indications. Calibration is
an activity of modelling different processes and testing their consequences for
mutual compatibility. However, comparing some standards is neither necessary
nor sufficient for successful calibration (Tal, 2017).
Let us apply the coherentist approach to the 2-D model of iconic reference.
An image is an effect of a continuing and co-dependent process of searching for
142 Thinking in Images
the best way to represent the target and trying to localize the target. It involves
ongoing attempts to match a referent to a target by searching for the most precise
way to identify the referent. Identifying the referent involves correcting our
background assumptions regarding the target.
This process is iterative and involves modifications of image construction
and representation target. It respects existing iconic conventions but is not
determined by them. Most importantly, it describes both the production and
interpretation of images.
Let us consider an architectural sketch. Sketching is a continuing process of
searching for the most precise expression of some architectural idea. At the same
time, it is a process of searching for the idea. When sketching, an architect has
only a rough measure of the target. He is localizing it by producing the content.
Next, think about the process of image interpretation. It is an ongoing activity
of trying to localize the target based on testing our assumptions in analysing
iconic content. At the same time, it is an activity of analysing iconic content based
on our background knowledge of the target. Broadly speaking, an architectural
sketch and image interpretation are a kind of calibration process of modelling
the idea and testing the consequences of the model.
To sum up, it seems productive to think of images as a kind of measurement
devices. This solves some old problems of depiction, such as the problem of
resemblance. On the other hand, it introduces some new problems, such as the
one of coordination. The advantage of the measurement-theoretic approach is
that it can be hoped that these problems can be solved within a more general
theoretical framework.
exist, then how is it possible to represent them? That is the paradox of impossible
images.
Some philosophers and psychologists try to find a way out of this paradox
by adopting three argumentation strategies. First, it is common to interpret
impossible images as instantiations of perceptual illusions (Ernst, 1986; Gregory,
1966; Kulvicki, 2006a). For example, it is possible to build an object that looks
like the Reutersvaard-Penrose triangle from a particular perspective.
However, the concept of an illusion implies that something merely looks like
something that it is not. No such thing needs to be said about the Reutersvaard-
Penrose triangle. Even if there can be a consistent object that resembles the
Reutersvaard-Penrose triangle, this does not imply that the triangle represents a
consistent object. If I build a scale model that looks like the Eiffel Tower, it does
not imply that photographs of the Eiffel Tower represent its model.
Second, it can be held that the Reutersvaard-Penrose triangle is an example
of a figure whose parts are consistent when taken separately but inconsistent
when put together (e.g. Blumson, 2014; Budd, 2008; Kennedy, 1974; Voltolini,
2015). According to this argument, just as we cannot have inconsistent beliefs,
we cannot have inconsistent images. Yet, we can have an inconsistent set of
beliefs that can be divided into consistent parts. Priest’s (1997) short story
‘Sylvan’s Box’ is an example of such a case. According to the plot of the story, at
one point, the main character believes that the box is open; at another, just the
opposite. The story is an example of a more general phenomenon. Any fiction,
just as experience, is full of more or less noticeable contradictions. Similarly,
the Reutersvaard-Penrose triangle is inconsistent as a whole, but each part is
consistent. Covering any two angles of the triangle reveals that the remaining
one is consistent.
However, this analogy leaves something out. In Priest’s story, we cannot
simply take out parts of Sylvan’s beliefs and hold that they are consistent. Such
a move would turn any contradictory system into a non-contradictory one.
We hold that the system is inconsistent if the relations between the parts of
the system are inconsistent. Thus, we need to supplement the story by holding
that Sylvan can be unaware of the inconsistencies of his beliefs. Alternatively,
he can have some implicit beliefs that render the inconsistent story consistent.
No such thing applies to the Reutersvaard-Penrose triangle. It is neither a
depiction of three disassembled parts nor is there anything implicitly assumed
in the picture. We see three assembled parts that are together inconsistent.
Nothing is hidden or implicitly assumed. The impossible object is seen in the
picture.
What is an Image? 145
The impossible fork depicts an object that must have two and three prongs
simultaneously to exist. This cannot be the case. Therefore, it is an impossible
image.
No consistent mathematical theory can describe the impossible fork
(Mortensen, 2010, 135 ff.). It is not a type of an occlusion paradox, either. Changing
the occlusion of any part(s) of the figure does not make it consistent. According
to Mortensen, the only way out is to acknowledge the existence of paraconsistent
geometrical systems that can include objects such as impossible forks.
However, the price is that we have to acknowledge the existence of
contradictory objects in our ontological landscapes. Consequently, we have to
adopt some sort of dialetheism, which is a belief that there are true contradictions
(Priest, 1999), which is not a conclusion everyone can happily accept.
Dialetheism might or might not be true, but we can do without it. According
to the measurement-theoretic account, images refer to their objects by
exemplifying how these objects can be constructed. The general idea is that there
can be no impossible objects, but there can be ‘indeterminable’ constructions.
Constructions have been characterized as procedures of arriving at a target by
means of determining the parameters of some space. Arriving at a target means
localizing an object in this space. However, there are such constructions whose
parameters cannot determine the properties of some space to localize the target.
These are indeterminable constructions.10
There are two classes of indeterminable constructions. We can compare
them to attempts to measure unmeasurable properties and attempts to measure
with the wrong tool, such as measuring length with gas. The first class refers to
the constructions that aim at the spaces whose properties are indeterminable.
Consider the Reutersvaard-Penrose triangle. It represents a construction that
does not determine the properties of the triangle, for the bottom bar of the
triangle is located to the front and the back of the topmost point of the triangle.
However, this does not imply that it represents an impossible object. The
Reutersvaard-Penrose triangle can be interpreted as representing a construction
that cannot determine the kind of space it is in. Alternatively, it can represent
a construction in some non-orientable space, for example, a four-loop Möbius
strip. Non-orientable spaces do not determine such properties as orientation.
They may appear inconsistent if we confuse the concepts of non-orientability
and inconsistency. In this perspective, the Reutersvaard-Penrose triangle only
appears as if it represented impossibilities.
The second class of indeterminable constructions refers to the constructions
that fail to determine the properties of some space. These are abortive
What is an Image? 147
constructions. They cannot reach a target since the construction parameters are
such that they localize an inconsistent object. Recall the operation of dividing
by 0. It localizes a set of inconsistent values. Constructing a triangle from the
line segments such that each side of the triangle is bigger than the sum of
the other two sides is abortive, for the operation cannot construct a triangle.
However, the fact that some constructions are abortive does not imply that they
do not exist.
The proper class of impossible images refers to such images that exemplify
abortive constructions. The impossible fork represents the rules of construction
that cannot reach the target. It is only one of the many abortive constructions.11
Moreover, since we distinguish between the properties of objects and the
properties of construction, we can keep the intuition that impossible objects
cannot exist, but there can be representations of impossible objects. Impossible
images represent constructions without targets. Consider a round square. Can it
exist? Obviously not. But there can be an image of a round square. Why is this so?
Metaphysical impossibilities are logical contradictions. The concept of a
round square is clearly contradictory. If so, according to the Duns Scotus Law, if
p & not-p, then q, everything can be an instantiation of this concept. However,
this does not seem right, for it contradicts the intuition that there can be better
or worse representations of non-existent objects. Compare an image of a shape
representing a cross between a square and a circle (the so-called squircle) and an
image of a horse. Granted, they are both inaccurate depictions of a round square.
However, the squircle appears less inaccurate than a horse picture. The squircle
represents an inconsistent measure to localize the round square, but it is still
better than the horse measure. By the same token, a ruler made of elastic material
is a bad measure, but it is still better than a ruler made of gas. In both cases, we
will get inconsistent results; yet in the first case, they will be less inconsistent.
To sum up, in the measurement-theoretic framework, we do not have to
introduce inconsistent geometries to explain impossible images. Moreover,
we can deliver a unificatory explanation of two kinds of impossible images.
Impossible images are the kind of images that exemplify indeterminable
constructions.
Summary
In this chapter, I described the 2-D model of iconic reference. According to the
model, images have two-dimensional semantics. They denote their targets and
148 Thinking in Images
exemplify content that identifies the referent. In the 2-D model, images employ
two kinds of correctness conditions characterized in the measurement-theoretic
terms: trueness and precision. Images that are both true and precise are accurate.
I employed the measurement-theoretic framework to explain the properties
of iconic reference. I claimed that images are a kind of measurement devices.
Contrary to the Traditional View, images are not copies of the world. They are
ways to localize the properties of the world, comparable to rulers.
In the last section, I showed how this framework could be applied to explain
the phenomenon of impossible images. According to the measurement-theoretic
view, impossible images represent indeterminable constructions.
7
Imagistic knowledge
The general idea of the Traditional View is that images resemble the world.
From such a perspective, it is always possible to ask whether iconic content fits
the resembled reality. Thinking about imagistic content in terms of matching the
world suggests that images can be true or false. However, these assumptions are
questionable if we accept the conclusions of Wittgenstein’s and Frege-Davidson’s
arguments. There is no room for pictorial truth.
It may appear that the lack of truth values prevents images from being
part of a rational train of thought. However, we can accept the premise without
acknowledging the conclusion. We do not need truth for something to be part of
a rational train of thought. Models and idealizations are not true, yet they are a
vital part of knowledge (Cartwright, 1983; Elgin, 2017). A thermometer is not a
truth-bearer. However, its readings can be part of justified beliefs.
In the same way, images can do epistemic work without being true (in the
propositional sense). A geometric drawing justifies our beliefs that a mathematical
figure is constructible. A map justifies our belief that we can get from London
to New York (e.g. Gauker, 2020; Shepard and Cooper, 1982; Williamson, 2016).
Moreover, images can test our beliefs. If I believe that triangle sides can have
lengths such that each side is longer than the sum of the other two, I can change
my mind having been shown that such triangles cannot be constructed. If I hold
that London is west of New York, I can change my mind by looking at a map.
Does this mean that images provide a reliable source of knowledge? It depends
on our epistemic goals. Using measurement devices is context-sensitive. Rulers
are usually reliable measurement devices, but sometimes they fail to meet our
needs. A ruler can identify the length of line segments, yet it is useless when it
comes to measuring the distance between the Earth and the Sun. Usually, we
rely on images. Sometimes they are unreliable. However, this does not imply that
we can dismiss images as useless in our epistemic practice, just like we do not
dismiss rulers as useless.
Thus, images appear to provide some knowledge. The question is what
such imagistic knowledge can be. If images are not true, they cannot provide
propositional knowledge, for the latter involves a truth-evaluable description of
the world. By the same token, images cannot provide information about possible
states of affairs, for it would require them to be truth-bearers. Neither can they
provide explanations, for the latter are propositional and have logical form. The
explanation is shaped by argument and involves inferential structures. Images
are not ‘inferentially promiscuous’.
It may seem that we can respond to these doubts by biting the bullet and
distinguishing between propositional and non-propositional knowledge. One
Thinking with Images 151
can hold that although images are not propositional, they can constitute a form
of non-propositional knowledge.
However, much depends on how the concept of non-propositional knowledge
is understood. It is most often argued that imagistic knowledge, contrary to
propositional knowledge, is non-conceptual in content and builds phenomenal
knowledge or knowledge-by-acquaintance (e.g. Gauker, 2011; Gregory, 2013;
Mößner, 2018). For instance, a memory image of a red rose gives us knowledge
of what rose-redness looks like (e.g. Tye, 2012).
This strategy, however, will not get us far. First, non-propositional knowledge
can have the same correctness conditions as propositional knowledge. Depicting
what a thief looks like can be put in terms of a set of propositions attributing
properties to the thief. For instance, depicting a thief as bald is correct under the
same conditions as my belief that the thief is bald. If I depict the thief as having
long hair, I depict him incorrectly in the same way as the proposition the thief
has long hair may be incorrect. That implies that if I know what the thief
looks like, I know that he is bald. In other words, if I know what x looks like, I
know that in such and such circumstances, x looks so and so (e.g. Stanley and
Williamson, 2001).
Second, we rightly expect images to inform us about the world. For
instance, if I take a picture of a thief, I expect to learn who the thief is, whether
he is bald or tall. I am not interested in how the thief is presented to my
mind. When scientists study the black hole picture, they do not want to learn
about the phenomenology of black holes, either. They want to know what
black holes are.
Thus, images should tell us how the world is and not only what it looks like.
If images could provide only phenomenal knowledge, then imagistic knowledge
would be useless from the point of view of our epistemic interests. Instead, we
rightly expect images to figure within the web of our beliefs and theories.
The Epistemological Challenge questions these expectations. In other
words, the role we attribute to images cannot be fulfilled. That is because we
hold that beliefs and theories are truth-evaluable; images are not. Images are
not propositional, yet we expect them to figure in propositional knowledge. The
problem is not that these expectations are not right. The question is how they
should be explained.
The lack of propositional knowledge can be compensated by an image’s
contribution to understanding. As it seems, images are a plausible source
of understanding without being a tool of explanation (e.g. de Regt, 2017;
Lipton, 2009, Mößner, 2018; Zagzebski, 2019). The black hole picture gives
152 Thinking in Images
us an insight into what black holes are. It does not provide a theoretical
explanation of why they are so. That being said, we need a clear account of
what understanding is.
Importantly, we need a view of understanding that is not reducible to
explanation. For instance, coherence-based accounts of understanding (e.g.
Elgin, 2017; Kosso, 2007; Kvanvig, 2018) define it as recognizing connections
between facts and as a skill of matching things to form a general schema.
However, recognizing coherence between facts is only another way of knowing
an explanation of these facts since the explanation is based on identifying factual
connections (Khalifa, 2017).
According to de Regt (2017), understanding is distinct from explanation. It
does not require true theories, either. It is a skill of making theories intelligible
but not necessarily true. A theory is intelligible if scientists can recognize some
characteristic consequences of the theory qualitatively. For instance, a kinetic
picture of gas as a collection of molecules in motion enables a qualitative
understanding of how gas behaves. By developing molecular models of gas,
one can arrive at intelligible kinetic theories of gas which are the sources of
constructing explanations of the gas phenomenon.
Although de Regt’s concept of understanding does not directly apply to
imagistic knowledge, it hints at how to think about the latter. According to the
measurement-theoretic account, images are measurement devices representing
the ways of identifying measured objects. The idea is that they enhance
understanding by providing recognition-based identification of the objects of
our beliefs and theories.
Recognizing objects is not a propositional kind of knowledge. It is not a
matter of knowing a description of an object, either. I can know a description of
an object without being able to recognize it. Recognition is the skill of localizing
the represented object in some space. Knowing how to recognize an object is a
non-propositional (procedural) kind of knowledge.
Let us compare this to the understanding of indexicals. According to Perry
(1979), understanding indexicals such as ‘here’ and ‘now’ is irreducible to
knowing a description of the space you are in. Believing that a meeting starts at
noon is irreducible to knowing that it is now. Knowing that to leave a wilderness,
you have to follow the Mt Tallac trail is different from knowing that the Mt Tallac
trail is here. Indexicals locate our beliefs in space and time.
By the same token, imagistic knowledge provides an understanding of the
objects of our beliefs and theories by localizing these objects in some logical or
physical space. What does it mean?
Thinking with Images 153
Imagistic Knowledge: the skill of recognizing objects of our beliefs and theories
within the logical and physical space.
Let me illustrate this with the case of interpreting a map (van Fraassen, 2008). To
understand our position in space, we have to localize ourselves on the map. This
is not the kind of information we can get from the map’s description. Unless we
can find our location on the map, the map is useless. Even if a map indicates the
point ‘we are here’, we still have to determine our position. Based on the map, we
have to localize ourselves by identifying the parameters of the space. Moreover,
we need to know what moves are permissible by the map. For instance, we need
to know that if we turn left, we can get to point A but not B. To interpret the map,
we must identify how we can move.
The operation can be quite simple. It can be based on reading off the
information ‘Diagon Alley’ and finding the same inscription on the street. Next,
based on our knowledge of how streets are plotted on the map, we can find our
way from Diagon Alley to Knockturn Alley. Thus, a successful interpretation of
the map involves our ability to localize and orient ourselves in the space. What
does this teach us about the nature of imagistic knowledge?
According to the 2-D model, a map is a measurement device used to orient
ourselves in the logical or physical space. It exemplifies the rules of construction
and construction invariants. These invariants are the landmarks, such as street
names and the spatial relations between them. They enable us to identify spatial
localizations and directions. A successful interpretation of the map involves
identifying these spatial invariants and recognizing permissible moves so that
one can orient oneself in space. Consequently, a good map helps us to understand
our position in the physical or logical space.
By the same token, imagistic knowledge enhances the understanding of our
beliefs and theories by means of localizing them in the logical or physical space.
Such knowledge is irreducible to the set of propositions in the same way as
understanding a map is irreducible to knowing the map’s description. Imagistic
knowledge is the skill of recognizing the objects of our beliefs and theories
within the experience and the space of possible states. Let me illustrate this with
two examples.
Suppose you want to understand what a harmonic oscillator is. This involves
being able to point at a pendulum and hold ‘this is a harmonic oscillator’. The
pendulum is a model that identifies the invariant properties of the harmonic
oscillator by means of which you localize the harmonic oscillator within the
space set up by classical mechanics. The model does not explain the theory. It
154 Thinking in Images
localizes the states set up by the theory. It helps us to orient ourselves within the
theory, making it intelligible.
In the same fashion, the black hole picture makes the theory of general
relativity intelligible.1 The theory offers explanations of the origin and behaviour
of black holes. It provides a theory of what black holes are. To understand this
theory, you have to be able to localize the object of the theory in the physical
or logical space. You have to be able to point at the black hole picture and hold
‘that is a black hole’. The black hole picture enables recognizing the phenomenon
the theory describes. It helps you to orient yourself within the theory of general
relativity.
Knowing how to orient ourselves within the theory is a non-propositional
kind of knowledge. It consists in knowing how to localize the object within
some logical or physical space set up by the theory. Such information cannot be
deduced from the theory. The role of a theory is to explain a given phenomenon.
In contrast, images localize the object of the theory by enabling recognition-
based identification of this object.
By the same token, images identify the objects of our beliefs. Let us consider
a portrait of John. The picture identifies John, just as indexicals ‘here’ and ‘now’
localize an object in spacetime. The portrait exemplifies the construction rules
by means of which we identify John. Such information is irreducible to knowing
a description of John. I can recognize John without knowing a description of
John; I can know a description of John without any ability to recognize him in
the picture, too.
Images are irreplaceable in imagistic knowledge. Knowing how to recognize
objects is irreducible to knowing the descriptions of these objects and, therefore,
cannot be represented by any set of propositions. The picture of John can
misrepresent John, but it does not express any false proposition. I can recognize
John in the picture without any descriptive knowledge of who he is. The only
known medium that can represent imagistic knowledge is an image, for it
exemplifies the rules of construction by means of which we identify the depicted
object.
To illustrate this irreplaceable role of images, let us consider the mental
rotation tasks (Shepard and Cooper, 1982; Shepard and Metzler, 1971). The task
is to create a mental image of an object and rotate it in order to compare it to
the presented figure. Next, one has to decide whether both objects are the same.
Notice that such a decision cannot be made simply by comparing descriptions
of the object expressed in some language-like representation system (Pylyshyn,
1973). First, you need the skill of rotating the first object in the right way. There
Thinking with Images 155
are infinitely many ways to rotate this object, but only one way leads to solving
the problem. The way of rotation cannot be inferred from the figure’s description.
If it could, it would require checking infinitely many ways of rotation which
leads directly to combinatorial explosion.
Second, we would still need additional information if we had two similar
descriptions of the figure. The task is to identify two objects as identical.
This information does not follow from having two similar descriptions. I can
know these descriptions without any ability to recognize that they refer to the
same object. I can know both the content of a map and a description of my
environment, but it does not follow that I can orient myself in the space. By the
same token, I can know a description of John, but I still need the information
that the depicted person is John (Kaplan, 1989).
Granted, the description-based explanation of how people solve mental
rotation tasks does not have to be wrong. The problem is that it is not sufficiently
informative to work as an explanation.
In the 2-D model, the way we solve mental rotation tasks is easier to
understand. Recognizing two objects as the same involves identifying the rules
of construction. Images exemplify the rules of construction by means of which
we recognize depicted objects. We recognize two figures as the same if we
identify the same construction invariants determined by these rules.
In the mental rotation tasks, subjects hold that they visualize the process of
the figure rotation. Suppose the measurement-theoretic account of images is
correct. In that case, the fact of visualizing this process results directly from the
epistemological properties of images. Only images can convey the information
that is needed in mental rotation tasks. The pictorial character of mental imagery
is not a contingent fact of human psychology but a necessary semantic fact.
Although imagistic knowledge is irreducible to propositional knowledge,
they are both entangled. First, localizing the phenomenon that is the subject of a
theory enables testing the theory. The epistemic role of images is to localize the
states within the space set up by the theory. If these states contradict each other,
it is a good reason for rejecting the theory.
Let us suppose that the black hole picture contradicts our predictions resulting
from the theory of general relativity. If a theory predicted that black holes were
square-shaped, but the black hole picture identified a crescent shape, it would be
a good reason to hold that the theory is wrong rather than poorly illustrated by
the picture.
Second, images can be a basis of propositional knowledge without having a
propositional character. According to the 2-D model, images do not describe
156 Thinking in Images
the world. They represent the ways we identify the states of the world. Images
represent how the world can be perceived. Consequently, images are not true or
false in the way propositions are. Images can accurately or inaccurately localize
the target. They are not expressing any truth. Propositions do that.
However, based on identified objects and properties, we can form beliefs and
theories. Images are the measurement devices we use to inform ourselves about
the world. A measurement for its own sake is a fruitless endeavour.
Let us consider the case of a ruler. The ruler’s length is neither true nor
false. It represents the way we isolate a spatial magnitude. At the same time, the
indication of the ruler can provide a basis for describing distance. Based on that
indication, I can hold that a line segment is one metre long and so on. We use a
ruler to describe its target.
Similarly, we use images to describe the world. Pointing at the black hole
picture, I can hold that the black hole emission ring is crescent-shaped. The
picture is used to describe the black hole by isolating its spatial features.
Images enable recognition-based identification of objects. However, images
cannot tell us what these objects are. To interpret an image, we need propositional
knowledge.
Compare this to the way we interpret measurement indications. There is
a difference between measurement indications, such as the numerals on the
thermometer display, and measurement outcomes. The latter are knowledge
claims that associate the measured values with the object being measured and
are inferred from one or more indications along with relevant background
knowledge (Tal, 2017). The reading of the thermometer indicates some physical
value. However, it is the role of a theory to interpret this value. Without a theory,
we can identify objects but cannot know what they are.
Similarly, using images involves some background knowledge necessary to
interpret their meaning. For instance, the black hole picture provides perceptual
recognition of the black hole. Yet, we need a physical theory to know that the
picture represents a black hole.
With these distinctions in mind, we are now in a position to address
the problem of phenomenological content. Let us recall Gregory’s idea
of distinctively sensory content. According to Gregory (2013), pictures
represent the ways something is like. They represent qualitative aspects of
experience. For instance, Cezanne’s Mont-Saint-Victoire paintings represent
distinctive ways in which the mountain appears to us. They represent the
colour depth and the way the light illuminates the mountain. How should
we explain this?
Thinking with Images 157
Iconic content
relation to other states. It shows how this temperature can be localized in some
measurement set-up.
By the same token, an image of John being bald represents the way of
identifying John. Depicted properties are the information we use to localize
John. When we depict John as bald, the property of being bald works as the
measurement predicate used to identify John. It does not predicate anything of
John, as if there were some relation between these properties and their object
seen in the picture, that is, something we predicate of and a predication. Depicted
properties represent the way we see John, not the way John is.
Let us compare a depiction of John with the proposition John is bald. The
proposition individuates the object of predication, that is, John, and predicates
of him the property of being bald. The property of being bald is the property of
the predicated object.
In contrast, we do not hold that the marks on the picture’s surface are bald or
that John is made of paint. These marks are the properties of the representation,
not the properties of the represented object. According to the 2-D model,
the properties of representation are the properties of the construction. Thus,
representing John as bald identifies the ways of arriving at John. Mistaking
the properties of construction and the constructed object leads directly to the
pictorial fallacy.
In the measurement-theoretic perspective, images do not bear predicative
functions. However, images can be a basis of predication in the following way.
Suppose you measure some object with a ruler. The indication of the ruler,
such as ‘one metre’, does not predicate the property of being one metre of this
object. It identifies this property. By using a ruler you establish the length of the
object. However, once this value is found, we can predicate it of this object by the
proposition this object is one meter long.
By the same token, a portrait of John does not predicate anything of John.
It localizes John by determining the properties that serve to recognize him.
For instance, it represents the property of being bald which is a measurement
predicate used to identify John. Once you recognize John in the picture, you can
predicate of John that he is bald.
Language is essential to express predicative functions (e.g. Davidson,
2001; Devitt, 2006). It distinguishes between the object of predication and the
predicate. It distinguishes between different modal contexts. Images cannot do
that. However, their job is to identify objects, not to predicate of them.
Moreover, the role of language is to determine the kind of predicate we project
onto the predicated object. Consider an image of a red square. It identifies the
160 Thinking in Images
properties of being red and square. However, the image does not predicate of red
that it is square-like or of a square that it is red. Language does that. Only within
the linguistic categories can we distinguish between the meaningful statement
that a square is red and a meaningless expression that red is square-like.
In the measurement-theoretic account, language and images are semantically
linked. On the one hand, recognizing the iconic content is a prerequisite to
describing this content in language. For instance, if pointing at a picture of John,
I hold ‘it’s John’, I have to be able to recognize John in the picture. Recognizing
John is a skill that is not language-based. In contrast, it is necessary to form a
description of John.
On the other hand, language is essential for making images meaningful. I
can recognize John in the picture, but without language, I cannot represent the
thought that the picture represents John. By the same token, we need language
to interpret the measurement’s results. Thermometer readings are meaningless
unless we can express what they represent. Only language can do that.
Does this imply that the iconic content is indeterminable and needs language-
like representations to fix its meaning? It depends on how we understand this
question.
Let us make two preliminary remarks. First, to determine the content of
representation, we need a criterion by means of which we decide what the object
of representation is. Second, in the 2-D model, we distinguish between rigid
and non-rigid kinds of reference. Respectively to the context of evaluation, an
image denotes an object non-rigidly. The iconic content, however, works as a
rigid designator. It identifies the same referent in every possible world.
The general idea is that we can introduce the criterion only in the case of
non-rigid kinds of reference. It is always possible to ask about a criterion that
determines image denotation. For instance, I can hold that the target of the
portrait of John is John since the author intended to depict John. Here, the
intention is a criterion by means of which we identify the target respectively to
the context of evaluation. We use this criterion to determine the target of the
portrait. Moreover, propositional representations are essential here. Applying a
criterion requires that we have a set of beliefs that are a basis of background
knowledge that justifies the use of such a criterion.
The question about criterion, however, is senseless in the case of iconic
content. We cannot ask about the criterion that determines iconic content. This
type of content refers rigidly to the referent. It identifies the same referent in
every possible world regardless of the context of evaluation. Iconic content is
criterion-less and directly referential. Let me explain.
Thinking with Images 161
the measured magnitude; yet, the length of the ruler is not the property of the
measured length. The ruler represents the way of localizing the object’s length.
Let us compare this with a description of a triangle. If I describe what a triangle
is, I predicate some properties of the mathematical object. The description can
be wrong, depending on whether the properties identified by the description
match the object’s properties. You need to find the described object to determine
whether a description is correct.
In contrast, a triangle diagram does not have to be compared to some abstract
mathematical object. The image tells you that if you take such and such lengths
of the line segments and arrange them properly, you will arrive at the object
represented by the image. With a triangle diagram, you do not say that now
you can look for triangles to find out what they are. You do not have to look for
anything that can match the image. The image is the thing you are looking for.
Compare this to the portrait of John. Suppose that you want to know who
John is. In this case, I can point at his portrait. The portrait represents a way of
identifying John. However, if you are shown a portrait of John, you do not say
that now you know how to find out who John is. The portrait is the thing you
are looking for. The portrait exemplifies the standard you use to identify the
referent. It does not require a criterion you use to identify the content, for its
iconic content determines the criterion of referent identification. The portrait of
John exemplifies the criterion by which you identify John, in the same way as a
triangle diagram exemplifies the criterion by which you identify triangles.
By the same token, a ruler exemplifies a standard used to identify spatial
distances. I can point at the ruler’s length if you ask me what one metre is.
However, it is nonsensical to say that now you know how to find out what one
metre is. The ruler’s length is not a description of some spatial magnitude. It sets
a standard to isolate some spatial magnitude. You cannot ask for a criterion to
determine the correctness of this standard, just as it makes no sense to ask how
you know that the standard metre is one metre long.
Importantly, this does not imply that these standards cannot be inaccurate.
The triangle diagram can be badly drawn; the portrait can be imprecise.
Depending on the context of evaluation, the referent can match no target and
so on. However, the question of accuracy conditions of some representational
standards should be strictly distinguished from the question of how we know
what these standards represent.
Let us apply these considerations to Wittgenstein’s example of a picture of
an old man walking up a steep path that looks as if the old man is walking
down. Two questions can be raised here. We can meaningfully ask whether the
Thinking with Images 163
image accurately identifies the target. In the same way, we can meaningfully ask
whether the indication of the ruler represents the distance or the velocity in
some frame of reference in relativistic physics.
However, it is meaningless to ask how we know that the iconic content is the
man walking up, not the man walking down. The iconic content permits both
interpretations, for it exemplifies both construction rules. These rules determine
the standards of interpretation. There can be no question of how I know which
rule it exemplifies, for we directly see the rule in the picture.
Think about how we interpret the content of the Necker cube. Depending
on the applied construction rule, it can be interpreted differently. However,
this does not imply that its content is indeterminable but only that it can be
interpreted differently. Moreover, it is meaningless to ask how we know that
we see one interpretation of the Necker cube and not another if the shape of
the Necker cube remains the same. The only answer is that we see it as such.
The construction rule we identify in picture-perception determines the way we
interpret the picture.
By the same token, Wittgenstein’s picture of an old man can be interpreted
differently, depending on the identified construction rules. However, there can
be no question about the criterion we use to identify the construction rule since
construction rules determine the interpretation standards.
Iconic content is directly referential. We cannot ask how we know that we
recognize an old man as walking up or down. We see it as such. By identifying
the construction rules, we directly see something in the picture. This is the
reason why it may appear that iconic content is primitive and unanalysable.
Thus, in the 2-D model, Wittgenstein’s argument appears to miss the mark.
Iconic content is indeterminable in the sense that it is criterion-less and
directly referential. This does not imply that iconic content is more primitive
or basic than propositional content. The rules of construction must be
identified in the picture to recognize an object. Such identification requires
skill and practice.
Moreover, holding that iconic content determines the standard of object
identification does not imply that the standard cannot be wrong. It can be
inaccurate, depending on what and how it is being represented.
The criterion-less and direct character of iconic content means that the
latter sets up the criterion for recognizing objects, properties and events. Iconic
content does not require any criterion of interpretation, for it determines this
criterion. Images represent how the world can be perceived in the same way as
rulers show how objects can look in some measurement set-up.
164 Thinking in Images
Metaphysical constraints
determining the parameters of some logical or physical space. The idea is that
we can distinguish between representational and nonrepresentational image
parts by identifying construction parameters.
Let us consider the triangle diagram. It represents the way a triangle can be
constructed. The triangle construction highlights the properties such as the
number of lines and the angle measure. These are the properties highlighted by
the construction parameters and content constituents. In contrast, properties
such as the colour of the lines are not representational as they are not singled out
by the construction parameters.
Image parts can be representational or nonrepresentational depending on the
identified construction rules. If I want to construct a congruent triangle, the size
of the lines is representational. If I want to construct a similar triangle, the shape
of the triangle is what matters. The size of the lines is nonrepresentational.
Let us compare this with the black hole picture. The construction rules
identify the crescent shape of the black hole emission ring. The properties of
the shape are representational because they are highlighted by the construction
parameters. In contrast, the photograph’s texture is nonrepresentational, for
there is no explicit construction rule in which the texture is highlighted by any
of its parameters.
Let me underline this point. In the 2-D model, there are no metaphysically
or psychologically predetermined representational primitives, such as lines
and dots (e.g. Burge, 2010, 2018; Camp, 2007; Tversky, 2004). Representational
primitives are the properties highlighted by the construction parameters by
means of which we identify a referent.
Depending on the identified construction rule, we isolate different
representational primitives. They are relative and context-sensitive. Consider
a picture of a red square. If one takes it as exemplifying red, the shape is
nonrepresentational. The colour is irrelevant if one takes it as exemplifying the
square construction.
Being relative and context-sensitive does not imply being arbitrary. The
meaning of indexicals is relative and context-sensitive, yet it is not arbitrary.
With this in mind, we can hold that images are canonically decomposable.
If images are canonically decomposable, then there is no reason to deny that
they are systematic. They clearly are. If I have an image of two objects o1 and
o2 where o1 is left of o2, I can easily imagine that o1 is right of o2. If I have an
auditory image, I can imagine the same music track with different paces and
pitches. Understanding one triangle diagram implies understanding any other.
Moreover, systematicity is a key concept in understanding how images work. Let
166 Thinking in Images
us recall the concept of natural generativity. One of the main tasks of any theory
of depiction is to explain how understanding one member of an iconic system
involves understanding any other member of the system. The concept of natural
generativity mirrors the concept of systematicity.
However, the systematicity of images cannot be explained in the same way as
the systematicity of propositional representations. Why is that so?
According to the classical explanation (e.g. Fodor, 1987; Fodor and Pylyshyn,
1988), propositional representations are systematic, for they have a logical form.
Propositional representations are decomposable into discrete atomic formulas
that contribute to the truth-conditions of the whole representation. For instance,
the meaning of the sentence ‘John is bald’ can be derived from the meaning of
its constituents. The idea is that every part is an atomic unit of a truth-evaluable
representation.
Moreover, propositional representations have combinatorial syntax, which
determines the place of their parts in this structure and the logical combinations
of content constituents. For instance, the sentence ‘John loves Mary’ consists
of the arguments ‘John’ and ‘Mary’, and the relation R of loving. Exhibiting the
thought that John loves Mary involves understanding R, which, in turn, makes it
possible to exhibit the thought that Mary loves John.
In contrast, images are deprived of logical form. They cannot be decomposed
into atomic units of truth-evaluable representations and lack combinatorial
syntax. Thus, we need an alternative explanation of the systematicity of images.
In the 2-D model, understanding iconic content involves knowing the
construction rules by means of which we identify a referent. Recognition-based
identification has been defined in terms of localizing the construction invariants
connected with knowing the construction rules that determine permissible
transformations of the constructed object. The idea is that we can think of the
systematicity of images in terms of preserving construction invariants. Two
representations are systematically co-related with each other if they preserve the
same construction invariants. Let me illustrate this with two cases.
Think about a triangle diagram. If I understand how to construct one kind
of a triangle diagram, I understand how to construct any other kind since I
understand a rule which determines what properties have to be preserved in those
constructions. Moreover, suppose I hold that I know how to construct a right
triangle diagram, but I do not recognize the construction of an obtuse triangle
diagram. In that case, it indicates that I do not understand triangle construction.
Let us consider an image of two objects o1 and o2 being left and right,
respectively. Understanding that o1 is left of o2 implies that I understand that o1
Thinking with Images 167
can be right of o2, since they preserve their locations when the reference point is
switched. Additionally, if I hold that I understand the image where o1 and o2 are
left and right, respectively, but I cannot recognize the content of an image with
the switched reference point, then I do not understand the first image.
Moreover, by manipulating construction parameters, one can indicate
preserved properties. For instance, in the mental rotation tasks, we can spatially
transform a figure, then check whether its shape fits another. By the same token,
we can change a music track’s pace and pitch, preserving the sound sequence.
The shape of the spatial figure and the sequence of sounds are the invariants of
these constructions.
The manipulation of the construction parameters can be cognitively
productive. Think about a freehand drawing. It is based on searching for the best
measure to identify a referent. By determining the parameters of the drawing,
we identify different construction invariants, which, in turn, localize different
properties of the desired object.
What is more, the result of such manipulation cannot be deduced. According
to the measurement-theoretic account, images like rulers and balances identify
the results of the measurement procedures. By systematically manipulating
construction parameters, we test the consequences of some background
assumptions involved in image production and interpretation. These
consequences cannot be inferred. If they could be, any measurement would be
reducible to inferential procedures.
Thinking with images is, using Dennett’s metaphor (2013), ‘turning the knobs’.
It is about localizing objects, properties and events within some measurement
set-up. It involves finding out what would change if the construction parameters
were different. For instance, what would change if we spatially rotated an object in
the mental rotation tasks or how would the world look if we reversed its colours.
To sum up, images are systematic and decomposable. They are decomposable
into parts highlighted by construction parameters. They are systematic by means
of preserving construction invariants. We do not need to introduce propositional
structures to explain the systematicity and compositionality of images. Images
are systematic and decomposable in their own right.
In the previous sections, I explained how the 2-D model addresses the
epistemological, semantical and metaphysical challenges. In this and the
168 Thinking in Images
next section, I describe the metaphysical landscape that can be seen from the
measurement-theoretic account of images. I discuss two problems: the problem
of the mental representation format and the nature of mental imagery.
A mental representation format is the way information is organized in mind.
A discussion of mental representation formats addresses how the information in
our mind is stored and processed. It concerns the structure of representations
interpreted as a set of representational primitives and combinatorial principles.
Thus, to describe a representational format, one has to describe the structure of
representation: the primitive elements and the possible operations that can be
carried out with them.
Following the influential tradition, we distinguish between analog and
digital formats of representation. According to Goodman (1976), to be an
analog representational system means to be a dense representational system.
An example of a dense representational system is an old-fashioned clock
that represents time continuously, unlike a digital clock that represents time
discretely. Moreover, an analog representational system is relatively replete.
A representational system is relatively replete if, in comparison with other
systems, many features of its members are relevant to determining what they
represent. The system of old-fashioned analog clocks is not replete since only
the position of the clock’s hands matters. In comparison, in the case of images,
such features as colour, shape and size are relevant. However, for reasons that
will not be covered here (e.g. Kulvicki, 2006; Maley, 2011), it is doubtful whether
Goodman succeeded in adequately explaining analog and digital formats of
representation.
In the last fifty years, the distinction has been variously interpreted and
explicated (e.g. Fodor and Pylyshyn, 1981; Haugeland, 1998; Lewis, 1971;
McGinn, 1989; Peacocke, 2019). Across those approaches, digital representations
are generally understood to be discrete entities. Numerals provide a good
example. 0 and 1 are discrete because they indicate distinct and separable entities.
For every representational token, it is clear which type it instantiates. In contrast,
analog representations do not admit definite type-identity. For example, the
colour value of a given colour patch is measured on a continuous rather than a
discrete scale (Dretske, 1981). Iconic representations are believed to be analog
structures, which means that they are indiscrete. The structure of propositional
representations is believed to be based on discrete structures.
An alternative way to interpret the analog–digital distinction can be framed
in terms of constraints that the representational system puts on representational
content. Different representational systems put constraints on the content a
Thinking with Images 169
representational system can carry and the range of possible transitions between
different contents. So, for example, an analog representation can represent a
magnitude value but not an integer (Beck, 2015) and so on.
If one wants to explain why some representational systems put constraints on
representational content, one has to describe the features of the representational
structure. For instance, one can explain why the Arabic numeral system is
preferred over the Roman numeral system by pointing out the fact that the Roman
numeral system does not include a zero. Analogously, iconic and propositional
representational systems put constraints on their representational content. A
theory of representational format should explain where these constraints come
from.
More generally, the question is what the concept of the mental representation
format should explain and how it can do that. Here, I claim that the problem
with the analog–digital distinction is not that it is not clear. Even if it were clear,
it would still be doubtful whether it could explain what it should explain, namely,
the difference between iconic and propositional representations.
There are at least two ways of describing the difference between propositional
and iconic representations. The first way refers to the idea of the informative
richness of images (e.g. Kitcher and Varzi, 2000). According to this criterion, iconic
representations display more information than propositional representations,
making images more effective in some reasonings (e.g. Larkin and Simon, 1987).
The analog–digital distinction seems to explain this difference, for the analog
representations consist of fuzzy sets of information, making iconic content vague
and rich. In contrast, digital representations consist of classical sets of information,
which makes propositional content more precise but informatively poorer.
However, a more profound understanding of the difference between
iconic and propositional content can be described in terms of their relation
to truth. Images are not truth-evaluable and lack logical form. In contrast,
propositions can be true or false and express logical relations. A full-fledged
theory of the mental representation format should explain what properties of
the representational structure are responsible for these constraints put on the
content.
Does the analog–digital distinction help us understand the difference between
iconic and propositional representations characterized in terms of their relation
to truth? It seems that it does not, as the iconic-propositional distinction does
not overlap with the analog–digital one (e.g. Peacocke, 2019).
If we characterize content in terms of its relation to truth, then the distinction
between analog and digital format is irrelevant for determining whether we are
170 Thinking in Images
generating mental images. None of these debates has direct implications for
others.
Nowadays, the imagery debate is mostly considered dead (Pearson and
Kosslyn, 2015). In cognitive psychology, there is a consensus that descriptionalists
fail to explain the perceptual nature of mental images. Moreover, it is held that
mental imagery is a form of weak and off-line perception (e.g. Pearson et al.,
2015). According to the weak perception theory of mental imagery, mental
images are a form of less vivid perception not triggered by sensory input.
The claim that descriptionalism is a dead end seems to be noncontroversial.
Moreover, there is no doubt that imagining is a form of perceptual experience
and that the neural correlates of perception and mental imagery are common
to a significant degree (e.g. Kosslyn et al., 2006; Nanay, 2015; Pearson et al.,
2015).6 There are, however, doubts regarding the consequences of these claims.
Primarily, the weak perception theory of mental imagery is philosophically
dubious.
First, although mental imagery and perception share the mechanisms
responsible for generating representations, there is an asymmetry between the
content of mental imagery and perception. It is believed that the content of
mental images is displayed as already interpreted (e.g. Chambers and Reisberg,
1985; Hinton, 1979; Ishai et al., 2000; Ittelson, 1996; McGinn, 2006; Reisberg,
1996; Reisberg and Heuer, 2005; Sartre, 1962; Scaife and Rogers, 1996; Slezak,
1995). In contrast, we can always reinterpret the content of perception.
The most striking illustration of this fact is the problem with reinterpreting
the so-called bistable figures (such as the duck-rabbit picture) in mental imagery.
Although reinterpretation of mental images is possible (e.g. Reisberg and
Chambers, 1991; Finke et al., 1989; Peterson et al., 1992), it is more cognitively
loaded and significantly less efficient than in perception (Mast and Kosslyn, 2002;
Kamermans et al., 2019). For instance, in the studies of Reisberg and Chambers
(1991), subjects were told to remember and imagine a figure with the same
geometry as the state of Texas rotated 90° clockwise. Even when they imagined
rotating the figure mentally so that it assumed its canonical orientation, they
were still unable to discover that it represented Texas. In contrast, the task was
trivial when subjects were told to rotate the image being seen.
The asymmetry between imagery and perception when interpreting bistable
figures cannot be explained by holding that mental images are displayed in the
mind’s eye too briefly, which might suggest that imagery is affected by memory
limitations (see Kosslyn, 1994; Kosslyn et al., 2006). For one thing, reinterpreting
the perceptual content does not have to be time-consuming. Often, we can
Thinking with Images 175
switch between two interpretations instantaneously. For another, the time that
affects the efficiency of reinterpretation tasks is the same time that suffices to
solve mental rotation and mental zooming tasks in Kosslyn’s classic research
on mental imagery (e.g. Kosslyn et al., 1978). Therefore, the time for which
information is available does not seem to be a relevant factor.
Second, suppose that we respond to the first doubt by holding that mental
imagery is less vivid than perception. However, that is a position that we do
not want to keep if we want to defend the imagistic theory of thinking. For
one thing, the weak perception theory of mental imagery appears to blur the
distinction between thoughts and perception and is vulnerable to arguments
from cognitive impenetrability of perception (Cavedon-Taylor, 2021; Firestone
and Scholl, 2016; Raftopoulos, 2009). For another, it seems to take us back to the
neo-Lockean positions we previously rejected (see Slezak, 2002a). In fact, the
weak perception theory of mental imagery is no more philosophically attractive
than the Lockean theory of ideas, which has been a philosophical dead end
for over three hundred years. If we hold that mental imagery entails weak
perception, then either exercising mental imagery is not a thinking operation
or speaking of imagistic theory of thought is senseless. Both alternatives are
unattractive.
Where do we stand? The situation resembles the one described by Goodman
and Elgin (1988) as the dilemma of mental imagery. Either we accept the weak
perception theory of mental imagery and leave out the questions asked by
philosophers and other troublemakers, or we just dismiss the talk of mental
images, holding that they are just the epiphenomena of some more substantive
cognitive mechanism. I think that both horns of the dilemma should be rejected.
The problem we face stems from overusing the photographic metaphor in
thinking about mental imagery. According to this metaphor, mental imagery
is like a camera, and mental images are like photographic snapshots displayed
before the mind’s eye. The force of the photographic metaphor results from
our attachment to the Traditional View. If images are copies of the world, then
mental images are copies of perceptual experiences. That is, however, where we
get stuck in conceptual confusion. There is no Cartesian cinema in the head;
some inner eye does not see mental images.
According to Block (1983b), the photographic model of mental imagery
should be replaced by the one in which mental images are based on mechanisms
involving constructive processes. Mental images are more like drawings rather
than photographs. However, Block has never been specific enough to explain
what such processes can be.
176 Thinking in Images
to arrive at the constructed object and recognize it. Mental images as conscious
experiences are exemplifications of these skills.
Does this mean mental images as conscious experiences are irrelevant to
mental imagery? It depends on how you understand this question. Thinking
of mental imagery in terms of the skills of producing mental images does not
imply that it actually creates them. By the same token, when thinking in words,
you do not have to produce these words. For instance, you can have a thought
and search for the words to express it. What is crucial is the skill to identify the
linguistic representations that correctly express this thought.
Mental images as conscious experiences are products of mental imagery taken
as a procedural knowledge of construction rules. This knowledge manifests itself in
the operations of constructing and interpreting the mental image. However, it can
also be manifested by any intelligent use of pictures, such as skilful use of diagrams.
Taking (mental) images as products of procedural knowledge does not imply
that they are irrelevant for exercising such a skill. If one cannot identify the
construction rules represented by an image, regardless of whether the image is
mental or extramental, then one does not have this procedural knowledge. In
this sense (mental) images are necessary for mental imagery.
To illustrate this, let us once again consider mental rotation tasks. The idea
is that subjects manipulate the parameters of perceptual space by rotating the
figure in order to localize construction invariants. Finding these invariants solves
the task of identifying two similar objects. The task cannot be solved without
mental imagery since it involves procedural knowledge of how to determine
these parameters, for example, setting the direction of rotation.
However, having such procedural knowledge is dissociated from having an
actual experience that exemplifies it. The operation of rotating a figure can be
visualized in the form of a conscious experience, but it does not have to be. We
do not have to represent the skill as a conscious mental image to exercise it. We
have to be able, however, to recognize the identity of figures when we visualize
them or display them in pictures.
Second, if mental imagery does not have to be conscious, then we can speak
meaningfully about unconscious mental images. Let us recall the phenomenon
of aphantasia, which is the lack of the ability to form mental images (e.g.
Keogh and Pearson, 2018; Milton et al., 2020; Zeman et al., 2015, 2020). This
phenomenon appears to be theoretically problematic only to the experiential
accounts of mental imagery. From the measurement-theoretic perspective, the
phenomenon of aphantasia does not seem to be particularly mysterious. What is
lacking is not mental imagery but the awareness of having mental images.
178 Thinking in Images
Being unaware of a mental image does not imply that one lacks the skills
to apply construction rules. For example, aphantastics can recognize identical
figures represented in pictures which indicates that they do not lack mental
imagery.
Third, the relationship between perception and mental imagery seems much
more profound than the weak perception theory suggests. Mental imagery is
not some less vivid perception. It is a skill of perceiving based on procedural
knowledge of construction rules. It involves the skill of identifying the parameters
of perceptual space in order to localize an object or event.
Thinking of mental imagery in terms of perceptual skills allows us to explicate
the relation between perception and mental imagery. For one thing, it allows us
to acknowledge the perceptual nature of mental images. If mental imagery is a
skilful use of perception, then the neural mechanisms of perception and mental
imagery should be shared, which is indeed the case. Moreover, if the format of
the iconic representation is domain-specific, then we should not expect to have
one mechanism responsible for mental imagery, which is also the case. There is
no common mechanism of mental imagery for all modalities.
For another, we can explain the asymmetry in interpreting bistable figures in
imagery and perception. If mental images are representations of measures, then we
expect their meanings to be fixed. They are not open to interpretation, in the same
way as the meaning of a standard metre is not open to interpretation. Images set
the conditions of interpretation by exemplifying the construction rules by means
of which we identify objects. In contrast, perceptions are open to interpretation,
for it is always possible to ask which measure should be applied to interpret them.
Depending on different measures, we highlight different perceptual properties.
Moreover, interpreting mental imagery in terms of a set of skills gives us
a better picture of the relation between perception and thinking than the neo-
Lockean approaches. On the one hand, it offers a hint on how to think about the
role of mental imagery in perception. A good illustration of such a role is the
so-called amodal completion. It is the common phenomenon of perceiving the
parts of perceived objects from which we have no sensory stimulation (e.g. Nanay,
2018).8 For example, when seeing a cat behind a fence, we amodally complete the
parts of the cat that are occluded. Importantly, amodal completion is a perceptual
phenomenon. It happens very early in the sensory cortices (Thielen et al., 2019).
In a sense, we see the occluded parts that are not triggered by the retinal input.
Now, if we think about mental imagery in terms of perceptual skills, then
amodal completion is a form of mental imagery. The procedural knowledge of
Thinking with Images 179
Summary
Conclusion
pointing at the conditions that must be met if such operations are possible. Thus,
I investigate what images do and try to find something that does it.
To find out what images do, I analyse the functions of images. I am doing
so by presenting two case studies. The first one concerns the knot diagrams in
mathematics. With this example, I study the idea of the content of diagrams. To
explain the way diagrams represent, I introduce the concept of construction.
It refers to the procedures of arriving at a goal by determining the parameters
of some logical or physical space. The general idea is that diagrams represent
objects and events by representing the ways these objects and scenarios can be
constructed.
By introducing the concept of construction, I am able to offer an alternative
account of iconic content. In contrast to the Traditional View, I hold that
resemblance works as an explanandum but not as an explanans. I hold that the
concept of construction can explain why images resemble their objects, avoiding
the objections directed at resemblance-based semantics.
In the second case study, I analyse the black hole picture. To explain the way
this picture represents its object, I introduce the concept of recognition-based
identification. I argue that recognition-based identification is different from
demonstrative reference and descriptions. Recognition is based on identifying
the construction invariants.
Based on the concepts of construction and recognition-based identification,
I introduce the so-called two-dimensional model of iconic reference. According
to this model, images denote their targets and exemplify the rules of construction
by means of which they reidentify the referent.
I argue that the best way to explain these properties of iconic reference is to
think of images in terms of measurement devices. Just like rulers and balances,
images represent the ways of localizing objects and events by determining the
parameters of some space. In contrast to the Traditional View, images are not
copies of the world. They are the measurement devices we use to localize the
properties of the world.
Next, I demonstrate that the measurement-theoretic account of images
offers a theory of imagistic knowledge and content. In the 2-D model, imagistic
thoughts are systematic and decomposable. Thus, thinking of images in terms of
measurement devices meets the requirements of a theory of thought. Moreover,
it is a kind of thinking that is irreducible to propositional thinking.
Last but not least, I show the metaphysical consequences of the measurement-
theoretic account of images for our thinking about the format of mental
representations and mental imagery. I hold that we can replace the analog–
Conclusion 183
In the book, I have argued that although the idea that we think with images is
intuitive, it has been rarely thought through. However, this does not mean that
it has never been considered. In general, no idea presented here cannot be found
in Kant, Peirce, Wittgenstein, Evans or Goodman. All I am presenting here is an
attempt to think through the consequences of their ideas. All I am hoping for
is that we can see some ideas clearer. That is the only thing a philosopher can
fairly offer.
Notes
Introduction
1 I use the expressions ‘thinking with images’ and ‘thinking in images’ interchangeably.
2 See, e.g., Irvine (2014), Lakoff (1994), Lakoff and Johnson (1980), Langacker (1987).
For a contemporary defence of an imagery-based theory of semantics, see, e.g., Ellis
(1995), Lowe (1996), Thomas (1997) and Nyíri (2001).
3 For a philosophical and empirical defence of the existence of unconscious mental
images, see, e.g., Church (2008), Nanay (2010, 2015, 2017, 2021) and Zeman et al.
(2010, 2015). The discussion mostly concerns the phenomenon of aphantasia, which
is a condition where one does not possess the ability to visualize images (e.g. Keogh
and Pearson, 2018; Milton et al., 2020; Zeman et al., 2015, 2020) and form non-
visual mental images (e.g. Dawes et al., 2020).
4 Mental and extramental images undoubtedly differ in many respects. The
differences, however, should not obscure the similarities (e.g. Abell and Currie,
1999). Reasoning with the help of mental images is often performed interactively
with external representations (e.g. Kirsh and Maglio, 1994). Forming an extramental
image is not a post-hoc expression of some reasoning done with the help of
mental images but an integral part of this reasoning (Sheredos and Bechtel, 2020).
Moreover, an ability to form an external image assumes an ability to form a mental
image; for example, knowing how to construct a triangle on a piece of paper is
knowing how to construct a triangle in the mind, and vice versa (Maynard, 2011).
5 Most generally, this relation to a proposition can be described in Fregean terms as an
act of ‘grasping’ thoughts, which refers to every act of discovering or understanding
the meaning of a thought and which is equivalent to knowing the truth-conditions
of a proposition. Grasping a thought is distinguished from evaluating the truth value
of a thought, for knowing the conditions according to which a thought is true is not
equivalent to knowing whether it is true.
Chapter 1
1 In fact, it is almost impossible to find a single empirical article without any reference
to a graph or a diagram. According to the studies of Zacks et al. (2002), the number
of graphs in scientific journals doubled between 1984 and 1994. Nothing suggests
that this trend has not continued.
186 Notes
Chapter 2
1 In cognitive psychology, the belief that thinking is propositional and action-based is best
expressed by Holyoak and Morrison’s (2012) definition of thinking as the systematic
manipulation of cognitive representations determining current or possible states of the
world. In accordance with the Received View, thinking is most often operationalized as
an act of reasoning and problem-solving (Evans, 2017). On the history of research on
thinking in cognitive psychology, see, e.g., Dominowski and Bourne (1994).
2 An example of the epistemological argument from the content indeterminacy of
images is Descartes’s well-known chiliagon-argument. According to Descartes, an
image of a chiliagon is indistinguishable from an image of polygon with 999 sides,
and yet we have clear and distinct ideas of a chiliagon and a 999-gon. Thus,
thinking cannot be imagistic. However, this argument is clearly invalid, since it
does not imply that our thoughts cannot be imagistic. It demonstrates only that
our perceptual capacities are limited. If we have enough time to draw a picture of
chiliagon and a 999-gon and count the angles, then it is possible to distinguish a
chiliagon and a 999-gon (see Dennett, 1990). In the imagery debate, contemporary
versions of Descartes’s epistemological argument (sharing the flaws of the original
argument) are Dennett’s argument (1981a) from a striped tiger and Armstrong’s
argument (1968) from a speckled hen.
3 Kulvicki’s argument concerns pictures, but it can be readily extended to all imagistic
genera.
4 The psychological instantiation of the pictorial fallacy is the so-called lack of
pictorial competence. It is an inability to discriminate between objects and
depictions of these objects which characterizes young children up to two years old.
It manifests itself in toddlers’ attempts to manually grasp the depicted objects as if
the real objects were presented (see DeLoache et al., 2003).
5 Gregory (2020) argues that images are non-propositional but predicative. The
Semantical Challenge shows that the predicative function is suspect.
6 According to Evans (1982), systematicity is constrained by semantic conditions
of appropriateness. For instance, thinking that John fell into the lake need
Notes 187
not entail a capacity to think that the lake fell into John. However, even if a
well-formed string is semantically absurd, that does not mean that it cannot express
a thought. For one thing, we can entertain absurd thoughts. For another, absurd
but well-formed strings can serve as a basis of inferences in logic. See, e.g., Camp
(2004).
7 That does not mean that they lack construction rules. They are obviously rule-
governed. That is why if one understands how to interpret one Venn diagram, then
one understands how to understand another (e.g. Tversky, 2004). A representational
system can have construction rules without syntactic structure. A syntax requires
discrete symbols and logical rules that transform strings of symbols into well-
formed formulas of the system.
8 Propositional and associationist theories of thought, just as the Received View, do
not represent actual theories or a set of theories but certain ideal types or research
programmes. Although they share some methods and questions, they are sets of
related yet independent theses. In this general sense, associationist theories of
thought have to be distinguished from historical associationist theories of mind
such as Mill’s and Hume’s.
9 There are also intermediate strategies, such as Rescorla’s (2009).
10 This distinction does not cover all possible accounts of thinking. There are also
atypical positions, such as Ryle’s adverbial theory.
11 To the best of my knowledge, Kant and Peirce were the first to recognize perceptual
and spatial metaphors as complementary in their theories of thinking.
Chapter 3
nature is imagistic. It only means that we need an explanation for why they appear
imagistic. Given that they demonstrate that we think with images, it does not bring
us any closer to answering the question of what thinking with images is.
In cognitive science, we encounter the same problem. It is well established
that there are individual differences in thinking and people can be characterized
in terms of being ‘concrete’ versus ‘abstract’ thinkers (e.g. Paivio, 1963; Paivio
and Harshman, 1983; Kozhevnikov et al., 2005). There is also something to the
observation that visual aids can facilitate the process of thinking. For instance, it
has been frequently indicated that the same sort of ‘visual’ style of thinking fosters
the creativity of scientists (e.g. Holyoak and Thagard, 1996; Johnson-Laird, 1998;
Otis, 2015; Rocke, 2010; Shepard, 1978). However, we must bear in mind that we
lack a clear-cut definition of thinking styles. This concept is most often understood
very broadly. It refers to many different domains, such as cognitive styles, learning
styles, personality types, and so on, which are not integrated within a general
theory of psychological functioning. There is strong evidence that there are no
unified visual and verbal cognitive styles (e.g. Kozhevnikov et al., 2005, 2010).
Moreover, it is crucial to remember that the psychology of individual differences
does not study thinking mechanisms but asks about individual differences in using
different mechanisms depending on preferences and needs. Thinking styles are
studied based on questionnaires and surveys. There is no way to study them using
neuroimaging methods. The fact that there are individual differences in solving
cognitive tasks does not imply that we were born with some inherent thinking
preferences. It can mean that using images is a good cognitive strategy for specific
tasks, and some individuals are more skilled in employing one strategy or another.
Thus, referring to thinking styles does not explain what thinking with images is. It
is only another way of describing the explanandum. A proper explanation requires
analysing the source of individual differences in cognitive tasks. We should find out
what kind of functions images can play and what cognitive purposes they are better
for. Foremost, what has to be explained is why imagistic thinking strategies are
sometimes better than non-imagistic ones.
2 Historically speaking, there is controversy as to who in fact represents the imagistic
theory of thought. Berkeley appears to believe that all ideas are images (for the
opposite view, see, e.g., Kasem, 1989). Hume believes something similar, though
there are some doubts (e.g. Yolton, 1996). Locke’s view is far from clear. It can be
argued that we do not have to refer to images to understand the nature of Locke’s
concept of idea and that we can instead adopt an adverbial interpretation. See, e.g.,
Ayers (1991), Chappell (1994), Lowe (1995), Soles (1999), White (1990) and Yolton
(1996).
3 In the light of grounding cognition in perception, arguing away the innate
elements of the mind seems to be relatively less important. In the case of Locke,
the point was that empiricism would be more convincing if there were no plausible
Notes 189
Chapter 4
1 I hold that knot diagrams are necessary to represent the ways of constructing
knots. This does not mean that every property of a knot is representable by a
knot diagram. For instance, knots have properties represented only by continued
fractions and polynomials.
190 Notes
Thus, the space can be a physical or logical space. For instance, the inflation
diagram localizes the values of time and inflation rate and determines the relation
between them. The term ‘7 + 5 = 12’ determines the relation between some
magnitudes. Geometrical diagrams are only a subset of a more general phenomena.
8 The distinction between type-diagram and token-diagram is Peircean in origin (e.g.
NEM 4.315).
9 Possessing such disposition does not imply that it will be actualized. For instance,
one can have no talent for drawing diagrams. In the same way, if I speak in some
language, I have a disposition to read in this language. However, such disposition
cannot be actualized, if I do not know the alphabet.
10 Note that I claim that diagrams can help discover a proof strategy (e.g. Antonietti,
1991; Giaquinto, 2008; Stenning and Oberlander, 1995). I do not claim that
diagrams can be used as proofs in mathematics. As is well known, we can be easily
misled by accidental properties of diagrams. The concept of mathematical proof is
different from the concept of mathematical construction.
11 As Black once said (1972, 122): ‘My chief objection to the resemblance view, then,
is that when pursued it turns out to be uninformative, offering a trivial verbal
substitution in place of insight.’
12 This structure-preserving mapping does not have to be isomorphism. It can be
either homomorphism (if the mapping is not bijective), monomorphism (if it is
an injective homomorphism), epimorphism (if it is surjective homomorphism),
endomorphism or automorphism (e.g. Pero and Suárez, 2016; Pietarinen, 2014).
Determining the character of these mappings is irrelevant for my argument.
13 Frigg (2006, 50) introduces the so-called problem of style. It concerns the taxonomy
of different styles of scientific representations. The riddle of style is similar to the
problem of the multiple models, which can be expressed by the question of why
scientists use different models of the same targets (Giere, 2006).
Chapter 5
1 Goodman holds (1976) that the idea of naturalness of icons can be explained by the
concept of entrenchment. In a nutshell, icons are natural representations for we are
more familiar with the representational systems that have a longer history of use.
However, that cannot work as an explanation, for we can ask what properties of the
system make it more likely to be entrenched and reproduced in history.
2 Although many philosophers of perception assume that we can perceive states of
affairs (e.g. Armstrong, 1997; Fish, 2009; McDowell, 1996), it is not uncontested (e.g.
Vernazzani, 2021). For our purposes, it is sufficient to claim that if the perception of
facts exists, it is distinct from perceptual recognition.
192 Notes
Chapter 6
later, I hold that images lack predicative functions. Third, I hold that iconic content
cannot be descriptive.
2 This distinction has been already applied to depiction in phenomenological
tradition (e.g. Husserl, 2005; Ingarden, 1989; Wiesing, 2009). Husserl distinguishes
between the picture’s surface, its content (Bildobjekt) and target (Bildsujet).
3 Sometimes denotation is interpreted as restricted to linguistic representations, such
as proper names. However, this restriction is neither necessary nor helpful (Elgin,
2010b).
4 To deal with targetless representations, Goodman (1976) distinguishes between
representations-of and representations-as. A picture of a unicorn is not a
representation of a unicorn but a kind of a picture, namely, a unicorn-picture.
Kinds of pictures are distinguished by different modes of presentation. I do not
endorse this distinction, for it reintroduces the old problem of sense-reference
distinction. In the same way as there can be no mode of presentation without
something that is represented (Evans, 1982), there can be no representation-as
without representation-of. The same objection applies to Hyman’s (2012, 2015)
distinction between the sense and reference of pictorial representations.
5 Notably, we have to remember to keep aside representing a particular object by
iconic content and identifying the possible world we are in. I hold that iconic
content can identify particulars but is not relativized to the context of possible
worlds (target is).
6 It is important not to confuse the understandability of images with their
interpretability. Understandability of images refers to their ability to be used in
some contexts. Interpretability abstracts from the context of use and determines the
conditions that have to be satisfied by an image to bear a representational function.
An image can be interpretable and not understandable. Suppose that in some
population there are no mathematicians, and there is no one who can understand a
mathematical graph. Still, the graph is interpretable. It does not work the other way
round. An image cannot be understandable if it is not interpretable.
7 This description is complementary with the representational theory of measurement
(RTM). According to RTM, measurement is ‘the construction of homomorphisms
(scales) from empirical relational structures of interest to numerical structures that
are useful’ (Krantz et al., 1971, 9). Measurement should satisfy two kinds of theorems.
The representation theorem holds that there a represented domain can be mapped
onto the structure of the representing domain. The uniqueness theorem establishes
the permissible transformations of the represented domain into the numerical
domain. However, our description of measurement does not depend on RTM.
8 Thinking about images in terms of measurement devices seems to imply that
images possess arbitrary semantics. The argument can look as follows. If we can
measure the world in any way we like, then we can depict the world any way we
want, which is false. Images are naturally connected to the world. We cannot
194 Notes
depict horses as cows, whereas we can conduct measurements with many different
measures. We can measure the length in metres and inches. However, this argument
is invalid, for it is not true that measurements are arbitrarily connected to the
world. We have different measurement conventions in the same way as there are
different iconic conventions. Yet, it is not true that there is no difference in what
type of measurement we choose. We can try to measure distances with gas, but it
does not have the properties that enable identifying the distance. Thus, the way we
measure depends on how the world is. In the same way, images are measurement
devices that are naturally connected to the world. We can depict objects differently.
However, the depiction has to possess properties that identify the object of
depiction. Thus, images depend on what they represent.
9 The extensions of the Reutersvaard-Penrose triangle are the well-known Penrose
stairs or Escher’s ‘Waterfall’.
10 Indeterminable constructions have to be distinguished from indetermined
constructions. For instance, a picture of a man in a hat does not determine whether
he is bald. However, it is a determinable property of this class of constructions. We
can have a different picture of the same man without a hat.
11 The impossible images phenomenon is not restricted to spatial representations.
Indeterminable constructions can be instantiated by temporal representations. For
instance, echolocation jamming in bats occurs when non-target sounds interfere
with target echoes and can be caused by the echolocation system itself.
Chapter 7
1 Note that the concept of understating comes in degrees. One does not have to
understand a whole theory to understand its parts. Most people understand basic
mathematical calculus, but it does not imply that they understand the advanced
calculus.
2 This does not imply that division of language into meaningful parts is semantically
non-arbitrary (Johnson, 2004; Quine, 1970; Salje, 2019). Yet, it is believed that
language has a logical structure and a syntax.
3 Burge (2018) argues that we can establish canonical decomposition by discovering
perceptual competences that determine picture constituents. By the same token, we
can establish a picture’s grammar-like structure. However, his argument misses the
point, for Fodor holds that there is no principal way we can decompose a picture
into semantical parts. Fodor’s argument is metaphysical, not psychological.
4 In Kozak (2021), I distinguish between the so-called vehicle-specificity and
modality-specificity of images.
Notes 195
Aasen, S. (2016). ‘Pictures, Presence and Visibility’. Philosophical Studies, 173 (1):
187–203.
Abel, G. (2012). ‘Knowledge Research: Extending and Revising Epistemology’. In A.
Abel and J. Conant (eds), Rethinking Epistemology, Vol. 1, 1–52. Berlin: DeGruyter.
Abell, C. (2009). ‘Canny Resemblance: An Account of Depiction’. Philosophical Review,
118 (2): 183–223.
Abell, C. and K. Bantinaki (2010). ‘Introduction’. In C. Abell and K. Bantinaki (eds),
Philosophical Perspectives on Depiction, 1–24. Oxford; New York: Oxford University
Press.
Abell, C. and G. Currie (1999). ‘Internal and External Pictures’. Philosophical Psychology,
12 (4): 429–45.
Abid, G. (2021). ‘Recognition and the Perception–Cognition Divide’. Mind & Language,
1–20. https://doi.org/10.1111/mila.12362.
Abrahamsen, A. and W. Bechtel (2015). ‘Diagrams as Tools for Scientific Reasoning’.
Review of Philosophy and Psychology, 6: 117–31.
Ambrosio, C. (2014). ‘Iconic Representations and Representative Practices’.
International Studies in the Philosophy of Science, 28 (3): 255–75.
Amit, E., C. Hoeflin, N. Hamzah and E. Fedorenko (2017). ‘An Asymmetrical
Relationship Between Verbal and Visual Thinking: Converging Evidence from
Behavior and fMRI’. Neuroimage, 15, 619–27.
Anderson, J. R. (1978). ‘Arguments Concerning Representations for Mental Imagery’.
Psychological Review, 85 (4): 249–77.
Antonietti, A. (1991). ‘Why Does Mental Visualisation Facilitate Problem-Solving?’
In R. H. Logie and M. Denis (eds), Mental Images in Human Cognition, 211–27.
Amsterdam: Elsevier.
Armstrong, D. (1968). A Materialist Theory of the Mind. London: Routledge & Kegan
Paul.
Armstrong, D. (1989). A Combinatorial Theory of Possibility. Cambridge, MA:
Cambridge University Press.
Armstrong, D. (1997). A World of States of Affairs. Cambridge, MA: Cambridge
University Press.
Arnheim, R. (1969). Visual Thinking. Berkeley; Los Angeles: University of California
Press.
Arnheim, R. (1980). ‘A Plea for Visual Thinking’. Critical Inquiry, 6 (3): 489–97.
Ayers, M. (1991). Locke: Epistemology and Ontology. London: Routledge.
Literature 197
Carey, S. (2011). ‘Précis of “The Origin of Concepts”’. The Behavioral and brain sciences,
34 (3): 113–24.
Cartwright, N. (1983). How the Laws of Physics Lie. Oxford: Oxford University Press.
Cartwright, N. and L. Runhardt (2014). ‘Measurement’. In N. Cartwright and E.
Montuschi (eds), Philosophy of Social Science: A New Introduction, 265–87. Oxford:
Oxford University Press.
Carvalho, J. M. (2019). Thinking with Images: An Enactivist Aesthetics. New York;
London: Routledge.
Casati, R. (2015). ‘Object Perception’. In M. Matthen (ed.), The Oxford Handbook of
Philosophy of Perception, 393–404. Oxford: Oxford University Press.
Casati, R. (2018). ‘Two, then Four Modes of Functioning of the Mind: Towards a
Unification of “Dual” Theories of Reasoning and Theories of Cognitive Artefacts’.
In J. M. Zacks and H. A. Taylor (eds), Representations in Mind and World: Essays
inspired by Barbara Tversky, 7–23. New York: Routledge.
Casati, R. and A. Varzi (1999). Parts and Places: The Structures of Spatial Representation.
Cambridge, MA: MIT Press.
Cavedon-Taylor, D. (2021). ‘Untying the Knot: Imagination, Perception and Their
Neural Substrates’. Synthese, 199: 7203–30.
Chambers, D. and D. Reisberg (1985). ‘Can Mental Images be Ambiguous?’ Journal of
Experimental Psychology: Human Perception and Performance, 11 (3): 317–28.
Chang, H. (2004). Inventing Temperature: Measurement and Scientific Progress. Oxford:
Oxford University Press.
Chang, H. and N. Cartwright (2014). ‘Measurement’. In M. Curd and S. Psillos (eds),
The Routledge Companion to Philosophy of Science, 411–19. London; New York:
Routledge.
Chappell, V. (1994). ‘Locke’s Theory of Ideas’. In V. Chappell (ed.), The Cambridge
Companion to Locke, 26–55. Cambridge: Cambridge University Press.
Chevalier, J.-M. (2015). ‘The Problem of Resemblance in Peirce’s Semiotics and
Philosophy’. Versus, 120: 45–59.
Church, J. (2008). ‘The Hidden Image: A Defense of Unconscious Imagining and Its
Importance’. American Imago, 65: 379–404.
Clark, A. (2013). ‘Whatever Next? Predictive Brains, Situated Agents, and the Future of
Cognitive Science’. Behavioral and Brain Sciences, 36: 181–204.
Clement, C. A. and R. J. Falmagne (1986). ‘Logical Reasoning, World Knowledge, and
Mental Imagery: Interconnections in Cognitive Processes’. Memory and Cognition,
14: 299–307.
Collins, C. (1991). The Poetics of the Mind’s Eye: Literature and the Psychology of
Imagination. Philadelphia, PA: University of Pennsylvania Press.
Coopmans, C. (2014). ‘Visual Analytics as Artful Revelation’. In C. Coopmans, J.
Vertesi, M. Lynch and S. Woolgar (eds), Representation in Scientific Practice
Revisited, 37–60. Cambridge, MA; London: MIT Press.
Crane, T. (1988). ‘The Waterfall Illusion’. Analysis, 48: 142–7.
Literature 201
Dennett, D. C. (2013). Intuition Pumps and Other Tools for Thinking. New York: W. W.
Norton & Company.
Dennett, D. C. (2017). From Bacteria to Bach and Back: The Evolution of Minds. New
York: W. W. Norton & Company.
De Regt, H. W. (2014). ‘Visualization as a Tool for Understanding’. Perspectives on
Science, 22 (3): 377–96.
De Regt, H. W. (2017). Understanding Scientific Understanding. New York: Oxford
University Press.
De Toffoli, S. and V. Giardino (2014). ‘Forms and Roles of Diagrams in Knot Theory’.
Erkenntnis, 79 (4): 829–42.
Devitt, M. (2006). Ignorance of Language. Oxford: Oxford University Press.
Dilworth, J. (2008). ‘The Propositional Challenge to Aesthetics’. British Journal of
Aesthetics, 48 (2): 115–44.
Djordjevic, J., R. J. Zatorre, M. Petrides, J. A. Boyle and M. Jones-Gotman (2005).
‘Functional Neuroimaging of Odor Imagery’. Neuroimage, 24: 791–801.
Djordjevic, J., R. J. Zatorre, M. Petrides and M. Jones-Gotman (2004). ‘The Mind’s Nose:
Effects of Odor and Visual Imagery on Odor Detection’. Psychological Science, 15 (3):
143–8.
Dokic, J. (2010). ‘Perceptual Recognition and the Feeling of Presence’. In B. Nanay (ed.),
Perceiving the World, 33–53. Oxford: Oxford University Press.
Dominowski, R. L. and L. E. Bourne (1994). ‘History of Research on Thinking and
Problem Solving’. In R. J. Sternberg (ed.), Thinking and Problem Solving, 1–36. San
Diego, CA: Academic Press.
Dorsch, F. (2016). ‘Knowledge by Imagination – How Imaginative Experience Can
Ground Factual Knowledge’. Teorema: Revista Internacional De Filosofía, 35 (3):
87–116.
Dresner, E. (2002). ‘Measurement Theoretic Semantics and Semantics of Necessity’.
Synthese, 130: 413–40.
Dresner, E. (2006). ‘A Measurement Theoretic Account of Propositions’. Synthese, 153:
1–22.
Dresner, E. (2010). ‘Language and the Measure of Mind’. Mind & Language, 25 (4):
418–39.
Dresner, E. (2014). ‘Decision Theory, Propositional Measurement, and Unified
Interpretation’. Mind, 123 (491): 707–32.
Dretske, F. (1981). Knowledge and the Flow of Information. Cambridge, MA: MIT
Press.
Dummett, M. (1993). Origins of Analytical Philosophy. London: Duckworth.
Earnshaw, R. A. and D. Watson (eds) (1993). Animation and Scientific Visualization:
Tools and Applications. London: Academic Press.
Eaton, M. (1980). ‘Truth in Pictures’. The Journal of Aesthetics and Art Criticism, 39 (1):
15–26.
Eco, U. (1995). The Search for the Perfect Language. Oxford: Blackwell.
Literature 203
Fodor, J. (2008). LOT 2: The Language of Thought Revisited. Oxford: Oxford University
Press.
Fodor, J. and Z. Pylyshyn (1981). ‘How Direct is Visual Perception?: Some Reflections
on Gibson’s “Ecological Approach”’. Cognition, 9: 207–46.
Fodor, J. and Z. Pylyshyn (1988). ‘Connectionism and Cognitive Architecture: A
Critical Analysis’. Cognition, 28 (1–2): 3–71.
Foster, D. (2003). ‘Does Colour Constancy Exist?’ Trends in Cognitive Sciences, 7: 439–43.
Foster, D. (2011). ‘Colour Constancy’. Vision Research, 51: 674–700.
Francis, G. K. (2007). A Topological Picturebook. New York: Springer Verlag.
Frege, G. (1984). ‘Thoughts’. Translated by P. Geach and R. Stoothoff. In B. McGuinness
(ed.), Collected Papers on Mathematics, Logic, and Philosophy, 351–72. Oxford: Basil
Blackwell.
French, S. R. D. (2003). ‘A Model-Theoretic Account of Representation (Or, I Don't
Know Much About Art But I Know It Involves Isomorphism)’. Philosophy of Science,
70 (5): 1472–83.
Frigg, R. (2006). ‘Scientific Representation and the Semantic View of Theories’. Theoria,
21 (1): 49–65.
Frigg, R and J. Nguyen (2020). Modelling Nature: An Opinionated Introduction to
Scientific Representation. Cham: Springer.
Frixione, M. and A. Lombardii (2015). ‘Street Signs and Ikea Instruction Sheets: Pragmatics
and Pictorial Communication’. Review of Philosophy and Psychology, 6: 133–49.
Galton, F. (1880). ‘Statistics of Mental Imagery’. Mind, 5: 301–18.
Gardner, H. (2004). Frames of Mind: The Theory of Multiple Intelligences. New York:
Basic Books.
Gaskin, R. (2008). The Unity of the Proposition. Oxford: Oxford University Press.
Gauker, C. (2007). ‘A Critique of the Similarity Space Theory of Concepts’. Mind &
Language, 22 (4): 317–45.
Gauker, C. (2011). Words and Images: An Essay on the Origin of Ideas. Oxford: Oxford
university Press.
Gauker, C. (2017). ‘Three Kinds of Nonconceptual Seeing-as’. Review of Philosophy and
Psychology, 8 (4): 763–79.
Gauker, C. (2020). ‘On the Difference Between Realistic and Fantastic Imagining’.
Erkenntnis, 1–20.
Gendler, T. S. (2004). ‘Thought Experiments Rethought – and Reperceived’. Philosophy
of Science, 71: 1154–63.
Gersel, J. P. (2017). ‘Discriminatory Capacities, Russell's Principle, and the Importance
of Losing Sight of Objects’. European Journal of Philosophy, 25: 700–20.
Giaquinto, M. (2007). Visual Thinking in Mathematics: An Epistemological Study.
Oxford: Oxford University Press.
Giaquinto, M. (2008). ‘Visualizing in Mathematics’. In P. Mancosu (ed.), The Philosophy
of Mathematical Practice, 22–42. Oxford: Oxford University Press.
Literature 205
Hookway, C. (2007). ‘Peirce on Icons and Cognition’. In U. Priss, S. Polovina and R. Hill
(eds), Conceptual Structures: Knowledge Architectures for Smart Applications, 59–68.
Berlin; Heidelberg: Springer.
Hope, V. (2009). ‘Object Perception, Perceptual Recognition, and That-Perception
Introduction’. Philosophy, 84 (330): 515–28.
Hopkins, R. (1998). Picture, Image and Experience: A Philosophical Inquiry. Cambridge:
Cambridge University Press.
Hubbard, T. L. (1997). ‘Target Size and Displacement Along the Axis of Implied
Gravitational Attraction: Effects of Implied Weight and Evidence of Representational
Gravity’. Journal of Experimental Psychology: Learning, Memory, and Cognition, 23:
1484–93.
Hubbard, T. L. (2010). ‘Auditory Imagery: Empirical Findings’. Psychological Bulletin,
136 (2): 302–29.
Hume, D. (1975). A Treatise of Human Nature. Ed. L. A. Selby-Bigge, 2nd edn., revised
by P. H. Nidditch. Oxford: Clarendon Press.
Hume, D. (1977). An Enquiry Concerning Human Understanding. Ed. E. Steinberg.
Indianapolis, IN: Hackett.
Hummel, J. E. (2013). ‘Object Recognition’. In D. Reisberg (ed.), Oxford Handbook of
Cognitive Psychology, 32–46. Oxford: Oxford University Press.
Husserl, E. (2005). Phantasy, Image Consciousness, and Memory (1898–1925). Trans. R.
Bernet. Dordrecht: Springer.
Hyman, J. (2006). The Objective Eye: Color, Form, and Reality in the Theory of Art.
Chicago, IL: University of Chicago Press.
Hyman, J. (2012). ‘Depiction’. Royal Institute of Philosophy Supplements, 71: 129–50.
Hyman, J. (2015). ‘Depiction’. In P. F. Bundgaard and F. Stjernfelt (eds), Investigations in
the Phenomenology and the Ontology of the Work of Art: What are Artworks and How
Do We Experience Them?, 191–208. Heidelberg: Springer.
Ingarden, R. (1989). The Ontology of the Work of Art. Trans. R. Meyer and J. T.
Goldthwait. Athens, OH: Ohio University Press.
Inkpin, A. (2016). ‘Projection, Recognition, and Pictorial Diversity’. Theoria, 82:
32–55.
International Organization for Standardization (1994). Accuracy (Trueness and
Precision) of Measurement Methods and Results – Part 1: General Principles and
Definitions. Standard No. 5725-1.
Intons-Peterson, M. J. (1983). ‘Imagery Paradigms: How Vulnerable are They to
Experimenter's Expectations?’ Journal of Experimental Psychology: Human
Perception and Performance, 9: 394–412.
Intons-Peterson, M. J. and A. R. White (1981). ‘Experimenter Naiveté and Imaginal
Judgments’. Journal of Experimental Psychology: Human Perception and Performance,
7 (4): 833–43.
Irvine, E. (2014). ‘What Iconicity Can and Cannot Do for Proto-Language’. In E. A.
Cartmill, S. Roberts, H. Lyn and H. Cornish (eds), The Evolution of Language:
Literature 209
Keogh, R. and J. Pearson (2018). ‘The Blind Mind: No Sensory Visual Imagery in
Aphantasia’. Cortex, 105: 53–60.
Khalifa, K. (2017). Understanding, Explanation, and Scientific Knowledge. New York:
Cambridge University Press.
Kiefer, M. and F. Pulvermüller (2012). ‘Conceptual Representations in Mind and Brain:
Theoretical Developments, Current Evidence and Future Directions’. Cortex, 48: 805–25.
Kind, A. (2018). ‘How Imagination Gives Rise to Knowledge’. In F. Macpherson and F.
Dorsch (eds), Perceptual Memory and Perceptual Imagination, 227–46. New York:
Oxford University Press.
Kirsh, D. (2010). ‘Thinking with External Representations’. Artificial Intelligence and the
Simulation of Behaviour, 25: 441–54.
Kirsh, D. and P. Maglio (1994). ‘On Distinguishing Epistemic from Pragmatic Action’.
Cognitive Science, 18: 513–49.
Kitcher, P. and A. Varzi (2000). ‘Some Pictures are Worth 2[aleph]0 Sentences’.
Philosophy, 75 (3): 377–81.
Kjørup, S. (1974). ‘George Inness and the Battle at Hastings, or Doing Things With
Pictures’. Monist, 58 (2): 216–35.
Klatzky, R. L., S. J. Lederman and D. E. Matula (1991). ‘Imagined Haptic Exploration
in Judgements of Object Properties’. Journal of Experimental Psychology: Learning,
Memory, and Cognition, 17: 314–22.
Knauff, M. (2013). Space to Reason: A Spatial Theory of Human Thought. Cambridge,
MA; London: MIT Press.
Knauff, M. and P. N. Johnson-Laird (2002). ‘Visual Imagery Can Impede Reasoning’.
Memory and Cognition, 30: 363–71.
Kobayashi, M., M. Takeda, N. Hattori, M. Fukunaga, T. Sasabe, N. Inoue, Y. Nagai, T.
Sawada, N. Sadato and Y. Watanabe (2004). ‘Functional Imaging of Gustatory Perception
and Imagery: “Top-Down” Processing of Gustatory Signals’. NeuroImage, 23: 1271–82.
Kosslyn S. M. (1980). Image and Mind. Cambridge, MA: Harvard University Press.
Kosslyn, S. M. (1994). Image and Brain: The Resolution of the Imagery Debate.
Cambridge, MA: MIT Press.
Kosslyn, S. M., T. M. Ball and B. J. Reiser (1978). ‘Visual Images Preserve Metric Spatial
Information: Evidence from Studies of Image Scanning’. Journal of Experimental
Psychology: Human Perception and Performance, 4: 47–60.
Kosslyn, S. M., W. L. Thompson and G. Ganis (2006). The Case for Mental Imagery.
Oxford: Oxford University Press.
Kosso, P. (2007). ‘Scientific Understanding’. Foundations of Science, 12: 173–88.
Kozak, P. (2020) ‘The Diagram Problem’. In A.-V. Pietarinen, P. Chapman, L. Bosveld-de
Smet, V. Giardino, J. Corter and S. Linker (eds), Diagrammatic Representation and
Inference, 216–24. Cham: Springer.
Kozak, P. (2021). ‘The Analog-Digital Distinction Fails to Explain the Perception-
Thought Distinction: An Alternative Account of the Format of Mental
Representation’. Studia Semiotyczne, 35 (1): 73–94.
Literature 211
Kozhevnikov, M., S. Kosslyn and J. Shephard (2005). ‘Spatial vs. Object Visualizers: A New
Characterization of Visual Cognitive Style’. Memory and Cognition, 33 (4): 710–26.
Kozhevnikov, M., O. Blazhenkova and M. Becker (2010). ‘Trade-Off in Object Versus
Spatial Visualization Abilities: Restriction in the Development of Visual-Processing
Resources’. Psychonomic Bulletin & Review, 17 (1): 29–35.
Krantz, D., R. D. Luce, A. Tversky and P. Suppes (1971). Foundations of Measurement
Volume I: Additive and Polynomial Representations. San Diego and London:
Academic Press.
Kripke, S. A. (1980). Naming and Necessity. Cambridge, MA: Harvard University Press.
Kripke, S. A. (1982). Wittgenstein on Rules and Private Language. Oxford: Blackwell.
Kulpa, Z. (1987). ‘Putting Order in the Impossible’. Perception, 16: 201–14.
Kulpa, Z. (2009). ‘Main Problems of Diagrammatic Reasoning. Part I: The
Generalization Problem’. Foundations of Science, 14 (1–2): 75–96.
Kulvicki, J. (2006a). On Images: Their Structure and Content. Oxford: Clarendon Press.
Kulvicki, J. (2006b). ‘Pictorial Representation’. Philosophy Compass, 1 (6): 535–46.
Kulvicki, J. (2014). Images. New York: Routledge.
Kulvicki, J. (2020). Modeling the Meanings of Pictures: Depiction and the Philosophy of
Language. Oxford: Oxford University Press.
Kvanvig, J. L. (2018). ‘Knowledge, Understanding, and Reasons for Belief ’. In D. Starr
(ed.), The Oxford Handbook of Reasons and Normativity, 685–715. New York: Oxford
University Press.
Lakoff, G. (1994). ‘What is Metaphor?’ In J. A. Barnden and K. J. Holyoak (eds),
Advances in Connectionist and Neural Computation Theory: Vol. 3, Analogy,
Metaphor, and Reminding, 203–57. Norwood, NJ: Ablex.
Lakoff, G. and M. Johnson (1980). Metaphors We Live By. Chicago, IL: University of
Chicago Press.
Langacker, R. W. (1987). Foundations of Cognitive Grammar. Stanford, CA: Stanford
University Press.
Langland-Hassan, P. (2015). ‘Imaginative Attitudes’. Philosophy and Phenomenological
Research, 40: 664–86.
Langland-Hassan, P. (2020). Explaining Imagination. Oxford: Oxford University Press.
Larkin, J. H. and H. A. Simon (1987). ‘Why a Diagram is (Sometimes) Worth Ten
Thousand Words’. Cognitive Science, 11: 65–99.
Latour, B. (1990). ‘Visualisation and Cognition: Drawing Things Together’. In M. Lynch
and S. Woolgar (eds), Representation in Scientific Activity, 19–68. Cambridge, MA:
MIT Press.
Laurence, S. and E. Margolis (2012). ‘The Scope of the Conceptual’. In E. Margolis, R.
Samuels and S. Stich (eds), The Oxford Handbook of Philosophy of Cognitive Science,
291–317. New York: Oxford University Press.
Lehrer, K. (2000). ‘Meaning, Exemplarization and Metarepresentation’. In D. Sperber
(ed.), Metarepresentations: A Multidisciplinary Perspective, 299–310. Oxford: Oxford
University Press.
212 Literature
Nanay, B. (2015). ‘Perceptual Content and the Content of Mental Imagery’. Philosophical
Studies, 172: 1723–36.
Nanay, B. (2017). ‘Pain and Mental Imagery’. The Monist, 100: 485–500.
Nanay, B. (2018). ‘The Importance of Amodal Completion in Everyday Perception’.
i-Perception, 9 (4): 1–16.
Nanay, B. (2021). ‘Unconscious Mental Imagery’. Philosophical Transactions of the Royal
Society B, 376 (1817): 20190689.
Narens, L. (1985). Abstract Measurement Theory. Cambridge, MA: MIT Press.
Neander, K. (1987). ‘Pictorial Representation: A Matter of Resemblance’. British Journal
of Aesthetics, 27 (3): 213–26.
Nelsen, R. B. (1993). Proofs Without Words: Exercises in Visual Thinking. Washington:
The Mathematical Association of America.
Nersessian, N. (2008). Creating Scientific Concepts. Cambridge, MA: MIT Press.
Newell, A. and H. A. Simon (1972). Human Problem Solving. Englewood Cliffs, NJ:
Prentice Hall.
Newall, M. (2011). What is a Picture? Depiction, Realism, Abstraction. London: Palgrave
Macmillan.
Newton, N. (1982). ‘Experience and Imagery’. The Southern Journal of Philosophy, 21:
475–87.
Novitz, D. (1977). Pictures and Their Use in communication: A Philosophical Essay. The
Hague: Martinus Nijhoff.
Novitz, D. (1988). ‘Review of Deeper into Pictures: An Essay on Pictorial
Representation’. British Journal of Aesthetics, 28 (1): 87–9.
Nunberg, G. (1993). ‘Indexicality and Deixis’. Linguistics and Philosophy, 16: 1–43.
Nyíri, J. C. (2001). ‘The Picture Theory of Reason’. In B. Brogaard and B. Smith (eds),
Rationality and Irrationality (Schriftenreihe-Wittgenstein Gesellschaft, Vol. 29),
242–66. Vienna: Öbv&hpt.
Nyíri, K. (2013). ‘Wittgenstein’s Philosophy of Pictures’. In A. Pichler and S. Säätelä
(eds), Wittgenstein: The Philosopher and his Works, 322–53. Berlin; Boston: De
Gruyter.
Otis, L. (2015). Rethinking Thought: Inside the Minds of Creative Scientists and Artists.
Oxford; New York: Oxford University Press.
Overgaard, S. (2022). ‘Seeing-as, Seeing-o, and Seeing-that’. Philosophical Studies, 179
(11): 1–20.
Pagin, P. and D. Westerståhl (2010). ‘Compositionality I: Definitions and Variants’.
Philosophy Compass, 5: 250–64.
Paivio, A. (1963). ‘Learning of Adjective-Noun Paired Associates as a Function
of Adjective-Noun Word Order and Noun Abstractness’. Canadian Journal of
Psychology, 17: 370–79.
Paivio, A. and R. Harshman (1983). ‘Factor Analysis of a Questionnaire on Imagery and
Verbal Habits and Skills’. Canadian Journal of Psychology, 37: 461–83.
Peacocke, C. (1986). Thoughts: An Essay on Content. Oxford: Basil Blackwell.
216 Literature
Priest, G. (1997). ‘Sylvan’s Box: A Short Story and Ten Morals’. Notre Dame Journal of
Formal Logic, 38 (4): 573–82.
Priest, G. (1999). ‘Perceiving Contradictions’. Australasian Journal of Philosophy, 77 (4):
439–46.
Prinz, J. J. (2002). Furnishing the Mind: Concepts and Their Perceptual Basis. Cambridge,
MA: MIT Press.
Proclus (1992). A Commentary on the First Book of Euclid’s Elements. Trans. G. Morrow.
Princeton, NJ: Princeton University Press.
Putnam, H. (1981). Reason, Truth, and History. Cambridge: Cambridge University
Press.
Pylyshyn, Z. W. (1973). ‘What the Mind’s Eye Tells the Mind’s Brain: A Critique of
Mental Imagery’. Psychological Bulletin, 80: 1–25.
Pylyshyn, Z. W. (2002). ‘Mental Imagery: In Search of a Theory’. Behavioral and Brain
Sciences, 25: 157–82.
Pylyshyn, Z. W. (2003a). Seeing and Visualizing: It’s Not What You Think. Cambridge,
MA: MIT Press.
Pylyshyn, Z. W. (2003b). ‘Return of the Mental Image: Are There Really Pictures in the
Brain?’ TRENDS in Cognitive Sciences, 7 (3): 113–18.
Pylyshyn, Z. W. (2007). Things and Places: How the Mind Connects with the World.
Cambridge, MA: MIT Press.
Quilty-Dunn, J. (2016). ‘Iconicity and the Format of Perception’. Journal of
Consciousness Studies, 23 (3–4): 255–63.
Quilty-Dunn, J. (2020). ‘Perceptual Pluralism’. Noûs, 54 (4): 807–38.
Quilty-Dunn, J. and E. Mandelbaum (2020). ‘Non-Inferential Transitions: Imagery and
Association’. In T. Chan and A. Nes (eds), Inference and Consciousness, 151–71. New
York: Routledge.
Quine, W. V. O. (1956). ‘Quantifiers and Propositional Attitudes’. Journal of Philosophy,
5: 177–87.
Quine, W. V. O. (1960). Word and Object. Cambridge, MA: MIT Press.
Quine, W. V. O. (1968). ‘Ontological Relativity’. Journal of Philosophy, 65: 185–212.
Quine, W. V. O. (1970). ‘Methodological Reflections on Current Linguistic Theory’.
Synthese, 21 (3–4): 386–98.
Raftopoulos, A. (2009). Cognition and Perception: How do Psychology and Neural
Science Inform Philosophy? Cambridge, MA: MIT Press.
Raftopoulos, A. and J. Zeimbekis (2015). ‘The Cognitive Penetrability of Perception –
An Overview’. In J. Zembeiskis and A. Raftopolous (eds), The Cognitive Penetrability
of Perception: New Philosophical Perspectives, 1–56. Oxford: Oxford University
Press.
Ragni, M. and M. Knauff (2013). ‘A Theory and a Computational Model of Spatial
Reasoning with Preferred Mental Models’. Psychological Review, 120 (3): 561–88.
Rayo, A. (2013). The Construction of Logical Space. Oxford: Oxford University Press.
Recanati, F. (2012). Mental Files. New York: Oxford University Press.
218 Literature
Salis, F. and R. Frigg (2020). Capturing the Scientific Imagination. In A. Levy and P.
Godfrey-Smith (eds), The Scientific Imagination: Philosophical and Psychological
Perspectives, 17–50. New York: Oxford University Press.
Salje, L. (2019). ‘Talking Our Way to Systematicity’. Philosophical Studies, 176: 2563–88.
Sartre, J.-P. (1962). Imagination: A Psychological Critique. Ann Arbor, MI: University of
Michigan Press.
Sartwell, C. (1991). ‘Natural Generativity and Imitation’. British Journal of Aesthetics, 31
(1): 58–67.
Scaife, M. and Y. Rogers (1996). ‘External Cognition: How Do Graphical
Representations Work?’ International Journal of Human-Computer Studies, 45:
185–213.
Scarry, E. (1999). Dreaming by the Book. Princeton, NJ: Princeton University Press.
Schier, F. (1986). Deeper Into Pictures. Cambridge: Cambridge University Press.
Schooler, J. W., M. Fallshore and S. M. Fiore (1995). ‘Epilogue: Putting Insight Into
Perspective’. In R. J. Sternberg and J. E. Davidson (eds), The Nature of Insight,
559–88. Cambridge, MA; London: MIT Press.
Schöttler, T. (2017). ‘Pictorial Evidence: On the Rightness of Pictures’. In N. Mößner
and A. Nordmann (eds), Reasoning in Measurement, 111–29. London; New York:
Routledge.
Sebeok, T. A. (1979). The Sign & Its Masters. Austin: University of Texas Press.
Sellars, W. (1949). ‘Language, Rules and Behavior’. In S. Hook (ed.), John Dewey:
Philosopher of Science and Freedom, 289–315. New York: Dial Press.
Sellars, W. (1963). Science, Perception and Reality. Atascadero: Ridgeview.
Sellars, W. (1969). ‘Language as Thought and as Communication’. Philosophy and
Phenomenological Research, 29 (4): 506–27.
Sellars, W. (2007). In the Space of Reasons: Selected Essays of Wilfrid Sellars. Ed. K.
Scharp, R. B. Brandom. Cambridge, MA: Harvard University Press.
Shabel, L. (2003). Mathematics in Kant’s Critical Philosophy: Reflections on Mathematical
Practice. New York: Routledge.
Shah, P. and A. Miyake (1996). ‘The Separability of Working Memory Resources for
Spatial Thinking and Language Processing’. Journal of Experimental Psychology:
General, 125: 4–27.
Shaver, P., L. Pierson and S. Lang (1975). ‘Converging Evidence for the Functional
Significance of Imagery in Problem Solving’. Cognition, 3: 359–75.
Shepard, R. N. (1978). ‘Externalization of Mental Images and the Act of Creation’.
In B. S. Randhawa and W. E. Coffman (eds), Visual Learning, Thinking, and
Communication, 133–89. New York: Academic Press.
Shepard, R. N. (1990). ‘On Understanding Mental Images’. In H. Barlow, C. Blakemore
and M. Westo-Smith (eds), Images and Understanding: Thoughts about Images: Ideas
about Understanding, 365–70. Cambridge: Cambridge University Press.
Shepard, R. N. and L. A. Cooper (1982). Mental Images and Their Transformations.
Cambridge, MA: MIT Press.
220 Literature
Solt, K. (1989). ‘Pictures and Truth’. British Journal of Aesthetics, 29 (2): 154–9.
Sorensen, R. (2002). ‘The Art of the Impossible’. In T. S. Gendler and J. Hawthorne
(eds), Conceivability and Possibility, 337–68. Oxford: Oxford University Press.
Spelke, E. S. and K. D. Kinzler (2007). ‘Core Knowledge’. Developmental Science, 10:
89–96.
Squires, R. (1969). ‘Depicting’. Philosophy, 44: 193–204.
Stanley, J. and T. Williamson (2001). ‘Knowing How’. The Journal of Philosophy, 98 (8):
411–44.
Stegmüller, W. (1969). Wissenschaftliche Erklärung und Begründung. Berlin: Springer
Verlag.
Stenning, K. (2002). Seeing Reason: Image and Language in Learning to Think. Oxford:
Oxford University Press.
Stenning, K. and O. Lemon (2001). ‘Aligning Logical and Psychological Perspectives on
Diagrammatic Reasoning’. Artificial Intelligence Review, 15 (1–2): 29–62.
Stenning, K. and J. Oberlander (1995). ‘A Cognitive Theory of Graphical and Linguistic
Reasoning: Logic and Implementation’. Cognitive Science, 19: 97–140.
Stenning, K. and P. Yule (1997). ‘Image and Language in Human Reasoning: A
Syllogistic Illustration’. Cognitive Psychology, 34 (2): 109–59.
Sternberg, R. J. (1980). ‘Representation and Process in Linear Syllogistic Reasoning’.
Journal of Experimental Psychology: General, 109 (2): 119–59.
Stich, S. P. (1978). ‘Beliefs and Subdoxastic States’. Philosophy of Science, 45 (4): 499–518.
Stjernfelt, F. (2000). ‘Diagrams as Centerpiece of a Peircean Epistemology’. Transactions
of the Charles S. Peirce Society, 36 (3): 357–84.
Stjernfelt, F. (2007). Diagrammatology: An Investigation on the Borderlines of
Phenomenology, Ontology and Semiotics. Dordrecht: Springer.
Strawson, P. F. (1963). Individuals: An Essay in Descriptive Metaphysics. New York:
Anchor Books.
Suárez, M. (2003). ‘Scientific Representation: Against Similarity and Isomorphism’.
International Studies in the Philosophy of Science, 17 (3): 225–44.
Swoyer, C. (1987). ‘The Metaphysics of Measurement’. In J. Forge (ed.), Measurement,
Realism and Objectivity: Essays on Measurement in the Social and Physical Sciences,
235–90. Dordrecht: Springer.
Swoyer, C. (1991). ‘Structural Representation and Surrogative Reasoning’. Synthese, 87
(3): 449–508.
Tal, E. (2013). ‘Old and New Problems in Philosophy of Measurement’. Philosophy
Compass, 8 (12): 1159–73.
Tal, E. (2017). ‘A Model-Based Epistemology of Measurement’. In N. Mößner and
A. Nordmann (eds), Reasoning in Measurement, 233–53. London; New York:
Routledge.
Tarski, A. (1956). Logic, Semantics, Metamathematics: Papers from 1923 to 1938. Oxford:
Oxford University Press.
Teller, P. (2013). ‘The Concept of Measurement-Precision’. Synthese, 190: 189–202.
222 Literature
Wollheim, R. (1980). Art and its Objects: Second Edition with Six Supplementary Essays.
Cambridge: Cambridge University Press.
Wollheim, R. (1984). The Thread of Life. Cambridge: Cambridge University Press.
Wollheim, R. (1987). Painting as an Art. Princeton, NJ: Princeton University Press.
Wollheim, R. (1993). The Mind and its Depths. Cambridge, MA; London: Harvard
University Press.
Wollheim, R. (1998). ‘On Pictorial Representation’. The Journal of Aesthetics and Art
Criticism, 56 (3): 217–26.
Wollheim, R. (2003). ‘What Makes Representational Painting Truly Visual?’ Aristotelian
Society Supplementary Volume, 77 (1): 131–47.
Yolton, J. W. (1996). Perception and Reality: A History from Descartes to Kant. Ithaca,
NY: Cornell University Press.
Yoo, S.-S., D. K. Freeman, J. J. III McCarthy and F. A. Jolesz (2003). ‘Neural Substrates of
Tactile Imagery: A Functional MRI Study’. Neuroreport, 14 (4): 581–5.
Zacks, J. M., E. Levy, B. Tversky and D. J. Schiano (2002). ‘Graphs in Print’. In M.
Anderson, B. Meyer and P. Olivier (eds), Diagrammatic Representation and
Reasoning, 187–206. London: Springer.
Zagzebski, L. (2019). ‘Toward a Theory of Understanding’. In S. R. Grimm (ed.),
Varieties of Understanding: New Perspectives from Philosophy, Psychology, and
Theology, 123–36. New York: Oxford University Press.
Zatorre, R. J., A. R. Halpern and M. Bouffard (2010). ‘Mental Reversal of Imagined
Melodies: A Role for the Posterior Parietal Cortex’. Journal of Cognitive Neuroscience,
22 (4): 775–89.
Zeimbekis, J. (2010). ‘Pictures and Singular Thought’. The Journal of Aesthetics and Art
Criticism, 68 (1): 11–21.
Zeimbekis, J. (2015). ‘Seeing, Visualizing, and Believing: Pictures and Cognitive
Penetration’. In J. Zeimbekis and A. Raftopoulos (eds), The Cognitive Penetrability of
Perception: New Philosophical Perspectives, 298–327. Oxford: Oxford University Press.
Zeman, A. Z. J., S. Della Sala, L. A. Torrens, V.-E. Gountouna, D. J. McGonigle and R.
H. Logie (2010). ‘Loss of Imagery Phenomenology with Intact Visuo-Spatial Task
Performance: A Case of “Blind Imagination”’. Neuropsychologia, 48: 145–55.
Zeman, A., M. Dewar and S. Della Sala (2015). ‘Lives without Imagery – Congenital
Aphantasia’. Cortex, 73: 378–80.
Zeman, A., F. Milton, S. Della Sala, M. Dewar, T. Frayling, J. Gaddum, A. Hattersley, B.
Heuerman-Williamson, K. Jones, M. MacKisack and C. Winlove (2020). ‘Phantasia
– The Psychological Significance of Lifelong Visual Imagery Vividness Extremes’.
Cortex, 130: 426–40.
Zhang, J. (1997). ‘The Nature of External Representations in Problem Solving’. Cognitive
Science, 21: 179–217.
Zhao, F., W. Schnotz, I. Wagner and R. Gaschler (2020). ‘Texts and Pictures Serve
Different Functions in Conjoint Mental Model Construction and Adaptation’.
Memory and Cognition, 48 (1): 69–82.
Index
Goodman, N. 4, 12, 19, 32, 65, 70, 99, literary (metaphor) 10, 12
103, 124–6, 129, 168, 175–6, 184, measurement-theoretic account
191 n.1, 193 n.4 of 137–42
Goodwin, W. 29 olfactory 10, 12, 139–40
Grandin, T. 21 tactile 10
Green, E. J. 44, 59, 108, 114, 121 imageless thought controversy 18
Greenberg, D. L. 17 image-propositions 32
Greenberg, G. 9, 29, 103, 123–4, 192 n.1 imagistic cognition 62–3
Gregory, D. 11, 151, 156, 186 n.5 imagistic theory of thought 13, 19, 21–3,
Gregory, R. L. 144 25–6, 55, 57–9, 73, 149, 158, 175,
Grzankowski, A. 13, 37 183, 189 n.3
Guillot, A. 10 impossible fork 124, 143, 145–7
Gurr, C. 22 indexicals 132, 136, 152, 154, 165
indices 9–10, 108, 138
Hacker, P. 18, 64–5, 69 informative richness 169
Halpern, A. R. 10 Ingarden, R. 193 n.2
Harshman, R. 188 Inkpin, A. 112
Haugeland, J. 12, 33, 168 instrumental nature of images 17–18,
Hawthorne, J. 41 20, 22, 26, 28, 181
Heck, R. G. 29, 35, 43 intentionality 39–40
Hegarty, M. 21, 79 Intons-Peterson, M. J. 7
Heuer, F. 174 irreducibility thesis 15, 19–20, 23, 25–6
Hintikka, J. 37 Irvine, E. 185 n.2
Hinton, G. 174 Isaac, A. M. C. 117
Holyoak, K. J. 186 n.1, 188 n.1 Ishai, A. 174
homomorphism 191 n.12, 193 n.7 isomorphism 128, 191 n.12
Hookway, C. 9 Ison, M. J. 116
Hope, V. 106–7 Ittelson, W. H. 174
Hopkins, R. 11, 105, 127 Izard, V. 189 n.6
Hubbard, T. L. 8, 10
Hume, D. 47, 58, 187 n.8, 188 n.2 Jackson, F. 45
Hummel, J. E. 107, 192 n.5 Jäkel, F. 51
Husserl, E. 131, 193 n.2 Jakubowski, K. 10
Hyman, J. 80, 99, 105, 124, 193 n.4 James, M. 111
Johnson, K. 194 n.2
iconic convention 120–2, 124–5, Johnson, M. 185 n.2
129–33, 142, 194 n.8 Johnson-Laird, P. N. 6, 22, 188 n.1
iconic denotation 126–9, 131–2
iconic difference 3 Kamermans, K. L. 174
identification by description 106, 110 Kan, I. P. 58
image (icon) Kant, I. 27, 66, 70, 184, 187 n.11, 190 n.4
auditory 5, 10, 12–13, 139–40, 165 Kaplan, D. 33, 37, 40, 110, 128, 155
emotional 10 Kasem, A. 188 n.2
external (pictures) 9, 12, 185 n.4 Kaufmann, G. 19
gustatory 10, 164, 170 Kennedy, J. M. 144
haptic 10 Keogh, R. 177, 185 n.3
iconic representation 3, 8–12, 21, 42, Khalifa, K. 152
44, 99, 121, 164, 168–73, 178, 180, Kiefer, M. 58
183, 192 n.3 Kind, A. 28
impossible 123–4, 142–8, 194 n.11 Kinzler, K. D. 189 n.3
Index 229
semantical challenge 39–43, 149, 158, systematicity 15–16, 31, 43–6, 48, 53, 63,
186 n.5 109, 112, 137–8, 149, 164–7, 180–2,
Shabel, L. 190 n.4 186 n.1, 186 n.6
Shah, P. 56
Shaver, P. 6 Tal, E. 135, 141, 156
Shepard, R. N. 17, 19, 56, 150, 154, 173, target 16, 83, 87, 89, 108–11, 114–16,
188 n.1 123–33, 135–6, 141–2, 146–7, 149,
Sheredos, B. 5, 185 n.4 156, 160–3, 182, 191 n.13, 192 n.1,
Shimojima, A. 22, 186 n.2 193 n.2, 193 nn.4–5, 194 n.11
Short, T. S. 9 Tarski, A. 31
Siewert, C. 107 Teller, P. 135
Simon, H. A. 21–3, 28, 46, 50, 169 Terrone, E. 111, 127, 129
Skinner, B. F. 51 Teufel, C. 189 n.5
Slezak, P. 2, 64, 174–5 Thagard, P. 10, 17, 21, 188 n.1
Sloman, A. 5, 28 Thielen, J. 178
Sober, E. 44 thinking
Soles, D. 188 n.2 animal 19
Solomon, K. O. 43 imagistic 2, 4, 11–15, 18, 20, 23–8,
Solt, K. 29 34, 36, 42–3, 48–9, 51–3, 55–60,
Sorensen, R. 142 62–3, 68, 70, 73, 75, 149–81, 187
space n.1, 188 n.1
informational 83, 85–6, 90, 99, 113, individual differences in 188 n.1
117, 121, 129 language-like 2, 5, 12–13, 18, 44, 46,
logical 14, 51–2, 83, 86, 105, 114, 62, 173, 181
127, 129, 131–2, 152–4, 157, 165, measurement-theoretic account of
176, 180, 182, 191 n.7 13–15, 149, 179–80, 183
perceptual similarity 62–3, 108 metaphors of 49–52
physical 14, 51–2, 83, 86, 105, 127, propositional 12–13, 25, 182
131–3, 152–4, 157, 165, 180, 182, thinking styles 188 n.1
191 n.7 Thomas, N. J. T. 11, 185 n.2
spatial representations 186 n.2, 194 n.11 thought
Spelke, E. S. 189 n.3 bearer of 15, 20, 26–8, 31–2, 45, 53,
Squires, R. 104 56, 65, 67, 69–70, 164
standard metre, 118, 133, 135, 140, 162, imagistic 2, 4, 11, 15, 18, 20, 23–7,
178, see also measurement 42, 51, 56–60, 63, 68, 70, 73, 75,
Stanley, J. 151 149, 181, 187 n.1
Stegmüller, W. 66 thought acquisition 47, 57
Stenning, K. 12, 22, 191 n.10 thoughts-perception border 59–64,
Sternberg, R. J. 6 73, 179
Stich, S. P. 35 Tichy, P. 82, 190 n.3
Stjernfelt, F. 9, 22, 186 n.2 Traditional View 15, 39, 43, 73, 75, 140,
Strawson, P. F. 109 142, 148–50, 157–8, 161, 164, 175,
structural similarity 93–6, 98, 116 181–2
Suárez, M. 32, 95, 191 n.12 translatability thesis 15, 23–6, 49
Swoyer, C. 17, 93, 131, 135, 137, Troscianko, E. T. 10
186 n.2 truth
symbols (symbolic representations) 6, truth-bearers 28–9, 31, 34–5, 150
9–10, 18, 22, 25, 31, 69, 72, truth-conditions 28, 31, 36, 38, 135,
90, 104–5, 121, 129, 187 n.7, 185 n.5
190 n.4 truth-makers 29
Index 233