Neurosci 02 00010
Neurosci 02 00010
Retired from Department of Biological Sciences, University of South Carolina, Columbia, SC 29208, USA;
[email protected]
Abstract: Cognition is often defined as a dual process of physical and non-physical mechanisms.
This duality originated from past theory on the constituent parts of the natural world. Even though
material causation is not an explanation for all natural processes, phenomena at the cellular level
of life are modeled by physical causes. These phenomena include explanations for the function of
organ systems, including the nervous system and information processing in the cerebrum. This
review restricts the definition of cognition to a mechanistic process and enlists studies that support
an abstract set of proximate mechanisms. Specifically, this process is approached from a large-scale
perspective, the flow of information in a neural system. Study at this scale further constrains the
possible explanations for cognition since the information flow is amenable to theory, unlike a lower-
level approach where the problem becomes intractable. These possible hypotheses include stochastic
processes for explaining the processes of cognition along with principles that support an abstract
format for the encoded information.
1. Introduction
1.1. The Many Definitions of Cognition
Citation: Friedman, R. Cognition as
Common definitions of cognition often include the phrase mental process or acqui-
a Mechanical Process. NeuroSci 2021,
sition of knowledge. Reference to mental processing descends from an assignment of
2, 141–150. https://doi.org/10.3390/
non-material substances to the act of thinking. Philosophers, such as the Cartesians and
neurosci2020010
Platonists, have written on this topic, including the relationship between mind and matter.
This perspective further involves concepts such as consciousness and intentionality. How-
Academic Editor: Lucilla Parnetti
ever, these ideas are based on metaphysical explanations and not on a modern scientific
Received: 29 March 2021
interpretation [1].
Accepted: 21 April 2021
The metaphysical approach is exemplified by the philosopher Plato and his Theory of
Published: 22 April 2021 Forms, a hypothesis of how knowledge is acquired. The idea is that a person is aware of
an object, such as a kitchen table, by comparison with an internal representation of that
Publisher’s Note: MDPI stays neutral object’s true form. The modern equivalent of this hypothesis is that our recognition of an
with regard to jurisdictional claims in object is by the similarity of its measurable properties with its true form. According to this
published maps and institutional affil- theory, these true and perfect forms originate in the non-material world.
iations. However, face recognition in primates shows that an object’s measured attributes
are not compared against a true form, but instead that recognition is from a comparison
between stored memory and a set of linear metrics of the object [2]. These findings agree
with studies of artificial neural networks, an analog of cerebral brain structure, where
Copyright: © 2021 by the author.
objects are recognized as belonging to a category without prior knowledge of the true
Licensee MDPI, Basel, Switzerland. categories [3].
This article is an open access article The theory of true forms originates from a thinking of a perfectly designed world
distributed under the terms and with deterministic processes, while a theory absent of true forms may instead depend
conditions of the Creative Commons on probabilistic processes. The rise of probabilistic thinking in natural science has co-
Attribution (CC BY) license (https:// incided with modern statistical methods and explanations of natural phenomena at the
creativecommons.org/licenses/by/ atomic level [4].
4.0/).
A modern experimental biologist would approach a study of the mind from a material
perspective, such as by the study of the cells and tissue of brain matter. This approach is
dependent on reduction of the complexity of a problem. An example is from economics,
where an individual is generalized as a single type and consequently the broader theories
of population behavior are based on this assumption [5]. There is a similar approach in
Newtonian physics where an object’s spatial extent is simplified as a single point in space.
Since some natural phenomena are not tractable to mechanistic study, concepts exist
that are not solely based on material and physical causes. However, it is necessary to
base science theory of brain function on natural mechanisms while disallowing mental
causation. There are exceptions where the physical world is visually indescribable and
solely dependent on mathematical description, but these occurrences are typically not
applicable to the investigation of life at the cellular level.
system. Most of the examples are in the visual system since that is the better studied of the
sensory systems, and is supported by the theories of optics and information flow. Lastly,
there is a section on general cognition that approaches the problem from an evolutionary
perspective.
sensory data consist of reflected light rays that are absorbed across a two-dimensional
surface, the retinal
surface, the retinalcells
cellsofofthe
theeye.
eye.These
Theselight
lightrays
raysrange
rangeacross
across the
the electromagnetic
electromagnetic spec-
spectra,
tra, but the retinal cells are specific to a small subset of all possible
but the retinal cells are specific to a small subset of all possible light rays. light rays.
From an abstract
abstract perspective,
perspective,thethesurface
surfacethatthatreceives
receivesthe visual
the visualinput
inputis aistwo-di-
a two-
mensional
dimensional sheet of cells
sheet where
of cells eacheach
where cell has
cell an
hasactivation value value
an activation at a point
at ainpoint
time in(Figure
time
1). Over1).
(Figure a length
Over aof time,of
length the distribution
time, of theseofactivations
the distribution is undergoing
these activations change,change,
is undergoing so the
neural system
so the neural is reporting
system fromfrom
is reporting a dynamic
a dynamic state of of
state activations.
activations.This
Thisview
viewatat the
the visual
surface is representative of both the spatial and and temporal
temporal components
components of the the proximate
proximate
cause of vision.
Figure 1. An abstract representation of data that are received by a sensory organ, such as light rays
that
that are
are absorbed
absorbed by
by cells
cells along
alongthe
thesurface
surfaceofofthe
theretina
retinaofofa acamera
cameraeye.
eye.The
Thedrawing
drawingshows thethe
shows
spatial pattern, but there is also a temporal dimension since this sensory input data are changing
spatial pattern, but there is also a temporal dimension since this sensory input data are changing
over time.
over time.
This
This representation
representation of ofsensory
sensorydata datais is
similar
similarto to
that received
that receivedby artificial neural
by artificial net-
neural
work
networksystems. These
systems. artificial
These systems
artificial are capable
systems of identifying
are capable objects
of identifying in a visual
objects scene
in a visual
and
scenelabeling them by
and labeling themtheir
bymembership
their membership to a category of related
to a category objects.objects.
of related This also shows
This also
analogous function
shows analogous between
function the artificial
between processprocess
the artificial and natural cognition
and natural [20]. [20].
cognition
The open problem has been been generalizing
generalizing this knowledge
knowledge (transfer
(transfer learning) that is is
acquired from processing sensory input data. This is the essential problem for for artificial
artificial
systems in emulating cognition in animals. However, there is recent work work that
that employs
employs
artificial models of transfer learning [21,22].
A related problem
problem is is in
in identifying
identifying an anobject
objectwhere
wherethetheviewpoint
viewpointisisvariable.
variable.It It
is
addressed
is addressed bybya model
a model [3][3]
that
thatis designed
is designed forfor
biological
biologicalrealism, along
realism, with
along witha robust
a robustar-
chitecture
architecture forfor
sampling
sampling thethe
parts of an
parts ofobject. This approach
an object. includes
This approach the sampling
includes of vis-
the sampling
of visual
ual data which
data which are thenareencoded
then encoded in an abstract
in an abstract format,format,
a vectora of
vector of number
number values. values.
Specif-
Specifically,
ically, this sampling
this sampling occursoccurs
acrossacross
blocksblocks of columns
of columns in a visual
in a visual scene.
scene. Further,
Further, eacheachcol-
column
umn consists
consists of aof
seta of
setvectors
of vectorswhere where
eacheach vector
vector is assigned
is assigned to a discrete
to a discrete categorycategory
by its
by itsof
level level of representation
representation of thedata
of the input input data (Figure
(Figure 2). These2).processed
These processed
data aredata
thenare then
utilized
utilized
for findingfor columns
finding columns
of similarityof similarity that correspond
that correspond to the partsto of
theanparts of aan
object, object, a
consensus-
consensus-based
based approach
approach toward toward establishing
establishing a robust identification
a robust identification of an object.of an object.
Sci 2021, 2, FOR PEER REVIEW 5
Figure 2. A model for processing of visual objects. The first panel shows a visual scene. The next
panel shows an open circle which represents a region with a potential object. The third panel is an
Figure
enlargement ofFigure 2. A model
this region. The finalfor processing of visual objects. The first panel shows a visual scene. The next
2. A model for panel contains
processing three objects.
of visual open diagonal shapes
The first panelthat areaabstract
shows visual scene. The next
panel
representationspanel shows
of theshows an open
information in circle
the which
image. represents
They are a region
ordered fromwith a
bottompotential
to top object.
by
an open circle which represents a region with a potential object. The low The thirdpanel
to third panelisisan
an
high level of abstraction.
enlargement of this region. The final panel contains three open diagonal shapes that
enlargement of this region. The final panel contains three open diagonal shapes that are abstract are abstract
representations of
representations of the
the information
informationin inthe
theimage.
image.They
Theyare
areordered
orderedfrom
frombottom
bottomtototop
topbybylow
lowtotohigh
Previoushighapproaches
level level to artificial
of abstraction.
of abstraction. systems have often overfit the network model to a
training data set. Overfitting hinders the generalizability of the final model [23]—in this
Previous
case, the model is aPrevious approaches
networkapproaches to artificial
artificial systems
of nodes interconnected
to systems have often
with weight
have often
values.overfit the
the network
network model
The overfitting
overfit model to to aa
training
problem leadstraining data
to loss data set. Overfitting
of transferability
set. Overfitting of thehinders
modelthe
hinders the generalizability
to generalizability
other applications. of the final
Nature
of the model
final solves [23]—in
model [23]—in this this
case,
this problem case,
by theof
a set
the model is a
processes.
model is a network
One is of
network thenodes
of visualinterconnected
nodes processing forwith
interconnected weight
spatial
with and values.
weight temporal
values. The
The overfitting
overfitting
problem
an objectleads
invariance of problem in a to
leads loss
scene
to of
of transferability
loss[24,25]. This leadsof
transferability oftothe model
a more
the to
to other
other applications.
modelgeneralized form of theNature
applications. Nature solves
solves
this problem
problem by
object than otherwise.
this by aa set
set of
of processes.
processes. One One is is the
the visual
visual processing
processing for for spatial
spatial and
and temporal
temporal
invariance
A secondinvariance of an
an object
and complementary
of objectmethod
in
in aa scene
is to[24,25].
scene neurallyThis
[24,25]. This
codeleads to aa more
the object
leads to generalized
by metrics
more that form
generalized form ofof the
the
object
are abstract and than otherwise.
generalizable.
object than otherwise. This reflects the example where a photograph of a cat is
A second
secondtoand
encoded so that it matches
A and
bothcomplementary
another photograph
complementary method
method and isato
is topencil
neurally code
sketch
neurally codeof the object
the cat. by
by metrics
This
object metrics that
that
are
generalizability abstract
areinabstract and
identifying generalizable.
and objects This
is now possible
generalizable. reflects
This reflectsin the the example
case
the where
of artificial
example where a photograph
systems [26].
a photograph of a
of a cat
cat is
is
encoded
Additionally,encoded so that
that itit matches
this generalizability
so matches
leadsto both
totoboth another
another photograph
corrections and
and aa pencil
for the variability
photograph in an sketch
pencil object’s
sketch of
of the
the cat.
cat. This
This
generalizability
form, such as generalizability
change in identifying
identifying
in its orientation,
in objects is
deobfuscation
objects isagainst
now possible
now possible in the
the case
the background,
in caseorofdetection
of artificial systems
artificial systems [26].
[26].
Additionally,
based on a partial this
view (Figure
Additionally, thisgeneralizability
generalizabilityleads
3). leadsto to
corrections
correctionsfor the
for variability in anin
the variability object’s form,
an object’s
such as change in its orientation, deobfuscation against the background, or detection based
form, such as change in its orientation, deobfuscation against the background, or detection
on a partial view (Figure 3).
based on a partial view (Figure 3).
(a) (b)
Figure 3. (a) Figure
The first3.panel
(a) The first panel
shows shows aofphotograph
a photograph of athat
a visual scene visual scene athat
contains contains
table a table
along with along
other with The second
objects.
other objects. The second panel in (a) (a)the same scene but transformed so that it appears
is (b)as a pen-
panel in (a) is the same scene but transformed so that it appears as a pencil sketch drawing; (b) The first panel is a visual
cil sketch drawing; (b)3.The first panel is a visual drawing of the of
digit nine (9), while the next panel
drawing of the digit nine (9),Figure
while the (a)next
Thepanel
first panel
is theshows a photograph
same digit a visual
but transformed scene
by rotationthat contains
of the image.a table along with
is the same digit but objects.
other transformed by rotation
The second panelofinthe
(a)image.
is the same scene but transformed so that it appears as a pen-
2.3.sketch
cil Perception
drawing; as a(b)
Mechanical Process
The first panel is a visual drawing of the digit nine (9), while the next panel
2.3. Perceptionisasthea Mechanical
same digit Process
but transformed by
There is an extensive amount of visual rotation of the image.
processing in the brain since it occupies one-
There is an
halfextensive amount
of the cerebral of visual
brain processing
tissue [15]. Further,inthe
thenumber
brain since it occupies
of neurons one- exponentially
increases
2.3.
fromPerception
half of the cerebral millions as
brain tissueinathe
Mechanical
[15]. Process
Further,
earlier thepathways
visual number oftoneurons
billions increases exponen-
in the higher layers of the cere-
tially from millions
brumTherein the
[15]. earlier
is an
This visualof
extensive
hierarchy pathways
amount
processes to billions
of visual
creates our in
processingthe in
visual higher layers
the brain
perception of the
since
of the it occupies
world, one-
but there
cerebrum [15].half
This
is noof hierarchy
the cerebral
evidence of processes
that abrain creates
tissue of
perception [15]. our visual
Further,
a scene perception
the number
is processed of the
by aofsingle world,
neurons but
increases
cognitive exponen-
path. Studies
there is no evidence
show from
tially thatan
that a perception
object in
millions ofearlier
is the a scene
identified is processed
independent
visual pathwaysofbythe
a single
to visualcognitive
billions scene
in theand path.
its attributes
higher layers of are
the
modified
Studies show cerebrum to
that an object disfavor
[15].isThis variability
identified ofin
independent
hierarchy its appearance,
processes of the so
ourthat
visual
creates sceneany
visualandtransformationof theofworld,
its attributes
perception the object
but
does not
there is nolead to misclassification
evidence error of
that a perception [27].
a scene is processed by a single cognitive path.
Studies show that an object is identified independent of the visual scene and its attributes
NeuroSci 2021, 2 146
Temporally, the advanced sensory processing occurs over a millisecond time scale [24],
so it not expected that perceptions occur in real time. Instead, cognitive processes cre-
ate an internal representation, a facsimile, of the sensory data and that construction
is the perception.
Studies have further divided perception and awareness into multiple types, but in
all cases these cognitive processes are a mechanical construction of the outside world [7].
These internal models that form our representation of the world are material processes,
including the perceived awareness of objects, a scene, and the occurrence of events. The
physical events that occur over time in a scene are also time delayed and the length of
that delay is subject to perception. Therefore, the representation of the time delay is not
calibrated with real time. Artificial neural networks show analogous processes with models
that are capable of predictive coding, such as completing a written sentence or the next
frame of a visual image [28].
Visual perception also includes other processes, such as the transformation of a
scene’s brightness and contrast levels [29]. This may help in identifying objects against a
background. Further, the cerebral processing in vision is more extensive than that of the
early steps along the visual pathway, so it is reasonable to assume that the perceptual image
is weakly correlated with the initial retinal input or the earlier-path internal representations
of the visual data.
Last, the limit on the number of evolutionary and developmental outcomes restricts
the possible hypotheses about cognition. For example, the evolution of the camera eye
expectedly occurred by modifications of small effect, along with the accompanying adapta-
tions in cognition. This predicts that the artificial systems can emulate the visual cognitive
processes by a finite number of steps as represented by an algorithm. This has held true
since deep learning methods are competitive with our cognitive ability to identify objects
and process natural language.
mammals. It has been shown in the cerebral and cerebellar regions of whales that cognitive
capability is not simply described by a change in neuron count or density [38].
To reiterate, the morphological changes of the brain and its regions are not necessarily
a simple correlation with cognitive function. The addition and subtraction of cognitive
capabilities, such as observed by contrasting species of marine and terrestrial mammals,
are complex phenomena that are molded by evolution and development. Therefore, it is
problematic to oversimplify the relationship between molecular or anatomical characters
and a cognitive function.
neuron is the proximate cause of cognition and does not possess a special immeasurable
quality. If a deep learning approach is an effort to emulate a pathway on how we acquire
knowledge, then a valid and realistic model should be established beforehand. The problem
is not whether the artificial systems can emulate the metaphysics of human thinking, a
false proposition, but instead that these systems are emulating a specific and measurable
cognitive process.
References
1. Vlastos, G. Parmenides’ theory of knowledge. In Transactions and Proceedings of the American Philological Association; The Johns
Hopkins University Press: Baltimore, MD, USA, 1946; pp. 66–77.
2. Chang, L.; Tsao, D.Y. The code for facial identity in the primate brain. Cell 2017, 169, 1013–1028. [CrossRef] [PubMed]
3. Hinton, G. How to represent part-whole hierarchies in a neural network. arXiv 2021, arXiv:2102.12627.
4. Jeans, J.H. Physics and Philosophy; Cambridge University Press: Cambridge, UK, 1942.
5. Smith, A. An Inquiry into the Nature and Causes of the Wealth of Nations, 1st ed.; A. Strahan and T. Cadell: London, UK, 1776.
6. Searle, J.R.; Willis, S. Intentionality: An Essay in the Philosophy of Mind; Cambridge University Press: Cambridge, UK, 1983.
7. Haggard, P. Sense of agency in the human brain. Nat. Rev. Neurosci. 2017, 18, 196–207. [CrossRef] [PubMed]
8. Huxley, T.H. Evidence as to Man’s Place in Nature; Williams and Norgate: London, UK, 1863.
9. Ramon, Y.; Cajal, S. Textura del Sistema Nervioso del Hombre y de los Vertebrados; Nicolas Moya: Madrid, Spain, 1904.
10. Kriegeskorte, N.; Kievit, R.A. Representational geometry: Integrating cognition, computation, and the brain. Trends Cognit. Sci.
2013, 17, 401–412. [CrossRef]
11. Hinton, G.E. Connectionist learning procedures. Artif. Intell. 1989, 40, 185–234. [CrossRef]
12. Schmidhuber, J. Deep learning in neural networks: An overview. Neural Netw. 2015, 61, 85–117. [CrossRef]
13. Yang, Z.; Purves, D. The statistical structure of natural light patterns determines perceived light intensity. Proc. Natl. Acad. Sci. USA
2004, 101, 8745–8750. [CrossRef]
14. Cichy, R.M.; Pantazis, D.; Oliva, A. Resolving human object recognition in space and time. Nat. Neurosci. 2014, 17,
455–462. [CrossRef]
15. Prasad, S.; Galetta, S.L. Anatomy and physiology of the afferent visual system. In Handbook of Clinical Neurology; Kennard, C.,
Leigh, R.J., Eds.; Elsevier: Amsterdam, The Netherlands, 2011; pp. 3–19.
16. Paley, W. Natural Theology: Or, Evidences of the Existence and Attributes of the Deity, 1st ed.; R. Faulder: London, UK, 1802.
17. Darwin, C. On the Origin of Species; John Murray: London, UK, 1859.
18. Tardieu, A.; Delaye, M. Eye lens proteins and transparency: From light transmission theory to solution X-ray structural analysis.
Annu. Rev. Biophys. Biophys. Chem. 1988, 17, 47–70. [CrossRef] [PubMed]
19. Borst, A.; Helmstaedter, M. Common circuit design in fly and mammalian motion vision. Nat. Neurosci. 2015, 18, 1067–1076.
[CrossRef] [PubMed]
20. DiCarlo, J.J.; Zoccolan, D.; Rust, N.C. How does the brain solve visual object recognition? Neuron 2012, 73, 415–434. [CrossRef]
21. Goyal, A.; Didolkar, A.; Ke, N.R.; Blundell, C.; Beaudoin, P.; Heess, N.; Mozer, M.; Bengio, Y. Neural Production Systems. arXiv
2021, arXiv:2103.01937.
22. Scholkopf, B.; Locatello, F.; Bauer, S.; Ke, N.R.; Kalchbrenner, N.; Goyal, A.; Bengio, Y. Toward Causal Representation Learning.
Proc. IEEE 2021, 1–22. [CrossRef]
23. Hawkins, D.M. The problem of overfitting. J. Chem. Inf. Comput. Sci. 2004, 44, 1–12. [CrossRef] [PubMed]
24. Yates, A.J. Delayed auditory feedback. Psychol. Bull. 1963, 60, 213–232. [CrossRef]
25. Wallis, G.; Rolls, E.T. Invariant face and object recognition in the visual system. Prog. Neurobiol. 1997, 51, 167–194. [CrossRef]
26. Goh, G.; Cammarata, N.; Voss, C.; Carter, S.; Petrov, M.; Schubert, L.; Radford, A.; Olah, C. Multimodal Neurons in Artificial
Neural Networks. Distill 2021. [CrossRef]
27. Garrigan, P.; Kellman, P.J. Perceptual learning depends on perceptual constancy. Proc. Natl. Acad. Sci. USA 2008, 105,
2248–2253. [CrossRef]
28. Liu, A.; Tucker, R.; Jampani, V.; Makadia, A.; Snavely, N.; Kanazawa, A. Infinite Nature: Perpetual View Generation of Natural
Scenes from a Single Image. arXiv 2020, arXiv:2012.09855.
29. Adelson, E.H. Lightness Perception and Lightness Illusions. In The New Cognitive Neurosciences, 2nd ed.; Gazzaniga, M., Ed.; The
MIT Press: Cambridge, MA, USA, 2000; pp. 339–351.
30. Chase, W.G.; Simon, H.A. Perception in chess. Cogn. Psychol. 1973, 4, 55–81. [CrossRef]
31. Silver, D.; Hubert, T.; Schrittwieser, J.; Antonoglou, I.; Lai, M.; Guez, A.; Lanctot, M.; Sifre, L.; Kumaran, D.; Graepel, T.; et al.
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play. Science 2018, 362, 1140–1144.
[CrossRef] [PubMed]
NeuroSci 2021, 2 150
32. Kiani, R.; Esteky, H.; Mirpour, K.; Tanaka, K. Object category structure in response patterns of neuronal population in monkey
inferior temporal cortex. J. Neurophysiol. 2007, 97, 4296–4309. [CrossRef]
33. Pang, R.; Lansdell, B.J.; Fairhall, A.L. Dimensionality reduction in neuroscience. Curr. Biol. 2016, 26, R656–R660. [CrossRef]
34. Grant, P.R.; Grant, B.R. Adaptive radiation of Darwin’s finches: Recent data help explain how this famous group of Galapagos
birds evolved, although gaps in our understanding remain. Am. Sci. 2002, 90, 130–139. [CrossRef]
35. Bostrom, N. The superintelligent will: Motivation and instrumental rationality in advanced artificial agents. Minds Mach. 2012,
22, 71–85. [CrossRef]
36. Fitch, W.T. The Biology and Evolution of Speech: A Comparative Analysis. Annu. Rev. Linguist. 2018, 4, 255–279. [CrossRef]
37. Wang, X.; Wu, W.; Ling, Z.; Xu, Y.; Fang, Y.; Wang, X.; Binder, J.R.; Men, W.; Gao, J.H.; Bi, Y. Organizational principles of abstract
words in the human brain. Cereb. Cortex 2018, 28, 4305–4318. [CrossRef] [PubMed]
38. Muller, A.S.; Montgomery, S.H. Co-evolution of cerebral and cerebellar expansion in cetaceans. J. Evol. Biol. 2019, 32, 1418–1431.
[CrossRef] [PubMed]
39. Chaabouni, R.; Kharitonov, E.; Dupoux, E.; Baroni, M. Communicating artificial neural networks develop efficient color-naming
systems. Proc. Natl. Acad. Sci. USA 2021, 118, e2016569118. [CrossRef] [PubMed]