0% found this document useful (0 votes)

73 views11 pages

Hierarchical Temporal Memory

Hierarchical temporal memory (HTM) is a machine learning algorithm inspired by the neocortex that is capable of online, continuous learning. It learns spatial and temporal patterns from streaming data to make predictions. HTM forms a hierarchical network of regions that learn increasingly complex patterns at higher levels of abstraction. The algorithm has evolved over time but still utilizes principles of spatial and temporal pooling to learn sequences and make inferences about new input patterns.

Uploaded by

john949

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

73 views11 pages

Hierarchical Temporal Memory

Uploaded by

john949

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Hierarchical temporal memory

Hierarchical temporal memory (HTM) is a biologically constrained machine intelligence technology

developed by Numenta. Originally described in the 2004 book On Intelligence by Jeff Hawkins with
Sandra Blakeslee, HTM is primarily used today for anomaly detection in streaming data. The technology is
based on neuroscience and the physiology and interaction of pyramidal neurons in the neocortex of the
mammalian (in particular, human) brain.

At the core of HTM are learning algorithms that can store, learn, infer, and recall high-order sequences.
Unlike most other machine learning methods, HTM constantly learns (in an unsupervised process) time-
based patterns in unlabeled data. HTM is robust to noise, and has high capacity (it can learn multiple
patterns simultaneously). When applied to computers, HTM is well suited for prediction,[1] anomaly
detection,[2] classification, and ultimately sensorimotor applications.[3]

HTM has been tested and implemented in software through example applications from Numenta and a few
commercial applications from Numenta's partners.

Structure and algorithms

A typical HTM network is a tree-shaped hierarchy of levels (not to be confused with the "layers" of the
neocortex, as described below). These levels are composed of smaller elements called regions (or nodes). A
single level in the hierarchy possibly contains several regions. Higher hierarchy levels often have fewer
regions. Higher hierarchy levels can reuse patterns learned at the lower levels by combining them to
memorize more complex patterns.

Each HTM region has the same basic function. In learning and inference modes, sensory data (e.g. data
from the eyes) comes into bottom-level regions. In generation mode, the bottom level regions output the
generated pattern of a given category. The top level usually has a single region that stores the most general
and most permanent categories (concepts); these determine, or are determined by, smaller concepts at lower
levels—concepts that are more restricted in time and space. When set in inference mode, a region (in each
level) interprets information coming up from its "child" regions as probabilities of the categories it has in
memory.

Each HTM region learns by identifying and memorizing spatial patterns—combinations of input bits that
often occur at the same time. It then identifies temporal sequences of spatial patterns that are likely to occur
one after another.

As an evolving model
HTM is the algorithmic component to Jeff Hawkins’ Thousand Brains Theory of Intelligence. So new
findings on the neocortex are progressively incorporated into the HTM model, which changes over time in
response. The new findings do not necessarily invalidate the previous parts of the model, so ideas from one
generation are not necessarily excluded in its successive one. Because of the evolving nature of the theory,
there have been several generations of HTM algorithms,[4] which are briefly described below.

First generation: zeta 1

The first generation of HTM algorithms is sometimes referred to as zeta 1.

Training

During training, a node (or region) receives a temporal sequence of spatial patterns as its input. The
learning process consists of two stages:

1. The spatial pooling identifies (in the input) frequently observed patterns and memorise
them as "coincidences". Patterns that are significantly similar to each other are treated as
the same coincidence. A large number of possible input patterns are reduced to a
manageable number of known coincidences.
2. The temporal pooling partitions coincidences that are likely to follow each other in the
training sequence into temporal groups. Each group of patterns represents a "cause" of the
input pattern (or "name" in On Intelligence).

The concepts of spatial pooling and temporal pooling are still quite important in the current HTM
algorithms. Temporal pooling is not yet well understood, and its meaning has changed over time (as the
HTM algorithms evolved).

Inference

During inference, the node calculates the set of probabilities that a pattern belongs to each known
coincidence. Then it calculates the probabilities that the input represents each temporal group. The set of
probabilities assigned to the groups is called a node's "belief" about the input pattern. (In a simplified
implementation, node's belief consists of only one winning group). This belief is the result of the inference
that is passed to one or more "parent" nodes in the next higher level of the hierarchy.

"Unexpected" patterns to the node do not have a dominant probability of belonging to any one temporal
group but have nearly equal probabilities of belonging to several of the groups. If sequences of patterns are
similar to the training sequences, then the assigned probabilities to the groups will not change as often as
patterns are received. The output of the node will not change as much, and a resolution in time is lost.

In a more general scheme, the node's belief can be sent to the input of any node(s) at any level(s), but the
connections between the nodes are still fixed. The higher-level node combines this output with the output
from other child nodes thus forming its own input pattern.

Since resolution in space and time is lost in each node as described above, beliefs formed by higher-level
nodes represent an even larger range of space and time. This is meant to reflect the organisation of the
physical world as it is perceived by the human brain. Larger concepts (e.g. causes, actions, and objects) are
perceived to change more slowly and consist of smaller concepts that change more quickly. Jeff Hawkins
postulates that brains evolved this type of hierarchy to match, predict, and affect the organisation of the
external world.

More details about the functioning of Zeta 1 HTM can be found in Numenta's old documentation.[5]

Second generation: cortical learning algorithms

The second generation of HTM learning algorithms, often referred to as cortical learning algorithms
(CLA), was drastically different from zeta 1. It relies on a data structure called sparse distributed
representations (that is, a data structure whose elements are binary, 1 or 0, and whose number of 1 bits is
small compared to the number of 0 bits) to represent the brain activity and a more biologically-realistic
neuron model (often also referred to as cell, in the context of HTM).[6] There are two core components in
this HTM generation: a spatial pooling algorithm,[7] which outputs sparse distributed representations
(SDR), and a sequence memory algorithm,[8] which learns to represent and predict complex sequences.

In this new generation, the layers and minicolumns of the cerebral cortex are addressed and partially
modeled. Each HTM layer (not to be confused with an HTM level of an HTM hierarchy, as described
above) consists of a number of highly interconnected minicolumns. An HTM layer creates a sparse
distributed representation from its input, so that a fixed percentage of minicolumns are active at any one
time. A minicolumn is understood as a group of cells that have the same receptive field. Each minicolumn
has a number of cells that are able to remember several previous states. A cell can be in one of three states:
active, inactive and predictive state.

Spatial pooling

The receptive field of each minicolumn is a fixed number of inputs that are randomly selected from a much
larger number of node inputs. Based on the (specific) input pattern, some minicolumns will be more or less
associated with the active input values. Spatial pooling selects a relatively constant number of the most
active minicolumns and inactivates (inhibits) other minicolumns in the vicinity of the active ones. Similar
input patterns tend to activate a stable set of minicolumns. The amount of memory used by each layer can
be increased to learn more complex spatial patterns or decreased to learn simpler patterns.

Active, inactive and predictive cells

As mentioned above, a cell (or a neuron) of a minicolumn, at any point in time, can be in an active, inactive
or predictive state. Initially, cells are inactive.

How do cells become active?

If one or more cells in the active minicolumn are in the predictive state (see below), they will be the only
cells to become active in the current time step. If none of the cells in the active minicolumn are in the
predictive state (which happens during the initial time step or when the activation of this minicolumn was
not expected), all cells are made active.

How do cells become predictive?

When a cell becomes active, it gradually forms connections to nearby cells that tend to be active during
several previous time steps. Thus a cell learns to recognize a known sequence by checking whether the
connected cells are active. If a large number of connected cells are active, this cell switches to the predictive
state in anticipation of one of the few next inputs of the sequence.

The output of a minicolumn

The output of a layer includes minicolumns in both active and predictive states. Thus minicolumns are
active over long periods of time, which leads to greater temporal stability seen by the parent layer.

Inference and online learning

Cortical learning algorithms are able to learn continuously from each new input pattern, therefore no
separate inference mode is necessary. During inference, HTM tries to match the stream of inputs to
fragments of previously learned sequences. This allows each HTM layer to be constantly predicting the
likely continuation of the recognized sequences. The index of the predicted sequence is the output of the
layer. Since predictions tend to change less frequently than the input patterns, this leads to increasing
temporal stability of the output in higher hierarchy levels. Prediction also helps to fill in missing patterns in
the sequence and to interpret ambiguous data by biasing the system to infer what it predicted.

Applications of the CLAs

Cortical learning algorithms are currently being offered as commercial SaaS by Numenta (such as Grok[9]).

The validity of the CLAs

The following question was posed to Jeff Hawkins in September 2011 with regard to cortical learning
algorithms: "How do you know if the changes you are making to the model are good or not?" To which
Jeff's response was "There are two categories for the answer: one is to look at neuroscience, and the other
is methods for machine intelligence. In the neuroscience realm, there are many predictions that we can
make, and those can be tested. If our theories explain a vast array of neuroscience observations then it tells
us that we’re on the right track. In the machine learning world, they don’t care about that, only how well it
works on practical problems. In our case that remains to be seen. To the extent you can solve a problem that
no one was able to solve before, people will take notice."[10]

Third generation: sensorimotor inference

The third generation builds on the second generation and adds in a theory of sensorimotor inference in the
neocortex.[11][12] This theory proposes that cortical columns at every level of the hierarchy can learn
complete models of objects over time and that features are learned at specific locations on the objects. The
theory was expanded in 2018 and referred to as the Thousand Brains Theory.[13]

Comparison of neuron models

Comparing the artificial neural network (A), the biological neuron (B), and
the HTM neuron (C).
Comparison of Neuron Models
Artificial Neural Neocortical Pyramidal Neuron
Network (ANN) (Biological Neuron) HTM Model Neuron[8]

Few synapses Thousands of synapses on the Inspired by the pyramidal cells

No dendrites dendrites in neocortex layers 2/3 and 5
Sum input × Active dendrites: cell recognizes Thousands of synapses
weights hundreds of unique patterns Active dendrites: cell recognizes
Learns by Co-activation of a set of synapses on a hundreds of unique patterns
modifying dendritic segment causes an NMDA Models dendrites and NMDA
weights of spike and depolarization at the soma[8] spikes with each array of
synapses Sources of input to the cell: coincident detectors having a
set of synapses
1. Feedforward inputs which form Learns by modeling the growth
synapses proximal to the soma and of new synapses
directly lead to action potentials
2. NMDA spikes generated in the more
distal basal
3. Apical dendrites that depolarize the
soma (usually not sufficient enough
to generate a somatic action
potential)
Learns by growing new synapses

Comparing HTM and neocortex

HTM attempts to implement the functionality that is characteristic of a hierarchically related group of
cortical regions in the neocortex. A region of the neocortex corresponds to one or more levels in the HTM
hierarchy, while the hippocampus is remotely similar to the highest HTM level. A single HTM node may
represent a group of cortical columns within a certain region.

Although it is primarily a functional model, several attempts have been made to relate the algorithms of the
HTM with the structure of neuronal connections in the layers of neocortex.[14][15] The neocortex is
organized in vertical columns of 6 horizontal layers. The 6 layers of cells in the neocortex should not be
confused with levels in an HTM hierarchy.

HTM nodes attempt to model a portion of cortical columns (80 to 100 neurons) with approximately 20
HTM "cells" per column. HTMs model only layers 2 and 3 to detect spatial and temporal features of the
input with 1 cell per column in layer 2 for spatial "pooling", and 1 to 2 dozen per column in layer 3 for
temporal pooling. A key to HTMs and the cortex's is their ability to deal with noise and variation in the
input which is a result of using a "sparse distributive representation" where only about 2% of the columns
are active at any given time.

An HTM attempts to model a portion of the cortex's learning and plasticity as described above. Differences
between HTMs and neurons include:[16]

strictly binary signals and synapses

no direct inhibition of synapses or dendrites (but simulated indirectly)
currently only models layers 2/3 and 4 (no 5 or 6)
no "motor" control (layer 5)
no feed-back between regions (layer 6 of high to layer 1 of low)

Sparse distributed representations

Integrating memory component with neural networks has a long history dating back to early research in
distributed representations[17][18] and self-organizing maps. For example, in sparse distributed memory
(SDM), the patterns encoded by neural networks are used as memory addresses for content-addressable
memory, with "neurons" essentially serving as address encoders and decoders.[19][20]

Computers store information in dense representations such as a 32-bit word, where all combinations of 1s
and 0s are possible. By contrast, brains use sparse distributed representations (SDRs).[21] The human
neocortex has roughly 16 billion neurons, but at any given time only a small percent are active. The
activities of neurons are like bits in a computer, and so the representation is sparse. Similar to SDM
developed by NASA in the 80s[19] and vector space models used in Latent semantic analysis, HTM uses
sparse distributed representations.[22]

The SDRs used in HTM are binary representations of data consisting of many bits with a small percentage
of the bits active (1s); a typical implementation might have 2048 columns and 64K artificial neurons where
as few as 40 might be active at once. Although it may seem less efficient for the majority of bits to go
"unused" in any given representation, SDRs have two major advantages over traditional dense
representations. First, SDRs are tolerant of corruption and ambiguity due to the meaning of the
representation being shared (distributed) across a small percentage (sparse) of active bits. In a dense
representation, flipping a single bit completely changes the meaning, while in an SDR a single bit may not
affect the overall meaning much. This leads to the second advantage of SDRs: because the meaning of a
representation is distributed across all active bits, the similarity between two representations can be used as
a measure of semantic similarity in the objects they represent. That is, if two vectors in an SDR have 1s in
the same position, then they are semantically similar in that attribute. The bits in SDRs have semantic
meaning, and that meaning is distributed across the bits.[22]

The semantic folding theory[23] builds on these SDR properties to propose a new model for language
semantics, where words are encoded into word-SDRs and the similarity between terms, sentences, and
texts can be calculated with simple distance measures.

Similarity to other models

Bayesian networks

Likened to a Bayesian network, an HTM comprises a collection of nodes that are arranged in a tree-shaped
hierarchy. Each node in the hierarchy discovers an array of causes in the input patterns and temporal
sequences it receives. A Bayesian belief revision algorithm is used to propagate feed-forward and feedback
beliefs from child to parent nodes and vice versa. However, the analogy to Bayesian networks is limited,
because HTMs can be self-trained (such that each node has an unambiguous family relationship), cope with
time-sensitive data, and grant mechanisms for covert attention.
A theory of hierarchical cortical computation based on Bayesian belief propagation was proposed earlier by
Tai Sing Lee and David Mumford.[24] While HTM is mostly consistent with these ideas, it adds details
about handling invariant representations in the visual cortex.[25]

Neural networks

Like any system that models details of the neocortex, HTM can be viewed as an artificial neural network.
The tree-shaped hierarchy commonly used in HTMs resembles the usual topology of traditional neural
networks. HTMs attempt to model cortical columns (80 to 100 neurons) and their interactions with fewer
HTM "neurons". The goal of current HTMs is to capture as much of the functions of neurons and the
network (as they are currently understood) within the capability of typical computers and in areas that can
be made readily useful such as image processing. For example, feedback from higher levels and motor
control is not attempted because it is not yet understood how to incorporate them and binary instead of
variable synapses are used because they were determined to be sufficient in the current HTM capabilities.

LAMINART and similar neural networks researched by Stephen Grossberg attempt to model both the
infrastructure of the cortex and the behavior of neurons in a temporal framework to explain
neurophysiological and psychophysical data. However, these networks are, at present, too complex for
realistic application.[26]

HTM is also related to work by Tomaso Poggio, including an approach for modeling the ventral stream of
the visual cortex known as HMAX. Similarities of HTM to various AI ideas are described in the December
2005 issue of the Artificial Intelligence journal.[27]

Neocognitron

Neocognitron, a hierarchical multilayered neural network proposed by Professor Kunihiko Fukushima in

1987, is one of the first deep learning neural network models.[28]

See also
Artificial consciousness
Artificial general intelligence
Belief revision
Cognitive architecture
Convolutional neural network
List of artificial intelligence projects
Memory-prediction framework
Multiple trace theory
Neural history compressor
Neural Turing machine

Related models
Hierarchical hidden Markov model

References
1. Cui, Yuwei; Ahmad, Subutai; Hawkins, Jeff (2016). "Continuous Online Sequence Learning
with an Unsupervised Neural Network Model". Neural Computation. 28 (11): 2474–2504.
arXiv:1512.05463 (https://arxiv.org/abs/1512.05463). doi:10.1162/NECO_a_00893 (https://d
oi.org/10.1162%2FNECO_a_00893). PMID 27626963 (https://pubmed.ncbi.nlm.nih.gov/276
26963). S2CID 3937908 (https://api.semanticscholar.org/CorpusID:3937908).
2. Ahmad, Subutai; Lavin, Alexander; Purdy, Scott; Agha, Zuha (2017). "Unsupervised real-
time anomaly detection for streaming data" (https://doi.org/10.1016%2Fj.neucom.2017.04.07
0). Neurocomputing. 262: 134–147. doi:10.1016/j.neucom.2017.04.070 (https://doi.org/10.10
16%2Fj.neucom.2017.04.070).
3. "Preliminary details about new theory work on sensory-motor inference" (https://discourse.nu
menta.org/t/preliminary-details-about-new-theory-work-on-sensory-motor-inference/697).
HTM Forum. 2016-06-03.
4. HTM Retrospective (https://www.youtube.com/watch?v=6_wattbWgiU) on YouTube
5. "Numenta old documentation" (https://web.archive.org/web/20090527174304/http://nument
a.com/for-developers/education/general-overview-htm.php). numenta.com. Archived from
the original (http://numenta.com/for-developers/education/general-overview-htm.php) on
2009-05-27.
6. Jeff Hawkins lecture describing cortical learning algorithms (https://www.youtube.com/watc
h?v=48r-IeYOvG4) on YouTube
7. Cui, Yuwei; Ahmad, Subutai; Hawkins, Jeff (2017). "The HTM Spatial Pooler—A Neocortical
Algorithm for Online Sparse Distributed Coding" (https://www.ncbi.nlm.nih.gov/pmc/articles/
PMC5712570). Frontiers in Computational Neuroscience. 11: 111.
doi:10.3389/fncom.2017.00111 (https://doi.org/10.3389%2Ffncom.2017.00111).
PMC 5712570 (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5712570). PMID 29238299
(https://pubmed.ncbi.nlm.nih.gov/29238299).
8. Hawkins, Jeff; Ahmad, Subutai (30 March 2016). "Why Neurons Have Thousands of
Synapses, a Theory of Sequence Memory in Neocortex" (https://www.ncbi.nlm.nih.gov/pmc/
articles/PMC4811948). Front. Neural Circuits. 10: 23. doi:10.3389/fncir.2016.00023 (https://d
oi.org/10.3389%2Ffncir.2016.00023). PMC 4811948 (https://www.ncbi.nlm.nih.gov/pmc/articl
es/PMC4811948). PMID 27065813 (https://pubmed.ncbi.nlm.nih.gov/27065813).
9. "Grok Product Page" (http://grokstream.com/product/). grokstream.com.
10. Laserson, Jonathan (September 2011). "From Neural Networks to Deep Learning: Zeroing
in on the Human Brain" (https://ai.stanford.edu/~joni/papers/LasersonXRDS2011.pdf) (PDF).
XRDS. 18 (1). doi:10.1145/2000775.2000787 (https://doi.org/10.1145%2F2000775.200078
7). S2CID 21496694 (https://api.semanticscholar.org/CorpusID:21496694).
11. Hawkins, Jeff; Ahmad, Subutai; Cui, Yuwei (2017). "A Theory of How Columns in the
Neocortex Enable Learning the Structure of the World" (https://www.ncbi.nlm.nih.gov/pmc/art
icles/PMC5661005). Frontiers in Neural Circuits. 11: 81. doi:10.3389/fncir.2017.00081 (http
s://doi.org/10.3389%2Ffncir.2017.00081). PMC 5661005 (https://www.ncbi.nlm.nih.gov/pmc/
articles/PMC5661005). PMID 29118696 (https://pubmed.ncbi.nlm.nih.gov/29118696).
12. Have We Missed Half of What the Neocortex Does? Allocentric Location as the Basis of
Perception (https://www.youtube.com/watch?v=yVT7dO_Tf4E) on YouTube
13. "Numenta publishes breakthrough theory for intelligence and cortical computation" (https://w
ww.eurekalert.org/pub_releases/2019-01/kta-np011119.php). eurekalert.org. 2019-01-14.
14. Hawkins, Jeff; Blakeslee, Sandra. On Intelligence.
15. George, Dileep; Hawkins, Jeff (2009). "Towards a Mathematical Theory of Cortical Micro-
circuits" (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2749218). PLOS Computational
Biology. 5 (10): e1000532. Bibcode:2009PLSCB...5E0532G (https://ui.adsabs.harvard.edu/a
bs/2009PLSCB...5E0532G). doi:10.1371/journal.pcbi.1000532 (https://doi.org/10.1371%2Fj
ournal.pcbi.1000532). PMC 2749218 (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC27492
18). PMID 19816557 (https://pubmed.ncbi.nlm.nih.gov/19816557).
16. "HTM Cortical Learning Algorithms" (https://numenta.org/resources/HTM_CorticalLearningAl
gorithms.pdf) (PDF). numenta.org.
17. Hinton, Geoffrey E. (1984). "Distributed representations" (https://web.archive.org/web/20171
114185128/http://repository.cmu.edu/cgi/viewcontent.cgi?article=2841&context=compsci).
Archived from the original (http://repository.cmu.edu/cgi/viewcontent.cgi?article=2841&conte
xt=compsci) on 2017-11-14.
18. Plate, Tony (1991). "Holographic Reduced Representations: Convolution Algebra for
Compositional Distributed Representations" (https://www.ijcai.org/Proceedings/91-1/Papers/
006.pdf) (PDF). IJCAI.
19. Kanerva, Pentti (1988). Sparse distributed memory (https://mitpress.mit.edu/books/sparse-di
stributed-memory). MIT press. ISBN 9780262111324.
20. Snaider, Javier; Franklin, Stan (2012). Integer sparse distributed memory (https://web.archiv
e.org/web/20171229112344/https://pdfs.semanticscholar.org/9810/4c7fabf6232715ef2bea1f
5b3a3425e1c3af.pdf) (PDF). Twenty-fifth international flairs conference. S2CID 17547390 (h
ttps://api.semanticscholar.org/CorpusID:17547390). Archived from the original (https://pdfs.s
emanticscholar.org/9810/4c7fabf6232715ef2bea1f5b3a3425e1c3af.pdf) (PDF) on 2017-12-
29.
21. Olshausen, Bruno A.; Field, David J. (1997). "Sparse coding with an overcomplete basis set:
A strategy employed by V1?" (https://doi.org/10.1016%2FS0042-6989%2897%2900169-7).
Vision Research. 37 (23): 3311–3325. doi:10.1016/S0042-6989(97)00169-7 (https://doi.org/
10.1016%2FS0042-6989%2897%2900169-7). PMID 9425546 (https://pubmed.ncbi.nlm.nih.
gov/9425546). S2CID 14208692 (https://api.semanticscholar.org/CorpusID:14208692).
22. Ahmad, Subutai; Hawkins, Jeff (2016). "Numenta NUPIC – sparse distributed
representations". arXiv:1601.00720 (https://arxiv.org/abs/1601.00720) [q-bio.NC (https://arxi
v.org/archive/q-bio.NC)].
23. De Sousa Webber, Francisco (2015). "Semantic Folding Theory And its Application in
Semantic Fingerprinting". arXiv:1511.08855 (https://arxiv.org/abs/1511.08855) [cs.AI (https://
arxiv.org/archive/cs.AI)].
24. Lee, Tai Sing; Mumford, David (2002). "Hierarchical Bayesian Inference in the Visual
Cortex". Journal of the Optical Society of America A. 20 (7): 1434–48.
CiteSeerX 10.1.1.12.2565 (https://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.12.25
65). doi:10.1364/josaa.20.001434 (https://doi.org/10.1364%2Fjosaa.20.001434).
PMID 12868647 (https://pubmed.ncbi.nlm.nih.gov/12868647).
25. George, Dileep (2010-07-24). "Hierarchical Bayesian inference in the visual cortex" (https://
web.archive.org/web/20190801074057/http://dileepgeorge.com/blog/?p=5).
dileepgeorge.com. Archived from the original (http://dileepgeorge.com/blog/?p=5) on 2019-
08-01.
26. Grossberg, Stephen (2007). Cisek, Paul; Drew, Trevor; Kalaska, John (eds.). Towards a
unified theory of neocortex: Laminar cortical circuits for vision and cognition. Technical
Report CAS/CNS-TR-2006-008. For Computational Neuroscience: From Neurons to Theory
and Back Again (https://web.archive.org/web/20170829064651/http://cns.bu.edu/Profiles/Gr
ossberg/GroCisek2007.pdf) (PDF) (Report). Amsterdam: Elsevier. pp. 79–104. Archived from
the original (http://cns.bu.edu/Profiles/Grossberg/GroCisek2007.pdf) (PDF) on 2017-08-29.
27. "ScienceDirect – Artificial Intelligence" (https://www.sciencedirect.com/journal/artificial-intelli
gence/vol/169/issue/2). 169 (2). December 2005: 103–212.
28. Fukushima, Kunihiko (2007). "Neocognitron" (https://doi.org/10.4249%2Fscholarpedia.171
7). Scholarpedia. 2 (1): 1717. Bibcode:2007SchpJ...2.1717F (https://ui.adsabs.harvard.edu/a
bs/2007SchpJ...2.1717F). doi:10.4249/scholarpedia.1717 (https://doi.org/10.4249%2Fschol
arpedia.1717).

Further reading
Ahmad, Subutai; Hawkins, Jeff (25 March 2015). "Properties of Sparse Distributed
Representations and their Application to Hierarchical Temporal Memory". arXiv:1503.07469
(https://arxiv.org/abs/1503.07469) [q-bio.NC (https://arxiv.org/archive/q-bio.NC)].
Hawkins, Jeff (April 2007). "Learn like a Human" (http://spectrum.ieee.org/computing/hardwa
re/learn-like-a-human). IEEE Spectrum.
Maltoni, Davide (April 13, 2011). "Pattern Recognition by Hierarchical Temporal Memory" (ht
tp://bias.csr.unibo.it/maltoni/HTM_TR_v1.0.pdf) (PDF). DEIS Technical Report. Italy:
University of Bologna.
Ratliff, Evan (March 2007). "The Thinking Machine" (https://www.wired.com/wired/archive/1
5.03/hawkins.html). Wired.

External links
HTM (https://www.numenta.com/resources/htm/) at Numenta
HTM Basics with Rahul (https://www.youtube.com/watch?v=z6r3ekreRzY) (Numenta), talk
about the cortical learning algorithm (CLA) used by the HTM model on YouTube

Retrieved from "https://en.wikipedia.org/w/index.php?title=Hierarchical_temporal_memory&oldid=1161619972"

Learn-C-Sharp-Course-Book
No ratings yet
Learn-C-Sharp-Course-Book
50 pages
Huawei Manager 2023 For Band Locking
No ratings yet
Huawei Manager 2023 For Band Locking
62 pages
arXiv 2303.07925v10
No ratings yet
arXiv 2303.07925v10
56 pages
MODULE6 1 Hierarchical Reinforcement Learning
No ratings yet
MODULE6 1 Hierarchical Reinforcement Learning
26 pages
AI Unit 2 Algorithms
No ratings yet
AI Unit 2 Algorithms
50 pages
Hierarchical Temporal Memory Cortical Learning Algorithm 0.2.1 en
No ratings yet
Hierarchical Temporal Memory Cortical Learning Algorithm 0.2.1 en
68 pages
5_P & S for String & File IO
No ratings yet
5_P & S for String & File IO
29 pages
Curse of Dimensionality
No ratings yet
Curse of Dimensionality
9 pages
List of Datasets For Machine-Learning Research
100% (1)
List of Datasets For Machine-Learning Research
61 pages
TCL Scripting Session
No ratings yet
TCL Scripting Session
31 pages
Document-Oriented Database
No ratings yet
Document-Oriented Database
10 pages
DaylidyonokFrolenkovaPanov-2018-ExtendedHierarchicalTemporalMemoryforMotionAnomalyDetection2
No ratings yet
DaylidyonokFrolenkovaPanov-2018-ExtendedHierarchicalTemporalMemoryforMotionAnomalyDetection2
14 pages
10.1007@978 3 030 38445 6
No ratings yet
10.1007@978 3 030 38445 6
243 pages
Python Unit 4 Part 3
No ratings yet
Python Unit 4 Part 3
18 pages
ETAM
No ratings yet
ETAM
25 pages
C++ Notes
No ratings yet
C++ Notes
99 pages
Real-Time Task Scheduling2
No ratings yet
Real-Time Task Scheduling2
23 pages
a-novel-project-oriented-system-on-chip-soc-design-course-for-computer-and-electrical-engineers
No ratings yet
a-novel-project-oriented-system-on-chip-soc-design-course-for-computer-and-electrical-engineers
9 pages
HTM Cortical LearningAlgorithms
No ratings yet
HTM Cortical LearningAlgorithms
68 pages
Ol-A Model and Simulation of Early-Stage
No ratings yet
Ol-A Model and Simulation of Early-Stage
11 pages
Extract, Transform, Load
No ratings yet
Extract, Transform, Load
9 pages
Computational Intelligence
No ratings yet
Computational Intelligence
6 pages
CHREST
No ratings yet
CHREST
6 pages
Data Science
No ratings yet
Data Science
7 pages
Data Wrangling
0% (1)
Data Wrangling
5 pages
Operators and Expressions in Java
No ratings yet
Operators and Expressions in Java
2 pages
Building Intelligent Machines
No ratings yet
Building Intelligent Machines
26 pages
Towards A Mathematical Theory of Cortical Micro-Circuits
No ratings yet
Towards A Mathematical Theory of Cortical Micro-Circuits
26 pages
Ijdkp 030207
No ratings yet
Ijdkp 030207
12 pages
FORR
No ratings yet
FORR
4 pages
Question 1 (25 Points) : A. What Is The Output of Each of The Following Fragments? Circle The Correct Answer
No ratings yet
Question 1 (25 Points) : A. What Is The Output of Each of The Following Fragments? Circle The Correct Answer
6 pages
IA Test
No ratings yet
IA Test
4 pages
5088
No ratings yet
5088
7 pages
Memory-Prediction Framework
No ratings yet
Memory-Prediction Framework
6 pages
Very Large Database
No ratings yet
Very Large Database
6 pages
10.1108@cw-04-2019-0039
No ratings yet
10.1108@cw-04-2019-0039
11 pages
Lab Assignment 2: There Are Total 5 Test Cases in Weak Normal Equivalence Class Testing
No ratings yet
Lab Assignment 2: There Are Total 5 Test Cases in Weak Normal Equivalence Class Testing
2 pages
Parallel Coordinates
No ratings yet
Parallel Coordinates
5 pages
Sheet 6 Model Answer
No ratings yet
Sheet 6 Model Answer
6 pages
Pic Programming Tutorial
100% (1)
Pic Programming Tutorial
18 pages
P. T. 1 Schedule for Class 6 to 12
No ratings yet
P. T. 1 Schedule for Class 6 to 12
1 page
Forward-Backward Activation Algorithm For Hierarchical Hidden Markov Models
No ratings yet
Forward-Backward Activation Algorithm For Hierarchical Hidden Markov Models
9 pages
Neural Cognitive Architectures For Never-Ending Learning: E.a.platanios@cs - Cmu.edu
No ratings yet
Neural Cognitive Architectures For Never-Ending Learning: E.a.platanios@cs - Cmu.edu
21 pages
Connectionism
No ratings yet
Connectionism
9 pages
Causal Loop Diagram
No ratings yet
Causal Loop Diagram
4 pages
Digital Signal Processing
No ratings yet
Digital Signal Processing
8 pages
Computational Phylogenetics
No ratings yet
Computational Phylogenetics
18 pages
A Comparison of Hmms and Dynamic Bayesian Networks For Recognizing O Ce Activities
No ratings yet
A Comparison of Hmms and Dynamic Bayesian Networks For Recognizing O Ce Activities
10 pages
Answer of SHEET#1
No ratings yet
Answer of SHEET#1
4 pages
Hierarchical Hidden Markov Model
No ratings yet
Hierarchical Hidden Markov Model
3 pages
XLDB
No ratings yet
XLDB
3 pages
Bayesian Programming
No ratings yet
Bayesian Programming
16 pages
Data Integration
No ratings yet
Data Integration
8 pages
Kundan Kumar (21MCA2029 Review Paper)
No ratings yet
Kundan Kumar (21MCA2029 Review Paper)
13 pages
Multidimensional Scaling
No ratings yet
Multidimensional Scaling
6 pages
OBJECT ORIENTED PROG in C++ Saurav Sahay
No ratings yet
OBJECT ORIENTED PROG in C++ Saurav Sahay
485 pages
Verilog Implementation of A Node of Hierarchical Temporal Memory
No ratings yet
Verilog Implementation of A Node of Hierarchical Temporal Memory
6 pages
Gobet Lane 2007 Order
No ratings yet
Gobet Lane 2007 Order
12 pages
Inheritance
No ratings yet
Inheritance
20 pages
Solutions Manual
No ratings yet
Solutions Manual
257 pages
Data Philanthropy
No ratings yet
Data Philanthropy
5 pages
Predictive Coding
No ratings yet
Predictive Coding
8 pages
Wavelet
No ratings yet
Wavelet
19 pages
Predictive Modelling
67% (3)
Predictive Modelling
64 pages
Question 1: Fill in The Blanks
No ratings yet
Question 1: Fill in The Blanks
4 pages
Data Blending
No ratings yet
Data Blending
3 pages
Adaptive Resonance Theory
No ratings yet
Adaptive Resonance Theory
5 pages
Articulation Point - Algorithm
No ratings yet
Articulation Point - Algorithm
3 pages
George Thesis
No ratings yet
George Thesis
191 pages
Ierarchical Emporal Emory: HTM Cortical Learning Algorithms
No ratings yet
Ierarchical Emporal Emory: HTM Cortical Learning Algorithms
57 pages
Physical Design Interview Complete 56
No ratings yet
Physical Design Interview Complete 56
1 page
Principal Component Analysis
No ratings yet
Principal Component Analysis
33 pages
Temporal Data Clustering Using Hidden Markov Model
No ratings yet
Temporal Data Clustering Using Hidden Markov Model
6 pages
Unit-3: Stacks and Queues: 2M Questions
No ratings yet
Unit-3: Stacks and Queues: 2M Questions
33 pages
Structured Data Analysis (Statistics)
No ratings yet
Structured Data Analysis (Statistics)
1 page
SPPU Pattern2019 Fds Unit 6
No ratings yet
SPPU Pattern2019 Fds Unit 6
32 pages
DAA Unit 3
No ratings yet
DAA Unit 3
23 pages
Data Lineage
No ratings yet
Data Lineage
14 pages
STTP Schedule Atal
No ratings yet
STTP Schedule Atal
1 page
Machine Learning Data Mining: Problems
No ratings yet
Machine Learning Data Mining: Problems
4 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
3 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
43 pages
Data Engineering
No ratings yet
Data Engineering
6 pages
Temporal Logic: Understanding Reactive Systems
From Everand
Temporal Logic: Understanding Reactive Systems
Pasquale De Marco
No ratings yet
Troanary Photonic Storage Blueprint - How Light Based Logic can Redefine Computation and Data Storage: Volume 10 Troanary Photonic Storage Blueprint, #1
From Everand
Troanary Photonic Storage Blueprint - How Light Based Logic can Redefine Computation and Data Storage: Volume 10 Troanary Photonic Storage Blueprint, #1
Ylia Callan
No ratings yet
Mind: An Emergent Property
From Everand
Mind: An Emergent Property
Simeon Locke
No ratings yet
Pointers
From Everand
Pointers
Brad G. Berman
No ratings yet
Functionalism: Fundamentals and Applications
From Everand
Functionalism: Fundamentals and Applications
Fouad Sabry
No ratings yet
The Sentient Robot: The Last Two Hurdles in the Race to Build Artificial Superintelligence
From Everand
The Sentient Robot: The Last Two Hurdles in the Race to Build Artificial Superintelligence
Rupert Robson
No ratings yet
Neural Modeling Fields: Fundamentals and Applications
From Everand
Neural Modeling Fields: Fundamentals and Applications
Fouad Sabry
No ratings yet
Ant Colony Optimization Algorithms: Fundamentals and Applications
From Everand
Ant Colony Optimization Algorithms: Fundamentals and Applications
Fouad Sabry
No ratings yet
Mind Uploading: Fundamentals and Applications
From Everand
Mind Uploading: Fundamentals and Applications
Fouad Sabry
No ratings yet
Kinematics of the Brain Activities
From Everand
Kinematics of the Brain Activities
Mostafa M. Dini
No ratings yet
Immortality Is Possible
From Everand
Immortality Is Possible
Carlos Herrero Carcedo
No ratings yet
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
From Everand
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
Fouad Sabry
No ratings yet
Affective Computing: Fundamentals and Applications
From Everand
Affective Computing: Fundamentals and Applications
Fouad Sabry
No ratings yet
Evolutionary Computation: Fundamentals and Applications
From Everand
Evolutionary Computation: Fundamentals and Applications
Fouad Sabry
No ratings yet
We Live In A Simulation
From Everand
We Live In A Simulation
Carlos Herrero Carcedo
No ratings yet
Convolutional Neural Networks in Python: Beginner's Guide to Convolutional Neural Networks in Python
From Everand
Convolutional Neural Networks in Python: Beginner's Guide to Convolutional Neural Networks in Python
Frank Millstein
No ratings yet
Brain Functioning and Regeneration: Kinematics of the Brain Activities Volume Iv
From Everand
Brain Functioning and Regeneration: Kinematics of the Brain Activities Volume Iv
Mostafa M. Dini
No ratings yet
Algorithmic Information Theory: Fundamentals and Applications
From Everand
Algorithmic Information Theory: Fundamentals and Applications
Fouad Sabry
No ratings yet
Long Short Term Memory: Fundamentals and Applications for Sequence Prediction
From Everand
Long Short Term Memory: Fundamentals and Applications for Sequence Prediction
Fouad Sabry
No ratings yet
TensorFlow in 1 Day: Make your own Neural Network
From Everand
TensorFlow in 1 Day: Make your own Neural Network
Krishna Rungta
3.5/5 (10)
Neuroevolution: Fundamentals and Applications for Surpassing Human Intelligence with Neuroevolution
From Everand
Neuroevolution: Fundamentals and Applications for Surpassing Human Intelligence with Neuroevolution
Fouad Sabry
No ratings yet
Attractor Networks: Fundamentals and Applications in Computational Neuroscience
From Everand
Attractor Networks: Fundamentals and Applications in Computational Neuroscience
Fouad Sabry
No ratings yet
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet
Artificial Neural Networks: Fundamentals and Applications for Decoding the Mysteries of Neural Computation
From Everand
Artificial Neural Networks: Fundamentals and Applications for Decoding the Mysteries of Neural Computation
Fouad Sabry
No ratings yet
The evolution of Soft Computing - From neural networks to cellular automata
From Everand
The evolution of Soft Computing - From neural networks to cellular automata
Marco Casella
4/5 (1)
Perceptrons: Fundamentals and Applications for The Neural Building Block
From Everand
Perceptrons: Fundamentals and Applications for The Neural Building Block
Fouad Sabry
No ratings yet
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
From Everand
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
Fouad Sabry
No ratings yet
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
From Everand
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
Fouad Sabry
No ratings yet
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
From Everand
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
Fouad Sabry
No ratings yet
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet

Hierarchical Temporal Memory

Uploaded by

Hierarchical Temporal Memory

Uploaded by

Hierarchical temporal memory

Hierarchical temporal memory (HTM) is a biologically constrained machine intelligence technology

Structure and algorithms

First generation: zeta 1

Second generation: cortical learning algorithms

Active, inactive and predictive cells

How do cells become active?

How do cells become predictive?

The output of a minicolumn

Inference and online learning

Applications of the CLAs

The validity of the CLAs

Third generation: sensorimotor inference

Comparison of neuron models

Few synapses Thousands of synapses on the Inspired by the pyramidal cells

Comparing HTM and neocortex

strictly binary signals and synapses

Sparse distributed representations

Similarity to other models

Neocognitron, a hierarchical multilayered neural network proposed by Professor Kunihiko Fukushima in

Retrieved from "https://en.wikipedia.org/w/index.php?title=Hierarchical_temporal_memory&oldid=1161619972"

You might also like