0% found this document useful (0 votes)

27 views64 pages

Path Signatures On Lie Groups

This document discusses the extension of path signatures, a tool for time series analysis, to Lie group valued time series, demonstrating their universality and characteristic properties. The authors present applications in human action recognition and hypothesis testing for random walks on Lie groups, showing that their methods outperform traditional approaches. Additionally, they provide algorithms and a Julia implementation for computing path signatures on Lie groups, contributing to the field of data analysis in geometric contexts.

Uploaded by

Minhajul Islam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views64 pages

Path Signatures On Lie Groups

Uploaded by

Minhajul Islam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 64

Path Signatures on Lie Groups

Darrick Lee [email protected]

Department of Mathematics
University of Pennsylvania
Robert Ghrist [email protected]
Departments of Mathematics and Electrical & Systems Engineering
University of Pennsylvania

Editor:

Abstract
Path signatures are powerful nonparametric tools for time series analysis, shown to form
a universal and characteristic feature map for Euclidean valued time series data. We lift
the theory of path signatures to the setting of Lie group valued time series, adapting these
tools for time series with underlying geometric constraints. We prove that this generalized
path signature is universal and characteristic. To demonstrate universality, we analyze
the human action recognition problem in computer vision, using SO(3) representations for
the time series, providing comparable performance to other shallow learning approaches,
while offering an easily interpretable feature set. We also provide a two-sample hypothesis
test for Lie group-valued random walks to illustrate its characteristic property. Finally we
provide algorithms and a Julia implementation of these methods.

1. Introduction
Time series data is ubiquitous in modern data science, and may take values in a variety of
forms. Perhaps the most common is a collection of simultaneous multivariate real-valued
time series {γ i }N i
i=1 , where γ : [0, 1] → R. In this case, we may consider the entire collection
γ = (γ 1 , . . . , γ N ) as a path through Euclidean space, γ : [0, 1] → RN . The path signature
is a feature set that completely characterizes such paths, and has recently been applied to
several tasks in machine learning (Chevyrev and Kormilitzin, 2016; Lyons, 2014). Recent
work has provided the path signature with strong theoretical properties; namely that it is
a universal and characteristic kernel for time series in Euclidean space RN (Chevyrev and
Oberhauser, 2018).
However, in many scenarios, the data may have some geometric constraints, and may
be better represented by elements of a (non-Euclidean) manifold. In this case, the time-
varying data can be modelled as path on such a manifold, rather than on Euclidean space.
Lie groups are smooth manifolds equipped with a compatible group structure. Paths (or
time series) valued in Lie groups model a number of natural phenomena, including the
following.

• The special Euclidean group SE(n) is the Lie group of all rigid body motions in Rn .
The group SE(3) is often used to model the position and pose of a rigid body, such as

1
Darrick Lee and Robert Ghrist

a component of a robotic arm or an element of a drone swarm, with k such components

or elements collectively giving rise to a path SE(3)k (Selig, 2004).

• The special orthogonal group SO(n) is the Lie group of all rotations in Rn ; this is a Lie
subgroup of SE(n). The Lie group SO(3)k has recently been used to represent the
pose of a human by recording the relative rotations of k pairs of body parts (Vemula-
palli and Chellappa, 2016). Thus, human movement can be represented as a path in
SO(3)k . This representation has been used in the computer vision problem of human
action recognition, and Lie group methods have achieved state-of-the-art results in
this domain (Huang et al., 2017).

• The state of an oscillator may be described as an element of the circle S 1 , and collective
behavior of a network of oscillators can be describe by an element of the n-torus,
T n = (S 1 )n . The time evolution of oscillator networks can therefore be modelled as a
path on T n (Strogatz, 2000).

• The Euclidean space RN is the simplest example of a Lie group, where the group
operation is addition. The classical path signature for Euclidean space can be viewed
as a special case of path signatures on Lie groups.

In this paper, we extend path signatures to time series valued in Lie groups, and show
that this extension is also a universal and characteristic kernel.

1.1 Contributions
We lift the theory of path signatures for time series valued in Euclidean space to the setting
of time series valued in Lie groups, restricting ourselves to the class of piecewise regular
paths on Lie groups.

Definition 1 Let G be a Lie group. A path γ : [a, b] → G is regular if γt0 is continuous and
nonvanishing on the entire interval [a, b]. Such a path is piecewise regular if there exists a
partition a = t0 < t1 < . . . < tn = b such that γ is regular on each open subinterval (ti , ti+1 )
for all i. The pathspace – the space of all piecewise regular paths on the unit interval,
γ : [0, 1] → G – will be denoted P G.

Let G be a Lie group of dimension N , and let g be its Lie algebra (the tangent space at
the identity). We denote the underlying vector space of g by g ∼ = RN . The path signature
is a function on paths,

S : P G → T ((g)),

valued in a formal power series of tensors, T ((g)), where we may view the coefficients as
descriptors (or features) of the underlying path (or time series). Path signatures for general
manifolds were originally defined by Chen (1958), but not in a manner conducive to data
analysis. This paper gives a computationally clean derivation for path signatures on Lie
groups tuned for use in data analysis.
Our generalization is designed to be analogous to the Euclidean case as much as possible,
for ease of applicability. For example, the definition of the path signature for γ : [0, 1] → G

2
Path Signatures on Lie Groups

depends only on the derivative γ 0 : [0, 1] → g. We exploit one of the key properties of Lie
groups — that tangent vectors at a point correspond to elements of its Lie algebra g, a
vector space. This will permit a signature construction making use of iterated integrals as
per the Euclidean case.
In the Euclidean case G = RN , the Lie group is often conflated with its Lie algebra
r = RN , and the fact that the integration is performed in the Lie algebra is often not
made. By clarifying and emphasizing this point, the generalization to Lie groups illuminates
understanding of the classical Euclidean case.
From a machine learning perspective, the basic properties of the path signature as a
feature map provide several benefits.

• The signature is a feature set for a path as a whole, and can be used to compare time
series with varying numbers of time points.

• Defined as iterated line integrals, the path signature is invariant under reparametriza-
tion, and thus only depends on the order in which events occur.

• The signature is left translation invariant, meaning the signatures of paths that differ
by a constant element g ∈ G will be the same. This implies that the signature only
depends on the dynamics of the time series and is unconcerned with the initial point.

• The antisymmetrization of the second degree signature tensor can be viewed as an

indicator of lead-lag behavior in the time series. In the case of Lie groups, the inter-
pretation will be considered in terms of left-invariant vector fields.

However, the most crucial property is that the path signature fully characterizes paths
up to tree-like equivalence; that is, the map S is injective, up to quotienting P G out by
an equivalence relation. This fact is originally due to Chen (1958) for the case of piecewise
regular paths on Lie groups, and later generalized by Hambly and Lyons (2010) to the case
of bounded variation paths in Rn .
Our main contribution is to apply this injectivity result to prove that a normalized
variant of the signature, S : P G → T ((g)), is a universal and characteristic feature map
for time series in G, when we equip T ((g)) with the structure of a Hilbert space. This is
proved in Section 4.2. This was originally shown for the Euclidean case by Chevyrev and
Oberhauser (2018). Such feature maps can be used to two large classes of machine learning
problems, in the context of kernel methods.

1. (Studying functions on P G) Solving a classification problem on P G can be re-

duced to finding a function f : P G → R such that the level set f = 0 provides the
decision boundary. The universality of the normalized signature map states that any
continuous bounded function f : P G → R can be approximated using a linear func-
tional f (·) ≈ h`, S̃(·)i. This allows us to reduce a nonlinear optimization problem into
a linear one, greatly reducing the complexity.

2. (Studying measures on P G) Two-sample hypothesis testing on P G requires the

computation of a set of statistics that is rich enough to distinguish any two probability

3
Darrick Lee and Robert Ghrist

measures on P G. The characteristicness of the normalized signature map states

that the kernel mean embedding (KME) is injective with respect to the normalized
signature

Φ : M(P G) → T ((g)), Φ(µ) = Eµ [S],

where M(P G) denotes all finite regular Borel measures on P G, and S is appropriately
normalized. This allows us to consider probability measures as elements of a linear
space; furthermore, the norm induced by the Hilbert space structure coincides with
the maximum mean discrepancy (MMD) between measures.

We perform two experiments that demonstrate the efficacy of the path signature for
these two classes of problems. First, we consider the computer vision problem of human
action recognition in Section 5.1. We show that the path signature method is much easier to
use than shallow learning methods previously applied to this problem (Vemulapalli et al.,
2014; Vemulapalli and Chellappa, 2016) while providing comparable results. Second, in
Section 5.2, we consider a hypothesis testing problem for simulated random walks on the
Lie group SO(3). Here, we show that the Lie group valued path signature vastly outperforms
the Euclidean path signature.
Along the way, we will establish extensions of other properties of the path signatures to
Lie groups and discuss several concepts related to path signatures and data analysis on Lie
groups more broadly. A summary of these contributions is given below.

1. We provide a detailed exposition of Lie group valued time series, and discuss a notion
of scaling for such time series in Section 2.2. Scaling of data is sometimes required
when the data needs to be normalized, and we discuss how scaling affects the path
signature in Section 3.1. We also discuss the continuous interpretation of discrete time
series on Lie groups in Section 2.3.

2. For G an N -dimensional Lie group, we give a signature-preserving bijection between

P G and P RN in Section 3.2, which provides a Euclidean representation of Lie group
valued time series. This bijection allows the exportation of Euclidean data analysis
tools to Lie group valued data. With the metric introduced in Section 3.3, this
bijection is an isometry.

3. It is well known that the Euclidean path signature is equivariant with respect to linear
transformations (Friz and Victoir, 2010). We show that path signatures are equivari-
ant under Lie group homomorphisms in general. Namely, given a homomorphism of
Lie groups F : G1 → G2 , where g1 and g2 are the respective Lie algebras, we define
the action of this homomorphism on the tensor algebra F∗ : T ((g1 )) → T ((g2 )), and
show in Section 3.4 that

S(F γ) = F∗ S(γ)

for all γ ∈ P G.

4
Path Signatures on Lie Groups

4. An important feature of the path signature is the interpretability of lower level sig-
nature terms. We discuss the extension of the lead-lag interpretation of second level
signature terms for Euclidean paths, as well as a topological interpretation of the first
level signature terms for abelian Lie groups in Section 3.6.

5. Path transformations, such as appending the time parameter or using a sliding win-
dow, are often used as a preprocessing step for Euclidean path signatures (Chevyrev
and Kormilitzin, 2016). We discuss these transformations in the context of breaking
reparametrization or left-translation invariance in Section 3.8. Empirical studies (Fer-
manian, 2019) have shown that the sliding window transformation (also called the
lead-lag transformation) provides good classification results, despite the lack of a the-
oretical explanation. We propose one explanation, which is that the sliding window
transformation breaks left-translation invariance, and we provide empirical evidence
in the experiments in Section 5.1.

6. We provide both algorithmic details and a Julia package for the computation of
path signatures valued in Lie groups, which can be found at https://github.com/
ldarrick/PathSignatures. For details, see Appendix A.

1.2 Previous and related work

The concept of path signatures is relatively new in data science and machine learning (Lyons,
2014; Chevyrev and Kormilitzin, 2016; Giusti and Lee, 2020), but has deep roots in topology
and geometry. Chen originally defined the path signature for piecewise regular paths on
manifolds and proved several basic properties in a sequence of papers (Chen, 1954, 1957,
1958). He later studied the geometry and topology of path spaces and loop spaces by
constructing a rational cochain model of these spaces, in which path signatures constitute
0-cochains (Chen, 1977).
Lyons (1998) developed the concept of the path signature in a different direction, using
the path signature as a construction to lift bounded variation paths on RN to paths of power
series of tensors T ((RN )). This initiated the study of rough paths, which can be thought of
as a generalization of the path signature to highly irregular paths. This theory was then
used to study stochastic processes and stochastic differential equations (Lyons and Qian,
2007; Lyons et al., 2007; Friz and Victoir, 2010).
Within machine learning, path signatures have been used to study real-valued time se-
ries data in a variety of settings. Examples can be found in the study of financial time
series (Gyurkó et al., 2013; Lyons et al., 2014), handwritten character recognition (Yang
et al., 2016), human action recognition using position data (Yang et al., 2019), identifying
psychological or neurological disorders (Moore et al., 2019; Zimmerman et al., 2018; Ar-
ribas et al., 2017) and featurizing the output of persistent homology in topological data
analysis (Chevyrev et al., 2020).
The theoretical aspects of path signatures in the context of kernel methods were devel-
oped in Kiraly and Oberhauser (2019) and Chevyrev and Oberhauser (2018). The present
paper is largely inspired by these two papers. The concept of using the path signature as a
kernel for time series was first proposed in Kiraly and Oberhauser (2019), and efficient al-
gorithms for computing the kernel were developed. The path signature for Euclidean space

5
Darrick Lee and Robert Ghrist

was shown to be a universal and characteristic feature map in Chevyrev and Oberhauser
(2018). This exploits the recently formalized duality between universal and characteristic
kernels in Simon-Gabriel and Schölkopf (2018).
It is well known that Euclidean path signatures are translation invariant, and we will
show that Lie group path signatures are left translation invariant. Diehl and Reizenstein
(2019) has considered the related problem of determining the Euclidean path signature
terms which are invariant under some matrix Lie group action.

We begin in Section 2 by reviewing basic facts on Lie groups and Lie algebras, and
provide an exposition on continuous and discrete time series on Lie groups. We then define
the path signature for Lie groups in Section 3, and discuss the bijection between P G and
P RN , the equivariance of the path signature, detecting lead-lag behavior in time series,
and path transformations. In Section 4, we provide a brief overview of kernel methods
and prove our main result, which shows that the path signature kernel is universal and
characteristic. Finally, in Section 5, we apply the path signature on Lie groups to a human
action classification problem and a hypothesis testing problem involving random walks on
SO(3).

1.3 Notation
Throughout this paper, we will denote the time parameter for a path γ : [0, 1] → G using
a subscript t, meaning γt := γ(t). Derivatives are shown using the prime notation, as in
γt0 := dγ N
dt (t). If we have a path in Euclidean space α : [0, 1] → R , we will use superscripts
to represent the components, such as α = (α1 , α2 , . . . , αN ). If G is a Lie group, we will use
g to denote its Lie algebra and use ḡ to be the underlying vector space of g (forgetting the
Lie bracket structure).
Continuous paths will often be denoted using the lowercase Greek symbols α, β, γ, and
the space of all piecewise regular paths in G is denoted P G. For T ∈ N, we let [T ] =
{1, . . . , T } denote the finite set of integers up to T . Discrete time series will be distinguished
using the hat notation γ̂ : [T ] → G, and the space of all discrete time series in G will be
denoted P̂ G.
There are also several parameters that will be used consistently throughout the paper.
Unless otherwise specified, we reserve the following symbols for the given meaning.

• N is the dimension of the Lie group G that paths take values in;

• T + 1 is the length of a discrete time series (so that the discrete derivative will be of
length T );

• M is the level of the truncated signature.

2. Lie groups, paths, and time series

We begin this section by recalling several basic facts about Lie groups Alexandrino and
Bettiol (2015), followed by paths on Lie groups and the interpretation of sampled time
series on Lie groups, stressing the differences from sampled time series on Euclidean space.

6
Path Signatures on Lie Groups

2.1 A review of Lie groups

Recall that a Lie group G is a smooth manifold with a group structure such that the
multiplication and inversion maps are both smooth. Let g0 ∈ G. The left translation
map by g0 , written as Lg0 : G → G, is defined to be Lg0 (g) = g0 g. The right translation
map Rg0 : G → G is defined analogously. This induces a mapping on tangent spaces
Lg0 ∗ : Tg G → Tg0 g G. A vector field X on G is called left-invariant if

Lg0 ∗ X(g) = X(Lg0 g) = X(g0 g)

for all g0 , g ∈ G. This implies that all left-invariant vector fields X are defined by their
value at the identity e ∈ G,
X(g) = Lg∗ X(e),
and thus, we obtain a one-to-one correspondence between left-invariant vector fields and
the tangent space at the identity, which we denote by g := Te G. Vector fields act on smooth
functions f : G → R, and we define an operation of left-invariant vector fields X and Y by

[X, Y ](f ) := X(Y (f )) − Y (X(f )),

where [X, Y ] is also left-invariant. This provides g with the structure of a Lie algebra, where
the Lie bracket [·, ·] : g × g → g is a bilinear mapping such that for all X, Y, Z ∈ g

[X, Y ] = −[X, Y ]
[X, [Y, Z]] + [Z, [X, Y ]] + [Y, [Z, X]] = 0.

Similarly, left translation induces a map L∗g0 : Tg∗0 g G → Tg∗ G on cotangent spaces. A
1-form ω ∈ T ∗ G is called left-invariant if

L∗g0 ω(g0 g) = L∗g0 ω(Lg0 g) = ω(g)

for all g, g0 ∈ G. Again, we obtain a correspondence between left-invariant 1-forms and the
cotangent space at the identity via the property

ω(g) = L∗g−1 ω(e).

Thus, we may identify the left-invariant 1-forms by the dual of the Lie algebra, g∗ .

Remark 2 We can think of tangent vectors (derivatives) of a path γ : [0, T ] → G as

elements of the Lie algebra g in two ways. First, for a tangent vector v ∈ Tg G, we can
compute the pushforward the tangent vector along the left multiplication map Lg−1 v ∈ Te G =
g. Second, a basis of the Lie algebra provides a global frame for G, meaning, it provides a
basis for Tg G for all g. By considering v ∈ Tg G in terms of this basis, we may also think
of v as an element of g.
In summary, the structure of the Lie group allows us to consider tangent vectors at any
point on G using a single vector space: a fact repeatedly used throughout this paper.

Given a left-invariant vector field X ∈ g, there exists a unique 1-parameter subgroup

ρX : R → G such that ρX (0) = e and ρ0X (0) = X(e). This is defined by the integral curve
of X which passes through the identity at t = 0.

7
Darrick Lee and Robert Ghrist

Definition 3 The Lie exponential map of G is defined as

exp : g → G, exp(X) := ρX (1),

where ρX is the 1-parameter subgroup defined above.

This exponential map provides a way to move between a Lie group and its Lie algebra.

Proposition 4 The exponential map exp : g → G is smooth and d(exp)0 = id. Thus,
exp is a diffeomorphism between an open neighborhood of the origin 0 ∈ g and an open
neighborhood of the identity e ∈ G.

Thus, if elements are near the origin, we can define an inverse map.

Definition 5 Suppose U ⊂ g is a neighborhood of the origin such that the exponential map
is a diffeomorphism. Let V = exp(U ). The logarithm map on V is defined to be

log : V → g, log(g) := exp−1 (g).

A homomorphism of Lie groups F : G → H is a smooth map which is also a group ho-

momorphism, and a homomorphism of Lie algebras φ : g → h is a linear map that preserves
the Lie bracket F ([X, Y ]) = [F (X), F (Y )] for all X, Y ∈ g. A Lie group homomorphism
F : G → H induces a Lie algebra homomorphism F∗ : g → h between the respective Lie
algebras by the induced map between the tangent spaces at the identity F∗ : Te G → Te H.

Example 1 The special orthogonal group SO(3) — orientation-preserving rotations of R3

— will be the running example used throughout this paper. This is a matrix Lie group and
can be explicitly described as the space of all 3 × 3 orthogonal matrices (AA| = A| A = I)
with determinant +1. The Lie algebra of SO(3) is so(3), which consists of all 3 × 3 skew-
symmetric matrices (B = −B | ). An explicit basis for so(3) is
     
0 −1 0 0 0 1 0 0 0
e1 = 1 0 0 , e2 =  0 0 0 , e3 = 0 0 −1 .
0 0 0 −1 0 0 0 1 0

We will denote the duals of these basis vectors to be ωi = ei ∈ g∗ . For all matrix Lie
groups, the Lie exponential and logarithm are simply the matrix exponential and logarithm.
Suppose θ ∈ R. The exponential map in these three basis directions gives us
 
cos θ − sin θ 0
exp(θe1 ) =  sin θ cos θ 0 ,
0 0 1
 
cos θ 0 sin θ
exp(θe2 ) =  0 1 0 ,
− sin θ 0 cos θ
 
1 0 0
exp(θe3 ) = 0 cos θ − sin θ .
0 sin θ cos θ

8
Path Signatures on Lie Groups

These are exactly the rotation matrices about the z, y, and x axes respectively. Therefore, we
may think of the basis vectors ei of the Lie algebra as infinitesimal rotations in the respective
directions. In particular, given a path γ ∈ P (SO(3)), the value ωi (γt0 ) corresponds to the
infinitesimal rotation of γ at time t in the direction of ei . If we integrate this over the
domain of the path,
Z 1
ωi (γt0 )dt,
0

we obtain the cumulative rotation of γ in the direction of ei over the unit interval. This
interpretation will be important to keep in mind when we define the path signature in Sec-
tion 3.

Finally, we briefly discuss the Riemannian structure of Lie groups. Recall that a Rie-
mannian metric on a smooth manifold M is the assignment of an inner product h·, ·ip to
the tangent space Tp M for every point p ∈ M , which varies smoothly. Specifically, this
means that if X, Y are smooth vector fields defined on a neighborhood of p, then the map
p 7→ hXp , Yp ip is smooth. On a Lie group, we often want a Riemannian metric that is
compatible with the algebraic structure of G. A Riemannian metric is left-invariant if

hX, Y ig = hLh∗ X, Lh∗ Y ihg

for all g, h ∈ G and X, Y ∈ Tg G, and a right-invariant Riemannian metric is defined

similarly. Such left-invariant metrics can simply be defined on the tangent space at the
identity.

Proposition 6 There is a one-to-one correspondence between left-invariant metrics on a

Lie group G and inner products on its Lie algebra g.

Namely, evaluating the inner product h , ig simply corresponds to viewing the tangent
vectors as elements of the identity, and then evaluating the chosen inner product on g. We
will assume that all Riemannian metrics under discussion are left-invariant, and simply call
them Riemannian metrics.

2.2 Paths on Lie groups

A Riemannian metric h·, ·i, where we now omit the subscript g since it is left-invariant,
provides a notion of length for piecewise regular paths on G. Suppose γ ∈ P G. Then the
length of γ is defined to be Z 1q
`(γ) := hγt0 , γt0 idt.
0
This allows us to define a metric on the Lie group. If g1 , g2 ∈ G, then the distance
between g and h is defined to be the infimum length of paths connecting g1 and g2 ,

d(g1 , g2 ) := inf {`(γ) : γ ∈ P G, γ0 = g1 , γ1 = g2 } .

Note that since the Riemannian metric is left invariant, this metric is also left invariant,

d(hg1 , hg2 ) = d(g1 , g2 ),

9
Darrick Lee and Robert Ghrist

for all h ∈ G. The more familiar notion of length in the path signature literature is the
1-variation of a path.

Definition 7 Suppose (X, dX ) is a metric space and let γ ∈ P X. The 1-variation of γ on

[0, 1] is defined as X
|γ|1−var = sup dX (γti , γti+1 ), (1)
(ti ) i

where the sum is taken over all partitions 0 = t1 ≤ . . . ≤ tn = 1 of [0, 1].

Using the metric induced by the Riemannian metric, we may consider the 1-variation
length of paths in G. Under the piecewise regular hypothesis, these two lengths are equiv-
alent.

Lemma 8 (Burtscher (2015)) Let γ ∈ P G. We have `(γ) = |γ|1−var .

At this point, in the case of paths on Euclidean space, we may use the 1-variation to
define a metric on P RN0 , which are the paths which start at the origin. Given a Lie group
G with a left-invariant Riemannian metric, we could follow the same procedure to obtain
a metric space structure on P Ge . However, this is not the metric space structure on P G
that is the most compatible with the path signature. We will defer this discussion until
Section 3.3.
The path space P RN is endowed with a vector space structure since RN itself is a vector
space. Similarly, we can endow P G with a group structure by pointwise multiplication,
where the identity is the constant path at the identity, and the inverse to a path γ ∈ P G
is the pointwise inverse. However, we are missing a notion of scaling for paths in P G, and
such an operation is important to have in machine learning, since algorithms may require
normalization of data. Such a scaling is obtained by proving a correspondence between
paths in G and paths in g, and then transferring the scaling operation from g to G.
This is done by considering paths on G from the point of view of differential equations.
We have the following existence and uniqueness theorem for first order ordinary differential
equations. Let P g denote the space of piecewise continuous paths γ : [0, 1] → g which are
right continuous, meaning limt↓t0 γt = γt0 .

Theorem 9 Let f ∈ P g, so that f : [0, 1] → g is piecewise continuous and right continuous,

where we consider elements of g as left-invariant vector fields. Then, the solution of the
first order ODE
γt0 = ft (γt ), γ0 = g (2)
exists and is unique.

Note that in this theorem, we consider a function γ : [0, 1] → G to be a solution of this

ODE if the differential equation holds at all points except the points of discontinuity of f .
This implies that we can represent piecewise regular paths in G as paths in the Lie algebra
g, along with its initial point. Let P Gg ⊂ P G be defined as

P Gg = {γ ∈ P G : γ0 = g}.

10
Path Signatures on Lie Groups

Corollary 10 Suppose G is a Lie group and g its Lie algebra. The map Ψg : P g → P Gg ,
which takes f ∈ P g to the solution of the ODE in Equation 2 with initial condition γ0 = g,
is a bijection.

Proof Firstly, the map Ψg is well defined by the existence and uniqueness theorem above.
The inverse to Ψg can be defined by taking the derivative at every differentiable point.
Suppose γ ∈ P Gg , and let d(γ) ⊂ [0, 1] denote the set of points such that γ is differentiable.
Note that [0, 1] − d(γ) is a finite set since γ is piecewise regular. Now, define Ψ−1 0
g (γ)(t) = γt
for all t ∈ d(γ), and at the nondifferentiable points by right continuity

Ψ−1 0
g (γ)(t) = lim γs .
s↓t

This map is well defined: Ψ−1

g (γ)(t) is continuous for every t ∈ [0, 1] − d(γ), and right
continuous by definition.

We can view P g as a Lie algebra, with pointwise vector space operations, and pointwise
Lie bracket. Because the group structure of P G and the Lie algebra structure of P g are
defined pointwise, the map Ψg is compatible with Lie algebra morphisms induced by Lie
group morphisms. Namely, if F : G → H is a Lie group morphism, we obtain a group
homomorphism F : P G → P H by applying the map pointwise. Analogously, if F∗ : g → h
is the induced Lie algebra morphism, we obtain a Lie algebra morphism F∗ : P g → P h.
The following lemma is immediate since the group structure on P G and the Lie algebra
structure on P g are defined pointwise.

Lemma 11 Suppose F : G → H is a morphism of Lie groups, and F∗ : g → h is the

induced morphism of Lie algebras. Then the following diagram commutes
F∗
Pg Ph
Ψ−1
g ΨF (g)
F
P Gg P HF (g) .

The map Ψg allows us to view paths on Lie groups as paths in a linear space, while
retaining all first order differential information. We can use the fact that many operations
for paths on RN are defined via operations on the Lie algebra, and thus generalize these
operations to Lie groups.
For a path α ∈ P RN and λ ≥ 0, denote the vector space scaling operation as

(λα)t := λαt .

However, another way of viewing the scaling operation for paths that begin at the origin is
by scaling in the Lie algebra. Suppose λ ≥ 0, and denote the vector space scaling in a Lie
algebra g by cλ : g → g.
Lemma 12 Let α ∈ P RN
0 . Then

λα = Ψ0 ◦ cλ ◦ Ψ−1
0 (α).

11
Darrick Lee and Robert Ghrist

Proof In RN , the map Ψ0 is simply integration in RN , and Ψ−1 0 is differentiation. Thus,

we have
Z t
Ψ0 ◦ cλ ◦ Ψ−1 λαs0 ds

0 (α) t
=
0
Z t
=λ αs0 ds
0
= (λα)t .

We use this fact as motivation to define scaling on Lie groups.

Definition 13 Suppose G is a Lie group and g its Lie algebra. Let γ ∈ P G and λ ≥ 0.
We define the Lie algebra scaling of γ by λ to be

λ · γ := Ψγ0 ◦ cλ ◦ Ψ−1
γ0 (γ). (3)

Remark 14 We highlight three important differences between vector space scaling for paths
in RN and Lie algebra scaling for paths in an arbitrary Lie group G, and provide a reason
for each.

1. Returning to the setting of paths in RN , the two notions of scaling differ slightly when
the path does not start at the origin. If we have α ∈ P RN such that α0 = x, then
(λα)0 = λx, while (λ · α)0 = x. However, if we align the initial points, the paths
coincide,

(λα) − λx = (λ · α) − x.

This difference is due to the fact that arbitrary Lie groups do not have a natural scaling
operation. However, if our Lie group was equipped with a suitable scaling operation,
such as a Carnot group (Le Donne, 2017), then we would be able to do define a scaling
operation that coincides with the vector space scaling in P RN .

2. We have only defined scaling by a nonnegative number. Definition 13 could be extended

to all real numbers λ without any changes, but the interpretation of negative scaling is
more difficult in arbitrary Lie groups. For a path α ∈ P RN 0 , scaling by λ = −1 simply
produces the pointwise inverse of a path. However, this is not the case in a general
Lie group. For example, let X, Y ∈ g and consider the piecewise path

e2tX : t ∈ [0, 21 )

γt =
eX e(2t−1)Y : t ∈ [ 12 , 1].

Here, we have γ1 = eX eY and (−1 · γ)1 = e−X e−Y , which are not inverses in general.
Thus, we see that the obstruction to this interpretation is the noncommutativity of
arbitrary Lie groups. However, in the setting of abelian Lie groups, such an interpre-
tation would hold.

12
Path Signatures on Lie Groups

3. By definition, the vector space scaling in P RN obeys the distributive law: λ(α + β) =
(λα) + (λβ) for α, β ∈ P RN and λ ∈ R. In other words, the vector space scaling is a
pointwise Lie group homomorphism for RN . However, cλ : g → g is not a morphism
of Lie algebras in general since cλ ([X, Y ]) = λ[X, Y ] 6= λ2 [X, Y ] = [cλ X, cλ Y ]. Thus,
it cannot be the induced map of an underlying Lie group homomorphism for G, so
the Lie algebra scaling for G is not distributive, λ · (αβ) 6= (λ · α)(λ · β), in general.
In the case of an abelian Lie group H, the associated Lie algebra h is abelian so that
[X, Y ] = 0 for all X, Y ∈ h, and thus Lie algebra scaling can be viewed as a pointwise
Lie group morphism.

Due to these remarks, we must keep in mind that the scaling operation for paths in Lie
groups is not compatible with the algebraic structure of G.

2.3 Discrete time series on G

In this subsection, we will consider the interpretation of discrete time series on an arbitrary
Lie group G, and also discuss derivative computations for these discrete time series. We
will continue the theme of comparison with the corresponding notions in RN .

Remark 15 Here, we will assume that discrete time series are uniformly sampled at integer
times. This does not result in any loss of generality due to the reparametrization invariance
of the path signature, given in Proposition 22.

Let T ∈ N and x̂ : [T + 1] → RN be a discrete time series in RN of length T + 1. There

is a natural interpretation of x̂ as a continuous time series x : [T + 1] → RN by linear
interpolation between points. Namely, it is the interpolation with a constant derivative
between the discrete points defined in x̂. This is the interpretation that we implicitly take
when we compute derivatives of discrete time series by finite differences x̂0i = x̂i+1 − x̂i to
get the discrete derivative x̂0 : [T ] → RN . Additionally, we can think about the continuous
path x as a geodesic interpolation of the discrete path x̂.
However, the interpretation is more subtle in the case of arbitrary Lie groups. Suppose
we have a discrete time series in G, which we denote by γ̂ : [T + 1] → G. We wish to obtain
an interpolation such that the derivative, when viewed in the Lie algebra g, is constant
between adjacent points. This can be achieved by taking the logarithm of the difference
between adjacent points. We define the discrete derivative γ̂ 0 : [T ] → g of a discrete Lie
group valued path by
γ̂i0 := log γ̂i−1 γ̂i+1 ∈ g.

(4)
Then, we can define the continuous interpolation γ : [0, T + 1] → G using the exponential
map such that for t ∈ [i, i + 1), the interpolation is

γt := γ̂t exp (t − i)γ̂t0 .

We note that this construction reduces to linear interpolation in the case of G = Rn .

This is due to the fact that for the additive Lie group RN , the exponential and logarithm
map are both the identity and are both globally defined. Additionally, the group operation

13
Darrick Lee and Robert Ghrist

is addition, so we should interpret all of the products as sums. However, there are two
essential differences between the case of arbitrary Lie groups and Euclidean space.
Firstly, for an arbitrary Lie group G, the logarithm map is only defined in a neighborhood
of the identity. The two reasons the logarithm may not be defined in a larger neighborhood
are the loss of injectivity and the loss of surjectivity of the exponential map. On any
compact Lie group, the exponential map will not be injective at any point. In this case,
we can define the logarithm to be the value closest to the origin, but non-injectivity may
still occur. For example, the point antipodal to the identity in S 1 has no unique logarithm
since there are two paths of equal distance to the identity. However, if we perturb the
target point in either direction, there exists a unique shortest path. This implies that by
undersampling the underlying time series, we may infer incorrect information. The case of
S 1 is exactly the situation encountered in the Nyquist sampling theorem.
The exponential map is not always surjective, with the simplest examples being non-
connected Lie groups. However, connected Lie groups such as SL(2, R) can still have non-
surjective exponential maps. In these cases, discrete derivatives may not exist, and finer
sampling is required so that the difference between adjacent points γ̃i−1 γ̃i+1 is closer to the
identity and has a well-defined logarithm. However, for compact Lie groups such as SO(3),
the Lie exponential map is surjective.
Secondly, the interpolation defined here may not be a geodesic connecting the two points.
Suppose h is a Riemannian metric on G. In general, geodesics do not coincide with the
one-parameter subgroups of G. In other words, in these cases, the Riemannian exponential
map and the Lie exponential map are not the same. However, for bi-invariant metrics, they
coincide.

Theorem 16 The Lie exponential map and the Riemannian exponential map at the identity
agree on Lie groups with bi-invariant metrics.

Thus, for all Lie groups equipped with bi-invariant metrics, we may continue to interpret
the interpolation as a geodesic interpolation. In fact, this holds for all compact Lie groups.

Proposition 17 Every compact Lie group admits a bi-invariant metric.

From this discussion, we find that for a compact Lie group G, the interpretation of
discrete time series on G is similar to the case of RN , with the main difference being the
non-injectivity of the exponential map.

3. Path signatures on Lie groups

This subsection, based on the exposition of path signatures on Euclidean space given
in Giusti and Lee (2020), begins by defining the path signature for Lie groups. We show
several basic properties which are well-known for path signatures on Euclidean space, culmi-
nating in the definition of tree-like equivalence for paths and the property that the signature
is an injective group homomorphism. This material was originally developed by Chen (1954,
1957, 1958) and is not novel.
We then prove a signature preserving bijection between paths on an N -dimensional
Lie group G and paths on RN , which highlights the extent to which the theory naturally

14
Path Signatures on Lie Groups

extends to the case of Lie groups. This result provides a Euclidean representation of Lie
group valued time series, and can thus be used to apply classical Euclidean data analysis
techniques to Lie group valued time series. To the authors’ knowledge, this result has not
previously appeared in literature.
We then consider the extension of the equivariance property of path signatures. This
is followed by an interpretation of the second-level signature terms as indicators of lead-lag
behavior between the directions corresponding to our choice of basis vectors for the Lie
algebra g. Finally, we close this section by discussing computational aspects of the path
signature for discrete time series, as well as symmetry breaking path transformations which
can be used as a preprocessing step.

In this section, we use (e1 , . . . , eN ) to denote an ordered basis of g and use (ω1 , . . . , ωN )
to denote the dual basis of g∗ such that ωi (ej ) = δi,j , where δi,j is the Kronecker delta.

3.1 Path signature as a group homomorphism

Let G be an N -dimensional Lie group. Recall that P G denotes the space of piecewise
regular paths γ : [0, 1] → G.

Definition 18 Let γ ∈ P G. Suppose ω1 , . . . , ωN ∈ g∗ form a basis of g∗ . For i ∈ [N ],

define a path S i (γ)t : [0, 1] → R as
Z t
S i (γ)t := ωi (γs0 )ds.
0

Next, let I = (i1 , . . . , im ) be a multi-index, where ij ∈ [N ]. Higher order paths S I (γ)t :

[0, 1] → R are inductively defined as
Z t
I
S (γ)t := S (i1 ,...,im−1 ) (γ)s ωim (γs0 )ds. (5)
0

The path signature of γ with respect to I is defined to be S I (γ) := S I (γ)1 .

We can also present the definition in a non-inductive way. Let ∆m be the standard
m-simplex

∆m = {(t1 , . . . , tm ) : 0 ≤ t1 < t2 < . . . < tm ≤ 1}.

By collapsing the inductive definition, we can write the path signature of γ with respect to
I = (i1 , . . . , im ) as
Z
I
S (γ) = ωi1 (γt01 ) . . . ωim (γt0m ) dt1 . . . dtm . (6)
∆m

We can amalgamate the path signatures with respect to every multi-index I into an
element of a tensor algebra.

15
Darrick Lee and Robert Ghrist

Definition 19 Suppose V is a real vector space of dimension N . The tensor algebra with
respect to V is defined to be
Y
T ((V )) = V ⊗m .
m≥0

Suppose (e1 , . . . , eN ) is an ordered basis for V . Suppose s, t ∈ T ((V )). Let tm ∈ V ⊗m be

the degree m part of t and if I = (i1 , . . . , im ) is a multi-index with ij ∈ [N ], then tI is the
coefficient of ei1 ⊗ . . . ⊗ eim in t. Addition and scalar multiplication is defined element-wise:

• (s + t)I = sI + tI ,

• (λt)I = λtI ,

and multiplication is defined by tensor multiplication

• (s ⊗ t)I = m (i1 ,...,ij ) t(ij+1 ,...,im ) .

P
j=0 s

Let ḡ be the underlying vector space of the Lie algebra g. Let e1 , . . . , eN be a basis for
g. We define the path signature of Γ ∈ P G to be
X X
S(γ) := 1 + S I (γ)ei1 ⊗ . . . ⊗ eim ∈ T ((ḡ)). (7)
m≥1 |I|=m

Remark 20 For path signatures defined on Euclidean space RN , we often choose the stan-
dard 1-forms (dx1 , . . . , dxN ) to be the basis of r, the Lie algebra of RN . Suppose α ∈ P RN .
We can also write our path component-wise as α = (α1 , . . . , αN ), where each αi : [0, 1] → R.
Then, evaluation of the standard 1-forms is simply dxi (αt0 ) = (αi )0t . Thus, in the Euclidean
case, the definition of the path signature reduces to
Z
I
S (α) = (αi1 )0t1 . . . (αim )0tm dt1 . . . dtm . (8)
∆m

Let γ ∈ P G and g ∈ G. The left translation of γ by g is defined to be the path

(gγ)t := g(γt ), where we left translate the path γ by g pointwise (one can analogously define
the right translation of a path). Similar to the case of Euclidean space, path signatures are
left translation invariant and reparametrization invariant.

Proposition 21 (Left translation invariance) Let γ ∈ P G and g ∈ G. Then S(gγ) =

S(γ).

Proof It suffices to show that S I (gγ) = S I (γ) for all multi-indices I. Note that we have

(gγ)0t = Lg∗ γt0 .

Specifically, this implies that γt0 and gγt0 are represented by the same element in the Lie
algebra g. Therefore for any ω ∈ g∗ , we have ω(gγt0 ) = ω(γt0 ). Thus, S I (gγ) = S I (γ) for all
I.

16
Path Signatures on Lie Groups

Proposition 22 (Reparametrization invariance) Let γ : [a, b] → G be a piecewise reg-

ular path, and let φ : [c, d] → [a, b] be a strictly increasing function. Then S(γ ◦ φ) = S(γ).

Proof This is the Change of Variables Theorem. Reparametrization invariance of the first
level of the signature is given as
Z d Z d Z b
0 0 0
i
S (γ ◦ φ) = ωi ((γ ◦ φ)t )dt = ωi (γφt )φt dt = ωi (γτ0 )dτ = S i (γ).
c c a

Invariance for higher order terms is shown by induction using the same argument.
In particular this proposition justifies our choice of only considering paths parametrized by
[0, 1], as any other path can be reparametrized into this domain. Next, we would like to
understand how scaling of paths in G given in Definition 13 affects the path signature. Note
that the vector space scaling in ḡ induces a dilation map in T ((ḡ)). Explicitly, we define
the map δλ : T ((ḡ)) → T ((ḡ)) as

δλ t := (t0 , λt1 , λ2 t2 , . . .). (9)

Proposition 23 Let γ ∈ P G and λ ≥ 0. Then S(λ · γ) = δλ S(γ).

Proof Consider the multi-index I = (i1 , . . . , ik ). Then,

Z
I
S (λ · γ) = ωi1 (λγt01 ) . . . ωik (λγt0k ) dt1 . . . dtk
∆kZ

= λk ωi1 (γt01 ) . . . ωik (γt0k ) dt1 . . . dtk

∆k
k I
= λ S (γ).

We have seen that the group structure on G allows us to define a group structure on P G
by pointwise multiplication. The group structure on G allows us to obtain another group
structure on a quotient of P G where the group operation is given by concatenation. Let
α, β ∈ P G. The concatenation of α and β is defined to be
: t ∈ [0, 12 )

α2t
(α ∗ β)t = −1
α1 (β0 ) β2t−1 : t ∈ [ 12 , 1].
The inverse of a path γ is defined to be the same path, but in the reverse direction

(γ −1 )t = γ1−t .

Concatenation or inversion of piecewise regular paths is still piecewise regular. In order

to obtain an identity element, we must quotient out by an equivalence relation.

Definition 24 A path γ ∈ P G is called reducible if there exist paths α, β, ζ ∈ P G such

that γ = α ∗ ζ ∗ ζ −1 ∗ β, up to reparametrization. The path α ∗ β is called a reduction of
γ. We define the reduction of ζ ∗ ζ −1 to be ce , the constant path at the identity e ∈ G. A
path γ is irreducible if no reduction exists. An irreducible path γ̃ obtained by finitely many
iterative reductions of a path γ is called an irreducible reduction of γ.

17
Darrick Lee and Robert Ghrist

ζ −1
β β
ζ

α α

α ∗ ζ ∗ ζ −1 ∗ β α∗β

Figure 1: (Left) An example of a reducible path α ∗ ζ ∗ ζ −1 ∗ β. (Right) The irreducible

reduction of the path on the left.

Theorem 25 (Chen (1958)) Every piecewise regular path γ ∈ P G has a unique irre-
ducible reduction up to reparametrization.

This result allows us to define the notion of tree-like equivalence.

Definition 26 A path γ ∈ P G is a tree-like path if its irreducible reduction is ce , the

constant path at the identity. Two paths α, β are tree-like equivalent, α ∼t β, if α ∗ β −1 is
a tree-like path.

Remark 27 The definition of tree-like equivalence includes translations. Indeed, suppose

γ ∈ P G and g ∈ G. Define gγ and γg to be the left and right translations of the path γ by
g. Then γ ∼t gγ since γ ∗ (gγ)−1 = γ ∗ γ −1 by the definition of the concatenation operator.
The same holds for right translations.
Additionally, tree-like equivalence also includes reparametrization since the definition of
reductions are reparametrization invariant.

Proposition 28 Tree-like equivalence is an equivalence relation.

Proof Let γ, γ1 , γ2 , γ3 ∈ P G. Note that the subscript here denotes distinct paths, and
does not denote the time parameter. By definition the reduction of γ ∗ γ −1 is the constant
path, so γ ∼t γ.
Next, if γ = α ∗ ζ ∗ ζ −1 ∗ β, for paths α, β, ζ ∈ P G, then γ −1 = β −1 ∗ ζ ∗ ζ −1 ∗ α−1 .
Thus, a path is reducible if and only if its inverse is reducible. Additionally, the reduction
β −1 ∗ α−1 of γ −1 is the inverse of the reduction α ∗ β of γ. Now, suppose γ1 ∼t γ2 so that
γ1 ∗ γ2−1 is tree-like. By the above argument, γ2 ∗ γ1−1 is also tree-like, so γ2 ∼t γ1 .
Finally, the concatentation α ∗ β of two tree-like paths is also tree-like, by performing
all the reductions of α and then performing all the reductions on β. Suppose γ1 ∼t γ2 and
γ2 ∼t γ3 . Then, γ1 ∗ γ3−1 is a reduction of (γ1 ∗ γ2−1 ) ∗ (γ2 ∗ γ3−1 ), and the latter path is
tree-like since it is a concatenation of two tree-like paths. By the uniqueness of irreducible
reductions, γ1 ∗ γ3−1 is tree-like. Thus, γ1 ∼t γ3 .

18
Path Signatures on Lie Groups

We can now define P g G := P G/ ∼t to be the space of tree-like equivalence classes of

piecewise regular paths in G. We define the identity element to be [ce ] ∈ P
g G, the equivalence
class of the constant path at the identity. Compatibility of concatenation and inversion are
implicit in the above proof, and the group axioms are easily checked. Thus, we have shown
the following.

Proposition 29 The quotient P

g G is a group.

We can now state Chen’s injectivity theorem.

Theorem 30 (Chen (1958)) Suppose G is a real Lie group. Let α, β ∈ P G. Then S(α) =
S(β) if and only if α and β are tree-like equivalent.

Chen also showed that the signature is a group homomorphism. Namely, suppose α, β ∈
P G. Chen’s identity (Chen, 1954) states that

S(α ∗ β) = S(α) ⊗ S(β). (10)

Putting the previous results together, we obtain the following characterization.

Proposition 31 The path signature map S : P

g G → T ((g)) is an injective group homomor-
phism.

We will also require an internal multiplicative structure on the path signature coefficients
which is an immediate generalization of the Euclidean path signature.

Definition 32 Let k and l be non-negative integers. A (k, l)-shuffle is a permutation of σ

of the set {1, 2, . . . , k + l} such that

σ −1 (1) < σ −1 (2) < . . . < σ −1 (k)

and

σ −1 (k + 1) < σ −1 (k + 2) < . . . < σ −1 (k + l).

We denote by Sh(k, l) the set of (k, l)-shuffles. Given two finite ordered multi-indices I =
(i1 , . . . , ik ) and J = (j1 , . . . , jl ) , let R = (r1 , . . . , rk , rk+1 , . . . rk+1 ) = (i1 , . . . , ik , j1 , . . . , jl )
be the concatenated multi-index. The shuffle product of I and J is defined to be the multiset

I J = rσ(1) , . . . rσ(k+l) : σ ∈ Sh(k, l) .

As an example, suppose I = (1, 2) and J = (2, 3). Then

I J = {(1, 2, 2, 3), (1, 2, 2, 3), (2, 1, 2, 3), (1, 2, 3, 2), (2, 1, 3, 2), (2, 3, 1, 2)} .
Theorem 33 Let I and J be multi-indices in [N ], of lengths k and l respectively, and
suppose γ ∈ P G. Then X
S I (γ)S J (γ) = S K (γ). (11)
K∈I J
19
Darrick Lee and Robert Ghrist

Proof Let R = (r1 , . . . , rk , rk+1 , . . . rk+l ) = (i1 , . . . , ik , j1 , . . . , jl ). Writing out the signature
on the left side of the equation using Equation 6, we get
Z Z
0 0
ωi1 (γt1 ) . . .ωik (γtk )dt1 . . . dtk ωj1 (γt01 ) . . . ωjl (γt0l )dt1 . . . dtl
∆k Z ∆ l

= ωr1 (γt01 ) . . . ωrk+l (γt0k+l ) dt1 . . . dtk+l ,

∆k ×∆l

and the sum on the right side is

X Z
ωσ(r1 ) (γt01 ) . . . ωσ(rk+l ) (γt0k+l )dt1 . . . dtk+l .
k+l
σ∈Sh(k,l) ∆

The equivalence of the two formulas is given by the standard decomposition of ∆k × ∆l

into (k + l)-simplices,

∆k × ∆l = {(t1 , . . . , tk+l ) : 0 < t1 < . . . < tk < 1, 0 < tk+1 < . . . < tk+l < 1}
G
= (tσ(1) , . . . , tσ(k+l) ) : 0 < t1 < . . . < tk+l < 1 .
σ∈Sh(k,l)

3.2 Relationship between paths in G and RN

In this subsection, we define a signature-preserving bijection between piecewise regular paths
in G and piecewise regular paths in RN which start at the identity and origin respectively.
The idea behind the following proposition is that the path signature computation only
requires the first derivative of paths. The Lie bracket is unused in the computation of path
signatures, so we can simply consider the Lie algebras of Lie group as vector spaces. Thus,
we can identify the underlying vector space of the Lie algebra g with the underlying vector
space of the Lie algebra r of RN . We then use the correspondence Ψg : P g → P Gg between
piecewise continuous paths P g and piecewise regular paths P G given in Corollary 10, to
map paths on G to paths on RN .
In the following proposition, we abuse notation and consider elements of the Lie algebras
r of RN and g of G as both the tangent space at the identity, and the vector space of left-
invariant vector fields. Similarly, we consider elements of the dual of the Lie algebra r∗
and g∗ as both the cotangent space at the identity, and the vector space of left-invariant
1-forms.
Because we will be using two different path signature functions, we will denote by
SR : P RN → T ((RN )) the path signature for RN with respect to the ordered basis of
standard 1-forms (dx1 , . . . , dxN ) of r∗ . We denote SG : P G → T ((RN )) to be the path
signature for G with respect to a given ordered basis (ω1 , . . . , ωN ) of g∗ .

Proposition 34 Suppose G is an N -dimensional Lie group with Lie algebra g. Let φ : r̄ →

ḡ be an isomorphism of vector spaces, and φ∗ : g∗ → r∗ be its dual. Let (dx1 , . . . , dxN )

20
Path Signatures on Lie Groups

denote the standard 1-forms of RN , and define ωi = (φ∗ )−1 (dxi ). Let SG : P G → T ((RN ))
be the path signature map for G with respect to the ordered basis (ω1 , . . . , ωN ) of g∗ . Then,
there exists a bijection Φ : P RN
0 → P Ge such that SR (γ) = SG (Φ(γ)) for all γ ∈ P R .
N

Proof The construction of the map Φ is derived from Corollary 10. Let ΨR : P r → P RN0
and ΨG : P g → P Ge be the bijections from Corollary 10 for RN and G respectively. Now,
define Φ by
Ψ−1 φ Ψ
Φ : P RN
0 −
−−R
→ Pr − G
→ P g −−→ P Ge .
The idea is that we start with a path γ ∈ P RN
0 , and apply the following maps:
1. Ψ−1 0
R : take the derivative γ to obtain a path in r

2. φ : identify the underlying vector space of r with g

3. ΨG : solve the differential equation (Equation 2) with the identity initial condition
to obtain a path in G.

5
0.1

2.5 0.05

0 0

-2.5 -0.05

-5 -0.1
-5 -2.5 0 2.5 5 -0.1 -0.05 0 0.05 0.1

Figure 2: An example of the function Φ when we take G = S 1 ×S 1 . (Left) A path γ ∈ P R20 .

(Middle) The derivative of γ as a path in r or g. (Right) The corresponding path
Φ(γ).

Because all three maps are bijective, Φ is also bijective. To show that the signatures
are invariant under this mapping, let I = (i1 , . . . , ik ) and γ ∈ P RN 0 . The path signature of
γ with respect to I is
Z
I
SR (γ) = dxi1 (γt1 ) . . . dxik (γtk )dt1 . . . dtk .
∆k
Note that the derivative of Φ(γ) is given by Φ(γ)0t = φ(γt0 ) and thus, the path signature of
Φ(γ) with respect to I is
Z
I
SG (Φ(γ)) = ωi1 (φ(γt01 )) . . . ωik (φ(γt0k ))dt1 . . . dtk
k
Z∆
= φ∗ (ωi1 )(γt01 ) . . . φ∗ (ωik )(γt0k )dt1 . . . dtk
∆k
= SRI (γ).

21
Darrick Lee and Robert Ghrist

The final equality holds because the dual isomorphism φ∗ takes ωi to dxi . Thus, SRI (γ) =
I (Φ(γ)) for all Γ ∈ P RN and all multi-indices I.
SG e

3.3 Stability of the path signature

In this section, we will discuss the stability of path signatures, which is of crucial importance
in machine learning applications. Because we will only be interested in truncated path sig-
natures in applications, we will consider the truncated signature map SM : P G → T ≤M (ḡ),
where
M
M
≤M
T (ḡ) = (ḡ)⊗m ,
m=0

where SM only retains information about the first M levels of the path signature. In
addition, we define the projection map

πm : T ((ḡ)) → ḡ⊗m

to a particular tensor level. Such a map can also be defined on the truncated tensor algebra
πm : T ≤M (ḡ) → ḡ⊗m , and we denote all such maps in the same manner.
By stability of the path signature, we mean to say that the truncated signature map
SM : P G → T ≤M (ḡ) is Lipschitz continuous. In order to disucss such a notion, we must
provide both P G and T ≤M (ḡ) with metrics. We begin with the metric on T ((ḡ)), which is
required in Section 4.2 and is analogous to the metric on T ≤M (ḡ).
Recall that a basis (e1 , . . . , eN ) of ḡ induces a natural inner product on ḡ by defining
the basis to be orthonormal. This extends to an inner product structure on ḡ⊗m , and given
tk ∈ ḡ⊗m , we will denote the norm by ktkm . In addition, this also extends to an inner
product on T ≤M (ḡ). Let s, t ∈ T ≤M (ḡ). Such an inner product and norm are defined to be
v
M X
u M X
X uX
I I
hs, ti = s t , ktk = t (tI )2 . (12)
m=0 |I|=m m=0 |I|=m

Then, we can use the norm to define a metric on both ḡ⊗m and T ≤M (ḡ). Namely, given
sm , tm ∈ ḡ⊗m and s, t ∈ T ≤M (ḡ), we have

dm (sm , tm ) = ksm − tm km
d(s, t) = ks − tk.

Note that this norm on the tensor algebra extends to T ((ḡ)), where the inner product and
norm for s, t ∈ T ((ḡ)) are defined as in (12) with M → ∞. In this case, the inner product
and norm may be infinite. However, image of the path signature lies in a subalgebra of
T ((ḡ)) where the norm is finite. Namely, we define

T1 ((ḡ)) := {t ∈ T ((ḡ)) : ktk < ∞, t0 = 1} . (13)

We nowshow that the signature of any path γ ∈ P G lies in this subspace.

22
Path Signatures on Lie Groups

Lemma 35 Let γ ∈ P G. Then kS(γ)k < ∞.

Proof Without loss of generality, we suppose that γ is parametrized by length such that
it is defined as γ : [0, L] → G, where L is the length, and kγt0 k = 1 for all differentiable t;
this assumption is valid due to the reparametrization invariance of the signature. We will
inductively bound each signature term. At the first level, we have
Z t
|S i (γ)(t)| ≤ |ωi (γs0 )|ds
0
≤ t,

using the fact that |ωi (γt0 )| ≤ kγt0 k = 1. Assume that for any multi-index I = (i1 , . . . , im−1 )
of length m − 1, we have

tm−1
|S I (γ)(t)| ≤ .
(m − 1)!

Now consider the multi-index I = (i1 , . . . , im ) of length m. Using the induction hypothesis,
and the recursive definition of the signature, we have
Z t
I
|S (γ)(t)| ≤ |S (i1 ,...,im−1 ) (s)||ωim (γs0 )|ds
0
Z t
sm−1 tm
≤ ds = .
0 (m − 1)! m!

Therefore, for any multi-index I of length m, we have

Lm
|S I (γ)| ≤ , (14)
m!
and the norm of S(γ) is bounded by
∞ X
X
2
kS(γ)k = (S I (γ))2
m=0 |I|=m
∞
X N m L2m
≤ <∞
(m!)2
m=0

where the last inequality uses the fact that there are N m multi-indices of length m.

Next, we consider a metric structure on P G. We mentioned in Section 2.2 that a metric

on P RN N
0 can be obtained by using the 1-variation of paths. Namely, given α, β ∈ P R0 , we
may consider the distance between these two paths as |α − β|1−var , where the difference
is performed pointwise. Such an approach could be used for P G in theory. Now suppose
α, β ∈ P Ge , we can use the metric on G to define the distance to be |β −1 α|1−var , where the
inversion and multiplication are both performed pointwise. However, this notion of distance
is not well-suited for the path signature.

23
Darrick Lee and Robert Ghrist

The main reason for this is that the computation of |β −1 α|1−var depends fundamentally
on the adjoint action of the Lie group on the Lie algebra, which is governed by the Lie
bracket. Namely, the adjoint action is trivial if and only if the Lie bracket is zero. However,
the path signature ignores the Lie bracket structure, so the prospect of Lipschitz continuity
of the signature with respect to this metric is problematic.
We therefore consider a different metric. Note that our path signature computations
have consistently been performed on the underlying vector space of the Lie algebra ḡ, so it
seems natural to directly define a metric using the derivatives α0 , β 0 ∈ P̂ g. One such notion
of a distance would be the L1 distance between these derivatives
Z 1
kα0 − β 0 kL1 = kαt0 − βt0 kg dt
0

which in particular does not use the Lie bracket structure. In fact this L1 distance is exactly
the 1-variation of the corresponding paths Φ−1 (α), Φ−1 (β) ∈ P RN 0 , given by the bijection
in Proposition 34. Thus, we can define the metric on P Ge to be

dR (α, β) := |Φ−1 (α) − Φ−1 (β)|1−var .

Note that equipped with this metric, the map Φ is trivially an isometry.

Lemma 36 Suppose Φ : P RN 0 → P Ge is the map defined in Proposition 34. Suppose P R0

is equipped with the 1-variation metric, and P Ge is equipped with the metric dR . Then, Φ
is an isometry.

Using this isometry, stability for Lie group path signatures is a direct corollary of stability
for Euclidean path signatures.

Proposition 37 (Friz and Victoir (2010)) Let α, β ∈ P R0 , and let

L ≥ max{|α|1−var , |β|1−var }.

Then, for all k ≥ 1, there exists some Ck > 0 such that

πk S(α) − S(β) ≤ Ck Lk−1 |α − β|1−var .
k

Corollary 38 Let α, β ∈ P Ge , and let L ≥ max{|α|1−var , |β|1−var }. Then, for all k ≥ 1,

there exists some Ck > 0 such that

πk S(α) − S(β) ≤ Ck Lk−1 dR (α, β).
k

3.4 Equivariance of the path signature

At this point, a natural question to consider is how do path signatures behave under Lie
group morphisms? Let G1 and G2 be Lie groups of dimensions N1 and N2 respectively.
Given a Lie group morphism F : G1 → G2 , we have an induced Lie algebra morphism F∗ :
g1 → g2 between the corresponding Lie algebras. In particular, all Lie algebra morphisms
are linear transformations, so if we forget the Lie bracket, this results in a map F∗ : ḡ1 → ḡ2

24
Path Signatures on Lie Groups

between the underlying vector spaces. Because linear transformations induce maps on tensor
products of the space F∗⊗m : ḡ⊗m
1 → ḡ⊗m
2 , we also get an induced map of algebras between
tensor algebras

F∗ : T ((ḡ1 )) → T ((ḡ2 )).

If (e1 , . . . , eN1 ) is an ordered basis for g1 and (f1 , . . . , fN2 ) is an ordered basis for g2 , then
we can write F∗ : g1 → g2 as an N2 × N1 matrix in terms of these bases, which we call M .
We can describe the action of F∗ in the tensor algebra using this matrix. Let t ∈ T ((ḡ1 )).
In general, the action on the order m elements tm ∈ ḡ⊗m is a tensor-matrix multiplication,
as described in Pfeffer et al. (2019), in which all m sides of the tensor tm are multiplied by
the matrix M . This can be written out as
∞ X
X
F∗ t = tI (M ei1 ) ⊗ (M ei2 ) ⊗ . . . ⊗ (M eik ).
m=0 |I|=m

The low order tensors can be written out in usual matrix notation. Consider t1 as a
column vector. The action on first order elements is matrix multiplication,

(F∗ t)1 = M t1 .

Considering t2 as a matrix, the action on the second order elements is conjugation by M ,

(F∗ t)2 = M t2 M | .

For higher orders, we can no longer use matrix notation, so we explicitly define the
action for a given index. Let J = (j1 , . . . , jn ) be a multi-index where jk ∈ [N2 ]. Then, the
element of F∗ t corresponding to the multi-index J is
N1 X
X N1 N1
X
J
(F∗ t) = ... t(i1 ,...,in ) Mj1 ,i1 Mj2 ,i2 , . . . , Mjn ,in .
i1 =1 i2 =1 in =1

The following is a generalization of the equivariance of the path signature in Euclidean

space, which is discussed in Friz and Victoir (2010) and Pfeffer et al. (2019). Here, sup-
pose (ω1 , . . . , ωN1 ) is the dual basis to (e1 , . . . , eN1 ) and (ν1 , . . . , νN2 ) is the dual basis to
(f1 , . . . , fN2 ).

Proposition 39 Let G1 and G2 be Lie groups, with Lie algebras g1 and g2 respectively.
Suppose F : G1 → G2 is a Lie group morphism and γ ∈ P (G1 ). Then

S(F γ) = F∗ S(γ).

Proof The proof of this claim is simply due to the linearity of integrals and 1-forms.
Consider the multi-index J = (j1 , . . . , jm ). Then,
Z
J
S (F γ) = νj1 (F∗ γt01 ) . . . νjm (F∗ γt0m )dt1 . . . dtm .
∆m

25
Darrick Lee and Robert Ghrist

Consider a single factor in the integrand. Using the basis (e1 , . . . , eN1 ) for g, write the
derivative γ 0 as
N1
X
γt0 = cit ei
i=1

where ci : [0, 1] → R are the component paths. Then, since νj (F∗ γt0 ) denotes the j th
component of F∗ γt0 , we can write this as

N1
X
νj (F∗ γt0 ) = Mj,i ωi (γt0 ).
i=1

Substituting this back into the formula for S J (F γ), we get

N1
X N1
X Z
J
S (F γ) = ... (Mj1 ,i1 . . . Mjm ,im ) ωi1 (γt0 ) . . . ωim (γt0 )dt1 . . . dtm
i1 =1 in =1 ∆m
N1
X N1
X
= ... (Mj1 ,i1 . . . Mjm ,im ) S (i1 ,...,im ) (γ)
i1 =1 in =1
J
= (F∗ S(γ)) .

3.5 Lead-lag relationships

For path signatures defined on Euclidean space, a certain linear combination of second
degree signature terms provides a reparametrization invariant indicator of lead-lag behavior
in cyclic real-valued time series, as initially introduced in Baryshnikov and Schlafly (2016).
In this subsection, we will extend this interpretation to time series valued in Lie groups.
A cyclic time series in G is one which is periodic up to a time-reparametrization. More
precisely, a time series γ is cyclic if it factors through the circle,
φ
→ S 1 → G,
γ : [0, 1] −

with φ monotone and winding around the circle at least twice, the winding condition en-
forcing nontrivial repetition.
Consider the interpretation for Euclidean paths in R2 . Suppose γ = (γ 1 , γ 2 ) ∈ P R2 is
a cyclic time series. We say that the component γ 1 exhibits a cyclic leading behavior with
respect to the component γ 2 if the following two conditions hold:

1. when γ 1 is large (small), then γ 2 is increasing (decreasing), and

2. when γ 2 is large (small), then γ 1 is decreasing (increasing).

26
Path Signatures on Lie Groups

The first condition can be viewed as a reparametrization invariant definition of a time series
γ 1 leading another time series γ 2 . The second condition is used because we are working
with cyclic time series, so we also consider the negative influence of γ 2 on γ 1 . We may think
of this phenomena as a feedback loop in which γ 1 positively influences γ 2 and γ 2 negatively
influences γ 1 . The standard example of such behavior is γt = (sin(t), − cos(t)).
To quantify what we mean by large or small in the two conditions above, we translate
the time series such that γ0 = (0, 0) and interpret large (small) to mean positive (negative).
Then, a measure for these two conditions are given by S 1,2 (γ) and −S 2,1 (γ) respectively,
Z 1 Z 1
S 1,2
(γ) = γt1 (γ 2 )0t dt, S 2,1
= (γ) γt2 (γ 1 )0t dt.
0 0

Thus, a measure for cyclic leading behavior can be defined as

1 1 1 2 0
Z
1 1,2
1,2
A (γ) = 2,1
S (γ) − S (γ) = γ (γ )t − γt2 (γ 1 )0t dt.
2 2 0 t

Because the signature is translation invariant, the translation to the origin described
above does not affect this measure. Moreover, if we consider a time series γ ∈ P RN , then
we can consider all pairwise cyclic leading behavior between components. We can place all
of this information into a matrix called the lead matrix, A(γ), which has entries

1 i,j
Ai,j (γ) = S (γ) − S j,i (γ) .

(15)
2
The entries Ai,j (γ) have a geometric interpretation in terms of the signed area of the
path, as per Baryshnikov and Schlafly (2016).
An example of the second level signatures and the signed area is shown in the figure
below.

S 1;2 S 2;1 1 (S 1;2 − S 2;1)

Figure 3: Second level signature computations S 1,2 (left), S 2,1 (middle), and the signed are
A1,2 (right). Blue represents positive area, while red represents negative area.

Returning to the setting of Lie groups, we can define the lead matrix of a path γ ∈ P G
in the same manner, but the interpretation must be slightly modified. Writing out the

27
Darrick Lee and Robert Ghrist

integral for S i,j (γ) given a basis (ω1 , . . . , ωN ) of g∗ , we get

Z 1 Z t
i,j
S (γ) = ωi (γs0 )ds ωj (γt0 )dt.
0 0

The inner integral is simply S i (γ)t and represents the cumulative variation of the path in
the direction of ei (the dual of ωi ), which is the analogue of the displacement in Euclidean
space. Thus, for a cyclic time series γ ∈ P G, we say that the ei direction exhibits cyclic
leading behavior with respect to the ej direction if the following holds:

1. (positive influence) when S i (γ)t is positive (negative), then ωi (γt0 ) is positive (neg-
ative), and

2. (negative influence) when S j (γ)t is positive (negative), then ωj (γt0 ) is negative

(positive).

Thus the lead matrix, as defined in Equation 15, can be interpreted as a measure of this
cyclic leading behavior for Lie group time series. An example of this interpretation is given
in Section 5.1.
However, the geometric interpretation in terms of signed area is no longer available.
This is because any area computation on Lie groups will require second-order differential
information about the paths, but path signatures are only defined using first order differ-
ential information. This suggests that an interpretation in terms of areas on the Lie group
will not be possible. However, by using Proposition 34, the value Ai,j (γ) can still be inter-
preted as the signed area of the corresponding path Φ−1 (γ), where Φ is the bijection given
in Proposition 34.

3.6 Topological considerations

In this section, we will consider the topological interpretation of the first level signature
terms for some Lie groups. Namely, the first level signature term S i is homotopy invariant
if the differential 1-form corresponding to the basis vector ωi ∈ g∗ is a closed form. For
simplicity, we will assume that all paths are piecewise smooth in this section. Note in
particular that the continuous interpretation of discrete time series is piecewise smooth, so
the discussion in this section holds for analysis of these discrete time series.
Recall the following definition of a homotopy between paths.

Definition 40 Suppose α, β : [0, 1] → G are homotopic relative to endpoints if α0 = β0 ,

α1 = β1 and there exists a continuous function h : [0, 1]2 → G, called a homotopy, such
that

h(0, t) = αt , h(1, t) = βt , h(s, 0) = α0 = β0 , h(s, 1) = α1 = β1 .

We use the notation α ' β if the paths α and β are homotopic relative to endpoints.

Loosely speaking, two paths are homotopic relative to endpoints if their endpoints co-
incide, and there exists a continuous deformation from one path to the other. Namely,

28
Path Signatures on Lie Groups

homotopy relative to endpoints forms an equivalence relation in P G. We say that a map

f : P G → R is homotopy invariant if f (α) = f (β) whenever α ' β.

Recall that a differential form ω is closed if its exterior derivative is trivial, dω = 0. By

Stokes’ theorem, the first level signature terms for closed forms are homotopy invariant.
Indeed, let α, β ∈ P G, and let h : [0, 1]2 → G be a homotopy between α and β. By Stokes’
theorem, we have
Z Z
ω= dω = 0,
∂h h

but the boundary of h is exactly α ∗ β −1 . Thus, we have

Z Z Z
ω− ω= ω = 0.
α β ∂h

For left-invariant forms on Lie groups, there is a simple way to determine whether the
form is closed. We begin with the invariant formula for the exterior derivative (Lee, 2003).
Let ω be a 1-form on G, and X, Y are vector fields on G, then

dω(X, Y ) = X(ω(Y )) − Y (ω(X)) − ω([X, Y ]).

If in particular, if ω ∈ g∗ is a left-invariant 1-form and X, Y ∈ g are left-invariant vector

fields, then this formula reduces to

dω(X, Y ) = ω([X, Y ])

since ω(X) and ω(Y ) are constant functions. Thus, the left invariant form ω is closed if
and only if ω([X, Y ]) = 0 for all X, Y ∈ g. In particular, this implies that all left-invariant
1-forms are closed on abelian Lie groups such as RN and T N , since [X, Y ] = 0 for all
X, Y ∈ g. However, there are no closed left invariant 1-forms on SO(3) since a nontrivial
ω ∈ so(3)∗ must be nonzero for at least some Z ∈ so(3). However, for all Z ∈ so(3), there
exist X, Y ∈ so(3) such that Z = [X, Y ]. In fact, this argument extends to all semisimple
Lie groups, and thus there are no closed left-invariant 1-forms on any semisimple Lie group.

3.7 Discretization of the path signature

We have focused our discussion of the path signature so far on the continuous setting in
order to discuss the theoretical framework. However, applications are studied in the discrete
setting. In this section, we provide the explicit computation of path signatures for discrete
time series, and discuss a useful discrete approximation.

Let v = (v 1 , . . . , v N ) ∈ g, where we have written out the components of v in terms of

some choice of basis on g. Consider the continuous time series γt = exp(vt), where exp is
the Lie exponential. Note that the derivative γt0 = v is constant. In this case, the path

29
Darrick Lee and Robert Ghrist

signature of γ is straightforward to compute. Given I = (i1 , . . . , im ), the path signature is

Z
S I (γ) = ωi1 (γt01 ) . . . ωim (γt0m )dt1 . . . dtm
m
Z∆
= vi1 . . . vim dt1 . . . dtm
∆m
vi . . . vim
= 1 .
m!
The entire path signature can be written concisely as the tensor exponential.

Definition 41 Let V be a real vector space. The tensor exponential exp⊗ : V → T ((V )) is
defined to be

v ⊗m
(exp⊗ (v))m = .
m!
Then, we may write the path signature of γ = exp(vt) to be

S(γ) = exp⊗ (v).

Now, suppose we have a discrete time series γ̂ : [T + 1] → G. Recall from Equation 4

that we may compute the discrete derivative by

γ̂t0 = log γ̂t−1 γ̂t+1

to obtain a discrete time series γ̂ 0 : [T ] → g. As discussed in Section 2.3, we interpret these

discrete time series as continuous paths by interpolating using the exponential. Thus, the
continuous interpretation of the discrete time series can be thought of as a concatenation
of several exponential paths

γ = exp(γ̂10 t) ∗ exp(γ̂20 t) ∗ . . . ∗ exp(γ̂T0 t).

Therefore, by the above computation of the path signature of an exponential path and
Chen’s identity, we define the continuous path signature of the discrete time series to be

S(γ̂) = exp⊗ (γ̂10 ) ⊗ exp⊗ (γ̂20 ) ⊗ . . . ⊗ exp⊗ (γ̂T0 ). (16)

By using tensor operations, this formula provides an effective implementation for the com-
putation of the path signature.
An alternative approach is to compute an approximation of the path signature for dis-
crete time series.

Definition 42 Let γ̂ : [T + 1] → G. Suppose ω1 , . . . , ωN ∈ g∗ form a basis of g∗ . Let

I = (i1 , . . . , im ) be a multi-index, where ij ∈ [N ]. Let γ̂ 0 : [T ] → g be the discrete derivative
of γ̂. We define the discrete m-simplex with length T to be

ˆ m = {(t1 , . . . , tm ) ∈ [T ]m
∆ : 0 ≤ t1 < t2 < . . . < tm ≤ 1}.
T

30
Path Signatures on Lie Groups

The discrete path signature of γ̂ with respect to I is defined to be

X
Ŝ I (γ̂) := ωi1 (γ̂t01 ) . . . ωim (γ̂t0m ),
ˆm
(t1 ,...,tm )∈∆ T

where t1 , . . . , tm ∈ [T ]. The discrete path signature can be viewed as a map

Ŝ : P̂ G → T ((g)).

The discrete path signature can be viewed as an approximation to the continuous path
signature. Let γ ∈ P G be a continuous path. Given a partition π = (0 = t1 < t2 < . . . <
tT +1 = 1), the discretization of γ with respect to π, denoted γ̂ (π) : [T + 1] → G, is defined
to be

(π)
γ̂i := γti .

The following proposition in Kiraly and Oberhauser (2019) shows that the discrete signature
indeed approximates the continuous path signature.

Proposition 43 (Corollary 4.3, Kiraly and Oberhauser (2019)) Let γ ∈ P RN , and

define a partition π = (0 = t1 < t2 < . . . < tT +1 = 1). Then,

kŜ(γ̂ (π) ) − S(γ)k ≤ |γ|1−var e|γ|1−var max |γ[ti ,ti+1 ] |1−var .

i=1,...,T

By applying the map Φ : P RN → P G from Proposition 34, and using the fact that it is
an isometry, we immediately get the following corollary for Lie group valued paths.

Corollary 44 Let γ ∈ P G, and define a partition π = (0 = t1 < t2 < . . . < tT +1 = 1).

Then,

kŜ(γ̂ (π) ) − S(γ)k ≤ |γ|1−var e|γ|1−var max |γ[ti ,ti+1 ] |1−var .

i=1,...,T

We consider the discrete path signature since it is more amenable to computation. In

particular, Kiraly and Oberhauser (2019) derived efficient algorithms to compute the dis-
crete path signature kernel, and we extend these algorithms to Lie groups in Section 4.3.
In addition, the discrete path signature is of independent interest and the algebraic prop-
erties of the discrete path signature are studied in Diehl et al. (2020), where it is called the
iterated sums signature.

3.8 Path transformations

We turn to discussion of several transformations which add information to paths.

31
Darrick Lee and Robert Ghrist

3.8.1 Time transformation

The time transformation is a simple method to remove reparametrization invariance (com-
mon in the path signatures literature such as in Chevyrev and Kormilitzin (2016)), defined
by appending the time parameter to the path:

TTime : P Ge → Pe(G × R)
γt 7→ (γt , t). (17)

Lemma 45 The map TTime : P Ge → Pe(G × R) is injective.

Proof Consider α, β ∈ P Ge . The time parameter of TTime (α) is monotone increasing,

implying the path TTime (α) is irreducible. Thus, since we also assume that α0 = β0 = e,
the paths TTime (α) and TTime (β) are tree-like equivalent if and only if they differ by a
reparametrization. However, all paths in the image of TTime have the same parametrization
in the time coordinate. Thus, TTime (α) is tree-like equivalent to TTime (β) if and only if
α = β.
Because the path signature is injective with respect to tree-like equivalence classes of paths,
this lemma implies that by first embedding a path γ ∈ P Ge into Pe(G × R), we obtain a
parametrization-dependent feature map for P Ge using the path signature. More generally,
this also removes tree-like invariance, so in fact we obtain an injective feature map for P Ge .

3.8.2 Identity initialized transformation

The identity initialized (IdInit) transformation is a simple method to remove translation
invariance: to the authors’ knowledge, this has not been explicitly discussed in the context
of path signatures. Suppose g ∈ G and we define `g ∈ P G to be the exponential path from
the identity e ∈ G to the point g ∈ G. Then the IdInit transformation is defined to be

TIdInit : P G → P Ge
γ 7→ `γ0 ∗ γ. (18)

For discrete time series, this simply amounts to appending the identity element to the
beginning of the time series. The following lemma is clear by definition.

Lemma 46 The map TIdInit : P G → P Ge is injective.

The implication of this lemma is that by embedding P G into P Ge , we can obtain a

translation-dependent feature map for P G using the path signature. Moreover, by combin-
ing the time and identity start transformations, the path signature provides an injective
feature map for P G

Corollary 47 The composition TTime ◦ TIdInit : P G → Pe(G × R) is injective.

32
Path Signatures on Lie Groups

3.8.3 Sliding window transformation

The sliding window transformation (often called the lead-lag transformation in the path
signature literature) has been used for several applications of the path signatures, such
as Gyurkó et al. (2013) and Yang et al. (2019), and has been found to produce good results.
An empirical study by Fermanian (2019) has shown that the sliding window transformation
performs well on classification tasks. The exact definition may vary slightly between these
papers, and we give the definition from Yang et al. (2019) and Fermanian (2019).
Let τ ∈ R+ , and m ∈ N. Given a path γ ∈ P G, which we recall is defined on the unit
interval γ : [0, 1] → G, we can extend its definition to all of R as

 γ0 : t < 0
γt = γt : t ∈ [0, 1]
γ1 : t > 1.


Then, we define the sliding window transformation with m lags to be

TSWin,m : P G → P Gm+1
γt 7→ (γt , γt−τ , γt−2τ , . . . , γt−mτ ). (19)

For discrete time series, we assume that the data is temporally uniformly sampled, and
we choose τ to be the time in between samples. Then, if we consider a discrete time series
to be γ̃ : {0, . . . , n} → G, then the sliding window transformation will be

TSWin,m (γ̃)i := (γ̃i , γ̃i−1 , . . . , γ̃i−m ).

In the context of Euclidean path signatures, Fermanian (2019) empirically shows that
the sliding window embedding often performs well on classification tasks, though there is no
theoretical explanation. We note that due to the choice of padding the start of the delayed
time series with the identity, this transformation breaks translation invariance. We suggest
that breaking the translation symmetry is one reason the sliding window transformation
performs well in practice. This is discussed in Remark 64 in Section 5.1.

4. The path signature kernel for Lie groups

In this section, we show that a normalized variant of the path signature can be used to
define a universal and characteristic kernel for Lie group valued time series. We begin by
giving an overview of kernel methods and the types of problems that kernel methods can
solve. Next, we extend the results of Chevyrev and Oberhauser (2018) to show that path
signatures for Lie group valued paths are universal and characteristic. Finally, we show
that the algorithms introduced in Kiraly and Oberhauser (2019) for efficient computation
of the signature kernel can also be used for Lie group valued paths.

4.1 Background on universal and characteristic kernels

Suppose X is a topological space which represents the space of data we would like to
consider; we will call this the input space. Many tasks in machine learning can be separated
into two broad classes of problems.

33
Darrick Lee and Robert Ghrist

1. Those which involve making inferences about functions f ∈ F, where F ⊂ RX is

the function class which we are considering. Performing binary classification reduces
to learning a function f ∈ F such that the level set f = 0 represents the decision
boundary between the two classes of data.

2. Those which involve making inferences about probability measures µ ∈ P(X ), where
P(X ) denotes the space of Borel probability measures on X . For example, in two
sample hypothesis testing, we begin with samples {x1 , . . . , xn } and {y1 , . . . , ym } taken
from probability distributions p and q on X respectively. Testing the null hypothesis
that p = q then corresponds to learning about the underlying measures of p and q.
The general philosophy behind kernel methods is to map the input space X into a
reproducing kernel Hilbert space (RKHS) H using a feature map

Φ : X → Hκ ,

where the corresponding kernel is

κ : X × X → R, κ(x, x0 ) = hΦ(x), Φ(x0 )i.

Problems involving learning nonlinear functions f ∈ RX given some input data {xi },
where xi ∈ X , can be reformulated as problems involving learning an element g ∈ H (which
can be thought of as a function g ∈ RX ) given the data {Φ(xi )}. Additionally, the norm
induced by the Hilbert space provides a metric between points x, y ∈ X as kx − yk. In
essence, this translates a nonlinear learning problem into a linear learning problem. This
allows the application of linear methods, which are much simpler and better developed in
many cases.
Measures µ on X can be mapped into the RKHS via the kernel mean embedding (KME),
Z
Φ : M(X ) → Hκ , Φ(µ) := Φ(x)dµ(x) = Eµ [Φ]. (20)
X
A priori, this map is not necessarily well-defined, so we will usually require restrictions on
the feature map or kernel such that the integral exists.

Lemma 48 (Sriperumbudur et al. R(2010))

p Let µ ∈ M(X ). If the kernel κ : X →
X → R is measurable and the integral X κ(x, x)dµ(x) < ∞, then Φ(µ) ∈ Hκ .

Specifically, the bounded integral condition is satisfied if we know that κ(x, x) = kΦ(x)k2 <
C for all x ∈ X for some fixed constant C. In other words, if the image of the feature map
Φ is contained in a bounded subset of Hκ , then the KME is well defined.
Similar to the previous case, we can use the norm on Hκ to define a notion of distance
on P(X ). Although this only provides a pseudometric since Φ may not be injective, it
coincides with a well known measure of discrepancy between probability measures.

Definition 49 Let F ⊂ RX be a class of functions and µ, ν ∈ P(X ). The maximum mean

discrepancy (MMD) of µ and ν with respect to F is

MMD[F, µ, ν] := sup (Eµ [f ] − Eν [f ]) . (21)

f ∈F

34
Path Signatures on Lie Groups

When we take the function class F to be the unit ball in the RKHS Hκ , the MMD can
be written as the distance between the mean embeddings, with respect to the norm on H.

Lemma 50 (Borgwardt et al. (2006)) Suppose the KME map Φ is well-defined and
suppose µ, ν ∈ M(X ). Let F = {f ∈ H ⊂ RX : kf k ≤ 1}. Then,

MMD2 [F, µ, ν] = kΦ(µ) − Φ(ν)k2 . (22)

This simplifies the study of probability measures by considering them as elements of a linear
space, and also provides a straightforward method to compute an unbiased finite sample
estimate of the MMD in terms of the kernel.

Lemma 51 (Gretton et al. (2012)) Suppose the KME map Φ is well-defined and sup-
pose µ, ν ∈ M(X ). Let F = {f ∈ H ⊂ RX : kf k ≤ 1}. Let X = (x1 , . . . , xn ) and
Y = (y1 , . . . , ym ) be i.i.d. samples from µ and ν respectively. An unbiased estimate of
MMD2 [F, µ, ν] is given as the MMD of the empirical distributions of X and Y ,
n n m m
1 XX 1 XX
MMD2u [F, X, Y ] = κ(xi , xj ) + κ(yi , yj )
n(n − 1) m(m − 1)
i=1 j6=i i=1 j6=i
n m
2 XX
− κ(xi , yj ). (23)
nm
i=1 j=1

From this discussion, kernels provide a unified way to study both nonlinear functions
and probability measures using the linear space H. However, there are deficiencies in both
scenarios.

1. In the case of nonlinear functions, we usually begin by choosing our function class
F ⊂ RX . How do we know that any function f : X → R can be represented arbitrarily
closely by an element in ` ∈ H such that f (x) ≈ h`, Φ(x)i for all x ∈ X ?

2. In the case of probability measures, it is often crucial that the MMD is in fact a
metric instead of just a pseudometric. How do we know that the feature map Φ is
rich enough to distinguish all probability measures µ ∈ M(X )?

The answer is given by the definitions of universal and characteristic kernels. This will
require us to extend the definition of the KME to Schwarz distributions rather than just
measures. We provide a quick exposition of the definitions here, and refer the reader to a
more thorough treatment in Simon-Gabriel and Schölkopf (2018). As usual, let F ⊂ RX be
a function class, and let F 0 denote its topological dual of all continuous linear functionals.
The definition of the KME for distributions is analogous to the case of measures
Z
0
Φ : F → Hκ , Φ(D) := Φ(x)dD(x), (24)
X

where the integral here is the weak- or Pettis- integral (Simon-Gabriel and Schölkopf, 2018).
Similar to the case of measures, this map is a priori not well-defined. However, we have a
simple criterion for the existence of these weak integrals.

35
Darrick Lee and Robert Ghrist

Lemma 52 (Simon-Gabriel and Schölkopf (2018)) Let F ⊂ RX . If the map ι : Hκ →

RX defined by ι(`) := h`, Φ(·)i has image contained in F, then the integral Φ(D) exists for
all D ∈ F 0 . Thus, the KME map Φ : F 0 → Hκ is well-defined.

We can now state the definition of a universal and characteristic feature map.

Definition 53 Fix an input space X , and a function space F ⊂ RX . Consider a feature

map

Φ : X → Hκ

into an RKHS Hκ with respect to a kernel k. Suppose that h`, Φ(·)i ∈ F for all ` ∈ Hκ . We
say that Φ is

1. universal to F if the map

ι : Hk → F, ` 7→ h`, Φ(·)i

has a dense image in F; and

2. characteristic to a subset P ⊆ F 0 if the KME map

Z
Φ : P → Hk , D 7→ Φ(x)dD(x)
X

is injective.

Note that we have assumed that the image ι(Hκ ) ⊂ F so by Lemma 52, the KME map
is well defined. The property of universality allows us to approximate any function f ∈ F
using linear functionals h`, Φ(·)i for ` ∈ Hκ . The dual of a class of functions F is generally
much larger than the set of probability measures on X . If M(X ) ⊂ F 0 , then a characteristic
feature map is able to represent probability measures on X with elements of H. Moreover
the MMD becomes a metric due to the injectivity of the KME.
We have the following equivalence between universality and characteristicness, as shown
in Simon-Gabriel and Schölkopf (2018) and Chevyrev and Oberhauser (2018).

Theorem 54 Suppose that F is a locally convex topological vector space. A feature map Φ
is universal to F if and only if Φ is characteristic to F 0 .

4.2 The path signature kernel

In this subsection, we will define the path signature kernel, and show that it is universal and
characteristic. This was shown for the Euclidean case in Chevyrev and Oberhauser (2018),
which states that these properties hold when studying paths evolving in a Hilbert space.
Through our definition of the path signature for Lie groups, we provide a clarification: the
space itself need not be a Hilbert space, but rather the space of tangent vectors must be a
Hilbert space. In our case of Lie groups, this is the Lie algebra with a Riemannian metric.
With this setup, the path signature kernel for Lie groups is universal and characteristic.

36
Path Signatures on Lie Groups

Recall that we have defined T1 ((ḡ)) in Equation 13 to be the subspace of T ((ḡ)) with
constant value 1 and finite norm. We will view the path signature
S:P
g G → T1 ((ḡ))
as a feature map, and recall that T1 ((ḡ)) is equipped with an inner product, and is in
particular a Hilbert space. However, as discussed in the previous subsection, we will need
to ensure that the signature map sends paths to a bounded subset of T1 ((ḡ)) in order for
the KME to be defined. This will be done by using a tensor normalization, which was first
discussed in Chevyrev and Oberhauser (2018).

Definition 55 A tensor normalization is a continuous injective map of the form

Λ : T1 ((ḡ)) → {t ∈ T1 ((ḡ)) : ktk ≤ K}
t 7→ δλ(t) t
where K > 0 is a constant, λ : T1 ((ḡ)) → (0, ∞) is a function, and δλ is the tensor dilation
from Definition 9.

We will discuss the construction and computational aspects of tensor normalization

in Appendix B. For now, we will assume that a tensor normalization exists, and set Λ :
T1 ((ḡ)) → T1 ((ḡ)) to be a fixed tensor normalization. Now, we have the normalized signature

ΦS : P
g G → T1 ((ḡ)), ΦS = Λ ◦ S,

which is a continuous injective map from P gG into a bounded subset of T1 ((ḡ)). Note that
due to the scaling property of the path signature from Proposition 23, this is equivalent to
ΦS (γ) = S(λ(S(γ)) · γ),
where we first scale the path in G by λ(S(γ)), using the Lie algebra scaling from Defini-
tion 13.
Following Chevyrev and Oberhauser (2018), we will show universality and then use
the duality in Theorem 54 to show characteristicness with respect to probability measures.
Using the theory discussed in the previous section, the objective is to find a function class
F ⊂ RP G and a topology on P G such that
1. the function class F can be approximated by linear functionals h`, ΦS (·)i, and
2. the dual F 0 contains probability measures on P G.
The difficulty with such a result is due to the fact that P G is not locally compact. How-
ever, the class of continuous bounded functions Cb (P G, R) has such properties when P G is
endowed with the strict topology, originally defined in Giles (1971).
Definition 56 Let X be a topological space. We say that a function ψ : X → R vanishes at
infinity if for each > 0, there exists a compact set K ⊂ X such that supx∈X−K |ψ(x)| < .
Denote by B0 (X, R) the set of functions that vanish at infinity. The strict topology on
Cb (X, R) is the topology generated by the seminorms
pψ (f ) = sup |f (x)ψ(x)|, ψ ∈ B0 (X, R).
x∈X

37
Darrick Lee and Robert Ghrist

Theorem 57 Let X be a metrizable topological space.

1. The strict topology on Cb (X, R) is weaker than the uniform topology and stronger than
the topology of uniform convergence on compact sets.

2. If F0 is a subalgebra of Cb (X, R) such that for all x, y ∈ X, there exists some f ∈ F0

such that f (x) 6= f (y) (F0 separates points), and for all x ∈ X, there exists some
f ∈ F0 such that f (x) 6= 0, then F0 is dense in Cb (X, R) under the strict topology.

3. The topological dual of Cb (X, R) equipped with the strict topology is the space of finite
regular Borel measures on X.

Specifically, note that the space of finite regular Borel measures on X includes all probability
measures on X. Finally, we are ready to state the universality and characteristicness result.

Theorem 58 Let Λ : T 1 (Rn ) → T 1 (Rn ) be a tensor normalization. The normalized signa-

ture

Φ:P
g G → T1 ((ḡ)), Φ = Λ ◦ S,

1. is a continuous injection from P

g G into a bounded subset of T 1 (Rn ),

2. is universal to F := Cb (P
g G, R) equipped with the strict topology, and

3. is characteristic to the space of finite regular Borel measures on P

g G.

Proof The fact that Φ is an injection follows from the injectivity of the path signature
from Theorem 30 and the definition of the tensor normalization. Continuity follows from
the stability property of Corollary 38. Next, we move on to universality. Define
∞
M
L=1+ (ḡ)⊗m
m=1

to be a dense subspace of T1 ((ḡ)) (note that L only contains finite linear combinations of
tensors, whereas T1 ((ḡ)) contains all power series of tensors) and define

F0 = {h`, Φ(·)i : P
g G → R : ` ∈ L}.

We aim to show that F0 satisfies the hypotheses of the second point of Theorem 57. By the
injectivity of Φ, the class of functions F0 separates points, and because the path signature
is defined with constant term 1, the path signature is nonzero for all paths γ ∈ P
g G. Finally,
by the shuffle product identity from Theorem 33, the class of functions F0 is closed under
shuffle multiplication and is therefore a subalgebra of Cb (Pg G, R). Namely, let I and J be
multi-indices, and eI and eJ be the corresponding basis vectors in L. Then, we may define
multiplication in F0 by
* +
X
heI , Φ(·)iheJ , Φ(·)i = eK , Φ(·) ,

K∈I J

38
Path Signatures on Lie Groups

which is closed. Thus, Φ is universal with respect to F. Finally, by the duality in Theo-
rem 54 and the third point of Theorem 57, the function class F is characteristic with respect
to finite regular Borel measures on P
g G.

Remark 59 Although this theorem is stated for tree-like equivalence classes of paths in G,
by precomposing with the time transformation or identity start transformation discussed
in Section 3.8, we can also obtain universal and characteristic feature maps that are not
reparametrization or translation invariant.

4.3 The kernel trick

The kernel trick refers to an efficient method to compute kernels without having to compute
explicit representations of elements in the feature space. Several efficient algorithms to
compute the Euclidean path signature kernel are provided in Kiraly and Oberhauser (2019),
who state that their algorithms hold for path signatures computed for Hilbert space-valued
data (their methods hold in more generality). The algorithms depend only on an inner
product structure in the space where the integrals are being computed, namely in ḡ, and
thus also hold in our present context of Lie group-valued data. In this section, we will
provide an explicit generalization of their main algorithm.

As in Section 3.3, we will restrict ourselves to the truncated signature

SM : P G → T ≤M (ḡ).

Here, the inner product for T ≤M (ḡ) was given in Equation 12. The signature kernel trun-
cated at level M is defined to be

KM : P G × P G → R, (α, β) 7→ hSM (α), SM (β)i (25)

We will begin by simplifying the computation of the kernel for continuous paths.

Proposition 60 The signature kernel can be computed as

M Z
X m
Y
KM (α, β) = hαs0 i , βt0i ig ds dt, (26)
m m
m=0 (s,t)∈∆ ×∆ i=1

where we view αt0 and βt0 as elements of the Lie algebra g, and the inner product in the
integrand h·, ·ig is computed in the Lie algebra. Also, we denote s = (s1 , . . . , sm ) and
t = (t1 , . . . , tm ) as elements of ∆m , and write ds := ds1 . . . dsm and dt := dt1 . . . dtm .

Proof Let’s consider the inner product at a single level m. Recall that πm : T ((ḡ)) → ḡ⊗m
is the projection on to the level m tensors, and h·, ·im refers to the inner product on ḡ⊗m .

39
Darrick Lee and Robert Ghrist

Then,

hπm S(α), πm S(β)im

X
= S I (α) · S I (β)
I:|I|=m
X Z Z
= ωi1 (αs0 1 ) . . . ωim (αs0 m )ds ωi1 (βt01 ) . . . ωim (βt0m )dt
m ∆m
I=(i1 ,...,im ) ∆
Z X
ωi1 (αs0 1 )ωi1 (βt01 ) . . . ωim (αs0 m )ωim (βt0m ) ds dt

=
(s,t)∈∆m ×∆m I=(i ,...,i )
1 m
Z Ym
= hαs0 i , βt0i ig ds dt.
(s,t)∈∆m ×∆m i=1

Then, adding up all of the levels, we get our desired result.

As noted by Kiraly and Oberhauser (2019), the expression in Equation 26 can be effi-
ciently computed by a method that is similar to Horner’s scheme forPcomputing polynomial
expressions. Suppose we wish to compute the expression p(x) = M i
i=0 x . By expanding
this polynomial as

p(x) = 1 + x(1 + x(1 + . . . + x(1 + x)))

where the recursion occurs M times and computing the brackets from the inside to the out-
side, we can evaluate the expression using M additions and M multiplications. In contrast,
2
the naive computation of p(x) would require M additions and M 2+M multiplications. Note
that we may write out this recursion explicitly as follows. Let

q1 = 1 + x, qm = 1 + xqm−1 .

Then, we may write p(x) = qM . We can significantly reduce the number of operations
required to compute the integrals in Equation 26 by adapting this procedure.

Corollary 61 Let
Z
Q1 (s, t) = 1 + s0 ∈[0,s]
hαs0 0 , βt00 ig ds0 dt0 . (27)
t0 ∈[0,t]

Then, we recursively define

Z
Qm (s, t) = 1 + s0 ∈[0,s]
Qm−1 (s0 , t0 )hαs0 0 , βt00 ig ds0 dt0 . (28)
t0 ∈[0,t]

The signature kernel can be computed as

KM (α, β) = QM (1, 1). (29)

40
Path Signatures on Lie Groups

Proof We will consider the case of M = 2. Here, we have

Z
Q2 (1, 1) = 1 + s ∈[0,1] Q1 (s1 , t1 )hαs0 1 , βt01 ig ds1 dt1
1
t1 ∈[0,1]
 
Z Z
=1+ s1 ∈[0,1]
1 +
s2 ∈[0,s1 ]
hαs0 2 , βt02 ig ds2 dt2  hαs0 1 , βt01 ig ds1 dt1
t1 ∈[0,1] t2 ∈[0,t1 ]
Z Z Z 2
Y
=1+ hα0 , βt01 ig ds1 dt1
s1 ∈[0,1] s1
+ s1 ∈[0,1] s2 ∈[0,s1 ]
hαs0 i , βt0i ig ds1 ds2 dt1 dt2
t1 ∈[0,1] t1 ∈[0,1] t2 ∈[0,t1 ] i=1
= K2 (α, β).

The general proof proceeds in the same manner. By each successive unfolding of the defi-
nition of Qm , we recover an additional summand in Equation 26.

Next, we will consider the discrete formulation of this expression. For simplicity, we
will consider discrete time series of the same length, though the following results also hold
when the two discrete time series are of different lengths. Suppose α̂, β̂ : [T + 1] → G, and
let α̂0 , β̂ 0 : [T ] → g be their corresponding discrete derivatives. Following the notation in
Section 3.7 for the discrete signature, we define the discrete signature kernel truncated at
level M to be
K̂M (α̂, β̂) := hŜM (α̂), ŜM (β̂)iM . (30)

Notice we are using the discrete signature given in Definition 42. Then, the discrete ana-
logues of Proposition 60 and Corollary 61 are as follows.

Proposition 62 The discrete signature kernel can be computed as

M
X X m
Y
K̂M (α̂, β̂) = hαs0 i , βt0i ig . (31)
m=0 (s,t)∈∆ ˆ m i=1
ˆ m ×∆
T T

Corollary 63 Let
X
Q̂1 (s, t) = 1 + hαs0 0 , βt00 ig (32)
s0 ∈[s], t0 ∈[t]

and recursively define

X
Q̂m (s, t) = 1 + Q̂m−1 (s0 , t0 )hαs0 0 , βt00 ig . (33)
s0 ∈[s], t0 ∈[t]

The discrete signature kernel can be computed as

K̂M (α̂, β̂) = Q̂M (T, T ). (34)

41
Darrick Lee and Robert Ghrist

The proofs for these discrete formulas proceed in exactly the same manner as their contin-
uous counterparts. This final recursive formula for the kernel provides an efficient compu-
tation of the discrete signature. We set some notation before writing down the algorithm.
Suppose A, B are T × T arrays. We will use the notation A[i, j] to denote elements of the
array, and we suppose that the arrays are 1-indexed. The notation for the pseudocode is
explained in Appendix A.
For the algorithm, suppose we have discrete time series α, β : [T + 1] → G, and the
corresponding derivatives α0 , β 0 : [T ] → g are already computed, as per Appendix A. We
assume that the Lie group G is N dimensional. In the pseudocode, we let a, b be the discrete
derivatives α0 , β 0 respectively.
Algorithm 1: Discretized Signature Kernel
Input : a,b (T × N arrays): Two paths as discrete derivatives
M (Int): The truncation level
Output: R (Float): The kernel value KM (α, β).
1 Compute the Gram matrix of the derivatives
K ← ab| ;
2 Initialize (T × T ) arrays A and Q ;
3 Initialize the first step of the recursion
A←K;
4 for m=2..M do
5 Q ← 1+ A[, ] ;
6 A←K·Q ;
7 end
8 R ← 1 + A[Σ, Σ];
9 Return R

As discussed by Kiraly and Oberhauser (2019), the runtime of this algorithm is O(T 2 ·
M ). Now, consider the naive computation of the kernel where we first compute the trun-
cated signatures of α̂ and β̂, and then compute the inner product. From the analysis in
Appendix A, a signature computation requires O(T N M ) operations. The leading-order
term in the computation of the inner product is computing the inner product of the re-
spective level M tensors. This requires N M operations. Thus, the complexity of the naive
computation is O(T N M ).
This suggests that if the length T of the time series is large and the truncation level is
small, then a naive computation may be more efficient. However, if we wish to compute the
kernel at a high truncation level, the recursive algorithm provided here scales significantly
better. Furthermore, several variants of this algorithm for Euclidean path signatures are
considered in Kiraly and Oberhauser (2019) such as by incorporating low-rank approxima-
tions. These algorithms can similarly be extended to the setting of Lie groups by applying
them to the discrete derivatives.

5. Experiments
In this section, we provide two detailed experiments to demonstrate the universal and char-
acteristic properties of the path signature. First, we consider the human action recognition

42
Path Signatures on Lie Groups

problem from computer vision, using a Lie group representation of the data. We find that
the path signature method is simple to implement, achieves comparable classification per-
formance to shallow learning methods, and provides an interpretable feature set. Second,
we perform a kernel two-sample hypothesis test aiming to distinguish between two different
random walks on SO(3). Here, we find that the path signature for SO(3) significantly
outperforms the same hypothesis test done using the Euclidean representation of SO(3).

5.1 Human action recognition

In this subsection, we aim to utilize the universal property of the signature kernel to study
a classification problem. Namely, we will be working in the domain of human action recog-
nition, in which several recent works, such as Huang et al. (2017), Rhif et al. (2018) and Li
et al. (2019), have used Lie group representations and deep learning to achieve state of the
art results. We also note that Euclidean path signatures have also been used to study the
Euclidean valued and joint-based representation of this problem in Yang et al. (2019).
Our goal in this section is to demonstrate the utility, simplicity and interpretability
of the Lie group path signature. Thus, our focus will be on establishing baseline results
using support vector machines and random forests, and our main comparison will be with
the “shallow” learning methods described in Vemulapalli et al. (2014) and Vemulapalli
and Chellappa (2016). We achieve comparable classification results using a random forest
algorithm and the second level signature as a feature set. A major advantage of the second
level signature is that we can interpret this feature set in terms of the underlying movement,
and we provide a brief discussion of this analysis.

5.1.1 Lie group representation

In this experiment, we aim to classify actions based on human skeletal motion data, which
we represent as a SO(3)k valued time series. This Lie group representation of human motion
was introduced in Vemulapalli et al. (2014) and Vemulapalli and Chellappa (2016), which
describes a pose as an element of SO(3)k . The relative rotation of two body parts is an
element of SO(3), and by recording the relative rotation of k pairs of body parts, we can
describe the full pose of a human.
A skeleton S = (V, E) can be described using a set of joints V = {v i }ni=1
V
, where v i ∈ R3 ,
nE
i
and a set of body parts E = {e }i=1 , where each body part is a function ei : {0, 1} → V .
We may consider ei (0) and ei (1) to respectively be the start and end point of the body part
ei . Additionally, we will denote the unit vector describing the direction from the start to
the end of the body part by

ei (1) − ei (0)
êi = .
kei (1) − ei (0)k

We will consider all pairs of body parts (ei , ej ) that share a joint, such that ei (c1 ) = ej (c2 )
for some c1 , c2 ∈ {0, 1}, so k is the number of adjacent pairs of body parts. To obtain
the rotation matrix for a chosen pair (ei , ej ), we rotate the global coordinate system (with
minimum rotation) such that êi is the x-axis. Then, the rotation matrix Ri,j ∈ SO(3) is the
minimum rotation from êi to êj in this coordinate system. By repeating this for all adjacent

43
Darrick Lee and Robert Ghrist

pairs, we obtain an element of SO(3)k , and further repeating this for all time steps, we can
represent this motion as a time series in SO(3)k .

5.1.2 Data and preprocessing

The dataset we use is the G3D-Gaming dataset from Bloom et al. (2012). This contains
663 sequences of 20 different gaming motions performed by 10 subjects, and each subject
performed each action at least two times. The 3D locations of 20 joints are provided for
every frame. However, the number of frames for each recorded action varies widely from 3
frames to 330 frames. We use the code provided in Vemulapalli and Chellappa (2016) to
generate the SO(3) representation.

Figure 4: Histogram of number of frames per recorded action in G3D dataset.

Note that in Vemulapalli et al. (2014) and Vemulapalli and Chellappa (2016), all pairs
of body parts are used in the Lie group representation, which results in k = 342 pairs for
this dataset. In contrast, we only use all adjacent pairs of body parts, which we call the
primary pairs, resulting in k = 18 pairs for this dataset. We use significantly less data
because the path signatures take into account the relationships between all input pairs, so
we have information regarding non-primary pairs through the higher order signature terms.
The numbering of the primary pairs are given in the following figure.

44
Path Signatures on Lie Groups

1
8 4
7 3
2
9 5
10 15 11 6

12
16

17 13

18 14

Figure 5: Numbering of the primary pairs of body parts.

We categorize these 18 primary pairs into 5 classes.

Body 1-2
L Arm 3-6
R Arm 7-10
L Leg 11-14
R Leg 15-18

Table 1: The body regions corresponding to groups of primary pairs.

Extensive preprocessing of this data is performed in Vemulapalli et al. (2014) and Vem-
ulapalli and Chellappa (2016) in order to deal with several difficulties. During the training
stage, the following steps were taken.

1. Each time series is resampled via interpolation so that all time series have a fixed
length.

2. A nominal curve in SO(3)k is generated for each action class.

3. To handle issues of rate variation and temporal misalignment, dynamic time warping
(DTW) is used to warp each time series to its corresponding nominal curve.

4. A rolling and unwrapping procedure is computed with respect to its nominal curve to
obtain a curve in the Lie algebra so(3)k .

5. (Optional) A Fourier temporal pyramid (FTP) representation of the Lie algebra curve
may be computed to further deal with temporal misalignment.

45
Darrick Lee and Robert Ghrist

A classifier such as a support vector machine (SVM) is then trained for every action class
(one vs. rest). In the testing stage, the data is preprocessed with respect to all nominal
curves and the corresponding SVM is used for prediction. This amounts to a large pre-
processing cost, especially for test samples, which must be preprocessed with respect to all
action classes.
In contrast, we perform minimal preprocessing since we can compare time series with
varying numbers of frames using path signatures, and the issues of rate variation and tem-
poral misalignment are handled by reparametrization invariance.

5.1.3 Results
Following Vemulapalli et al. (2014); Vemulapalli and Chellappa (2016), we use a cross-
subject test setting, where we use half of the subjects for training, and the other half for
testing. All of the reported classification results are averaged over ten different combinations
of the train/test split. We perform the classification using a kernel SVM, as well as a random
forest. For the kernel SVM, we report the results using the signature kernel truncated at
level 6. We use the Julia implementation of scikit-learn, with the SVC implementation for
support vector machines, which uses the one-against-one approach (Knerr et al., 1990) for
multi-class classifcation. For the random forest, we compute the level 2 signature of the time
series and treat it as a feature set. We use the random forest implementation in the Julia
DecisionTrees package. We follow the suggested default random forest hyperparameters
√
in Probst et al. (2019), and use 1,000 trees, nf features for each tree, where nf is the total
number of input features, a maximum depth of 100, and use 70% of the data to train each
tree. A tensor renormalization is used for all path signature computations as described in
Proposition 65, and using the function
√
x2

if x ≤ √M
ψ(x) =
M + M 1+a (M −a − x−a )/a if x > M

where M = 4 and a = 1 as in Chevyrev and Oberhauser (2018).

We apply each algorithm to several embeddings of the discrete time series. In addition
to using the raw time series, we also use the identity initialialized (IdInit) embedding to
remove translation invariance, and the sliding window (SWin) embedding with 1 to 6 lags.

Random Forest SVM

Raw 69.99% 65.20%
IdInit 80.48% 77.86%
SW, 1 lag 80.32% 79.24%
SW, 2 lags 82.71% 80.27%
SW, 3 lags 83.77% 80.84%
SW, 4 lags 84.47% 81.08%
SW, 5 lags 84.86% 81.44%
SW, 6 lags 85.56% 81.44%

Table 2: Classification results using a random forest and SVM for different embeddings.

46
Path Signatures on Lie Groups

Based on our results, a random forest trained using level 2 signatures outperforms an
SVM trained using level 6 signatures for all embeddings. This may be due to the fact that
random forests are in general better suited for multi-class classification tasks, since the SVM
approach is to split the multi-class problem into 20

2 binary classification tasks.
We note by using the random forest classifier, and introducing lags, we are able to
achieve results which are comparable to the 87.95% accuracy of Vemulapalli and Chellappa
(2016), given the fact that we are using significantly less input data, minimal preprocessing,
and default hyperparameters on the random forest.

Remark 64 In Section 3.8.3, we mentioned that one possible explanation for the strong
empirical performance of the sliding window embedding is the breaking of translation invari-
ance. In these results, we can isolate the effect of breaking translation invariance by using
the IdInit embedding. We see that for both the random forest and the SVM, the performance
improves significantly after using the IdInit embedding when compared to the raw time se-
ries. The sliding window embedding with 1 lag also significantly improves the performance
when compared to the raw time series, but we see that the increase is comparable to that
of the IdInit embedding. When we increase the number of lags, the path signature is able
to capture more information by integrating with respect to past values, and thus the perfor-
mance continues to improve. Fermanian (2019) suggested the problem of explaining why
the sliding window embedding performs well in practice. This empirical evidence indicates
that one such reason is the breaking of the translation invariance of the path signature.

5.1.4 Interpretation of the second level signature matrix

In addition to comparable classification accuracy, the second level signature matrix affords
an interpretable feature set. This is one of the highlights of this method, as none of the
cited classification methods are easily interpretable. Namely, in both the narrow and deep
learning approaches, the input to the classification algorithm is the stacked (and in some
cases preprocessed) Lie group valued time series, resulting in an extremely high dimensional
vector. In this section, we will discuss the interpretation of the raw second level signature
matrix, which was used for the random forest classification.
The second level path signature of a path γ ∈ P G, where G is N -dimensional, can be
viewed as a (N × N ) matrix where the (i, j) entry is simply S (i,j) (γ). For these applications,
our Lie group is G = SO(3)18 , which has N = 54 dimensions. We will consider the absolute
value of the averaged second level signature matrices for each action class of the G3D data
set. Specifically, because we are focused only on the interpretation in this section, we will
use all of the data when computing averages. Also, we take the absolute value to simplify
the interpretation of these matrices. We begin by considering a single action class (walk)
in order to describe how to read the matrix.
Here we have visualized the (54×54) matrix as a heatmap, where the entries are arranged
in accordance with standard matrix notation. The thin red grid lines separate the matrix
into (3 × 3) blocks, where each triplet of entries corresponds to a single primary pair. The
numerical labels on the axes enumerate the blocks (or primary pairs), which correspond to
the primary pairs as shown in Figure 5. The thicker red grid lines are used to separate
the different categories of primary pairs, as outlined in Table 1, and labelled in the figure.
The diagonal entries of the matrix measure the square of the cumulative rotation of a given

47
Darrick Lee and Robert Ghrist

Walk (Si,j)
j
i L Arm R Arm L Leg R Leg
3 6 9 12 15 18
1

3
L Arm 0.8
6
0.6
R Arm
9
0.4
L Leg 12

15 0.2
R Leg
18 0

Figure 6: Averaged absolute second level signature matrix for the action class “walk.”

basis direction for a given primary pair. The off diagonal entries measure the positive or
negative influence of the basis direction i on the basis direction j, as defined in Section 3.5.
For a closer look at this example, we isolate the blocks corresponding to the left and right
legs, and reduce the color threshold.

Walk (|Si,j|)
j
i L Leg R Leg
11 12 13 14 15 16 17 18
0.3
11
12 0.25
L Leg
13 0.2
14
0.15
15
16 0.1
R Leg
17 0.05
18
0

Figure 7: Averaged absolute second level signature matrix for the action class “walk.”

The blocks with the largest magnitude are the diagonal blocks 14 and 18, which cor-
respond to the joints of the left and right foot (see Figure 5. In addition, many of the
off-diagonal entries are nonzero, which corresponds to the action of walking. While walk-
ing, we alternate moving our left and right legs, and the signature matrix measures this as
an influence of rotations about joints in one leg on the rotations about joints in the other
leg.

In Figure 8, we have plotted the absolute value of the averaged second level signature
matrix for all action classes in the data set. We omit the labelling to simplify the figure,
but all labelling is the same as Figure 6. In particular, the colors range from 0 to 1.

48
Path Signatures on Lie Groups

PunchRight PunchLeft KickRight KickLeft

Defend GolfSwing TennisSwingForehand TennisSwingBackhand

TennisServe ThrowBowlingBall AimAndFireGun Walk

Run Jump Climb Crouch

Steer Wave Flap Clap

Figure 8: Averaged absolute S 2 matrices for all actions in G3D dataset.

49
Darrick Lee and Robert Ghrist

5.2 Random walks on Lie groups

In this experiment, we aim to take advantage of the characteristic property of the signature
kernel and perform a kernel two-sample hypothesis testing problem, as described in Gretton
et al. (2012). We aim to distinguish between two distinct discrete random walks on SO(3), or
in other words, between two distinct probability distribution on P (SO(3)). In addition, by
using the matrix representation of SO(3), we can also view these as probability distributions
on P R9 , and we also perform the hypothesis testing in this context. We find that for
this problem, the path signature defined using left-invariant forms on SO(3) significantly
outperforms the Euclidean signature.

5.2.1 Hypothesis testing with MMD

Suppose µ and ν are two probability measures on P G and we have i.i.d. samples {X i }ni=1
of µ and {Y i }ni=1 of ν, each with n samples. Note that we use the superscript to index
the separate samples, since the subscript is reserved for the time parameter. We wish
to distinguish between the null hypothesis H0 : µ = ν against the alternative hypothesis
H1 : µ 6= ν. This is done by computing the unbiased empirical MMD (Equation 23) between
our two collections of samples, and comparing this against a chosen threshold. A Type I
error occurs when the test falsely rejects the null hypothesis, and a Type II error occurs
when the test falsely accepts the null hypothesis. The level α of a test is a chosen upper
bound on the probability of a Type I error.
The threshold for the MMD is chosen such that the test is consistent, meaning the
test achieves a level α and a Type II error of zero in the asymptotic large sample limit.
One method for choosing the threshold is by using large deviation bounds for the MMD
under the assumption that the null hypothesis is true, which are derived in Gretton et al.
(2012). However, because such bounds must hold for all measures, the threshold chosen in
this way is conservative and may not yield optimal results with a finite number of samples.
An alternative method is to use a data-dependent threshold. Our method here will be to
generate a null distribution for the MMD by performing a permutation test, and computing
the 1 − α quantile for a level α test. Alternative methods for deriving data-dependent
thresholds are explored in Gretton et al. (2009, 2012).

5.2.2 Random walk generation

For this experiment, we will consider discrete random walks on SO(3). Each walk is initial-
ized by sampling a point uniformly at random from SO(3). A method to sample uniformly
from SO(3) is given in Diaconis and Shahshahani (1987). Next, we sample T i.i.d. points
{vi }Ti=1 from the unit sphere in the Lie algebra so(3). We use a unimodal probability distri-
bution on S 2 , called the von Mises-Fisher distribution. This is parametrized by the mean
direction x ∈ S 2 , and the concentration parameter κ ≥ 0, which describes the concentration
of the distribution about the mean.
We may think of this as a random walk with drift in the mean direction. A numerically
stable method of sampling from this distribution is discussed in Jakob (2012). The random
walk γ̂ : [T + 1] → SO(3) is defined recursively as
γ̂i+1 = γ̂i exp(cvi ),

50
Path Signatures on Lie Groups

Figure 9: The von-Mises Fisher density on S 2 with mean direction x = (0, 0, 1) and κ = 0.1.

where c > 0 is the step size and γ̂1 is a randomly sampled point on SO(3).

5.2.3 Results
We perform two classes of tests.

1. H0 is false. Here, we attempt to distinguish between distributions with different mean

directions. We take xµ = (1, 0, 0) and xν = (0, 1, 0).

2. H0 is true. Here we take the mean directions of both distributions to be xµ = xν =

(1, 0, 0).

All distributions will have a concentration parameter of κ = 0.1. For each test, we will
sample n = 50 random walks from each distribution, and each random walk will have
L = 100 steps. To generate the null distribution of the MMD, we perform a permutation
test with 2,000 permutations for a given set of samples. A level of α = 0.05 is chosen, so we
compute the 0.95 quantile of the null distribution as the threshold used for the MMD. The
path signature is truncated at level 4 in the MMD computation. We perform 1,000 tests
for each class, and each test is done using both Lie group and Euclidean path signatures.
The following table provides the error rates (false positive/negative) for the two classes
of tests using the two methods.

Lie Group Euclidean

H0 false 6.1% 83.7%
H0 true 5.7% 4.9%

Table 3: Error rates for hypothesis testing. Each test was run 1,000 times.

Additionally, we provide histograms that summarize the test results. The test distribu-
tions show the distribution of MMDu over the 1,000 independent trials. The null distribution
shown is generated from a permutation test for a single trial. The red line shows the 0.95

51
Darrick Lee and Robert Ghrist

quantile, and represents the threshold for that trial. The histograms for H0 being false are
shown in Figure 10 and the histograms for H0 being true are shown in Figure 11.
We find that the Lie group path signatures significantly outperform Euclidean path
signatures. This is due to the fact that the Euclidean representation of the data is ill-suited
for this problem. We are aiming to detect a slight drift in the direction of the rotation,
which is a translation invariant feature in SO(3). However, this is not a translation invariant
feature in the Euclidean representation of the problem, so the effect of the drift is confounded
in the Euclidean path signature.

52
Path Signatures on Lie Groups

Figure 10: Test (top) and null (bottom) distributions of MMDu when H0 is false.

Figure 11: Test (top) and null (bottom) distributions of MMDu when H0 is true.

53
Darrick Lee and Robert Ghrist

6. Conclusion
We have defined path signature for Lie group valued time series, and studied several of its
properties, the main result being the universal and characteristic properties of the signature
kernel. By defining signature using only the derivative of the path, computational techniques
from Euclidean valued paths to Lie group valued paths can be exported cleanly. Our
theory is validated using two detailed experiments highlighting both the universal and
characteristic properties, showing that the path signature has strong empirical performance,
while providing an interpretable feature set which can be used to better understand the
underlying phenomena.
Lie group valued data is ubiquitous; however, in previous studies analyzing such data,
the analysis pipeline can be complicated due to the ostensible complexities when dealing
with Lie groups (as described in Section 5.1). Our derivations show that Lie group valued
data can be treated in a manner nearly identical to standard Euclidean valued data.
The work in this paper provides the foundations for further studying properties and
applications of path signatures for Lie groups and more. We highlight two directions for
possible future research.

1. Although the path signature is a complete characterization of paths in Euclidean

space, the inversion problem of recovering a path given its path signature is diffi-
cult (Lyons and Xu, 2017; Chang et al., 2017). Pfeffer et al. (2019) studies a restricted
and approximate inversion problem from an algebro-geometric perspective of the path
signatures, which was initiated in Améndola et al. (2019).
One may consider a similar question for Lie groups. Suppose we fix a path γ ∈ P G,
and we are given the path signature of a path α ∈ P H, where α = F γ, where
F : G → H is a Lie group homomorphism. Is it possible to recover the Lie group
homomorphism F ?

2. In Chen (1958), the path signature is defined for a manifold M by choosing a collection
of 1-forms {ωi }N
i=1 (as no natural Lie algebra basis is available). Choose x ∈ M to be
a basepoint, and let S : P Mx → T ((RN )) be the path signature defined with respect
to these 1-forms and let α, β ∈ P Mx . If the 1-forms span the cotangent bundle of M
at every point x ∈ M , Chen’s injectivity result states that S(α) = S(β) if and only if
α and β are tree-like equivalent (in a slightly modified sense). The injectivity theorem
given in Theorem 30 is a customization of Chen’s result to the case M = G since a
basis of g∗ spans the cotangent bundle at every point.
Recently, there has been interest in studying time series evolving on manifolds, and
the development of a path signature kernel for paths on manifolds would provide a
powerful tool for geometric time series analysis. One of the difficulties of this definition
would be the representation of data on manifolds and the choice of 1-forms used. One
could begin with path signatures on parallelizable manifolds, which by definition admit
a smooth basis of vector fields and 1-forms. This would encompass all orientable 3-
dimensional manifolds.

54
Path Signatures on Lie Groups

Acknowledgments

D.L. would like to acknowledge Chad Giusti and Jakob Hansen for several helpful discus-
sions throughout this project. D.L. and R.G. are supported by the Office of the Assistant
Secretary of Defense Research & Engineering through ONR N00014-16-1-2010. D.L. is also
supported by the Natural Sciences and Engineering Research Council of Canada (NSERC)
PGS-D3.

Appendix A. Implementation details of path signature computation

We provide implementation details of the path signature computation for Lie groups, which
includes the case for Euclidean space. Several libraries containing path signature computa-
tions for Euclidean paths exist, listed below.
1. The esig package (Lyons) implemented in C++ and Python was one of the first
libraries for path signature computations. It is a CPU-only and single-threaded im-
plementation.
2. The iisignature package (Reizenstein and Graham, 2020) is also implemented in
Python. It is also a CPU-only and single-threaded implementation.
3. The Signatory library (Kidger and Lyons, 2020) is the most recent and complete
implementation of signature computations. It is written in C++ with a built-in
Python wrapper. It includes functionality such as backpropagation which was not
included in the previous packages. In addition, it provides GPU as well as both single
and multi-threaded CPU support.
Our current package is the first Julia implementation of path signature computations,
and is also the first package to support matrix Lie group valued time series. We provide
functions for both the continuous and discrete path signatures. The package can be found
in https://github.com/ldarrick/PathSignatures.
Suppose P : [T + 1] → G is a discrete time series in an N -dimensional matrix Lie group
G ⊆ GLD (R) (elements are D × D matrices), and let p : [T ] → g be the discrete derivative
of the time series. Note our change in notation for the paths in order to match the code.
The continuous signature computation is performed by using Chen’s identity (Equation 10)
as described in Section 3.7. Specifically, once the discrete derivative is computed, the path
signature is computed by tensor exponentiation and multiplication:
S(P ) = exp⊗ (p1 ) ⊗ exp⊗ (p2 ) ⊗ . . . ⊗ exp⊗ (pT ).

Our goal in this section is to compute the truncated discrete signature ŜM (P ). Recall
that the discrete path signature with respect to the multi-index I = (i1 , . . . , im ) ∈ [N ]m is
computed as
X
Ŝ I (γ̂) := ωi1 (pt1 ) . . . ωim (ptm ).
ˆm
(t1 ,...,tm )∈∆ T

We note that the path signature computation relies only on the derivative p, so we split the
computation into two steps.

55
Darrick Lee and Robert Ghrist

1. Compute the discrete derivative p from a discrete time series P in a Lie group G with
respect to a chosen basis of g. Note that this step is dependent on both the Lie group
G, and the choice of basis of g.

2. Compute the truncated discrete path signature ŜM (γ̂) given the discrete derivative p.
Note that this step is independent of both the Lie group G as well as the choice of
basis of g.

This abstraction allows us to write a single path signature function, though we must write
a new discrete derivative function for each Lie group we wish to consider. Let us first fix
some notation used in the pseudo-code.

1. Matrix Multiplication. For a T1 × T2 array A and a T2 × T3 array B, we define

T2
X
AB[i, j] = A[i, k]B[k, j].
k=1

2. Element-wise Multiplication. For two arrays A and B of the same size, we define

(A · B)[i, j] = A[i, j]B[i, j].

3. Transpose. For an array A, we define

A| [i, j] = A[j, i].

4. Sum. For a T1 × T2 array A, we define

T1 X
X T2
A[Σ, Σ] = A[i, j].
i=1 j=1

5. Cumulative Sum. For a T1 × T2 array A, we define the T1 × T2 array

j
i X
X
A[, ][i, j] = A[k1 , k2 ].
k1 =1 k2 =1

6. End. For a length T array A, we define

A[end] = A[T ].

7. Splat. For a length T array A, we define the splat operator

A... = A[1], A[2], . . . , A[T ],

to use the entries of A as arguments for some function.

56
Path Signatures on Lie Groups

Algorithm 2: Discrete Derivative for Matrix Lie Groups

Input : P (((T + 1) × D × D) array): A path in the matrix Lie group G
Output: p ((T × N ) array): The discrete derivative
1 Initialize (T × N ) array p ;
2 Initialize (D × D) array p ;
3 for t=1..T do
4 Compute the discrete derivative at time t
d ← logm(P[t+1,:,:]-1 P[t,:,:]);
5 Convert the derivative to a representation in terms of a chosen basis of g
p[t,:] ← convert to basis(d);
6 end
7 Return p;

The discrete derivative function for a path P : [T + 1] → RN , which can be represented

by a ((T + 1) × N ) array, is simply a difference operation in the first dimension. Now,
let’s consider a path P : [T + 1] → G, where G is an N -dimensional matrix Lie group
G ⊆ GLD (R). This path can be represented by a ((T + 1) × D × D) array. Suppose that we
have implemented a function convert to basis() which takes in a (D × D) array, which
represents an element of g, and converts it into a length N vector. Let logm() be the matrix
logarithm function.
Next, we will describe the discrete signature computation. Note that the following
algorithm is not novel, and is similar to the implementation provided in Kidger and Lyons
(2020). The idea is to compute the truncated signature using forward recursion, using the
computations for lower levels to also compute higher levels. Thus, this computation is made
up of two functions: the initialization and the recursion. The truncated signature will be an
element of T (≤M ) (RN ). This is represented as a 1D array of multidimensional arrays, which
we label s. For example, s[m] is an m-dimensional array of length N in each dimension,
and we denote access to the inner arrays using separate square brackets. Namely, s[2][3, 4]
is the [3, 4] element of the 2D array s[2].
Here, we may easily compute the complexity of the truncated signature algorithm in
terms of the number of elementary operations. We note that there are N M entries of
the level M signature, and all lower levels are computed along the way. To compute an
entry of the level M signature from the level M − 1 signature, we require O(T ) elementary
operations. Thus, the complexity of this algorithm is O(T N M ).

57
Darrick Lee and Robert Ghrist

Algorithm 3: Discrete Signature Initialization

Input : p ((T × N ) array): The discrete derivative
M (Int): The truncation level
Output: s (Tensor polynomial): The truncated discrete signature
1 Initialize 1D array of multidimensional arrays s;
2 Initialize length M array cur ind;
3 Initialize length T array Q;
4 for n=1..N do
5 Set the first index
cur ind[1] ← n;
6 Compute first level signature path and store in s
Q ← K[ ,n];
s[1][n] ← Q[end];
7 Perform forward recursion
s ← sig forward(p, Q cur ind, 2, M);
8 end
9 Return s

58
Path Signatures on Lie Groups

Algorithm 4: Signature Forward Recursion (sig forward)

Input : s (Tensor polynomial): The current signature
p ((T × N ) array): The discrete derivative
lastQ (Length T array): The previous signature path
cur ind (Length M array): The current index
cur level (Int): The current level
M (Int): The truncation level
Output: s (Tensor polynomial): Partially computed truncated discrete signature
1 Initialize length T array Q;
2 if cur level < M then
3 for n=1..N do
4 Update the current index
cur ind[cur level] ← n;
5 Compute the current signature path
Q ← lastQ · p[:,n] ;
Q ← Q[ ] ;
6 Store the current signature
s[cur level][cur ind[1:cur level]...] = Q[end] ;
7 Perform forward recursion
s ← sig forward(p, Q, cur ind, cur level+1, M) ;
8 end
9 else
10 for n=1..N do
\\ Last level: perform the same operations, but don’t perform recursion
11 cur ind[cur level] ← n
12 Q ← lastQ · p[:,n]
13 s[cur level][cur ind...] ← Q(Σ)
14 end
15 end
16 Return s

59
Darrick Lee and Robert Ghrist

Appendix B. Tensor Normalization

In this appendix, we will discuss the construction of a tensor normalization, as shown
in Appendix A of Chevyrev and Oberhauser (2018), as well as the computation of the
normalization. The following proposition provides the construction that we will follow.
Suppose H is a Hilbert space throughout this appendix.

Proposition 65 Let ψ : [1, ∞) → [1, ∞) with ψ(1) = 1. For t ∈ T1 ((H)) and define
λ : T1 ((H)) → (0, ∞) to be the unique non-negative number such that kδλ(t) tk2 = ψ(ktk).
Define

Λ : T1 ((H)) → T1 ((H))
t 7→ δλ(t) t.

Denote further kψk∞ = supx≥1 ψ(x). Then the following holds.

n p o
1. The function Λ takes values in the set t ∈ T1 ((H)) : ktk ≤ kψk∞ .

2. If ψ is injective, then so is Λ.
3. Suppose that supx≥1 ψ(x)/s2 ≤ 1, kψk∞ < ∞, and that ψ is K-Lipschitz for some
K > 0. Then
√ p p
kΛ(s) − Λ(t)k ≤ (1 + K + 2 kψk∞ )( ks − tk ∨ ks − tk).

Corollary 66 Let ψ : [1, ∞) → [1, ∞) be injective satisfying ψ(1) = 1 and the conditions
of item (3.) in Proposition 65. Then, the function Λ constructed in Proposition 65 is a
tensor normalization.

This method of constructing a tensor normalization is done by defining the normalized

norm kδλ(t) tk of an element t, given its original norm ψ(ktk). In practice, we will be working
(≤M )
with a truncated element of the tensor algebra. Thus, given an element t ∈ T1 ((H)),
we can obtain the value of λ(t) by solving the equation kδλ(t) tk2 = ψ(ktk). This reduces
to finding the zero of the polynomial equation
M
X
λ2k ktk k2 − ψ(ktk) = 0.
k=1

References
Marcos Alexandrino and Renato Bettiol. Lie Groups and Geometric Aspects of Isometric
Actions. Springer International Publishing, 2015.
Carlos Améndola, Peter Friz, and Bernd Sturmfels. Varieties of Signature Tensors. Forum
of Mathematics, Sigma, 7:e10, 2019.
Imanol Perez Arribas, Kate Saunders, Guy Goodwin, and Terry Lyons. A signature-
based machine learning model for bipolar disorder and borderline personality disorder.
arXiv:1707.07124 [stat], July 2017.

60
Path Signatures on Lie Groups

Yuliy Baryshnikov and Emily Schlafly. Cyclicity in multivariate time series and applications
to functional MRI data. In 2016 IEEE 55th Conference on Decision and Control (CDC),
pages 1625–1630, December 2016.

Victoria Bloom, Dimitrios Makris, and Vasileios Argyriou. G3D: A gaming action dataset
and real time action recognition evaluation framework. In 2012 IEEE Computer Society
Conference on Computer Vision and Pattern Recognition Workshops, pages 7–12, June
2012.

Karsten M. Borgwardt, Arthur Gretton, Malte J. Rasch, Hans-Peter Kriegel, Bernhard

Schölkopf, and Alex J. Smola. Integrating structured biological data by Kernel Maximum
Mean Discrepancy. Bioinformatics, 22(14):e49–e57, July 2006.

Annegret Y Burtscher. Length structures on manifolds with continuous Riemannian metrics.

New York Journal of Mathematics, 21:273–296, 2015.

Jiawei Chang, Nick Duffield, Hao Ni, and Weijun Xu. Signature inversion for monotone
paths. Electronic Communications in Probability, 22, 2017.

Kuo-Tsai Chen. Iterated Integrals and Exponential Homomorphisms†. Proceedings of the

London Mathematical Society, s3-4(1):502–512, 1954.

Kuo-Tsai Chen. Integration of Paths, Geometric Invariants and a Generalized Baker- Haus-
dorff Formula. Annals of Mathematics, 65(1):163–178, 1957.

Kuo-Tsai Chen. Integration of Paths–A Faithful Representation of Paths by Noncommu-

tative Formal Power Series. Transactions of the American Mathematical Society, 89(2):
395–407, 1958.

Kuo-Tsai Chen. Iterated path integrals. Bulletin of the American Mathematical Society, 83
(5):831–879, September 1977.

Ilya Chevyrev and Andrey Kormilitzin. A Primer on the Signature Method in Machine
Learning. arXiv:1603.03788 [cs, stat], March 2016.

Ilya Chevyrev and Harald Oberhauser. Signature moments to characterize laws of stochastic
processes. arXiv:1810.10971 [math, stat], October 2018.

Ilya Chevyrev, Vidit Nanda, and Harald Oberhauser. Persistence Paths and Signature
Features in Topological Data Analysis. IEEE Transactions on Pattern Analysis and
Machine Intelligence, 42(1):192–202, January 2020.

Persi Diaconis and Mehrdad Shahshahani. The Subgroup Algorithm for Generating Uniform
Random Variables. Probability in the Engineering and Informational Sciences, 1(1):15–
32, January 1987.

Joscha Diehl and Jeremy Reizenstein. Invariants of Multidimensional Time Series Based
on Their Iterated-Integral Signature. Acta Applicandae Mathematicae, 164(1):83–122,
December 2019.

61
Darrick Lee and Robert Ghrist

Joscha Diehl, Kurusch Ebrahimi-Fard, and Nikolas Tapia. Time-Warping Invariants of

Multidimensional Time Series. Acta Applicandae Mathematicae, May 2020.

Adeline Fermanian. Embedding and learning with signatures. arXiv:1911.13211 [cs, stat],
November 2019.

Peter K. Friz and Nicolas B. Victoir. Multidimensional Stochastic Processes as Rough Paths:
Theory and Applications. Cambridge Studies in Advanced Mathematics. Cambridge Uni-
versity Press, 2010.

Robin Giles. A Generalization of the Strict Topology. Transactions of the American Math-
ematical Society, 161:467–474, 1971.

Chad Giusti and Darrick Lee. Iterated Integrals and Population Time Series Analysis. In
Nils A. Baas, Gunnar E. Carlsson, Gereon Quick, Markus Szymik, and Marius Thaule,
editors, Topological Data Analysis, Abel Symposia, pages 219–246, Cham, 2020. Springer
International Publishing.

Arthur Gretton, Kenji Fukumizu, Zaı̈d Harchaoui, and Bharath K. Sriperumbudur. A Fast,
Consistent Kernel Two-Sample Test. In Y. Bengio, D. Schuurmans, J. D. Lafferty, C. K. I.
Williams, and A. Culotta, editors, Advances in Neural Information Processing Systems
22, pages 673–681. Curran Associates, Inc., 2009.

Arthur Gretton, Karsten M. Borgwardt, Malte J. Rasch, Bernhard Schölkopf, and Alexan-
der Smola. A Kernel Two-Sample Test. Journal of Machine Learning Research, 13:
723–773, March 2012.

Lajos Gergely Gyurkó, Terry Lyons, Mark Kontkowski, and Jonathan Field. Extracting
information from the signature of a financial data stream. arXiv:1307.7244 [q-fin], July
2013.

Ben Hambly and Terry Lyons. Uniqueness for the signature of a path of bounded variation
and the reduced path group. Annals of Mathematics, 171(1):109–167, 2010.

Zhiwu Huang, Chengde Wan, Thomas Probst, and Luc Van Gool. Deep learning on lie
groups for skeleton-based action recognition. In The IEEE Conference on Computer
Vision and Pattern Recognition (CVPR), July 2017.

Wenzel Jakob. Numerically stable sampling of the von Mises Fisher distribution on $S2̂$
(and other tricks). 2012.

Patrick Kidger and Terry Lyons. Signatory: Differentiable computations of the signature
and logsignature transforms, on both CPU and GPU. arXiv:2001.00706 [cs, stat], Jan-
uary 2020.

Franz J. Kiraly and Harald Oberhauser. Kernels for Sequentially Ordered Data. Journal
of Machine Learning Research, 20(31):1–45, 2019.

62
Path Signatures on Lie Groups

Stefan Knerr, Léon Personnaz, and Gérard Dreyfus. Single-layer learning revisited: A
stepwise procedure for building and training a neural network. In Françoise Fogelman
Soulié and Jeanny Hérault, editors, Neurocomputing, NATO ASI Series, pages 41–50,
Berlin, Heidelberg, 1990. Springer.
Enrico Le Donne. A Primer on Carnot Groups: Homogenous Groups, Carnot-Carathéodory
Spaces, and Regularity of Their Isometries. Analysis and Geometry in Metric Spaces, 5
(1):116, 2017.
John M. Lee. Introduction to Smooth Manifolds. Graduate Texts in Mathematics. Springer-
Verlag, New York, 2003.
Yanshan Li, Tianyu Guo, Xing Liu, and Rongiie Xia. Skeleton-based Action Recognition
with Lie Group and Deep Neural Networks. In 2019 IEEE 4th International Conference
on Signal and Image Processing (ICSIP), pages 26–30, July 2019.
Terry Lyons. esig. https://esig.readthedocs.io/en/latest/.
Terry Lyons. Differential equations driven by rough signals. Revista Matemática Iberoamer-
icana, 14(2):215–310, 1998.
Terry Lyons. Rough paths, Signatures and the modelling of functions on streams.
arXiv:1405.4537 [math, q-fin, stat], May 2014.
Terry Lyons and Zhongmin Qian. System Control and Rough Paths. Clarendon, Oxford,
2007.
Terry Lyons and Weijun Xu. Hyperbolic development and inversion of signature. Journal
of Functional Analysis, 272(7):2933–2955, April 2017.
Terry Lyons, Michael Caruana, and Thierry Lévy. Differential Equations Driven by Rough
Paths: Ecole d’Eté de Probabilités de Saint-Flour XXXIV-2004. École d’Été de Proba-
bilités de Saint-Flour. Springer-Verlag, Berlin Heidelberg, 2007.
Terry Lyons, Hao Ni, and Harald Oberhauser. A feature set for streams and an application
to high-frequency financial tick data. In BigDataScience ’14, 2014.
P. J. Moore, T. J. Lyons, J. Gallacher, and Alzheimer’s Disease Neuroimaging Initiative.
Using path signatures to predict a diagnosis of Alzheimer’s disease. PloS One, 14(9):
e0222212, 2019.
Max Pfeffer, Anna Seigal, and Bernd Sturmfels. Learning paths from signature tensors.
SIAM Journal on Matrix Analysis and Applications, 40(2):394–416, 2019.
Philipp Probst, Marvin N. Wright, and Anne-Laure Boulesteix. Hyperparameters and
tuning strategies for random forest. WIREs Data Mining and Knowledge Discovery, 9
(3):e1301, 2019.
Jeremy F. Reizenstein and Benjamin Graham. Algorithm 1004: The Iisignature Library:
Efficient Calculation of Iterated-Integral Signatures and Log Signatures. ACM Transac-
tions on Mathematical Software, 46(1):8:1–8:21, March 2020.

63
Darrick Lee and Robert Ghrist

Manel Rhif, Hazem Wannous, and Imed Riadh Farah. Action Recognition from 3D Skele-
ton Sequences using Deep Networks on Lie Group Features. In 2018 24th International
Conference on Pattern Recognition (ICPR), pages 3427–3432, August 2018.

J. M. Selig. Lie Groups and Lie Algebras in Robotics. In Jim Byrnes, editor, Computa-
tional Noncommutative Algebra and Applications, NATO Science Series II: Mathematics,
Physics and Chemistry, pages 101–125, Dordrecht, 2004. Springer Netherlands.

Carl-Johann Simon-Gabriel and Bernhard Schölkopf. Kernel Distribution Embeddings:

Universal Kernels, Characteristic Kernels and Kernel Metrics on Distributions. Journal
of Machine Learning Research, 19(44):1–29, 2018.

Bharath K. Sriperumbudur, Arthur Gretton, Kenji Fukumizu, Bernhard Schölkopf, and

Gert R. G. Lanckriet. Hilbert Space Embeddings and Metrics on Probability Measures.
Journal of Machine Learning Research, 11(Apr):1517–1561, 2010.

Steven H. Strogatz. From Kuramoto to Crawford: Exploring the onset of synchronization

in populations of coupled oscillators. Physica D: Nonlinear Phenomena, 143(1):1–20,
September 2000.

Raviteja Vemulapalli and Rama Chellappa. Rolling Rotations for Recognizing Human Ac-
tions from 3D Skeletal Data. In 2016 IEEE Conference on Computer Vision and Pattern
Recognition (CVPR), pages 4471–4479, June 2016.

Raviteja Vemulapalli, Felipe Arrate, and Rama Chellappa. Human Action Recognition
by Representing 3D Skeletons as Points in a Lie Group. In 2014 IEEE Conference on
Computer Vision and Pattern Recognition, pages 588–595, June 2014.

Weixin Yang, Lianwen Jin, and Manfei Liu. DeepWriterID: An End-to-End Online Text-
Independent Writer Identification System. IEEE Intelligent Systems, 31(2):45–53, March
2016.

Weixin Yang, Terry Lyons, Hao Ni, Cordelia Schmid, and Lianwen Jin. Developing the Path
Signature Methodology and its Application to Landmark-based Human Action Recogni-
tion. arXiv:1707.03993 [cs], December 2019.

Benjamin J. Zimmerman, Ivan Abraham, Sara A. Schmidt, Yuliy Baryshnikov, and Fa-
tima T. Husain. Dissociating tinnitus patients from healthy controls using resting-state
cyclicity analysis and clustering. Network Neuroscience, pages 1–23, April 2018.

(Universitext) George Bachmann, Lawrence Narici, Edward Beckenstein - Fourier and Wavelet Analysis-Springer (2000) PDF
100% (1)
(Universitext) George Bachmann, Lawrence Narici, Edward Beckenstein - Fourier and Wavelet Analysis-Springer (2000) PDF
514 pages
Physics Practical Examination
No ratings yet
Physics Practical Examination
13 pages
2024 Bayer 1
No ratings yet
2024 Bayer 1
166 pages
A Multiplicative Surface Signature Through Its Magnus Expansion
No ratings yet
A Multiplicative Surface Signature Through Its Magnus Expansion
93 pages
Translation-Invariant Cones of Functions On Semi-Simple Lie Groups
No ratings yet
Translation-Invariant Cones of Functions On Semi-Simple Lie Groups
56 pages
Hannan E.J., Krishnaiah P.R., Rao M.M.-Handbook of Statistics, Vol. 5. Time Series in The Time Domain (1985) PDF
No ratings yet
Hannan E.J., Krishnaiah P.R., Rao M.M.-Handbook of Statistics, Vol. 5. Time Series in The Time Domain (1985) PDF
482 pages
Naphthalene
75% (4)
Naphthalene
3 pages
Primer On The Signature Method in Machine Learning
No ratings yet
Primer On The Signature Method in Machine Learning
47 pages
Invertibility and Singularity for Bounded Linear Operators
From Everand
Invertibility and Singularity for Bounded Linear Operators
Robin Harte
No ratings yet
Lie Group
No ratings yet
Lie Group
59 pages
Topological Groups and Invariant Measures
No ratings yet
Topological Groups and Invariant Measures
49 pages
Ecuaciones Diferenciales Parciales y Kernels-2021
No ratings yet
Ecuaciones Diferenciales Parciales y Kernels-2021
41 pages
Hewitt GibbsWilbrahamPhenomenonEpisode 1979
No ratings yet
Hewitt GibbsWilbrahamPhenomenonEpisode 1979
33 pages
Average Signature of Geodesic Paths in Compact Lie Groups: Chong Liu and Shi Wang November 12, 2024
No ratings yet
Average Signature of Geodesic Paths in Compact Lie Groups: Chong Liu and Shi Wang November 12, 2024
27 pages
Sobolev Spaces Elliptic Equations 2010
No ratings yet
Sobolev Spaces Elliptic Equations 2010
88 pages
On Expected Signatures and Signature Cumulants in
No ratings yet
On Expected Signatures and Signature Cumulants in
30 pages
KKKKKKKK
0% (2)
KKKKKKKK
14 pages
VOLTAGE DROP Testing Overview and Lab Sheet
100% (1)
VOLTAGE DROP Testing Overview and Lab Sheet
6 pages
Top Dynamics Survey
No ratings yet
Top Dynamics Survey
52 pages
On The Solution of Linear Differential Equations in Lie Groups - Iserles - Norsett
No ratings yet
On The Solution of Linear Differential Equations in Lie Groups - Iserles - Norsett
38 pages
E Suppl Cor Bertero
No ratings yet
E Suppl Cor Bertero
68 pages
Session 4 5 - Linear Algebra in Python
No ratings yet
Session 4 5 - Linear Algebra in Python
9 pages
Hua Et Al. - 2023 - A Class of Semilinear Elliptic Equations On Groups
No ratings yet
Hua Et Al. - 2023 - A Class of Semilinear Elliptic Equations On Groups
23 pages
Class 10 Physics Practicals No. 6 To 10
No ratings yet
Class 10 Physics Practicals No. 6 To 10
16 pages
State Space Representation of Gaussian Processes
No ratings yet
State Space Representation of Gaussian Processes
45 pages
Holder Classes Via Semigroups and Riesz Transforms
No ratings yet
Holder Classes Via Semigroups and Riesz Transforms
41 pages
AB SG Unit 2 Progress Check MCQ Part A
100% (1)
AB SG Unit 2 Progress Check MCQ Part A
9 pages
Talk Sotiris Notaris
No ratings yet
Talk Sotiris Notaris
29 pages
S - W GAN T S G: IG Asserstein S For IME Eries Eneration
No ratings yet
S - W GAN T S G: IG Asserstein S For IME Eries Eneration
13 pages
Introduction To Matlab
No ratings yet
Introduction To Matlab
25 pages
Discrete Subgroups of Real Semisimple Lie Groups: Home Search Collections Journals About Contact Us My Iopscience
No ratings yet
Discrete Subgroups of Real Semisimple Lie Groups: Home Search Collections Journals About Contact Us My Iopscience
15 pages
IGCSE Physics Exam
100% (4)
IGCSE Physics Exam
24 pages
1 s2.0 S0022247X14008130 Main
No ratings yet
1 s2.0 S0022247X14008130 Main
29 pages
Marcov Jump Processes
No ratings yet
Marcov Jump Processes
16 pages
Introduction to the Mathematics of Inversion in Remote Sensing and Indirect Measurements
From Everand
Introduction to the Mathematics of Inversion in Remote Sensing and Indirect Measurements
S. Twomey
No ratings yet
03 The Gaussian Kernel
No ratings yet
03 The Gaussian Kernel
15 pages
Geometric Integration Theory
From Everand
Geometric Integration Theory
Hassler Whitney
No ratings yet
The Cabal of The Twelve Houses
No ratings yet
The Cabal of The Twelve Houses
9 pages
Chapter 10
100% (1)
Chapter 10
51 pages
Marwan-2011-How To Avoid Potential Pitfalls in Recurrence Plot Based Data Analysis
No ratings yet
Marwan-2011-How To Avoid Potential Pitfalls in Recurrence Plot Based Data Analysis
16 pages
FINAL Chapter 6 Conical Dryer 1
No ratings yet
FINAL Chapter 6 Conical Dryer 1
16 pages
Fractional Signature: A Generalisation of The Signature Inspired by Fractional Calculus
No ratings yet
Fractional Signature: A Generalisation of The Signature Inspired by Fractional Calculus
9 pages
Semigroup Actions On Homogeneous Spaces
No ratings yet
Semigroup Actions On Homogeneous Spaces
43 pages
GAL Daniel Eceizabarrena - Distribution Theory and Fundamental Solutions of Differential Operators
No ratings yet
GAL Daniel Eceizabarrena - Distribution Theory and Fundamental Solutions of Differential Operators
73 pages
Fundamental Properties of Generalized Functions
No ratings yet
Fundamental Properties of Generalized Functions
32 pages
2018DwyerCatalog PDF
No ratings yet
2018DwyerCatalog PDF
548 pages
University of Waterloo Department of Applied Mathematics Amath 391: From Fourier To Wavelets Fall 2019 Lecture Notes
No ratings yet
University of Waterloo Department of Applied Mathematics Amath 391: From Fourier To Wavelets Fall 2019 Lecture Notes
29 pages
8.continuous Groups
No ratings yet
8.continuous Groups
6 pages
Linear Algebra
From Everand
Linear Algebra
Georgi E. Shilov
2.5/5 (3)
Frewer 2012 Iti School PDF
No ratings yet
Frewer 2012 Iti School PDF
62 pages
Recursive Analysis
From Everand
Recursive Analysis
R. L. Goodstein
No ratings yet
D MatN 102 5-6.17 Final MTN 11 17.dvi
No ratings yet
D MatN 102 5-6.17 Final MTN 11 17.dvi
13 pages
Engineering Journal::On The Rational Dynamic Equation On Discrete Time Scales
No ratings yet
Engineering Journal::On The Rational Dynamic Equation On Discrete Time Scales
33 pages
CEP Dynamics 2018-22
No ratings yet
CEP Dynamics 2018-22
11 pages
Calculus Fundamentals Explained
From Everand
Calculus Fundamentals Explained
Samuel Horelick
3/5 (3)
Automatic Wall Painting Robot Journal
No ratings yet
Automatic Wall Painting Robot Journal
2 pages
Kernel Density Estimation On The Siegel Space With An Application To Radar Processing
No ratings yet
Kernel Density Estimation On The Siegel Space With An Application To Radar Processing
18 pages
Abhandlungen Der Koniglichen Gesellschaft Der Wissenschaften Zu Gottingen, Vol. 13.
No ratings yet
Abhandlungen Der Koniglichen Gesellschaft Der Wissenschaften Zu Gottingen, Vol. 13.
38 pages
Nonlinear Transformations of Random Processes
From Everand
Nonlinear Transformations of Random Processes
Ralph Deutsch
No ratings yet
Complex Integration and Cauchy's Theorem
From Everand
Complex Integration and Cauchy's Theorem
G. N. Watson
No ratings yet
Basic Group Theory
No ratings yet
Basic Group Theory
4 pages
Multi-Resolution Analysis and Wavelets
No ratings yet
Multi-Resolution Analysis and Wavelets
13 pages
Density Estimation On The Rotation Group Using Diffusive Wavelets
No ratings yet
Density Estimation On The Rotation Group Using Diffusive Wavelets
14 pages
Invariant Manifold Theory for Hydrodynamic Transition
From Everand
Invariant Manifold Theory for Hydrodynamic Transition
S.S. Sritharan
No ratings yet
Markov Semigroups: Paolo Guiotto
No ratings yet
Markov Semigroups: Paolo Guiotto
26 pages
Four Probe Experiment
No ratings yet
Four Probe Experiment
2 pages
The CE/SE Compressible Fluid Solver: (Theory)
No ratings yet
The CE/SE Compressible Fluid Solver: (Theory)
5 pages
On A New Multivariate Sampling Paradigm and A Polyspline Shannon Function
No ratings yet
On A New Multivariate Sampling Paradigm and A Polyspline Shannon Function
23 pages
Artigo-Vanderlei-v1.0 (CPG)
No ratings yet
Artigo-Vanderlei-v1.0 (CPG)
8 pages
Diffusion Gaussian Kernel PDF
No ratings yet
Diffusion Gaussian Kernel PDF
13 pages
Ur m60
0% (1)
Ur m60
3 pages
1990 Bertrand Discrete Mellin Transform For Signal Analysis
No ratings yet
1990 Bertrand Discrete Mellin Transform For Signal Analysis
4 pages
Diffusion Gaussian Kernel
No ratings yet
Diffusion Gaussian Kernel
13 pages
Sachin MTH 102
No ratings yet
Sachin MTH 102
13 pages
Rapidly Varying Sequences and Rapid Convergence: D. Djur Ci C, Lj.D.R. Ko Cinac, M.R. Žižovi C
No ratings yet
Rapidly Varying Sequences and Rapid Convergence: D. Djur Ci C, Lj.D.R. Ko Cinac, M.R. Žižovi C
7 pages
C1 L2
No ratings yet
C1 L2
25 pages
Polarimeter Biquartz
No ratings yet
Polarimeter Biquartz
4 pages
A Simplified 3D Model For Tunnel Constru PDF
No ratings yet
A Simplified 3D Model For Tunnel Constru PDF
8 pages
Mechanism Synthesis, Graphical
No ratings yet
Mechanism Synthesis, Graphical
32 pages
131 - Vggray R. - Probability, Random Processes, and Ergodic Properties
No ratings yet
131 - Vggray R. - Probability, Random Processes, and Ergodic Properties
5 pages
FHMM1014 Chapter 1 Number and Set
No ratings yet
FHMM1014 Chapter 1 Number and Set
93 pages
Inflatable - Packers - Inflation - Media and - Associated - Factors
No ratings yet
Inflatable - Packers - Inflation - Media and - Associated - Factors
3 pages
IMC Simulation: June 15, 2009
No ratings yet
IMC Simulation: June 15, 2009
4 pages
Chapter 16 Solids Liquids and Gases
No ratings yet
Chapter 16 Solids Liquids and Gases
30 pages
Semi-Integrated SOG/TEOS Etchback Process For Multimetal Submicron Devices
No ratings yet
Semi-Integrated SOG/TEOS Etchback Process For Multimetal Submicron Devices
4 pages
6.1 Classification of Elements
No ratings yet
6.1 Classification of Elements
16 pages
2022 MGS Physics Paper 1
No ratings yet
2022 MGS Physics Paper 1
19 pages
12 Mathematics Sp05
No ratings yet
12 Mathematics Sp05
27 pages
Firstie Unit Three States of Matter Book
No ratings yet
Firstie Unit Three States of Matter Book
7 pages
Experiment 01
No ratings yet
Experiment 01
2 pages