0% found this document useful (0 votes)

256 views16 pages

Approximate Entropy For Testing Randomness PDF

1. The document discusses applying the concept of approximate entropy to test randomness in binary sequences. Approximate entropy measures irregularity by comparing empirical entropies of pattern frequencies. 2. The key insight is modifying the definition to use frequencies in a circular sequence. This allows deriving the limiting distribution of the test statistic, providing a basis for statistical tests of randomness. 3. For fixed pattern length m, the limiting distribution is chi-squared with degrees of freedom equal to the number of possible patterns minus one. This allows evaluating tail probabilities to test randomness in examples like binary expansions of constants.

Uploaded by

Leonardo Ochoa Ruiz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

256 views16 pages

Approximate Entropy For Testing Randomness PDF

Uploaded by

Leonardo Ochoa Ruiz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Approximate Entropy for Testing Randomness

Andrew L. Rukhin
Abstract

In this paper a new concept of approximate entropy is modied

and applied to the problem of testing for randomness a string of binary bits. This concept has been introduced in a series of papers
by S. Pincus and co-authors. The corresponding statistic is designed
to measure the degree of randomness of observed sequences. It is
based on incremental contrasts of empirical entropies based on the
frequencies of dierent patterns in the sequence. Sequences with large
approximate entropy must have substantial uctuation or irregularity.
Alternatively, small values of this characteristic imply strong regularity, or lack of randomness, in a sequence. Pincus and Kalman (1997)
evaluated
approximate
entropies for binary and decimal expansions of
p
p
2 and 3 with the surprising conclusion that the expansion of
pe; 3;demonstrated
much more irregularity than that of .
Tractable small sample distributions are hardly available, and testing randomness is based, as a rule, on fairly long strings. Therefore, to
have rigorous statistical tests of randomess based on this approximate
entropy statistic, one needs the limiting distribution of this characteristic under the randomness assumption. Until now this distribution
remained unknown and was thought to be dicult to obtain.
The key step leading to the limiting distribution of approximate
entropy is a modication of its denition based on the frequencies of
dierent patterns in the augmented or circular version of the original
sequence. In Section 2 it is shown that the approximate entropy as
well as its modied version converges in distribution to a 2 -random
variable when the length of a template, m, is xed. A similar result
when m increases to innity is obtained in Section 3. In this situation
the limiting distribution is normal with the parameters of this law
determined from Poisson approximation. These facts provide the basis
for statistical tests of randomness via the approximate entropy. In

particular, tail probabilities for the approximate entropy test can be

A graph of these values for binary expansions of e; and
pevaluated.
3 illustrates the use of this concept.

Key words : Decomposable Statistics, Entropy, Information Divergence, Multinomial Distribution, Poisson Distribution, 2-distribution.
Andrew L. Rukhin is a Professor in the Department of Mathematics and
Statistics at University of Maryland at Baltimore County, Baltimore, MD,
21250. He also has a faculty appointment in the Statistical Engineering Division at the National Institute of Standards and Technology, Gaithersburg,
MD 20899-0001. This work has been motivated by a joint project with the
Computer Security Division of the National Institute of Standards and Technology.

1 Introduction: Approximate Entropies

In this paper I apply a new concept of approximate entropy and its modication to the problem of testing for randomness a string of binary bits. This
problem gained importance with the wide use of public key cryptography
and the need for good secure encryption algorithms. All such algorithms are
based on a generator of (pseudo) random numbers; the testing of such generators for randomness became crucial for communications industry where
digital signatures and key management are vital for information processing.
To measure the degree of randomness of observed sequences Pincus and
Singer (1996) suggested to use a general characteristic, the so-called approximate entropy. Actually this approach is pursued in a series of papers by S.
Pincus and co-authors (Pincus (1991), Pincus and Huang (1992), Pincus and
Kalman (1997)). It is based on the likelihood that templates in the sequence
that are similar will remain similar on next incremental comparisons.
To x the ideas denote by 1; : : : ; n, a sequence of i.i.d. random variables
each taking values in the nite set f1; : : : ; sg. For Yi(m) = (i; : : : ; i+m?1 ),
1 i n ? m + 1, let
Cim = n + 11 ? m # fj : 1 j n ? m + 1; Yj (m) = Yi(m)g
2

and

n+1
X?m
1
log Cim :
n + 1 ? m i=1
m
Observe that Ci is the relative frequency of occurrences of the template
Yi(m) in the sequence, and ?(m) is the entropy of the empirical distribution
arising on the observed subset of the set of all sm possible patterns of length
m:
The approximate entropy ApEn of order m; m 1 is dened as
ApEn(m) = (m) ? (m+1)
with ApEn(0) = ?(1) : \ApEn(m) measures the logarithmic frequency with
which blocks of length m that are close together remain close together for
blocks augmented by one position. Thus, small values of ApEn(m) imply
strong regularity, or persistence, in a sequence. Alternatively, large values
of ApEn(m) imply substantial uctuation, or irregularity .." (Pincus and
Singer, 1996, p 2083).
Pincus and Singer (1996) dened a sequence to be m-irregular (m-random)
if its approximate entropy ApEn(m) takes the largest possible value. Pincus
and Kalman (1997) evaluated quantities
p
pApEn(m); m = 0; 1; 2 for binary
and decimal expansions
p of e; ; 2 and 3 with the surprising conclusion
that the expansion of 3 demonstrated much more irregularity than that of
.
Since ?(m) is the entropy of the empirical distribution which under the
randomness assumption must be almost uniform, one should expect that for
xed m; (m) ?m log s and ApEn(m) = (m) ?(m+1) ! log s; indeed this
fact follows from Theorem 2 in Pincus (1991). As far as the limiting behavior
of ApEn(m)?log s, Pincus and Huang (1992), p 3072, indicate that \analytic
proofs of asymptotic normality and especially explicit variance estimates for
ApEn appear to be extremely dicult".
The key step leading to the limiting distribution of approximate entropy
is a modication of its denition. Introduce the modied version of the
empirical distribution entropy ?(m) as
X
(1)
~ (m) =
i1 im log i1 im :

(m) =

i1 im

Here i1 im = !i1im =n denotes the relative frequency of the pattern
(i1 ; ; im ) in the augmented (or circular) version of the original string,
3

i.e.
(1 ; : : : ; n; 1P
; : : : ; m?1 ). Under this denition !i1im =
P !in the, string
k i1 im k so that for any m; i1 im i1 im = n:
Dene the modied approximate entropy as

g(m) = ~ (m) ? ~ (m+1) :

ApEn
(2)
g(m) is that by Jensen's inequality, log s
A denite advantage of ApEn
g(m) for any m, whereas it is possible that log s < ApEn(m) (albeit
ApEn
the probability of this tends to zero as n increases.) Therefore the largest
g(m) is merely log s. The maximally random sequences
possible value of ApEn
under this denition have the relative frequencies of all patterns (in a circular
version of the sequence) of a given length are as close to the common value n?1
as possible. For example, in addition to maximally random binary strings
from the point of view of ApEn(1) 1; 1; 0; 0; 1, 1; 0; 0; 1; 1, 0; 0; 1; 1; 0, and
0; 1; 1; 0; 0 mentioned by Pincus and Singer (1996) p 2084, one should add
two sequences 1; 0; 0; 0; 1, 0; 1; 1; 1; 0, which are random from the point of
g (1).
view of ApEn
g(m) cannot dier much if n is
On the other hand ApEn(m) and ApEn
large. Indeed for Yi(m) = (i1; : : : ; im ), put i01 im = Cim, so that
X 0
(m) =
i1 im log i01 im :
i1 im

Then with !i01im = (n ? m + 1)i01 im

i1 im

!i01 im = n ? m + 1;

and !i1im ? !i01 im m ? 1. It follows that

i1 im ? i01 im n ?m m? +1 1 ;

(3)

g(m)
which suggests that for a xed m, Pincus' approximate entropy and ApEn
must be close when n is large.
g(m)]
In the next Section I derive the limiting distribution of n[log s?ApEn
g(m)] =
when n ! 1 and m is xed. It is also proven that n[ApEn(m)?ApEn
OP (n?1) ; so that the limiting distributions of Pincus' approximate entropy
4

g(m) coincide. Section 3 contains a similar result when m !

and of ApEn
1. These facts provide the basis for statistical tests of randomness via the
approximate entropy.
In particular, in Section 4 the tail probabilities for the approximatep entropy test are evaluated and plotted for binary expansions of e; and 3.

2 Asymptotic Behavior of Approximate Entropy: Fixed m.

g(m)] as well
It is shown here that the limiting distribution of 2n[log s ? ApEn
as of 2n[log s ? ApEn(m)] is that of a 2-random variable with (s ? 1)sm
degrees of freedom.
Proposition 1 For xed m as n ! 1 one has the following convergence in
distribution
h
g(m)i ! 2 (sm+1 ? sm ):
2n log s ? ApEn
Also
1
g
n[ApEn(m) ? ApEn(m)] = OP n ;
(4)
so that
2n[log s ? ApEn(m)] ! 2(sm+1 ? sm):
g(m). Put
Proof Let us start with the limit theorem for ApEn

p
Zi1im = n i1 im ? s1m :
Then the vector formed by Zi1 im has asymptotic multivariate normal distribution with zero mean and the covariance matrix of the form
1
1
m = m Im ? 2m em eTm:
s
s
Here Im denotes the sm sm identity matrix andPeTm = (1; : : : ; 1) is a smdimensional vector. Since with probability one, Zi1im = 0, (1) shows
that
"
#
1 i
2m Z 2
m Zi i
s
s
i1 im h
i

i
m
1
m
1
~(m) = ? X 1m + Zp
pn ? 2n +OP n3=2
n ?m log s+
i1 im s
5

?m log s + s2n
m

i1 im

Zi21 im :

Using a similar notation for patterns of length m + 1, let i1 imim+1 be the
relative frequencies, and let Zi1im im+1 denote the corresponding dierences
between empirical and theoretical probabilities. Then

Zi1 im =
and

s
X

k=1

Zi1im k

m+1

~ (m+1) ?(m + 1) log s + s

Thus

2n i1 imim+1

Zi21im im+1 :

~ (m) ? ~ (m+1)
2
3
!2
m X X
X
s
log s ? 2n 4
Zi1imk ? s
Zi21im im+1 5
i1 im k
i1 im im+1
m
= log s ? s Z T QZ
2n
m
+1
m
+1
with the s s block-diagonal matrix Q formed by formed by sm blocks
Q0 of size s s,
Q0 = sI1 ? e1 eT1 ;
and sm+1 -dimensional normal
vector Z . The distribution of the quadratic
P
T
form Z QZ is that of li1 imim+1 Wi21 imim+1 with independent standard
normal variables Wi1 im im+1 and li1 imim+1 denoting the eigenvalues of the
matrix 1=2 Q1=2 .
It is easy to check that
2 =
1m=+1

1 I ? 1 e eT ;
(
m
+1)
s =2 m+1 s3(m+1)=2 m+1 m+1

2 Q1=2 = 1 Q:
and 1m=+1
m+1 sm+1
h 2 1=2
i
The evaluation of the determinant, det 1m=+1
Qm+1 ? lIm+1 , shows the
needed eigenvalues are equal to s with multiplicity (s ? 1)sm and 0 with
multiplicity sm. Therefore
1
~ (m) ? ~ (m+1) log s ? 2 ((s ? 1)sm )
2n

and

h
g(m)i 1 2(sm+1 ? sm):
n log s ? ApEn
2
pn h 0 ? s?m i then
0
=
The estimate (3) shows that
if
Z
i

i
i1 im
m
1
jZi01im ? Zi1 im j (m ? 1)pn=(n ? m + 1) and

2m
2
(m) (m) sm X 2
X

Zi012im 2(sn ?(mm?+1)1)2 :
~ ? 2n Zi1 im ?
i1im

i1 im
Thus (4) follows and the Proposition 1 is proven. 2
For the observed value ApEn(m), one has to dene 2 (obs) as 2 (obs)
= 2n jlog s ? ApEn(m)j, whereas, as has been noticed, the dierence log s ?
g(m) is always positive. The reported P-value (tail probability) is
ApEn

Pn(m) = 1 ? P 2m?1 ; 2(obs)=2
with P denoting the incomplete gamma-function. The null hypothesis of
randomness is rejected for large values of 2 (obs). h
g(m)i and
The asymptotic distribution of the statistics 2n log s ? ApEn
2n [log s ? ApEn(m)], evaluated under the alternative of the form i1im im+1 =
s?m?1 + n?1=2 i1 im im+1 , with T e = 0, is a noncentral 2-distribution with
sm+1 ? sm degrees of freedom and the noncentrality parameter T =sm+1.
This fact allows for an approximate power function of the corresponding test
of randomness.

3 Asymptotic Behavior of Approximate Entropy: Large m.

In this section I consider the situation when both n and m tend to innity
so that
n ! > 0:
(5)
m
s +1
(Actually this condition can be relaxed to minn smn+1 > 0. Observe that
g(m) for such values of m is
the numerical evaluation of ApEn(m) and ApEn
feasible; for example, the values s = 2; n = 105; m = 17 with = 0:38147::
have been tried.)
7

To investigate this case let us write the formula for the modied approximate entropy in the following form

g(m) = X i1 im log i1 im ?

ApEn
i1 im

i1 im im+1

i1 imim+1 log i1 im im+1

1 Xh
!i1ims i
!i1im 1 + + !
=
!
i1 im s log P
i1 im 1 log P
n i1im
k !i1 im k
k !i1 im k
X
=1
(6)
n i1im f (!i1im1 ; : : : ; !i1ims)
with f (u1; : : : ; us) denoting
the entropy of the probability distribution dened
P
by probabilities uk = uj ; k = 1; : : : ; s,

!
!
u
u
s
1
f (u1; : : : ; us) = ?u1 log P u ? ? us log P u :
j j
j j
P (u )?
Note
that
our
function
f
has
a
special
form,
namely,
f
(
u
;
:
:
:
;
u
)
=
j
1
s
j
(Pj uj ) with (u) = ?u log u.
A similar representation with n replaced by n ? m + 1Palso holds for
ApEn(m). Indeed in the notation of Section 2, !i01 im ? k !i01im k 1
and
no more than one m-tuple i1; : : : ; im for which !i01 im 6=
P !0 there. exists
i1 im k Therefore

0
0
X
!
X 0
!
im im+1
!i01 imim+1 log n i?1m
!i1im log n ?i1mim+ 1 ?
+ 1
i1 im
i1 im im+1

0xmax
[(x + 1) log(x + 1) ? x log x] + log(n ? m + 1) 2 log n;
n?m+1
so that
!
0
Xh 0
!
1
i

i
1
m
1
+
ApEn(m) = n ? m + 1
!i1 im1 log P !0
k i1 im k
i1 im
!i
!
0
!
log
n
i

i
s
+!i01im s log P 1 0 m
+ OP
n :
k !i1 im k
Thus ApEn also admits the representation (6), and the limiting distribution
g and ApEn is that of this decomposable statistic. Sums of
of both ApEn
8

this form (with functions f of only one argument) have been extensively
studied. See Holst (1972), Morris (1975) and Medvedev (1977). Although
our situation with f depending on s frequencies !i1im1 ; : : : ; !i1 im s does
not follow directly from these results, the special form of this function leads
to the following Proposition 2 which can be derived from Holst (1972) after
some modications.
Let i1 im 1; : : : ; i1im s denote s independent Poisson random variables
with parameter . It is also convenient to write 1 ; : : : ; s or 1(); : : : ; s()
for a s-tuple of such random variables. Put
m
X
n = n1
Ef (i1 im 1; : : : ; i1im s) = sn Ef (1 ; : : : ; s)
i1 im
=
and

1
s Ef (1; : : : ; s);

1 ; : : : ; s ); 1 + + s )
= Cov (f (
Var ( + + )
1

= 1 Cov (f (1; : : : ; s); 1 + + s) :

With

g(u1; : : : ; us) = f (u1; : : : ; us) ? Ef (1 ; : : : ; s) ? [1 + + s ? s];

one has

Varg (1 ; : : : ; s ) = Varf (1 ; : : : ; s ) ? s 2 ;

so that the sums

Un =
and

X
i1 im

g (i1 im 1; : : : ; i1im s)

X
Vn = p1n
[i1im 1 + + i1 im s ? s]
i1 im

are uncorrelated, EUn Vn = 0. Let

n2 =

i1 im

Varg (i1 im 1 ; : : : ; i1 im s ) = sm Varg (1 ; : : : ; s);

then the joint asymptotic distribution of Un =n and Vn is normal with zero
mean and the identity covariance matrix. The conditional distribution of
g(m) ? n, since
Un=n given Vn = 0 coincides with the distribution of ApEn
the conditional distribution of (i1im 1; : : : ; i1 im s) given that P[i1 im1 +
+ i1 im s] = n is multinomial.
g(m)?
Therefore the following result concerning the convergence of n[ApEn
n]=n and of (n ? m + 1)[ApEn(m) ? n]=n to a standard normal distribution is not surprising.

n!1
0 g
1
ApEn
(
m
)
?

n
P @n
xA ! (x)

Proposition 2 Under condition (5) for

!
ApEn
(m) ? n
x ! (x)
P n
n
Sketch of the Proof The argument above can be made rigorous by examig(m) as in Lemmas 2.1, 2.2, A1,
nation of the characteristic function of ApEn
m
+1
A2 and A3 of Holst (1972). With N = s as in Lemma 2.1 there
2
3
1
n ?Nz
Y
X
f
(

;:::;
)
xi1 ii1mim 1 i1im s 5 (Nz)n!e
AN (z) = En 4
n=0
i1 im
Y X zj1 ++js e?z f (i1 im 1 (z);:::;i1ims (z))
=
:
xi1 im
i1 im j1 :::js j1 ! js !
A similar representation
for the characteristic function '(t)
P
= E exp fit i1 im f (i1 im 1 ; : : : ; i1 ims)g as in Lemma 2.2 follows; the only
dierence is that the ordinary sum in the right-hand side is replaced by the
multiple sum
X (ei )j1++js h it[(j1 )++(js)]?it(j1 +:::+js) i
e
?1 :
e?sei
j1 :::js j1 ! js !
The same estimates as in Lemmas A1 and A2 hold for the corresponding
function. The convergence result in Lemma A2 also holds by analysis of
Taylor's expansion. 2
and

For example, when s = 2, one has to evaluate

1 k log(k + 1)
X
?

() = E 1 () log 1 () = e
;
k!
k=1

and then
Similarly with
one has
Also if
and

n = (2) 2? 2() :

() = E 1 () log 1()[1 () ? ];
= (2) 2? 2 () :
2() = Var (1 () log 1()) ;

() = Cov ([1() + 2 ()] log[1 () + 2()]; 1() log 1 ())
1 k log(k + 1)
X
k ? (2)();
= e?2
k!
k=1
where
!
k
X
k
+
1
k = (j + 1) log(j + 1) j + 1 ;
j =1
then
Varf (1 ; 2 ) = 2 (2) + 2 2 () ? 4 ():
Thus
"
2#
[

(2

)
?
2

(

)]
2
2
2
m
:
n = s (2) + 2 () ? 4 () ?
2
More generally, the asymptotic distribution of the sum
1 X f (!
S=
n i1im i1im 1; : : : ; !i1ims)

when f (u1; : : : ; us) = Pj (uj ) ? (Pj uj ) can be shown to be normal under

mild regularity conditions on function .
11

The asymptotic power of this test statistic S under the alternative i1 im
is determined by the ratio R = lim[E S ? n]=n, whose absolute value is to
be maximized to have the optimal Pitman eciency.
Under the alternative of the form i1im im+1 = s?m?1 + n?1=4 i1im im+1
with
Z i1 +i2=s++(im +1)s?m
i1im =
q(u) du
?m

R
such that 1 q(u) du = 0,

i1 +i2 =s++im s

P ( ) ? (P )] [(P ? s)2 ? P ]

E
[
i
i i:
i i
i i
i
2
q (u) du
P
P
P
1
=
2
0
Var [ i (i ) ? ( i i ) ? i i ]

This formulas can be used to show that there is no optimal statistic S.

This is to be contrasted with asymptotic optimality of 2-test in the class
of decomposable statistics with function f depending only on one argument
(see Holst, 1972, Ivchenko and Medvedev, 1978, 1980).
Essentially the same conclusions about the power of the approximate
entropy test as about 2 -test (Kallenberg et al, 1985) can be made.

4 Examples
Here are two strings of 20 binary bits which have been suggested by Chaitin
(1975)
(A) 01010101010101010101
(B ) 01101100110111100010
For a non-randomly looking sequence (A); ApEn(0) = ?(1) = ?~ (1) =
log 2, which is the largest possible value for ApEn. Since there are only two
occurring patterns of length 2, namely (0; 1) and (1; 0) with frequencies 10
and 9 respectively,
1
10
9
(2)
=
10 log + 9 log
= ?0:6918:::
19
19
19
Thus
ApEn(1) = 0:0014:::
with 2 (obs) = 40[log 2 ? ApEn(1)] = 27:6699:::
12

For the modied entropy

1
10
10
~ (2) =
10 log + 10 log
= ? log 2;
20
20
20

g (1) = 0 and 2 (obs) = 40 log 2 = 27:7258::: Thus from the point

with ApEn
g (1), the sequence (A) is completely non-random.
of view of ApEn
This is to be contrasted with the values of the approximate entropy for
the string (B ).

9
11
1
(1)
(1)
~
9 log + 11 log
= ?0:6881::
= =
20
20
20
There are 5 patterns (1; 0), 6 patterns (1; 1), 5 patterns (0; 1), and 3 patterns
(0; 0) in this string, so that
1
5
6
5
3
(2) =
5 log + 6 log + 5 log + 3 log
= ?1:3581::
19
19
19
19
19
and
ApEn(1) = 0:6699:::
with 2 (obs) = 40[log 2 ? ApEn(1)] = 0:9299: One also has

~(2) = 1 5 log 5 + 6 log 6 + 5 log 5 + 4 log 4 = ?1:3762::
20
20
20
20
20
as there are 5 copies of (1; 0), 6 copies of (1; 1), 5 copies of (0; 1), and 4 copies
g (1) = 0:6881::,
of (0; 0) in the augmented version of this string, Thus ApEn
which is closer to the maximum value 0:6931:: than Pincus' entropy, and
g (1)] = 0:2024:: is smaller.
2(obs) = 40[log 2 ? ApEn
Thus from the point of view of approximate entropies ApEn(1) and
g
ApEn(1) the sequence (A) does not look random at all, but the string (B)
g (1). The strings (A)
does and even more so for the modied entropy ApEn
and (B) are also examined in Pincus and Kalman (1997) p 3514, with a
numerical mistake.

1
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
0

100

150

200

250

300

Figure 1 Consecutive P-values for binary expansions of

(dotted line) and e (solid line) when m = 1.

350

400

3 (broken line),

In Figure 1 the P-values Pnp(1) from Section 2 are plotted against the rst
digits of binary expansions
of 3. and e. According to this data, P-values
p
corresponding to 3 are much smaller than those of e and
p . The situation,
however, is reverse for m = 7. when the digits of and 3 look much more
random than these of of expansion of e (Figure 2).

0.8

0.7

0.6

0.5

0.4

0.3

0.2

0.1

0
0

100

200

300

400

500

600

Figure 2 Consecutive P-values for binary expansions of

(dotted line) and e (solid line) when m = 7.

700

800

900

3 (broken line),

The procedures based on randomess test via approximate entropy form

now a part of a battery of empirical tests for randomness developed at the
Computer Security Division of the National Institute of Standards and Technology. They are being used for investigation of various existing random
numbers generators, such as Data Encryption Algorithm, Secure Hash Algorithm, Digital Signature Algorithm and Blum, Blum and Shub generator.

References
[1] Chaitin, G. (1975), \Randomness and mathematical proof," Scientic
American, 232, pp 47{52.
15

[2] Holst, L. (1972), \Asymptotic normality and eciency for certain

goodnes-of-t tests", Biometrika, 59, pp 137{145.
[3] Ivchenko, G., and Medvedev, Yu.I. (1978), \Separable statistics and
hypotheses testing. The case of small samples," Theory of Probability
and Its Applications, 23, pp 764{775.
[4] Ivchenko, G., and Medvedev, Yu.I. (1978), \Decomposable statistics
and hypothesis testing for grouped data," Theory of Probability and Its
Applications, 25, pp 540{551.
[5] Kallenberg, W. C. M., Oosterho, J., and Schriver, B.F. (1985), \The
number of classes in chi-squared goodness-of-t tests," Journal of the
American Statistical Association, 80, pp 959{968.
[6] Medvedev, Yu. I. (1977), \Separable statsitics in a polynomial scheme.
I," Theory of Probability and Its Applications, 22, pp 1{15.
[7] Morris, C. (1975), \Central limit theorem for multinomial sums," Annals
of Statistics, 3, pp 165{188.
[8] Pincus, S. (1991), \Approximate entropy as a measure of system complexity," Proceedings of the National Academy of Sciences of the USA,
88, pp 2297{2301.
[9] Pincus, S., and Huang, W.-M. (1992), \Approximate entropy, statistical
properties and applications," Communications in Statistics, Part ATheory and Methods, 21, pp 3061{3077.
[10] Pincus, S., and Kalman, R. E. (1997), \Not all (possibly) \random"
sequences are created equal," Proceedings of the National Academy of
Sciences of the USA, 94, pp 3513{3518.
[11] Pincus, S., and Singer, B. H. (1996), \Randomness and degrees of
irregularity," Proceedings of the National Academy of Sciences of the
USA, 93, pp 2083{2088.

Final Python
No ratings yet
Final Python
718 pages
Unit - III Joint Probability Distribution (Full Notes)
100% (2)
Unit - III Joint Probability Distribution (Full Notes)
30 pages
An Introduction To Combinatorics and Graph Theory: David Guichard
100% (1)
An Introduction To Combinatorics and Graph Theory: David Guichard
155 pages
A Theorem On Coloring The Lines of A Network
No ratings yet
A Theorem On Coloring The Lines of A Network
4 pages
Lottery Mathematics
No ratings yet
Lottery Mathematics
7 pages
Problems
No ratings yet
Problems
133 pages
Inner Product
100% (1)
Inner Product
80 pages
Notes PDF
No ratings yet
Notes PDF
407 pages
Digital Transmission - Digital To Digital Conversion
No ratings yet
Digital Transmission - Digital To Digital Conversion
51 pages
TI89 Advanced ST
No ratings yet
TI89 Advanced ST
95 pages
Yamaha dgx-305 dgx-505
No ratings yet
Yamaha dgx-305 dgx-505
92 pages
Factorial - Combination and Permutation
No ratings yet
Factorial - Combination and Permutation
29 pages
Algorithms For Randomness in The Behavioral Sciences
No ratings yet
Algorithms For Randomness in The Behavioral Sciences
16 pages
Probability Distribution
No ratings yet
Probability Distribution
52 pages
Probability Problem Set
100% (3)
Probability Problem Set
13 pages
(Supersceded 2025 Above) A Conceptual Bridge Between Classical and Quantum Physics
No ratings yet
(Supersceded 2025 Above) A Conceptual Bridge Between Classical and Quantum Physics
10 pages
Game Playing
100% (1)
Game Playing
36 pages
Analytix Labs Data Science Course
100% (1)
Analytix Labs Data Science Course
18 pages
2 Talk1 PublicKeyCryptography 23june2021
No ratings yet
2 Talk1 PublicKeyCryptography 23june2021
68 pages
Topic 4 - Probability (Old Notes)
100% (1)
Topic 4 - Probability (Old Notes)
22 pages
Introduction To Arduino Programming
No ratings yet
Introduction To Arduino Programming
16 pages
340 Printable Course Notes
No ratings yet
340 Printable Course Notes
184 pages
Section 03: Data Handling and Spreadsheets in Analytical Chemistry
No ratings yet
Section 03: Data Handling and Spreadsheets in Analytical Chemistry
45 pages
Increase Your Chances of Winning The Particular Lottery
No ratings yet
Increase Your Chances of Winning The Particular Lottery
3 pages
IB - Number Systems and Conversions
No ratings yet
IB - Number Systems and Conversions
30 pages
Probability I12
No ratings yet
Probability I12
100 pages
Wave Shaping Synthesis
100% (1)
Wave Shaping Synthesis
12 pages
Web Server PDF
No ratings yet
Web Server PDF
7 pages
18 Combinations and Permutations
No ratings yet
18 Combinations and Permutations
10 pages
LottoDesignsTC MWU 105
No ratings yet
LottoDesignsTC MWU 105
123 pages
Random Number Generator
No ratings yet
Random Number Generator
46 pages
Easy Probability
No ratings yet
Easy Probability
53 pages
The Lottery Is A Mathematics Powerball
No ratings yet
The Lottery Is A Mathematics Powerball
9 pages
04 ES Random Variables
No ratings yet
04 ES Random Variables
17 pages
Even and Odd Math Proofs Solutions
No ratings yet
Even and Odd Math Proofs Solutions
3 pages
Prime Numbers
No ratings yet
Prime Numbers
5 pages
The Pleasures of Probability: Richard Isaac
No ratings yet
The Pleasures of Probability: Richard Isaac
6 pages
On Independence and Sensitivity of Statistical Randomness Tests
No ratings yet
On Independence and Sensitivity of Statistical Randomness Tests
12 pages
How To Calculate The Expected Value: P X P P P
No ratings yet
How To Calculate The Expected Value: P X P P P
2 pages
Randomness and Coincidence
No ratings yet
Randomness and Coincidence
6 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
29 pages
Combinaciones y Permutaciones
No ratings yet
Combinaciones y Permutaciones
19 pages
Prime Numbers
No ratings yet
Prime Numbers
113 pages
Modular Programming
No ratings yet
Modular Programming
2 pages
Interfacing ARDUINO With C++
No ratings yet
Interfacing ARDUINO With C++
4 pages
Exercises - Probability Theory - With Answers
No ratings yet
Exercises - Probability Theory - With Answers
4 pages
Colour
No ratings yet
Colour
35 pages
Mathematics in Games, Sports, and Gambling-The Games People Play by Ronald J. Gould
No ratings yet
Mathematics in Games, Sports, and Gambling-The Games People Play by Ronald J. Gould
2 pages
Exponential Smoothing
No ratings yet
Exponential Smoothing
5 pages
Tricks of The Trade: Tutorial
No ratings yet
Tricks of The Trade: Tutorial
8 pages
Conditional Probability
No ratings yet
Conditional Probability
13 pages
Mathematics Sequence Sheet
No ratings yet
Mathematics Sequence Sheet
4 pages
Urick Sonar Equations PDF
No ratings yet
Urick Sonar Equations PDF
14 pages
Martingale (Probability Theory) - Wikipedia, The Free Encyclopedia
No ratings yet
Martingale (Probability Theory) - Wikipedia, The Free Encyclopedia
9 pages
Alcuin's Sequence - Wikipedia
No ratings yet
Alcuin's Sequence - Wikipedia
6 pages
Beamer Appearance Cheat Sheet
No ratings yet
Beamer Appearance Cheat Sheet
5 pages
Calc Help Sheet
No ratings yet
Calc Help Sheet
1 page
S 15 Notes
No ratings yet
S 15 Notes
216 pages
Armour Stone Grading
100% (1)
Armour Stone Grading
14 pages
Bcom Final 161219
No ratings yet
Bcom Final 161219
53 pages
Uncertainty: Chapter 13, Sections 1-6
No ratings yet
Uncertainty: Chapter 13, Sections 1-6
31 pages
MATH 2201 CE, Summer-2023
No ratings yet
MATH 2201 CE, Summer-2023
3 pages
UChicago Algo Trading Lecture 1 Intro To Trading
No ratings yet
UChicago Algo Trading Lecture 1 Intro To Trading
70 pages
Statistics
No ratings yet
Statistics
65 pages
Unit-2: Operations On Single Random Variables
No ratings yet
Unit-2: Operations On Single Random Variables
11 pages
PtspExternalpapers Key
No ratings yet
PtspExternalpapers Key
44 pages
STS DMT 101 Lecture 02
No ratings yet
STS DMT 101 Lecture 02
22 pages
Test of Association
No ratings yet
Test of Association
27 pages
IIT Madras Syllabus PDF
No ratings yet
IIT Madras Syllabus PDF
2 pages
Week 6 Notes
No ratings yet
Week 6 Notes
10 pages
Bernoulli Distribution
No ratings yet
Bernoulli Distribution
5 pages
ORN 14 Hydrological Design Manual For Slope Stability
No ratings yet
ORN 14 Hydrological Design Manual For Slope Stability
64 pages
Basic Principles of Probability and Statistics
No ratings yet
Basic Principles of Probability and Statistics
32 pages
SLP IN Statistics and Probability 11 Lesson 3
No ratings yet
SLP IN Statistics and Probability 11 Lesson 3
4 pages
Gatsonis 1989
No ratings yet
Gatsonis 1989
9 pages
CH 1
No ratings yet
CH 1
7 pages
FIDP StatandProb
No ratings yet
FIDP StatandProb
9 pages
Experiments On The Consciousness Prior - Yoshua Bengio and William Fedus
No ratings yet
Experiments On The Consciousness Prior - Yoshua Bengio and William Fedus
5 pages
Transformer Age Assessment
No ratings yet
Transformer Age Assessment
19 pages
Refresher Module 23 M4 Data Analysis 1
No ratings yet
Refresher Module 23 M4 Data Analysis 1
1 page
Sampling Distribution of Sample Mean: Muhammad Tahir Yousafzai
No ratings yet
Sampling Distribution of Sample Mean: Muhammad Tahir Yousafzai
15 pages
Cambridge International Examinations
No ratings yet
Cambridge International Examinations
12 pages
Homework Wiley 1 - Key Answer
No ratings yet
Homework Wiley 1 - Key Answer
2 pages
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Lectures on the Coupling Method
From Everand
Lectures on the Coupling Method
Torgny Lindvall
No ratings yet
Solutions Manual to accompany An Introduction to Numerical Methods and Analysis
From Everand
Solutions Manual to accompany An Introduction to Numerical Methods and Analysis
James F. Epperson
5/5 (1)
Geometry of Submanifolds
From Everand
Geometry of Submanifolds
Bang-Yen Chen
No ratings yet
Mathematical Foundations of Information Theory
From Everand
Mathematical Foundations of Information Theory
A. Ya. Khinchin
3.5/5 (9)
Exercises of Logarithms and Exponentials
From Everand
Exercises of Logarithms and Exponentials
Simone Malacrida
No ratings yet
Complex analysis A Complete Guide
From Everand
Complex analysis A Complete Guide
Gerardus Blokdyk
No ratings yet

Approximate Entropy For Testing Randomness PDF

Uploaded by

Approximate Entropy For Testing Randomness PDF

Uploaded by

Approximate Entropy for Testing Randomness

In this paper a new concept of approximate entropy is modi ed

particular, tail probabilities for the approximate entropy test can be

1 Introduction: Approximate Entropies

g(m) = ~ (m) ? ~ (m+1) :

Then with !i01im = (n ? m + 1)i01 im

and !i1im ? !i01 im  m ? 1. It follows that

g(m) coincide. Section 3 contains a similar result when m !

2 Asymptotic Behavior of Approximate Entropy: Fixed m.

~ (m+1)  ?(m + 1) log s + s

3 Asymptotic Behavior of Approximate Entropy: Large m.

g(m) = X i1 im log i1 im ?

i1 imim+1 log i1 im im+1

= 1 Cov (f (1; : : : ; s); 1 + + s) :

g(u1; : : : ; us) = f (u1; : : : ; us) ? Ef (1 ; : : : ; s) ? [1 + + s ? s];

Varg (1 ; : : : ; s ) = Varf (1 ; : : : ; s ) ? s 2 ;

so that the sums

g (i1 im 1; : : : ; i1im s)

are uncorrelated, EUn Vn = 0. Let

Varg (i1 im 1 ; : : : ; i1 im s ) = sm Varg (1 ; : : : ; s);

Proposition 2 Under condition (5) for

For example, when s = 2, one has to evaluate

n = (2) 2? 2() :

when f (u1; : : : ; us) = Pj (uj ) ? (Pj uj ) can be shown to be normal under

P ( ) ? (P  )] [(P  ? s)2 ? P  ]

This formulas can be used to show that there is no optimal statistic S.

For the modi ed entropy

g (1) = 0 and 2 (obs) = 40 log 2 = 27:7258::: Thus from the point

Figure 1 Consecutive P-values for binary expansions of

(dotted line) and e (solid line) when m = 1.

Figure 2 Consecutive P-values for binary expansions of

(dotted line) and e (solid line) when m = 7.

The procedures based on randomess test via approximate entropy form

[2] Holst, L. (1972), \Asymptotic normality and eciency for certain

You might also like

In this paper a new concept of approximate entropy is modied

g(m) = ~ (m) ? ~ (m+1) :

Then with !i01im = (n ? m + 1)i01 im

and !i1im ? !i01 im m ? 1. It follows that

~ (m+1) ?(m + 1) log s + s

g(m) = X i1 im log i1 im ?

i1 imim+1 log i1 im im+1

= 1 Cov (f (1; : : : ; s); 1 + + s) :

g(u1; : : : ; us) = f (u1; : : : ; us) ? Ef (1 ; : : : ; s) ? [1 + + s ? s];

Varg (1 ; : : : ; s ) = Varf (1 ; : : : ; s ) ? s 2 ;

g (i1 im 1; : : : ; i1im s)

Varg (i1 im 1 ; : : : ; i1 im s ) = sm Varg (1 ; : : : ; s);

n = (2) 2? 2() :

when f (u1; : : : ; us) = Pj (uj ) ? (Pj uj ) can be shown to be normal under

P ( ) ? (P )] [(P ? s)2 ? P ]

For the modied entropy

g (1) = 0 and 2 (obs) = 40 log 2 = 27:7258::: Thus from the point

[2] Holst, L. (1972), \Asymptotic normality and eciency for certain