An Introduction To Probability Theory and Its Applications II

Probability Textbook - William Feller

Uploaded by

temp12321

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

100% found this document useful (5 votes)

992 views694 pages

An Introduction To Probability Theory and Its Applications II

Probability Textbook - William Feller

Uploaded by

temp12321

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

You are on page 1/ 694

An Introduction to Probability Theory and Its Applications ‘WILLIAM FELLER (1906-1970) Eugene Higgins Profesor of Mathematics Princeton University VOLUME II SECOND EDITION John Wiley & Sons, Inc. New York - London - Sydney + TorontoCopyright © 1966, 1971 by John Wiley & Sons, Inc. All rights reserved, Published simultaneously in Canada. No part of this book may be reproduced by any means, ‘or transmitted, nor translated into a machine language without the written permission of the publisher. Library of Congress Catalogue Card Number: $7-10805 ISBN 0 471 25709 5 Printed in the United States of America 09876543To O. E. Neugebauer 0 et praesidium et dulce decus meumPreface to the First Edition [AT THE TIME THE FIRST VOLUME OF THIS BOOK WAS WRITTEN (BETWEEN 1941 and 1948) the interest in probability was not yet widespread. Teaching was ‘ona very limited scale and topics such as Markov chains, which are now extensively used in several disciplines, were highly specialized chapters of pure mathematics. The first volume may therefore be likened to an all- Purpose travel guide to a strange country. To describe the nature of probability it had to stress the mathematical content of the theory as well as the surprising variety of potential applications. It was predicted that the ensuing fluctuations in the level of difficulty would limit the usefulness of the book. In reality it is widely used even today, when its novelty has ‘worn off and its attitude and material are available in newer books written for special purposes. The book seems even to acquire new friends. The fact that laymen are not deterred by passages which proved difficult to students of mathematics shows that the level of difficulty cannot be measured objectively; it depends on the type of information one seeks and the details one is prepared to skip. The traveler often has the choice between climbing a peak or using a cable car. In view of this success the second volume is written in the same style. It involves harder mathematics, but most of the text can be read on different, levels. The handling of measure theory may illustrate this point. Chapter IV contains an informal introduction to the basic ideas of measure theory and the conceptual foundations of probability. The same chapter lists the few facts of measure theory used in the subsequent chapters to formulate analytical theorems in their simplest form and to avoid futile discussions of regularity conditions. The main function of measure theory in this connection is to justify formal operations and passages to the limit that would never be questioned by a non-mathematician. Readers interested primarily in practical results will therefore not feel any need for measure theory. To facilitate access to the individual topics the chapters are rendered as self-contained as possible, and sometimes special cases are treated separately ahead of the general theory. Various topics (such as stable distributions and renewal theory) are discussed at several places from different angles. To avoid repetitions, the definitions and illustrative examples are collected invi PREFACE chapter VI, which may be described as a collection of introductions to the subsequent chapters. The skeleton of the book consists of chapters V, VIII, and XV. The reader will decide for himself how much of the preparatory chapters to read and which excursions to take. Experts will find new results and proofs, but more important is the attempt to consolidate and unify the general methodology. Indeed, certain parts of probability suffer from a lack of coherence because the usual grouping and treatment of problems depend largely on accidents of the historical development. In the resulting confusion closely related problems are not recognized as such and simple things are obscured by complicated methods. Consider- able simplifications were obtained by a systematic exploitation and development of the best available techniques. This is true in particular for the proverbially messy field of limit theorems (chapters XVI-XVII). At other places simplifications were achieved by treating problems in their natural context. For example, an elementary consideration of a particular random walk led to a generalization of an asymptotic estimate which had been derived by hard and laborious methods in risk theory (and under more restrictive conditions independently in queuing). I have tried to achieve mathematical rigor without pedantry in style. For example, the statement that 1/(1 + £) is the characteristic function of }e1*! seems to me a desirable and legitimate abbreviation for the logically correct version that the function which at the point £ assumes the value 1/(. + &) is the characteristic function of the function which at the point x assumes the value de! 1 fear that the brief historical remarks and citations do not render justice to the many authors who contributed to probability, but I have tried to give credit wherever possible. The original work is now in many cases superseded by newer research, and as a rule full references are given only to papers to which the reader may want to turn for additional information. For example, no reference is given to my own work on limit theorems, whereas a paper describing observations or theories underlying an example is cited even if it contains no mathematics.’ Under these circumstances the index of authors gives no indication of their importance for probability theory. Another difficulty is to do justice to the pioneer work to which we owe new directions of research, new approaches, and new methods. Some theorems which were considered strikingly original and deep now appear with simple proofs among more refined results. It is difficult to view such a theorem in its historical perspective and to realize that here as elsewhere it is the first step that counts. 2 This system was used also in the first volume but was misunderstood by some subsequent writers; they now attribute the methods used in the book to earlier scientists who could, not have known them,ACKNOWLEDGMENTS Thanks to the support by the U.S. Army Research Office of work in probability at Princeton University I enjoyed the help of J. Goldman, L. Pitt, M. Silverstein, and, in particular, of M. M. Rao. They eliminated many inaccuracies and obscurities. All chapters were rewritten many times and preliminary versions of the early chapters were circulated among friends. In this way I benefited from comments by J. Elliott, R. S. Pinkham, and L. J. Savage. My special thanks are due to J. L. Doob and J. Wolfowitz for advice and criticism. The graph of the Cauchy random walk was supplied by H, Trotter. The printing was supervised by Mrs. H. McDougal, and the appearance of the book owes much to her. WituiaM FELLER October 1965 ix‘THE MANUSCRIPT HAD BEEN FINISHED AT THE TIME OF THE AUTHOR'S DEATH but no proofs had been received. I am grateful to the publisher for providing a proofreader to compare the print against the manuscript and for compiling the index. J. Goldman, A. Grunbaum, H. McKean, L, Pitt, and A. Pittenger divided the book among themselves to check on the mathematics. Every mathematician knows what an incredible amount of work that entails. 7 express my deep gratitude to these men and extend my heartfelt thanks for their labor of love. May 1970 CLARA N. FELLER xiIntroduction ‘THE CHARACTER AND ORGANIZATION OF THE BOOK REMAIN UNCHANGED, BUT. the entire text has undergone a thorough revision. Many parts (Chapter XVII, in particular) have been completely rewritten and a few new sections have been added. At a number of places the exposition was simplified by streamlined (and sometimes new) arguments. Some new material has been incorporated into the text. While writing the first edition I was haunted by the fear of an excessively Jong volume. Unfortunately, this led me to spend futile months in shortening the original text and economizing on displays. This damage has now been repaired, and a great effort has been spent to make the reading easier. Occasional repetitions will also facilitate a direct access to the individual chapters and make it possible to read certain parts of this book in con- junction with Volume 1. Concerning the organization of the material, see the introduction to the first edition (repeated here), starting with the second paragraph. Lam grateful to many readers for pointing out errors or omissions. I especially thank D. A. Hejhal, of Chicago, for an exhaustive and penetrating list of errata and for suggestions covering the entire book. January 1970 WittiaM FELLER Princeton, NJ. xiiAbbreviations and Conventions WG Epoch. Intervals BY, RE, RE 1 is an abbreviation for if and only if. This term is used for points on the time axis, while time is reserved for intervals and durations. (In discussions of stochastic processes the word “times” carries too heavy a burden. The systematic use of “epoch,” introduced’ by J. Riordan, seems preferable to varying substitutes such as moment, instant, or point.) — — are denoted by bars: 2,5 is an open, a, 6 a closed interval; half-open intervals are denoted by a,b. and ‘a,b. This notation is used also in higher dimensions, The pertinent conventions for vector notations and order relations are found in V,1 (and also in IV,2). The symbol (a, 5) is reserved for pairs and for points. stand for the line, the plane, and the r-dimensional Cartesian space. refers to volume one, Roman numerals to chapters. Thus 1; X1,G.6) refers to section 3 of chapter XI of volume 1. D indicates the end of a proof ot of a collection of examples. nand® denote, respectively, the normal density and distribution function with zero expectation and unit variance. 0,0, and~, Let w and v depend on a parameter 2 which tends, say, to a. Assuming that v is positive we write u = Ol) a (remains bounded u = 00) yf = 0 ‘f(@) U{de}. For this abbreviation see V,3. Regarding Borel sets and Baire functions, see the introduction to chapter V.Contents CHAPTER I THE EXPONENTIAL AND THE UNIFORM DENSITIES . Introduction © 2 2. 2 . Densities. Convolutions ss. . The Exponential Density .: . Waiting Time Paradoxes. The Poisson Process . The Persistence of Bad Luck. ©... . Waiting Times and Order Statistics . . The Uniform Distribution . Random Splittings . Convolutions and Covering Theorems . 10. Random Directions. 2 2 2. 11. The Use of Lebesgue Measure 12, Empirical Distributions 13, Problems for Solution... een Awana CHAPTER TL SPECIAL DeNstries. RANDOMIZATION . 1, Notations and Conventions. : 2, Gamma Distributions . *3. Related Distributions of Statistics 4, Some Common Densities : 5. Randomization and Mixtures... 6. Discrete Distributions u 15 a a 25 29 33 36 39 45 45 47 8 49 33 55. * ‘Starred sections are not required for the understanding of the sequel and should be omitted at first reading xviixviii CONTENTS 7. Bessel Functions and Random Walks 8. Distributions on a Circle 9. Problems for Solution CHAPTER TIL Densrrigs IN HIGHER DIMENSIONS. NORMAL DENSITIES AND PROCESSES 1. Densities oo 2. Conditional Distributions. .- 3, Return to the Exponential and the Uniform Distributions "4, A Characterization of the Normal Distribution 5. Matrix Notation. The Covariance Matrix 6. Normal Densities and Distributions. *7. Stationary Normal Processes 8, Markovian Normal Densities. 9. Problems for Solution CHAPTER IV PROBABILITY MEASURES AND SPACES. . Baire Functions ae 2. Interval Functions and Integrals in ‘RY. 3. o-Algebras. Measurability . 4, Probability Spaces. Random Variables . 5, The Extension Theorem 6. Product Spaces. Sequences of Independent Variables. 7. Null Sets. Completion CHAPTER V PRosasiLiry DistRiButions IN RT 1. Distributions and Expect 2. Preliminaries 3, Densities 4. Convolutions 58 61 66 66 n 4 80 83 87 94 103 104 106 2 us 18. 121 125 127 128 136 138 143‘CONTENTS Symmetrization Integration by Parts. Existence of Moments Chebyshev's Inequality Further Inequalities. Convex Functions Simple Conditional Distributions. Mixtures... *10. Conditional Distributions “11. Conditional Expectations . 12. Problems for Solution CHAPTER, ‘VIA SURVEY OF SOME IMPORTANT DISTRIBUTIONS AND PROCESSES 1. Stable Distributions in Rt 2. Examples... oe 3. Infinitely Divisible Distributions in ME. 4, Processes with Independent Increments *5. Ruin Problems in Compound Poisson Processes 6. Renewal Processes . 7. Examples and Problems 8. Random Walks. © 2 9, The Queuing Process... ae 10, Persistent and Transient Random Walks 11. General Markov Chains *12. Martingales. fe 13. Problems for Solution... 2 6. CHAPTER VIE Laws OF LARGE NUMBERS. APPLICATIONS IN ANALYSIS . 1, Main Lemma and Notations . . 2. Bernstein Polynomials. Absolutely Monotone Functions 3. Moment Problems . “4. Application to Exchangeable ‘Variables *5, Generalized Taylor Formula and Semi-Groups 6. Inversion Formulas for Laplace Transforms... xix 148 150 15 152 156 160 162 165 169 169 173, 176 179 182 184 187 190 194 200 205 209 218 219 219 222 224 228 230 232xx ‘CONTENTS +7. Laws of Large Numbers for ‘enticaly Distributed Variables 2 2... ee BM *8. Strong Laws Se 27 *9, Generalization to Martingales =. = | |). 2A 10, Problems for Solution. . 2 2 1. 2... 244 CHAPTER VIE THe Basic Liver THEOREMS co UT 1, Convergence of Measures ce AT 2, Special Properties. ©. 2. 1. 1... 282 3, Distributions as Operators © 5... . 28H 4, The Central Limit Theorem. . . - 258 *5. Infinite Convolutions. . 2. 6... ee. 265 6, Selection Theorems. co ee 267 7, Ergodic Theorems for Markov Chains - se. 210 8. Regular Variation. . : +. mS *9. Asymptotic Properties of Regularly Varying Functions . 279 10. Problems for Solution. . . . . . . . . . . 284 CHAPTER IX INFINireLy Divisiste DisTRIBUTIONS AND SeMI-GRoUPS . . 290 1. Orientation. © 6 280 2. Convolution Semi-Groups .: 293 3, Preparatory Lemmas L: ce. 296 4, Finite Variances. 2. 2... sw. 28 5. The Main Theorems . ee 300 6. Example: Stable Semi-Groups . - 5 305 7. Triangular Arrays with Identical Distributions. . . 308 8. Domains of Attraction . se BID 9. Variable Distributions. The Three-Series Theorem . . 316 10. Problems for Solution. . 2... . 318CHAPTER CONTENTS X MARKOV PROCESSES AND SEMI-GROUPS . 1 2. 3 4, 5. 6, 1, 8 9, 10. (CHAPTER . The Pseudo-Poisson Type. ‘A Variant: Linear Increments . Jump Processes: . : Diffusion Processes in 2... : ‘The Forward Equation. Boundary Conditions . Diffusion in Higher Dimensions. . 7. Subordinated Processes... oo . Markov Processes and Semi-Groups : ). The “Exponential Formula” of Semi-Group Theory - Generators. The Backward Equation XI RENEWAL THEORY 1. 2. 3. ‘The Renewal Theorem. Fe Proof of the Renewal Theorem . Refinements 2 2 2 1 2 2 eee 4. Persistent Renewal Processes 5. 6. 1 8 10. (CHAPTER The Number NV, of Renewal Epochs ‘Terminating (Transient) Processes Diverse Applications... See Existence of Limits in Stochastic Processes... ). Renewal Theory on the Whole Line. Problems for Solution. . 2... | XIE RanpoM WaLKSIN RES ee 2 3. 3a. Basic Concepts and Notations. . . . . Duality. Types of Random Walks . Distribution of Ladder Heights Wiener- Hog Factor- ization... ee ‘The Wiener-Hopf Integral Equation... xxi 321 322 324 326 332 337 345 349 353 356 358 358 364 366 368 372 374 317 379 380 385 389) 390 394 398xxii ‘CONTENTS 4. Examples 2 0 2. 5. Applications : 6. A Combinatorial Lemma . 7, Distribution of Ladder Epochs : 8. The Arc Sine Laws. toe 9. Miscellaneous Complements... 2 2 1. 10. Problems for Solution CHAPTER XTIL LAPLACE TRANSFORMS. TAUBERIAN THEOREMS, RESOLVENTS 1, Definitions. The Continuity Theorem 2, Elementary Properties, 3. Examples 2. : ce 4. Completely Monotone Functions. Inversion Formulas. 5. Tauberian Theorems 6. Stable Distributions . *7. Infinitely Divisible Distributions *8. Higher Dimensions 9, Laplace Transforms for Semi- -Groups 10. The Hille-Yosida Theorem . : 11. Problems for Solution . CHAPTER XIV APPLICATIONS OF LAPLACE TRANSFORMS 1. The Renewal Equation: Theory. 2. Renewal-Type Equations: Examples . 3. Limit Theorems Involving Arc Sine Distributions . 4. Busy Periods and Related Branching Processes... 5. Diffusion Processes... .: 6. Birth-and-Death Processes and Random Walks 7. The Kolmogorov Differential Equations 8. Example: The Pure Birth Process... : 9, Calculation of Ergodic Limits and of Firs-Passage Times 10. Problems for Solution. ae 412, 413, 47 423 425 429, 429 434 436 439, 442 448, 449 452 434 458 463 466 470 473 475 479 483 488 491 495‘CONTENTS ‘CHAPTER XV_ Characteristic FUNCTIONS ce 1, Definition. Basic Properties 2. Special Distributions. Mixtures. 2a. Some Unexpected Phenomena 3. Uniqueness. Inversion Formulas : 4, Regularity Properties... Lo 5. The Central Limit Theorem for Equal Components . 6. The Lindeberg Conditions . 7. Characteristic Functions in Higher Dimensions. *8. Two Characterizations of the Normal Distribution 9. Problems for Solution CHAPTER XVI* EXPANSIONS RELATED TO THE CENTRAL LIMIT THEOREM. 1. Notations 2. Expansions for Densities 3, Smoothing : . 4. Expansions for Distributions . toe 5. The Berry-Esséen Theorems. 2 2. 2. 6. Expansions in the Case of Varying Components : 7. Large Deviations coe ‘CHAPTER XVIL_ Inrintrety Divistete Distrisutions. 1, Infinitely Divisible Distributions . : 2. Canonical Forms. The Main Limit Theorem . 2a, Derivatives of Characteristic Functions . 3. Examples and Special Properties... 4, Special Properties . 5, Stable Distributions and Their Domains of Attraction *6, Stable Densities Se 7. Triangular Arrays. xxii 498 498 502 505 507 su 515 518 521 525 526 531 532 533 536 538 542, 546 548 554 554 558 565 570 574 581 583xxiv ‘CONTENTS “8. The Class L. oe *9, Partial Attraction. “Universal Laws” *10. Infinite Convolutions . 11, Higher Dimensions 12, Problems for Solution. ‘CHAPTER XVIIE AppLicaTions oF FOURIER MeTHops To RANDOM WALKS 1. The Basic Identity . : *2, Finite Intervals, Wald’s ‘Approximation 3. The Wiener-Hopf Factorization 4, Implications and Applications 5, Two Deeper Theorems 6. Criteria for Persistency 7. Problems for Solution CHAPTER XIX HARMONIC ANALYSIS The Parseval Relation . Positive Definite Functions Stationary Processes Bee Fourier Series . *5, The Poisson Summation Formula Positive Definite Sequences. L? Theory Stochastic Processes and Integrals Problems for Solution . ye ANSWERS TO PROBLEMS . Some Books ON CoGNATE SUBIECTS . Inpex . 588 590, 592 593 595 598 598 601 612 614 616 619 619, 620 623, 626 629 633 635 641 647 651 655 657An Introduction to Probability Theory and Its ApplicationsCHAPTERI The Exponential and the Uniform Densities 1. INTRODUCTION In the course of volume 1 we had repeatedly to deal with probabilities defined by sums of many small terms, and we used approximations of the form a. Pla nd} = (1—p,)" and the expected waiting time is E(T) = d/p,. Refinements of this model are obtained by letting 9. grow smaller in such a way that the expectation d/p, = a remains 4 Further examples from volume 1: The arc sine distribution, chapter IIl, section 4; the distributions for the number of returns to the origin and first passage times in II,7;. the limit theorems for random walks in XIV; the uniform distribution in problem 20 of XU,7. 2 Concerning the use of the term epoch, see the list of abbreviations at the front of the book.2 ‘THE EXPONENTIAL AND THE UNIFORM DENSITIES Lt fixed. To a time interval of duration there correspond n~ 1/6 trials, and hence for small 3 (2) {T > 1} (1 ~ Bay et approximately, as can be seen by taking logarithms. ‘This model considers the waiting time as a geometrically distributed discrete random variable, and (1.2) states that “in the limit” one gets an exponential distribution. From the point of view of intuition it would seem more natural to start from the sample space whose points are real numbers and to introduce the exponential distribution directly. (8) Random choices. To “choose a point at random” in the interval 0,1 is a conceptual experiment with an obvious intuitive meaning. It can be described by discrete approximations, but it is easier to use the whole interval as sample space and to assign to each interval its length as probability. The conceptual experiment of making two independent random choices of points in 0,1 results in a pair of real numbers, and so the natural sample space is a unit square. In this sample space one equates, almost instinctively, “probability” with “area.” This is quite satisfactory for some elementary purposes, but sooner or later the question arises as to what the word “area” really means, » As these examples show, a continuous sample space may be conceptually simpler than a discrete model, but the definition of probabilities init depends on tools such as integration and measure theory. In denumerable sample spaces it was possible to assign probabilities to all imaginable events, whereas in general spaces this naive procedure leads to logical contra- dictions, and our intuition has to adjust itself to the exigencies of formal logic. ‘We shail soon see that the naive approach can lead to trouble even in relatively simple problems, but it is only fair to say that many probabilistically significant problems do not require a clean definition of probabilities. Some- times they are of an analytic character and the probabilistic background serves primarily as a support for our intuition. More to the point is the fact that complex stochastic processes with intricate sample spaces may lead to significant and comprehensible problems which do not depend on the delicate tools used in the analysis of the whole process. A typical reasoning may run as follows: if the process can be described at all, the random variable Z must have such and such properties, and its distribution must. therefore satisfy such and such an integral equation. Although probabilistic arguments can greatly influence the analytical treatment of the equation in question, the latter is in principle independent of the axioms of probability. Intervals are denoted by bars to preserve the symbol (a, b) for the coordinate notation ‘of points in the plane. See the list of abbreviations at the front ofthe book.12 DENSITIES, CONVOLUTIONS 3 Specialists in various fields are sometimes so familiar with problems of this type that they deny the need for measure theory because they are unac- quainted with problems of other types and with situations where vague reasoning did lead to wrong results.* This situation will become clearer in the course of this chapter, which serves as an informal introduction to the whole theory. It describes some analytic properties of two important distributions which will be used throughout this book, Special topics are covered partly because of significant applications, partly to illustrate the new problems confronting us and the need for appropriate tools. It is not necessary to study them systematically or in the order in which they appear. ‘Throughout this chapter probabilities are defined by elementary integrals, and the limitations of this definition are accepted. The use of a probabilistic Jargon, and of terms such as random variable or expectation, may be justified in two ways. They may be interpreted as technical aids to intuition based on the formal analogy with similar situations in volume 1. Alternatively, every- thing in this chapter may be interpreted in a logically impeccable manner by a passage to the limit from the discrete model described in example 2(a). Although neither necessary nor desirable in principle, the latter procedure has the merit of a good exercise for beginners. 2, DENSITIES. CONVOLUTIONS A probability density on the line (or Rt) is a function f such that en Sl) > 0, [V1 dz=l For the present we consider only piecewise continuous densities (see V,3 for the general notion). To each density f we let correspond its distribution function® F defined by @2) re =[" sordy “The roles of rigor and intuition are subject to misconceptions, As was pointed out in volume 1, natural intuition and natural thinking are @ poor affair, but they gain strength withthe development of mathematical theory. Today's intuition and applications depend fon the most sophisticated theories of yesterday. Furthermore, strict theory represents ‘economy of thought rather than luxury. Indeed, experience shows that in applications ‘most people rely on lengthy calculations rather than simple arguments because these appear risky. [The nearest illustration is in example S(@).} ® We recall that by “distribution function” is meant a right continuous non-decreasing function with limits © and 1 at 0. Volume t was concerned mainly with distributions ‘whose growth is due entirely to jumps. Now we focus our attention on distribution functions defined as integrals. General distribution functions will be studied in chapter V.4 ‘THE EXPONENTIAL AND THE UNIFORM DENSITIES 12 It is a monotone continuous function increasing from 0 to 1. We say that Sf and F are concentrated on the interval a 0 and consider the discrete random variable X, which for (n—1)8 <2 < nd assumes the constant value nd. Here n = 0, +1, £2,.... In volume 1 we would have used the multiples of 6 as sample As far as possible we shall denote random variables (that is, functions on the sample space) by capital boldface letters, reserving small letters for numbers or location parameters. This holds in particular for the coordinate variable X, namely the function defined by Xe)12 DENSITIES, CONVOLUTIONS 5 space, and described the probability distribution of X, by saying that (2.5) P{X;=n0} = F(nd) — F((n—1)6). Now X, becomes a random variable in an enlarged sample space, and its distribution function is the function that for nd- 0, the event {X* < x} is the same as (—Vz 0 gz) =0 for «<0. ‘The distribution function of X* is given for all x by F(W2) and has density 1/(W a), The expectation of X is defined by 26) EX) = { aya) dz, provided the integral converges absolutely. ‘The expectations of the approxi- mating discrete variables X, of example (a) coincide with Riemann sums for this integral, and so E(X,)-» E(X). If w is a bounded continuous function the same argument applies to the random variable u(X), and the relation E(u(X,)) > E(u(X)) implies en BUX = [uefa the point here is that this formula makes no explicit use of the distribution of u(X). Thus the knowledge of the distribution of a random variable X suffices to calculate the expectation of functions of it. The second moment of X is defined by es) Bex) = [2700 dr, provided the integral converges. Putting = E(X), the variance of X is again defined by es) Var (X) = E((X—p)*) = E(X*) — yt,6 THE EXPONENTIAL AND THE UNIFORM DENSITIES 12 Note. If the variable X is positive (that is, if the density f is concentrated on 0, 06) and if the integral in (2.6) diverges, it is harmless and convenient to say that X has an infinite expectation and write E(X) = 2. By the same token one says that X has an infinite variance when the integral in (2.8) diverges. For variables assuming positive and negative values the expectation remains undefined when the integral (2.6) diverges. A typical ‘example is provided by the density 7™'(1+2*)". > The notion of density carries over to higher dimensions, but the general discussion is postponed to chapter IIT, Until then we shall consider only the analogue to the product probabilities introduced in definition 2 of 1; V4 to describe combinations of independent experiments. In other words, in this chapter we shall be concerned only with product densities of the form S(@) gy), f@ gy) Me), ete., where f, g,... are densities on the line. Giving a density of the form f(z) g(y) in the plane ‘R? means identifying “probabilities” with integrals: 2.10) P(A} ={f see ator de a Speaking of “two independent random variables X and Y with densities f and g” is an abbreviation for saying that probabilities in the (X, Y)-plane ‘are assigned in accordance with (2.10). This implies the multiplication rule for intervals, for example P(X > a, ¥ > 6} = P(X > a)P(Y > b}. The analogy with the discrete case is so obvious that no further explanations are required, Many new random variables may be defined as functions of X and Y, but the most important role is played by the sum S =X + Y. The event A = (S z>0. Note on the notion of random variable. The use of the line or the Cartesian spaces ‘R" as sample spaces sometimes blurs the distinction between random variables and “ordinary” functions of one or more variables. In volume 1 random variable X could assume only denumerably many values and it was then obvious whether we were talking about a function (such as the square or the exponential) defined on the line, or the random variable X* or e* defined in the sample space. Even the outer appearance of these functions was entirely different inasmuch as the “ordinary” exponential assumes all positive values whereas e* had a denumerable range. To see the change in this situation, consider now “two independent random variables X and Y with a common density f.” In other words, the plane ‘R? serves as sample space, and probabilities are defined as integrals of f(z)f(y). Now every function of two variables can be defined in the sample space, and then it becomes a random variable, but it must be borne in mind that a function of ‘two variables can be defined also without reference to our sample space. For example, certain statistical problems compel one to introduce the random variable /(X)f(¥) [see example VI,12(d)]. On the other hand, in introducing ‘our sample space ‘R? we have evidently referred to the “ordinary” function f defined independently of the sample space. This “ordinary” function induces many random variables, namely /(X), f(¥), f(X4¥), ete. Thus the same f may serve either as a random variable or as an ordinary function.8 THE EXPONENTIAL AND THE UNIFORM DENSITIES 13 ‘As a tule (and in each individual case) it will be clear whether or not we are concerned with a random variable. Nevertheless, in the general theory there arise situations in which functions (such as conditional probabilities and expectations) can be considered either as free functions or as random variables, and this is somewhat confusing if the freedom of choice is not properly understood. Note on terminology and notations. To avoid overburdening of sentences itis customary to call E(X), interchangeably, expectation of the variable X, or of the density f, or of the distribution F. Similar liberties wil be taken for other terms. For example, convolution really signifies an operation, but the term is applied also to the result of the operation and the function f'g is referred (0 as “the convolution.” Tn the older literature the terms distribution and frequency function were applied to what we call densities; our distribution functions were described as “cumulative,” and the abbreviation c.df. is still in use, THE EXPONENTIAL DENSITY For arbitrary but fixed « > 0 put (3.1) Siz) = ae, F(a) = for x>0 and F(x) = f(2) = 0 for x <0. Then f isan exponential density, F its distribution function. A trite calculation shows that the expectation equals a, the variance a*. In example 1(a) the exponential distribution was derived as the limit of geometric distributions, and the method of example 2(a) leads to the same result. We recall that in stochastic processes the geometric distribution frequently governs waiting times or lifetimes, and that this is due to its “lack of memory,” described in 1; XIII,9: whatever the present age, the residual lifetime is unaffected by the past and has the same distribution as the lifetime itself. It will now be shown that this property carries over to the exponential limit and to no other distribution. Let T be an arbitrary positive variable to be interpreted as life- or waiting time. It is convenient to replace the distribution function of T by its tail G2) U) = PIT > 1}. Intuitively, U(t) is the “probability at birth of a lifetime exceeding 1.” Given an age s, the event that the residual lifetime exceeds 1 is the same as {T > s+1} and the conditional probability of this event (given age s) equals the ratio U(s+2)/U(s). This is the residual lifetime distribution, and it coincides with the total lifetime distribution iff G3) UGH = US) UO, 51> 0.13 ‘THE EXPONENTIAL DENSITY 9 Inwas shown in 1; XVIL6 that positive solution of this equation is necessarily of the form U(t) = e-*, and hence the lack of aging described above in ltalies holds true if the lifetime distribution is exponential, We shall refer to this lack of memory as the Markov property of the exponential distribution, Analytically it reduces to the statement that only for the exponential distribution F do the tails U = 1—F satisfy (3.3), but this explains the constant occurrence of the exponential distribution in Markov processes. (A stronger version of the Markov property will be described in section 6.) Our description referred to temporal processes, but the argument is general and the Markov property remains meaningful when time is replaced by some other parameter. Examples. (a) Tensile strength. To obtain a continuous analogue to the proverbial finite chain whose strength is that of its weakest link denote by U(t) the probability that a thread of length (of a given material) can sustain a certain fixed load. A thread of length s+f does not snap iff the ‘two segments individually sustain the given load, Assuming that there is no interaction, the two events must be considered independent and U must satisfy (3.3). Here the length of the thread takes over the role of the time parameter, and the length at which the thread will break is an exponentially distributed random variable. (b) Random ensembles of points in space play a role in many connections so that it is important to have an appropriate definition for this concept. Speaking intuitively, the first property that perfect randomness should have is.a lack of interaction between different regions: the observed configuration within region 4, should not permit conclusions concerning the ensemble in a non-overlapping region 4g. Specifically, the probability p that both Ay and A, are empty should equal the product of the probabilities p, and Ps that Ay and Ay be empty. It is plausible that this product rule cannot hold for alf partitions unless the probability p depends only on the volume of the region A but not on its shape. Assuming this to be so, we denote by U(f) the probability that a region of volume t be empty. These probabilities then satisfy (3.3) and hence U(r) = e~*'; the constant « depends on the density of the ensemble or, what amounts to the same, on the unit of length. It will be shown in the next section that the knowledge of U(t) permits us to calculate the probabilities p,(t) that a region of volume ¢ contains exactly points of the ensemble; they are given by the Poisson distribution p,(t) = e-*"(at)"/n!, We speak accordingly of Poisson ensembles of points, this term being less ambiguous than the term random ensemble which may have other connotations, (©) Ensembles of circles and spheres. Random ensembles of particles present a more intricate problem. For simplicity we assume that the particles,10 THE EXPONENTIAL AND THE UNIFORM DENSITIES 13 are of a spherical or circular shape, the radius p being fixed. The configuration is then completely determined by the centers and it is tempting to assume that these centers form a Poisson ensemble. This, however, is impossible in the strict sense since the mutual distances of centers necessarily exceed 2p. One feels nevertheless that for small radii p the effect of the finite size should be negligible in practice and hence the model of a Poisson ensemble of centers should be usable as an approximation. For a mathematical model we postulate accordingly that the centers form a Poisson ensemble and accept the implied possibility that the circles or spheres intersect. This idealization will have no practical consequences if the dii_p are small, because then the theoretical frequency of intersections be negligible. Thus astronomers treat the stellar system as a Poisson ensemble and the approximation to reality seems excellent. The next two examples show how the model works in practice. (@) Nearest neighbors. We consider a Poisson ensemble of spheres (stars) with density a. The probability that a domain of volume ¢ contains no center equals ¢~*'. Saying that the nearest neighbor to the origin has a distance >r amounts to saying that a sphere of radius r contains no star center in its interior. The volume of such a ball equals 4wr?, and hence in a Poisson ensemble of stars the probability that the nearest neighbor has a distance >r is given by e~**"”, The fact that this expression is independent of the radius p of the stars shows the approximative character of the model and its limitations. In the plane, spheres are replaced by circles and the distribution function for the distance of nearest neighbors is given by 1 — e~**"*, (€) Continuation: free paths. For ease of description we begin with the two-dimensional model. The random ensemble of circular disks may be interpreted as the cross section of a thin forest. I stand at the origin, which is not contained in any disk, and look in the direction of the positive z-axis. ‘The longest interval 0,7 not intersecting any disk represents the visi or free path in the x-direction. It is a random variable and we denote it by L. Denote by A the region formed by the points at a distance

An Introduction To Probability Theory and Its Applications (Vol.1), Feller W
100% (3)
An Introduction To Probability Theory and Its Applications (Vol.1), Feller W
525 pages
Financial Calculus An Introduction To Derivative Pricing
100% (1)
Financial Calculus An Introduction To Derivative Pricing
242 pages
(Dover Books On Mathematics Series) Georgi E. Shilov - Linear Algebra-Dover (1977)
No ratings yet
(Dover Books On Mathematics Series) Georgi E. Shilov - Linear Algebra-Dover (1977)
399 pages
Probability Theory I - M. Loève
100% (1)
Probability Theory I - M. Loève
437 pages
B. Ramdas Bhat - Modern Probability Theory - An Introductory Textbook-Wiley (1985)
100% (1)
B. Ramdas Bhat - Modern Probability Theory - An Introductory Textbook-Wiley (1985)
288 pages
Algebraic Geometry - R. Hartshorne
50% (2)
Algebraic Geometry - R. Hartshorne
514 pages
Protter Stochastic Integration and Differential Equations 2nd
100% (4)
Protter Stochastic Integration and Differential Equations 2nd
430 pages
Probability (Graduate Texts in - Albert N. Shiryaev PDF
92% (12)
Probability (Graduate Texts in - Albert N. Shiryaev PDF
636 pages
COX, D. R. HINKLEY, D. V. Theoretical Statistics. 1974 PDF
100% (4)
COX, D. R. HINKLEY, D. V. Theoretical Statistics. 1974 PDF
522 pages
(Kelley) General Topology
100% (10)
(Kelley) General Topology
315 pages
Principles of Mathematical Analysis Walter Rudin PDF
100% (6)
Principles of Mathematical Analysis Walter Rudin PDF
351 pages
Daniel Pedoe Geometry, A Comprehensive Course 1988
92% (12)
Daniel Pedoe Geometry, A Comprehensive Course 1988
462 pages
A Practical Guide To Quantitative Finance Interviews by Xinfeng Zhou
No ratings yet
A Practical Guide To Quantitative Finance Interviews by Xinfeng Zhou
213 pages
An Introduction To Probability Theory and Its Applications. Vol. 2. 2nd. Ed. W. Feller
90% (10)
An Introduction To Probability Theory and Its Applications. Vol. 2. 2nd. Ed. W. Feller
683 pages
Linear Algebra by Gilbert Strang
100% (13)
Linear Algebra by Gilbert Strang
516 pages
An Introduction To Probability Theory and Its Applications, Vol. 2 by William Feller
100% (1)
An Introduction To Probability Theory and Its Applications, Vol. 2 by William Feller
683 pages
How To Prove It
100% (17)
How To Prove It
320 pages
2002 - Stochastic Calculus PDF
100% (4)
2002 - Stochastic Calculus PDF
784 pages
Casella Berger Statistical Inference
100% (2)
Casella Berger Statistical Inference
686 pages
(T. Cacoullos) Exercises in Probability
100% (2)
(T. Cacoullos) Exercises in Probability
264 pages
Introduction To Probability
100% (28)
Introduction To Probability
520 pages
The Art of Probability
100% (7)
The Art of Probability
361 pages
An Introduction To Probability Theory and Its Applications, Volume II
No ratings yet
An Introduction To Probability Theory and Its Applications, Volume II
683 pages
Cameron, Peter J - Combinatorics - Topics, Techniques, Algorithms-Cambridge Univ. Press (2010)
No ratings yet
Cameron, Peter J - Combinatorics - Topics, Techniques, Algorithms-Cambridge Univ. Press (2010)
367 pages
E. T. Whittaker, G. N. Watson - A Course of Modern Analysis-Cambridge University Press (2021)
100% (3)
E. T. Whittaker, G. N. Watson - A Course of Modern Analysis-Cambridge University Press (2021)
721 pages
(Marek Capinski, Tomasz Jerzy Zastawniak) Probability Through Problems
83% (6)
(Marek Capinski, Tomasz Jerzy Zastawniak) Probability Through Problems
274 pages
(Charles C. Pinter) A Book of Abstract Algebra
90% (10)
(Charles C. Pinter) A Book of Abstract Algebra
365 pages
2 - William Feller - An Introduction To Probability Theory and Its Applications - Vol II
No ratings yet
2 - William Feller - An Introduction To Probability Theory and Its Applications - Vol II
525 pages
Adventures in Stochastic Processes (Sidney Resnick)
100% (2)
Adventures in Stochastic Processes (Sidney Resnick)
320 pages
Munk Res Topology
89% (9)
Munk Res Topology
554 pages
(William Feller) An Introduction To Probability TH
No ratings yet
(William Feller) An Introduction To Probability TH
683 pages
Spiegel M.R. Real Variables, Lebesque Measure With Applications To Fourier Series 1990
100% (3)
Spiegel M.R. Real Variables, Lebesque Measure With Applications To Fourier Series 1990
201 pages
J.V. Uspensky-Introduction To Mathematical Probability - McGraw-Hill (1937) - 2 PDF
83% (6)
J.V. Uspensky-Introduction To Mathematical Probability - McGraw-Hill (1937) - 2 PDF
419 pages
Devlin - The Joy of Sets
No ratings yet
Devlin - The Joy of Sets
205 pages
Asymptotical Statistics
100% (2)
Asymptotical Statistics
460 pages
Fundamentals of Algebraic Topology (Gradua - Steven Weintraub
100% (6)
Fundamentals of Algebraic Topology (Gradua - Steven Weintraub
169 pages
Dummit & Foote's Algebra
100% (3)
Dummit & Foote's Algebra
945 pages
Needham Visual Complex Analysis
100% (5)
Needham Visual Complex Analysis
618 pages
Marek Capinski, Tomasz Jerzy Zastawniak - Probability Through Problems (2000, Springer) PDF
No ratings yet
Marek Capinski, Tomasz Jerzy Zastawniak - Probability Through Problems (2000, Springer) PDF
133 pages
Elementary Theory 00 Grif
100% (1)
Elementary Theory 00 Grif
222 pages
Oksendal - Stochastic Differential Equations
50% (2)
Oksendal - Stochastic Differential Equations
385 pages
Winning Ways For Your Mathematical Plays - Vol 1
100% (2)
Winning Ways For Your Mathematical Plays - Vol 1
290 pages
An Introduction To Probability Theory and Its Applications. Vol. 1. 3rd. Ed. W. Feller
100% (3)
An Introduction To Probability Theory and Its Applications. Vol. 1. 3rd. Ed. W. Feller
525 pages
1990 - Margaris - First Order Mathematical Logic
100% (2)
1990 - Margaris - First Order Mathematical Logic
121 pages
Numerical Methods For Stochastic Control Problems in Continuous Time (PDFDrive)
100% (1)
Numerical Methods For Stochastic Control Problems in Continuous Time (PDFDrive)
480 pages
Measure Integral and Probability Capinski Kopp 2ed 2003
100% (1)
Measure Integral and Probability Capinski Kopp 2ed 2003
326 pages
Grimmett G.R., Stirzaker D.R. Probability and Random Processes (3ed., Oxford, 2001)
100% (33)
Grimmett G.R., Stirzaker D.R. Probability and Random Processes (3ed., Oxford, 2001)
608 pages
An Introduction To Probability Theory and Its Applications Vol II - William Feller - 3ed, 3 Ed
100% (3)
An Introduction To Probability Theory and Its Applications Vol II - William Feller - 3ed, 3 Ed
683 pages
Algebra, Serge Lang
100% (5)
Algebra, Serge Lang
934 pages
Abstract Algebra, Dummit, 2004
100% (8)
Abstract Algebra, Dummit, 2004
945 pages
Proba McKean
No ratings yet
Proba McKean
469 pages
Proofs That Really Count The Art of Combinatorial
No ratings yet
Proofs That Really Count The Art of Combinatorial
5 pages
Feller Volume 1
No ratings yet
Feller Volume 1
527 pages
Problems in Calculus and Analysis by Albert Blank
No ratings yet
Problems in Calculus and Analysis by Albert Blank
274 pages
Probability - Basic Ideas and Selected Topics
No ratings yet
Probability - Basic Ideas and Selected Topics
247 pages
A Tour Through Mathematical Logic
No ratings yet
A Tour Through Mathematical Logic
416 pages
Needham - Visual Complex Analysis
No ratings yet
Needham - Visual Complex Analysis
610 pages
03.0 PP XV Xxii Preface
No ratings yet
03.0 PP XV Xxii Preface
8 pages
Probability - Leo Breiman
100% (1)
Probability - Leo Breiman
438 pages
Introduction
No ratings yet
Introduction
4 pages

An Introduction To Probability Theory and Its Applications II

Uploaded by

An Introduction To Probability Theory and Its Applications II

Uploaded by

You might also like