Introduction to Statistics by Ewa Paszek - HTML preview

PLEASE NOTE: This is an HTML preview only and some elements such as links or page numbers may be incorrect.
Download the book in PDF, ePub for a complete version.

Chapter 2Continuous Distributions

2.1CONTINUOUS DISTRIBUTION*

CONTINUOUS DISTRIBUTION

RANDOM VARIABLES OF THE CONTINUOUS TYPE

Random variables whose spaces are not composed of a countable number of points but are intervals or a union of intervals are said to be of the continuous type. Recall that the relative frequency histogram h( x ) associated with n observations of a random variable of that type is a nonnegative function defined so that the total area between its graph and the x axis equals one. In addition, h( x ) is constructed so that the integral

(2.1)
_autogen-svg2png-0003.png

is an estimate of the probability P( a<X<b ) , where the interval ( a,b ) is a subset of the space R of the random variable X.

Let now consider what happens to the function h( x ) in the limit, as n increases without bound and as the lengths of the class intervals decrease to zero. It is to be hoped that h( x ) will become closer and closer to some function, say f( x ) , that gives the true probabilities , such as P( a<X<b ) , through the integral

(2.2)
_autogen-svg2png-0010.png
Definition: PROBABILITY DENSITY FUNCTION

1. Function f(x) is a nonnegative function such that the total area between its graph and the x axis equals one.

2. The probability P( a<X<b ) is the area bounded by the graph of f( x ) , the x axis, and the lines x=a and x=b .

3. We say that the probability density function (p.d.f.) of the random variable X of the continuous type, with space R that is an interval or union of intervals, is an integrable function f( x ) satisfying the following conditions:

  • f( x )>0 , x belongs to R,

  • _autogen-svg2png-0017.png

  • The probability of the event A belongs to R is _autogen-svg2png-0018.png

Example 2.1

Let the random variable X be the distance in feet between bad records on a used computer tape. Suppose that a reasonable probability model for X is given by the p.d.f.

_autogen-svg2png-0019.png

Note That
R=( x:0≤x<∞ ) f( x ) xR_autogen-svg2png-0022.png

The probability that the distance between bad records is greater than 40 feet is

_autogen-svg2png-0023.png

The p.d.f. and the probability of interest are depicted in FIG.1.

Figure (Fig_ex1.gif)
Figure 2.1
The p.d.f. and the probability of interest.

We can avoid repeated references to the space R of the random variable X, one shall adopt the same convention when describing probability density function of the continuous type as was in the discrete case.

Let extend the definition of the p.d.f. f( x ) to the entire set of real numbers by letting it equal zero when, x belongs to R. For example,

_autogen-svg2png-0025.png

has the properties of a p.d.f. of a continuous-type random variable x having support ( x:0≤x<∞ ) . It will always be understood that f(x)=0 , when x belongs to R, even when this is not explicitly written out.

Definition: PROBABILITY DENSITY FUNCTION

1. The distribution function of a random variable X of the continuous type, is defined in terms of the p.d.f. of X, and is given by _autogen-svg2png-0028.png

2. For the fundamental theorem of calculus we have, for x values for which the derivative F'( x ) exists, that F’(x)=f(x).

Example 2.2

continuing with Example 1

If the p.d.f. of X is

_autogen-svg2png-0030.png

The distribution function of X is F( x )=0 for x≤0

_autogen-svg2png-0033.png

Note That
_autogen-svg2png-0034.png

Also F'( 0 ) does not exist. Since there are no steps or jumps in a distribution function F( x ) , of the continuous type, it must be true that P( X=b )=0 for all real values of b. This agrees with the fact that the integral

_autogen-svg2png-0038.png is taken to be zero in calculus. Thus we see that P( aXb )=P( a<X<b )=P( aX<b )=P( a<Xb )=F( b )−F( a ), provided that X is a random variable of the continuous type. Moreover, we can change the definition of a p.d.f. of a random variable of the continuous type at a finite (actually countable) number of points without alerting the distribution of probability.

For illustration, _autogen-svg2png-0040.png and _autogen-svg2png-0041.png

are equivalent in the computation of probabilities involving this random variable.

Example 2.3

Let Y be a continuous random variable with the p.d.f. g( y )=2y , 0<y<1 . The distribution function of Y is defined by

_autogen-svg2png-0044.png

Figure 2 gives the graph of the p.d.f. g( y ) and the graph of the distribution function G( y ) .

Figure (Cont_ex_3.gif)
Figure 2.2
The p.d.f. and the probability of interest.

For illustration of computations of probabilities, consider

_autogen-svg2png-0047.png

and _autogen-svg2png-0048.png

Recall That

The p.d.f. f( x ) of a random variable of the discrete type is bounded by one because f( x ) gives a probability, namely f( x )=P( X=x ) .

For random variables of the continuous type, the p.d.f. does not have to be bounded. The restriction is that the area between the p.d.f. and the x axis must equal one. Furthermore, it should be noted that the p.d.f. of a random variable X of the continuous type does not need to be a continuous function.

For example,

_autogen-svg2png-0052.png

enjoys the properties of a p.d.f. of a distribution of the continuous type, and yet f( x ) had discontinuities at x=0,1,2, and 3. However, the distribution function associates with a distribution of the continuous type is always a continuous function. For continuous type random variables, the definitions associated with mathematical expectation are the same as those in the discrete case except that integrals replace summations.

FOR ILLUSTRATION, let X be a random variable with a p.d.f. f( x ) . The expected value of X or mean of X is

_autogen-svg2png-0056.png The variance of X is _autogen-svg2png-0057.png

The standard deviation of X is _autogen-svg2png-0058.png

Example 2.4

For the random variable Y in the Example 3.

_autogen-svg2png-0059.png and

_autogen-svg2png-0060.png

2.2THE UNIFORM AND EXPONENTIAL DISTRIBUTIONS*

THE UNIFORM AND EXPONENTIAL DISTRIBUTIONS

The Uniform Distribution

Let the random variable X denote the outcome when a point is selected at random from the interval [ a,b ] , −∞<a<b<∞ . If the experiment is performed in a fair manner, it is reasonable to assume that the probability that the point is selected from the interval [ a,x ] , ax<b is ( xa )( ba ) . That is, the probability is proportional to the length of the interval so that the distribution function of X is

_autogen-svg2png-0006.png

Because X is a continuous-type random variable, F'( x ) is equal to the p.d.f. of X whenever F'( x ) exists; thus when a<x<b , we have f( x )=F'( x )=1/( ba ).

Definition: DEFINITION OF UNIFORM DISTRIBUTION

The random variable X has a uniform distribution if its p.d.f. is equal to a constant on its support. In particular, if the support is the interval [ a,b ] , then

(2.3)
_autogen-svg2png-0012.png

Moreover, one shall say that X is U( a,b ) . This distribution is referred to as rectangular because the graph of f( x ) suggest that name. See Figure1. for the graph of f( x ) and the distribution function F(x).

Figure (UniformDistPlot.png)
Figure 2.3
The graph of the p.d.f. of the uniform distriution.
Note that

We could have taken f( a )=0 or f( b )=0 without alerting the probabilities, since this is a continuous type distribution, and it can be done in some cases.

The mean and variance of X are as follows:

_autogen-svg2png-0018.png and _autogen-svg2png-0019.png

An important uniform distribution is that for which a=0 and b =1, namely U( 0,1 ) . If X is U( 0,1 ) , approximate values of X can be simulated on most computers using a random number generator. In fact, it should be called a pseudo-random number generator (see the pseudo-numbers generation) because the programs that produce the random numbers are usually such that if the starting number is known, all subsequent numbers in the sequence may be determined by simple arithmetical op