Purdue Digital Signal Processing Labs by Charles A. Bouman - HTML preview

/ Home / Teacher's Resources / Purdue Digital Signal Processing Labs

PLEASE NOTE: This is an HTML preview only and some elements such as links or page numbers may be incorrect.
Download the book in PDF, ePub, Kindle for a complete version.

Chapter 11. Lab 7b - Discrete-Time Random Processes (part 2)

Questions or comments concerning this laboratory should be directed to Prof. Charles A. Bouman, School of Electrical and Computer Engineering, Purdue University, West Lafayette IN 47907; (765) 494-0340; bouman@ecn.purdue.edu

Bivariate Distributions

In this section, we will study the concept of a bivariate distribution. We will see that bivariate distributions characterize how two random variables are related to each other. We will also see that correlation and covariance are two simple measures of the dependencies between random variables, which can be very useful for analyzing both random variables and random processes.

Background on Bivariate Distributions

Sometimes we need to account for not just one random variable, but several. In this section, we will examine the case of two random variables–the so called bivariate case–but the theory is easily generalized to accommodate more than two.

The random variables X and Y have cumulative distribution functions (CDFs) F_X(x) and F_Y(y), also known as marginal CDFs. Since there may be an interaction between X and Y, the marginal statistics may not fully describe their behavior. Therefore we define a bivariate, or joint CDF as

(11.1) F_{X
,
Y} ( x , y ) = P ( X ≤ x , Y ≤ y ) .

If the joint CDF is sufficiently “smooth”, we can define a joint probability density function,

(11.2)

Conversely, the joint probability density function may be used to calculate the joint CDF:

(11.3)

The random variables X and Y are said to be independent if and only if their joint CDF (or PDF) is a separable function, which means

(11.4) f_{X
,
Y} ( x , y ) = f_X ( x ) f_Y ( y )

Informally, independence between random variables means that one random variable does not tell you anything about the other. As a consequence of the definition, if X and Y are independent, then the product of their expectations is the expectation of their product.

(11.5) E [ X Y ] = E [ X ] E [ Y ]

While the joint distribution contains all the information about X and Y, it can be very complex and is often difficult to calculate. In many applications, a simple measure of the dependencies of X and Y can be very useful. Three such measures are the correlation, covariance, and the correlation coefficient.

Correlation
(11.6)
Covariance
(11.7)
Correlation coefficient
(11.8)

If the correlation coefficient is 0, then X and Y are said to be uncorrelated. Notice that independence implies uncorrelatedness, however the converse is not true.

Samples of Two Random Variables

In the following experiment, we will examine the relationship between the scatter plots for pairs of random samples and their correlation coefficient. We will see that the correlation coefficient determines the shape of the scatter plot.

Let X and Y be independent Gaussian random variables, each with mean 0 and variance 1. We will consider the correlation between X and Z, where Z is equal to the following:

(11.9) Z = Y
(11.10) Z = ( X + Y ) / 2
(11.11) Z = ( 4 * X + Y ) / 5
(11.12) Z = ( 99 * X + Y ) / 100

Notice that since Z is a linear combination of two Gaussian random variables, Z will also be Gaussian.

Use Matlab to generate 1000 i.i.d. samples of X, denoted as X₁, X₂, ..., X₁₀₀₀. Next, generate 1000 i.i.d. samples of Y, denoted as Y₁, Y₂, ..., Y₁₀₀₀. For each of the four choices of Z, perform the following tasks:

Use Equation 11.8 to analytically calculate the correlation coefficient ρ_XZ between X and Z. Show all of your work. Remember that independence between X and Y implies that E[XY]=E[X]E[Y]. Also remember that X and Y are zero-mean and unit variance.
Create samples of Z using your generated samples of X and Y.
Generate a scatter plot of the ordered pair of samples . Do this by plotting points , , ..., . To plot points without connecting them with lines, use the '.' format, as in plot(X,Z,'.'). Use the command subplot(2,2,n) (n=1,2,3,4) to plot the four cases for Z in the same figure. Be sure to label each plot using the title command.
Empirically compute an estimate of the correlation coefficient using your samples X_i and Z_i and the following formula.
(11.13)

INLAB REPORT

Hand in your derivations of the correlation coefficient ρ_XZ along with your numerical estimates of the correlation coefficient .
Why are ρ_XZ and not exactly equal?
Hand in your scatter plots of for the four cases. Note the theoretical correlation coefficient ρ_XZ on each plot.
Explain how the scatter plots are related to ρ_XZ.

Autocorrelation for Filtered Random Processes

In this section, we will generate discrete-time random processes and then analyze their behavior using the correlation measure introduced in the previous section.

Background

A discrete-time random process X_n is simply a sequence of random variables. So for each n, X_n is a random variable.

The autocorrelation is an important function for characterizing the behavior of random processes. If X is a wide-sense stationary (WSS) random process, the autocorrelation is defined by

(11.14)

Note that for a WSS random process, the autocorrelation does not vary with n. Also, since , the autocorrelation is an even function of the “lag” value m.

Intuitively, the autocorrelation determines how strong a relation there is between samples separated by a lag value of m. For example, if X is a sequence of independent identically distributed (i.i.d.) random variables each with zero mean and variance σ_X², then the autocorrelation is given by

(11.15)

We use the term white or white noise to describe this type of random process. More precisely, a random process is called white if its values X_n and X_n+m are uncorrelated for every m≠0.

Figure 11.1.

A LTI system diagram

If we run a white random process X_n through an LTI filter as in Figure 11.1, the output random variables Y_n may become correlated. In fact, it can be shown that the output autocorrelation r_YY(m) is related to the input autocorrelation r_XX(m) through the filter's impulse response h(m).

(11.16) r_YY ( m ) = h ( m ) * h ( – m ) * r_XX ( m )

Experiment

Consider a white Gaussian random process X_n with mean 0 and variance 1 as input to the following filter.

(11.17) y ( n ) = x ( n ) – x ( n – 1 ) + x ( n – 2 )

Calculate the theoretical autocorrelation of Y_n using Equation 11.15 and Equation 11.16. Show all of your work.

Generate 1000 independent samples of a Gaussian random variable X with mean 0 and variance 1. Filter the samples using Equation 11.17. We will denote the filtered signal Y_i, i=1,2,⋯,1000.

Draw 4 scatter plots using the form subplot(2,2,n), (n=1,2,3,4). The first scatter plot should consist of points, , (i=1,2,⋯,900). Notice that this correlates samples that are separated by a lag of “1”. The other 3 scatter plots should consist of the points , , , (i=1,2,⋯,900), respectively. What can you deduce about the random process from these scatter plots?

For real applications, the theoretical autocorrelation may be unknown. Therefore, r_YY(m) may be estimated by the sample autocorrelation, r^'_YY(m) defined by

(11.18)

where N is the number of samples of Y.

Use Matlab to calculate the sample autocorrelation of Y_n using Equation 11.18. Plot both the theoretical autocorrelation r_YY(m), and the sample autocorrelation r^'_YY(m

PREV NEXT