Calculating Basic Statistical Procedures in SPSS by John R. Slate, Ana Rojas-LeBouef - HTML preview

/ Home / Mathematics / Calculating Basic Statistical Procedures in SPSS

PLEASE NOTE: This is an HTML preview only and some elements such as links or page numbers may be incorrect.
Download the book in PDF, ePub, Kindle for a complete version.

Chapter 4. Calculating Correlations: Parametric and Non Parametric

This Chapter has been peer-reviewed, accepted, and endorsed by the National Council of Professors of Educational Administration (NCPEA) as a significant contribution to the scholarship and practice of education administration. Formatted and edited in Connexions by Theodore Creighton and Brad Bizzell, Virginia Tech, Janet Tareilo, Stephen F. Austin State University, and Thomas Kersten, Roosevelt University.

This chapter is part of a larger Collection (Book) and is available at: Calculating Basic Statistical Procedures in SPSS: A Self-Help and Practical Guide to Preparing Theses, Dissertations, and Manuscripts

Slate and LeBouef have written a "companion book" which is available at: Preparing and Presenting Your Statistical Findings: Model Write Ups

John R. Slate is a Professor at Sam Houston State University where he teaches Basic and Advanced Statistics courses, as well as professional writing, to doctoral students in Educational Leadership and Counseling. His research interests lie in the use of educational databases, both state and national, to reform school practices. To date, he has chaired and/or served over 100 doctoral student dissertation committees. Recently, Dr. Slate created a website Writing and Statistical Help to assist students and faculty with both statistical assistance and in editing/writing their dissertations/theses and manuscripts.

Ana Rojas-LeBouef is a Literacy Specialist at the Reading Center at Sam Houston State University where she teaches developmental reading courses. She recently completed her doctoral degree in Reading, where she conducted a 16-year analysis of Texas statewide data regarding the achievement gap. Her research interests lie in examining the inequities in achievement among ethnic groups. Dr. Rojas-LeBouef also assists students and faculty in their writing and statistical needs on the website Writing and Statistical Help.

Theodore B. Creighton, is a Professor at Virginia Tech and the Publications Director for NCPEA Publications, the Founding Editor of Education Leadership Review, and the Senior Editor of the NCPEA Connexions Project.

Brad E. Bizzell, is a recent graduate of the Virginia Tech Doctoral Program in Educational Leadership and Policy Studies, and is a School Improvement Coordinator for the Virginia Tech Training and Technical Assistance Center. In addition, Dr. Bizzell serves as an Assistant Editor of the NCPEA Connexions Project in charge of technical formatting and design.

Janet Tareilo, is a Professor at Stephen F. Austin State University and serves as the Assistant Director of NCPEA Publications. Dr. Tareilo also serves as an Assistant Editor of the NCPEA Connexions Project and as a editor and reviewer for several national and international journals in educational leadership.

Thomas Kersten is a Professor at Roosevelt University in Chicago. Dr. Kersten is widely published and an experienced editor and is the author of Taking the Mystery Out of Illinois School Finance, a Connexions Print on Demand publication. He is also serving as Editor in Residence for this book by Slate and LeBouef.

Calculating Correlations: Parametric and Nonparametric

In this set of steps, readers will calculate either a parametric or a nonparametric statistical analysis, depending on whether the data reflect a normal distribution. A parametric statistical procedure requires that its data be reflective of a normal curve whereas no such assumption is made in the use of a nonparametric procedure. Of the two types of statistical analyses, the parametric procedure is the more powerful one in ascertaining whether or not a statistically significant relationship, in this case, exists. As such, parametric procedures are preferred over nonparametric procedures. When data are not normally distributed, however, parametric analyses may provide misleading and inaccurate results. Accordingly, nonparametric analyses should be used in cases where data are not reflective of a normal curve. In this set of steps, readers are provided with information on how to make the determination of normally or nonnormally distributed data. For detailed information regarding the assumptions underlying parametric and nonparametric procedures, readers are referred to the Hyperstats Online Statistics Textbook at http://davidmlane.com/hyperstat/ or to the Electronic Statistics Textbook (2011) at http://www.statsoft.com/textbook/

Research questions for which correlations are appropriate involve asking for relationships between or among variables. The research question, “What is the relationship between study skills and grades for high school students?” could be answered through use of a correlation.

Step One:

Perform ScatterPlots

√ Graphs

√ Legacy Dialogs

√ Scatter/Dot

√ The Simple Scatter icon should be highlighted

√ Define

√ Drag one of the two variables of interest to the first box (Y axis) on the right hand side and the other variable of interest to the second box (X axis) on the right hand side. It does not matter which variable goes in the X or Y axis because your scatterplot results will be the same.

Once you have a variable in each of the two boxes, click on the OK tab on the bottom left hand corner of the screen.

√ Look at the scatterplots to see whether a linear relationship is present.

In the screenshot below, the relationship is very clearly linear.

Step Two:

Calculate Descriptive Statistics on Variables

√ Analyze

* Descriptive Statistics

* Frequencies

* Click on the variables for which you want descriptive statistics (your dependent variables)

* You may click on each variable separately or highlight several of them

* * Once you have a variable in the left hand cell highlighted, click on the arrow in the middle to send the variable to the empty cell titled Variable(s)

√ Statistics

* Click on as many of the options you would like to see results

* At the minimum, click on: M, SD, Skewness, and Kurtosis

* Continue

* Charts (these are calculated only if you wish to have visual depictions of skewness and of kurtosis-they are not required)

* Histograms (not required, optional) with Normal Curve

* Continue

* OK

Step Three:

Check for Skewness and Kurtosis values falling within/without the parameters of normality (-3 to +3)

* Skewness [Note. Skewness refers to the extent to which the data are normally distributed around the mean. Skewed data involve having either mostly high scores with a few low ones or having mostly low scores with a few high ones.] Readers are referred to the following sources for a more detailed definition of skewness: http://www.statistics.com/index.php?page=glossary&term_id=356 and http://www.statsoft.com/textbook/basic-statistics/#Descriptive%20statisticsb

To standardize the skewness value so that its value can be constant across datasets and across studies, the following calculation must be made: Take the skewness value from the SPSS output (in this case it is -.177) and divide it by the Std. error of skewness (in this case it is .071). If the resulting calculation is within -3 to +3, then the skewness of the dataset is within the range of normality (Onwuegbuzie & Daniel, 2002). If the resulting calculation is outside of this +/-3 range, the dataset is not normally distributed.

* Kurtosis [Note. Kurtosis also refers to the extent to which the data are normally distributed around the mean. This time, the data are piled up higher than normal around the mean or piled up higher than normal at the ends of the distribution.] Readers are referred to the following sources for a more detailed definition of kurtosis: http://www.statistics.com/index.php?page=glossary&term_id=326 and http://www.statsoft.com/textbook/basic-statistics/#Descriptive%20statisticsb

To standardize the kurtosis value so that its value can be constant across datasets and across studies, the following calculation must be made: Take the kurtosis value from the SPSS output (in this case it is .072) and divide it by the Std. error of kurtosis (in this case it is .142). If the resulting calculation is within -3 to +3, then the kurtosis of the dataset is within the range of normality (Onwuegbuzie & Daniel, 2002). If the resulting calculation is outside of this +/-3 range, the dataset is not normally distributed.

Table 4.1. Performance IQ (Wechsler Performance Intelligence 3)

Statistics

Performance IQ (Wechsler Performance Intelligence 3)

Valid	1180
Missing	2

Mean

81.14

Std. Deviation

14.005

Skewness

-.177

Std. Error of Skewness

.071

Kurtosis

.072

Std. Error of Kurtosis

.142

Standardized Coefficients Calculator

Copy variable #1 and #2 into the skewness and kurtosis calculator

Step Four:

Calculate a Correlation Procedure on the Data

√ Analyze

√ Correlate

√ Bivariate

√ Send Over Variables on which you want to calculate a correlation by clicking on the variables in the left hand cell and then clicking on the middle arrow to send them to the right hand cell.

√ Perform a Pearson r if the standardized skewness coefficients and standardized kurtosis coefficients are within normal limits—the Pearson r is the default

√ Calculate a Spearman rho if the standardized skewness coefficients and standardized kurtosis coefficients are outside of the normal limits of +/- 3

√ To calculate a Spearman rho, click on the Spearman button and unclick the Pearson r

√ Use the default two-tailed test of significance

√ Use the Flag significant Correlation

√ OK

Step Five:

Check for Statistical Significance

1. Go to the correlation box

2. Follow Sig. (2-tailed) row over to chosen variable column

3. If you have any value less than .05 or less than your Bonferroni adjustment, if you are calculating multiple correlations on the same sample in the same study, then you have statistical significance.

Table 4.2. Correlations

Verbal IQ (Wechsler Verbal Intelligence 3)

Performance IQ (Wechsler Performance Intelligence 3)

Verbal IQ (Wechsler Verbal Intelligence 3)

Pearson Correlation	1	.664^**
Sig. (2-tailed)		.000
N	1182	1180

Performance IQ (Wechsler Performance Intelligence 3)

Pearson Correlation	.664^**	1
Sig. (2-tailed)	.000
N	1180	1180

**. Correlation is significant at the 0.01 level (2-tailed).

[In this matrix, it appears that four unique correlations are present, one per cell. In fact, only one unique correlation, or r, is present in this four cell matrix.]

Step Six:

Check For Effect Size

1. Go to the correlation box

2. Find Pearson’s Correlation Row or Spearman rho’s and follow it to the variable column.

3. Your effect size will be located in the cell where the above intersect.

4. The effect size is calculated as:

Cohen's criteria for correlations (1998)

.1 = small (range from .1 to .29)

.3 = moderate (range from .3 to .49)

.5 = large (range from .5 to 1.0)

Correlations cannot be greater than 1.00, therefore a 0 should not be placed in front of the decimal.

Step Seven:

Check the Level of Variance the Variables Have in Common

1. Square the Pearson Correlation Value or Spearman rho value to find the variance

2. In this example, the Verbal IQ and the Performance IQ share 44.09% of the variance in common (see correlation value of .664).

Table 4.3. Correlations

Verbal IQ (Wechsler Verbal Intelligence 3)

Performance IQ (Wechsler Performance Intelligence 3)

Verbal IQ (Wechsler Verbal Intelligence 3)

Pearson Correlation	1	.664^**
Sig. (2-tailed)		.000
N	1182	1180

Performance IQ (Wechsler Performance Intelligence 3)

Pearson Correlation	.664^**	1
Sig. (2-tailed)	.000
N	1180	1180

** Correlation is significant at the 0.01 level (2-tailed).

Step Eight:

Write the Numerical Sentence

1. r(n)_sp=_spcorrelation coefficient,_spp_sp<_sp.001 (or Bonferroni-adjusted alpha significance error rate).

2. Using this example: r(1180) = .66, p < .001

[sp means to insert a space.] Remember that all mathematical symbols are placed in italics.

Step Nine:

Narrative and Interpretation