What kinds of research questions can the Chi-Square statistic address?
Researchers are interested in how religious fundamentalism affects views on teaching sex education in public schools. They use data from the 2008 General Social Survey to examine the association between the ordinal variable of religious fundamentalism (coded as fundamentalist, moderate, and liberal) and the dichotomous variable of teaching sex education in public schools (coded as oppose or favor). Their null hypothesis is that no relationship exists between these variables in the population; they are independent. The research hypothesis is that a relationship does exist between these variables in the population; the variables are dependent.
How does the Chi-Square statistic work?
The Chi-Square statistic is most commonly used to evaluate a Test of Independence in reference to a crosstabulation (also known as a bivariate table). Crosstabulation presents the distributions of two variables simultaneously, with the intersections of the categories of the variables appearing in the cells of the table. The Test of Independence assesses whether an association exists between the two variables by carefully examining the pattern of responses in the cells; calculating the Chi-Square statistic and comparing it against a critical value from the Chi-Square distribution allows the researcher to assess whether the association seen between the variables in a particular sample is likely to represent an actual relationship between those variables in the population.
The calculation of the Chi-Square statistic is quite straight-forward and intuitive:
where fo = the observed frequency (what is actually seen in the cells)
and fe = the expected frequency if NO relationship existed between the variables
As depicted in the formula, the Chi-Square statistic is based on the difference between what is actually observed in the table and what would be expected if there was truly no relationship between the variables; by evaluating the size of this difference, the researcher can determine whether the variables are dependent in the population.
How is the Chi-Square statistic run in SPSS and how is the output interpreted?
The Chi-Square statistic appears as an option when requesting a crosstabulation in SPSS. The output is labeled Chi-Square Tests; the Chi-Square statistic used in the Test of Independence is in the first row labeled Pearson Chi-Square. This statistic can be evaluated by comparing the actual value against a critical value found in a Chi-Square distribution (where degrees of freedom is calculated as # of rows – 1 x # of columns – 1), but it is easier to simply examine the p-value provided by SPSS. To make a conclusion about the hypothesis with 95% confidence, the value labeled Asymp. Sig. (which is the p-value of the Chi-Square statistic) should be less than .05 (which is the alpha level associated with a 95% confidence level).
Is p-value (labeled Asymp. Sig.) < .05? If so, conclude that the variables are dependent in the population.
In this example, there is an association between fundamentalism and views on teaching sex education in public schools. While 17.2% of fundamentalists oppose teaching sex education, only 6.5% of liberals are opposed. The p-value indicates that these variables are dependent in the population.
What are special concerns with regard to the Chi-Square statistic?
There are a number of important considerations when using the Chi-Square statistic to evaluate a crosstabulation. Because of how the Chi-Square value is calculated, it is extremely sensitive to sample size – when the sample size is sufficiently large (~500), almost any small difference appears significant. It is also sensitive to the distribution within the cells, and SPSS gives a warning message if cells have fewer than 5 cases. This can be addressed by always using categorical variables with a limited number of categories (by combining categories if necessary to produce a smaller table).
Statistics Solutions can assist with your quantitative analysis by assisting you to develop your methodology and results chapters. The services that we offer include:
- Edit your research questions and null/alternative hypotheses
- Write your data analysis plan; specify specific statistics to address the research questions, the assumptions of the statistics, and justify why they are the appropriate statistics; provide references
- Justify your sample size/power analysis, provide references
- Explain your data analysis plan to you so you are comfortable and confident
- Two hours of additional support with your statistician
Quantitative Results Section (Descriptive Statistics, Bivariate and Multivariate Analyses, Structural Equation Modeling, Path analysis, HLM, Cluster Analysis)
- Clean and code dataset
- Conduct descriptive statistics (i.e., mean, standard deviation, frequency and percent, as appropriate)
- Conduct analyses to examine each of your research questions
- Write-up results
- Provide APA 6th edition tables and figures
- Explain chapter 4 findings
- Ongoing support for entire results chapter statistics
*Please call 877-437-8622 to request a quote based on the specifics of your research, or email Info@StatisticsSolutions.com.





