Conduct and Interpret an Independent Sample T-Test

What is the Independent Sample T-Test?

The independent sample t-test is a member of the t-test family, which consists of tests that compare mean value(s) of continuous-level(interval or ratio data), normally distributed data.  The independent sample t-test compares two means.  It assumes a model where the variables in the analysis are split into independent and dependent variables.  The model assumes that a difference in the mean score of the dependent variable is found because of the influence of the independent variable.  Thus, the independent sample t-test is an analysis of dependence.  It is one of the most widely used statistical tests, and is sometimes erroneously called the independent variable t-test.

The t-test family is based on the t-distribution, because the difference of mean score for two multivariate normal variables approximates the t-distribution.  The t-distribution and also the t-test is sometimes also called Student’s t.  Student is the pseudonym used by W.  S.  Gosset in 1908 to publish the t-distribution based on his empirical findings on the height and the length of the left middle finger of criminals in a local prison.

Within the t-test family, the independent samples t-test compares the mean scores of two groups in a given variable, that is, two mean scores of the same variable, whereby one mean represents the average of that characteristic for one group and the other mean represents the average of that specific characteristic in the other group.  Generally speaking, the independent samples t-test compares one measured characteristic between two groups of observations or measurements.  It tells us whether the difference we see between the two independent samples is a true difference or whether it is just a random effect (statistical artifact) caused by skewed sampling.

The independent samples t-test is also called unpaired t-test.  It is the t-test to use when two separate independent and identically distributed variables are measured.  Independent samples are easiest obtained when selecting the participants by random sampling.

The independent samples t-test is similar to the dependent sample t-test, which compares the mean score of paired observations these are typically obtained when either re-testing or conducting repeated measurements, or when grouping similar participants in a treatment-control study to account for differences in baseline.  However the pairing information needs to be present in the sample and therefore a paired sample can always be analyzed with an independent samples t-test but not the other way around.

Examples of typical questions that the independent samples t-test answers are as follows:

• Medicine – Has the quality of life improved for patients who took drug A as opposed to patients who took drug B?
• Sociology – Are men more satisfied with their jobs than women? Do they earn more?
• Biology – Are foxes in one specific habitat larger than in another?
• Economics – Is the economic growth of developing nations larger than the economic growth of the first world?
• Marketing: Does customer segment A spend more on groceries than customer segment B?

The Independent Sample T-Test in SPSS

The independent samples t-test, or Student’s t-test, is the most popular test to test for the difference in means.  It requires that both samples are independently collected, and tests the null hypothesis that both samples are from the same population and therefore do not differ in their mean scores.

Our research question for the independent sample t-test is as follows:

Does the standardized test score for math, reading, and writing differ between students who failed and students who passed the final exam?

Let’s start by verifying the assumptions of the t-test to check whether we made the right choices in our decision tree.  First, we are going to create some descriptive statistics to get an impression of the distribution.  In order to do this, we open the Frequencies menu in Analyze/Descriptive Statistics/Frequencies…

Next we add the two groups to the list of variables.  For the moment our two groups are stored in the variable A and B.  We deselect the frequency tables but add distribution parameters and the histograms with normal distribution curve to the output.

The histograms show quite nicely that the variables approximate a normal distribution and also their distributional difference.  We could continue with verifying this ‘eyeball’ test with a K-S test, however because our sample is larger than 30, we will skip this step.

The independent samples t-test is found in Analyze/Compare Means/Independent Samples T-Test.

In the dialog box of the independent samples t-test we select the variable with our standardized test scores as the three test variables and the grouping variable is the outcome of the final exam (pass = 1 vs.  fail = 0).  The independent samples t-test can only compare two groups (if your independent variable defines more than two groups, you either would need to run multiple t-tests or an ANOVA with post hoc tests).  The groups need to be defined upfront, for that you need to click on the button Define Groups… and enter the values of the independent variable that characterize the groups.

The dialog box Options… allows us to define how missing cases shall be managed (either exclude them listwise or analysis by analysis).  We can also define the width of the confidence interval that is used to test the difference of the mean scores in this independent samples t-test.