Question 1

What is the difference between a chi-square goodness-of-fit test and a chi-square test of independence?

Accepted Answer

A goodness-of-fit test compares observed frequencies in a single categorical variable against theoretically expected frequencies — for example, checking whether a die is fair. A test of independence uses a contingency table to determine whether two categorical variables are related — for example, whether gender and voting preference are associated. Both use the same χ² formula, but degrees of freedom differ: df = k − 1 for goodness-of-fit, and df = (r − 1)(c − 1) for independence, where r and c are the numbers of rows and columns.

Question 2

Why must expected frequencies be at least 5 in a chi-square test?

Accepted Answer

The chi-square distribution is a continuous approximation to the discrete distribution of the test statistic. When expected counts are very small (below 5), the approximation breaks down and the test produces inflated Type I error rates — meaning it rejects the null hypothesis too often. In such cases, Fisher's exact test is the preferred alternative for 2×2 tables, as it computes exact probabilities without relying on large-sample approximations. For larger tables with sparse cells, consider combining categories or collecting more data.

Question 3

How do I choose the correct degrees of freedom for a chi-square test?

Accepted Answer

Degrees of freedom control the shape of the chi-square distribution and therefore the critical value threshold. For a goodness-of-fit test with k categories, df = k − 1, because once k − 1 frequencies are known the last is determined by the total. For a contingency table test of independence with r rows and c columns, df = (r − 1)(c − 1). Using the wrong df will give you the wrong critical value, potentially leading to incorrect conclusions. Always verify the number of independent categories or cells before entering the degrees of freedom.

Chi-Square Test Calculator

About this calculator

How to use

Frequently asked questions

What is the difference between a chi-square goodness-of-fit test and a chi-square test of independence?

Why must expected frequencies be at least 5 in a chi-square test?

How do I choose the correct degrees of freedom for a chi-square test?