The Chi-Square Test uses the chi-square distribution of one or more sets of data, to test whether there is a significant difference between observed frequencies and expected frequencies.
The chi-square distribution is given by the formula:
where,
Aij | = | actual frequency in the i'th row & j'th column; |
Eij | = | expected frequency in the i'th row & j'th column; |
r | = | number of rows; |
c | = | number of columns. |
The chi-square test is then used to determine whether the value of this function is likely to have occurred by chance alone, in independent sets of data.
In Excel 2010, the Chitest function has been replaced by the Chisq.Test function, which has improved accuracy.
Although it has been replaced, the Chitest function is still available in Excel 2010 (stored in the list of compatibility functions), to allow compatibility with earlier versions of Excel.
However, the Chitest function may not be available in future versions of Excel, so it is advised that you use the Chisq.Test function if possible.
The Excel CHITEST function uses the chi-square test to calculate the probability that the differences between two supplied data sets (of observed and expected frequencies), are likely to be simply due to sampling error, or if they are likely to be real.
The syntax of the function is:
Where the function arguments are:
actual_range | - | An array of observed frequencies. |
expected_range | - | An array of expected frequencies (must have the same dimension as the actual_range array). |
You should bear in mind that the chi-square test is not reliable when the expected values are too small. As a guideline, if any of the expected values are less than 5, or if the total of the expected values is less than 50, the result of the chi-square test should not be relied upon.
Cells B3-C5 and F3-G5 of the spreadsheet below show the observed and expected frequencies of responses from men and women to a simple question.
The chi-square test for independence, for the above data sets, is calculated using the Excel Chitest function as follows:
This gives the result 0.000699103.
Generally, a probability of 0.05 or less is considered to be significant. Therefore, the returned value of 0.000699103 is highly significant.
For further examples of the Excel Chitest function, see the Microsoft Office website.
If you get an error from the Excel Chitest function this is likely to be one of the following:
#N/A | - |
Occurs if either:
|
#DIV/0! | - | Occurs if any of the values in the supplied expected_range is zero. |
#NUM! | - | Occurs if any of the values in the supplied expected_range is negative. |