Send the link below via email or IMCopy
Present to your audienceStart remote presentation
- Invited audience members will follow you as you navigate and present
- People invited to a presentation do not need a Prezi account
- This link expires 10 minutes after you close the presentation
- A maximum of 30 users can follow your presentation
- Learn more about this feature in our knowledge base article
Chi-Squared for Dummies
Transcript of Chi-Squared for Dummies
of Chi-squared 1. Test for Goodness of Fit
determine if population distribution is different
from specific distribution
Conditions: expected counts must be >1 and no more than 20% of them must be <5.
2. Test for Homogeneity
comparison of observed and expected values
as name suggests, it is to see their similarities
in regards to a specific variable
3. Test for Association/Independence
to see if there is an association between 2 categorical variables
requires a Two-Way table Components of
Chi-Squared Test 1. Hypotheses
4. Finding chi-squared
5. Interpretation Hypotheses 1st Step in any test
Point of test is to prove or disprove null hypothesis
In the case that null hypothesis is disproved, alternative hypothesis is correct.
2. Alternative Matrices
Is in r x c form, which is row by column form. You do not count the sum row or column.
Consists of expected and observed data, all in separate matrices.
Usually requires you to make a sum row and column Z-scores Chi-squared is actually the square of the Z statistic.
Can only be used for comparison of two proportions.
The greater the Z-score, the higher the probability
Null Hypothesis: proportions are equal
Alternative Hypotheses: the proportions are not equal. Finding Chi-Squared Make sure there is a sum row and column
Find expected counts by multiplying row total and column total and then dividing by the number of observations.
Plug in observed and expected counts into this equation, or your calculator.
You're done. Null Hypotheses 1. Goodness of Fit
The observed population proportions are equal to that of the expected
The distribution of the response variable (y-value) is the same in all the populations.
There is no association between the 2 categorical variables. Alternative Hypotheses 1. Goodness of Fit
The observed population proportions differ from that of the expected
The distribution of the response variable (y-value) is not the same in all the populations.
There is an association between the 2 categorical variables. Interpretation Our Example Let's say you surveyed 200 people about their opinion on the nation's economic status. This consisted of 123 women and 77 men. Twenty-three women and 10 men had no opinion and 33 women and 15 men were satisfied with the economic status. Equation Our Example Suppose a student gets a 60 in a numerical reasoning test. The class scores for the exam are normally distributed. The mean score on the test is 50 and standard deviation is 12. How did this student perform relative to everyone else?
The z-score for the numerical test is (60-50)/12 = 0.83. This means 17% of students in his class did better than him. You would put this in your calculator... In [A] In [B] At this point, you need to find the degrees of freedom (a.k.a. df) which is basically (r-1)(c-1).
You find this value on the far left column of the Chi-square distribution critical values.
The top row gives p-values (which are probability values). Match your df with a p-value by finding the number in the table that is the closest to your chi-squared value. If you find it is evenly between two p-values, find the average of this. Compare p-value to significance level (alpha), which is commonly 0.05.
If p-value is less than alpha, you cannot accept null hypothesis. And then: