MTH-361A | Spring 2025 | University of Portland
March 19, 2025
The guiding principle of statistics is statistical thinking.
Statistical Thinking in the Data Science Life Cycle
Types of Inference
Parameter Estimation | Hypothesis Testing | |
---|---|---|
Goal | Estimate an unknown population value | Assess claims about a population value |
Methods | Point Estimation: A single value estimate (e.g., sample
mean) Interval Estimation: A range of plausible values (e.g., confidence interval) |
State a null and an alternative hypothesis Compute a test statistic and compare it to a threshold (p-value or critical value) |
Key Concept | Focuses on precision in estimation (confidence intervals) | Focuses on decision-making based on evidence (reject or fail to reject the null hypothesis) |
Types of Decision Errors
Reality/Decision | Reject \(H_0\) | Fail to reject \(H_0\) |
---|---|---|
\(H_0\) is true | Type I error with probability \(\alpha\) (significance level) |
Correct decision with probability \(1-\alpha\) (confidence level) |
\(H_0\) is false | Correct decision with probability \(1-\beta\) (power of test) |
Type II error with probability \(\beta\) |
Conditions
Confidence Intervals for One Proportion
\[\hat{p} \pm z^{\star} SE_{\hat{p}}\]
Conditions
Confidence Intervals for Two Proportions
\[\hat{p}_B - \hat{p}_A \pm z^{\star} SE_{\hat{p}_B - \hat{p}_A}\]
Let \(p\) be the population parameter and \(p_0\) be the null value.
State Hypotheses
\(\star\) Note: The alternative hypothesis can be \(\ne\) (two-sided) and \(<\) or \(>\) (one-sided) depending on context.
Set Significance Value \(\alpha\)
Common values are \(\alpha = 0.10, 0.05, 0.01\). Note that \(\alpha\) is the Type I error rate.
Compute the Test Statistic
\[z = \frac{\hat{p}-p_0}{SE_p}\]
Determine the P-Value
Make a Decision and conclusion
Let \(p_A\) and \(p_B\) be the population parameters for groups \(A\) and \(B\) respectively and \(p_0\) be the null value.
State Hypotheses
\(\star\) Note: The alternative hypothesis can be \(\ne\) (two-sided) and \(<\) or \(>\) (one-sided) depending on context.
Set Significance Value \(\alpha\)
Common values are \(\alpha = 0.10, 0.05, 0.01\). Note that \(\alpha\) is the Type I error rate.
Compute the Test Statistic
\[z = \frac{\left(\hat{p}_B-\hat{p}_A\right)-p_0}{SE_{p_B - p_A}}\]
Determine the P-Value
Make a Decision and conclusion
.pdf
file.Data collection was done on 3/19/2025
Number of Wins
Blind (\(A\)) | Non-Blind (\(B\)) | Difference (\(B\)-\(A\)) | |
---|---|---|---|
Demonstration | \(6\) | \(5\) | \(-1\) |
Group 1 | \(8\) | \(4\) | \(-4\) |
Group 2 | \(6\) | \(5\) | \(-1\) |
Group 3 | \(7\) | \(5\) | \(-2\) |
Group 4 | \(4\) | \(6\) | \(2\) |
Total | \(31\) | \(25\) | \(-6\) |
Data collection was done on 3/19/2025
Proportions
Each group played 15 rounds for blind and 15 rounds for non-blind.
Blind (\(A\)) | Non-Blind (\(B\)) | Difference (\(B\)-\(A\)) | |
---|---|---|---|
Demonstration | \(\hat{p}_A = \frac{6}{15} \approx 0.40\) | \(\hat{p}_B = \frac{5}{15} \approx 0.3333\) | \(\hat{p}_B - \hat{p}_A = \frac{-1}{15} \approx -0.0667\) |
Group 1 | \(\hat{p}_A = \frac{8}{15} \approx 0.5333\) | \(\hat{p}_B = \frac{4}{15} \approx 0.2667\) | \(\hat{p}_B - \hat{p}_A = \frac{-4}{15} \approx -0.2667\) |
Group 2 | \(\hat{p}_A = \frac{6}{15} = 0.40\) | \(\hat{p}_B = \frac{5}{15} \approx 0.3333\) | \(\hat{p}_B - \hat{p}_A = \frac{-1}{15} \approx -0.0667\) |
Group 3 | \(\hat{p}_A = \frac{7}{15} \approx 0.4667\) | \(\hat{p}_B = \frac{5}{15} \approx 0.3333\) | \(\hat{p}_B - \hat{p}_A = \frac{-2}{15} \approx -0.1333\) |
Group 4 | \(\hat{p}_A = \frac{4}{15} \approx 0.2667\) | \(\hat{p}_B = \frac{6}{15} = 0.40\) | \(\hat{p}_B - \hat{p}_A = \frac{2}{15} \approx 0.1333\) |
Overall | \(\hat{p}_A = \frac{31}{75} \approx 0.4133\) | \(\hat{p}_B = \frac{25}{75} \approx 0.3333\) | \(\hat{p}_B - \hat{p}_A = \frac{-6}{75} \approx -0.08\) |