Inference for Proportions

Applied Statistics

MTH-361A | Spring 2025 | University of Portland

March 19, 2025

Objectives

Previously… (1/3)

The guiding principle of statistics is statistical thinking.

Statistical Thinking in the Data Science Life Cycle

Statistical Thinking in the Data Science Life Cycle

Previously… (2/3)

Types of Inference

Parameter Estimation Hypothesis Testing
Goal Estimate an unknown population value Assess claims about a population value
Methods Point Estimation: A single value estimate (e.g., sample mean)
Interval Estimation: A range of plausible values (e.g., confidence interval)
State a null and an alternative hypothesis
Compute a test statistic and compare it to a threshold (p-value or critical value)
Key Concept Focuses on precision in estimation (confidence intervals) Focuses on decision-making based on evidence (reject or fail to reject the null hypothesis)

Previously… (3/3)

Types of Decision Errors

Reality/Decision Reject \(H_0\) Fail to reject \(H_0\)
\(H_0\) is true Type I error
with probability \(\alpha\)
(significance level)
Correct decision
with probability \(1-\alpha\)
(confidence level)
\(H_0\) is false Correct decision
with probability \(1-\beta\)
(power of test)
Type II error
with probability \(\beta\)

Parameter Estimation for One Proportion

Conditions

Confidence Intervals for One Proportion

\[\hat{p} \pm z^{\star} SE_{\hat{p}}\]

Parameter Estimation for Two Proportions

Conditions

Confidence Intervals for Two Proportions

\[\hat{p}_B - \hat{p}_A \pm z^{\star} SE_{\hat{p}_B - \hat{p}_A}\]

Hypothesis Testing for One Proportion (1/2)

Let \(p\) be the population parameter and \(p_0\) be the null value.

State Hypotheses

\(\star\) Note: The alternative hypothesis can be \(\ne\) (two-sided) and \(<\) or \(>\) (one-sided) depending on context.

Set Significance Value \(\alpha\)

Common values are \(\alpha = 0.10, 0.05, 0.01\). Note that \(\alpha\) is the Type I error rate.

Hypothesis Testing for One Proportion (2/2)

Compute the Test Statistic

\[z = \frac{\hat{p}-p_0}{SE_p}\]

Determine the P-Value

Make a Decision and conclusion

Hypothesis Testing for Two Proportions (1/2)

Let \(p_A\) and \(p_B\) be the population parameters for groups \(A\) and \(B\) respectively and \(p_0\) be the null value.

State Hypotheses

\(\star\) Note: The alternative hypothesis can be \(\ne\) (two-sided) and \(<\) or \(>\) (one-sided) depending on context.

Set Significance Value \(\alpha\)

Common values are \(\alpha = 0.10, 0.05, 0.01\). Note that \(\alpha\) is the Type I error rate.

Hypothesis Testing for Two Proportions (2/2)

Compute the Test Statistic

\[z = \frac{\left(\hat{p}_B-\hat{p}_A\right)-p_0}{SE_{p_B - p_A}}\]

Determine the P-Value

Make a Decision and conclusion

Activity: Rock-Paper-Scissors Hypothesis Testing

  1. Make sure you have a copy of the W 3/19 Worksheet. This will be handed out physically and it is also digitally available on Moodle.
  2. Work on your worksheet by yourself for 10 minutes. Please read the instructions carefully. Ask questions if anything need clarifications.
  3. Get together with another student.
  4. Discuss your results.
  5. Submit your worksheet on Moodle as a .pdf file.

RPS Wins Data

Data collection was done on 3/19/2025

Number of Wins

Blind (\(A\)) Non-Blind (\(B\)) Difference (\(B\)-\(A\))
Demonstration \(6\) \(5\) \(-1\)
Group 1 \(8\) \(4\) \(-4\)
Group 2 \(6\) \(5\) \(-1\)
Group 3 \(7\) \(5\) \(-2\)
Group 4 \(4\) \(6\) \(2\)
Total \(31\) \(25\) \(-6\)

RPS Proportion of Wins Data

Data collection was done on 3/19/2025

Proportions

Each group played 15 rounds for blind and 15 rounds for non-blind.

Blind (\(A\)) Non-Blind (\(B\)) Difference (\(B\)-\(A\))
Demonstration \(\hat{p}_A = \frac{6}{15} \approx 0.40\) \(\hat{p}_B = \frac{5}{15} \approx 0.3333\) \(\hat{p}_B - \hat{p}_A = \frac{-1}{15} \approx -0.0667\)
Group 1 \(\hat{p}_A = \frac{8}{15} \approx 0.5333\) \(\hat{p}_B = \frac{4}{15} \approx 0.2667\) \(\hat{p}_B - \hat{p}_A = \frac{-4}{15} \approx -0.2667\)
Group 2 \(\hat{p}_A = \frac{6}{15} = 0.40\) \(\hat{p}_B = \frac{5}{15} \approx 0.3333\) \(\hat{p}_B - \hat{p}_A = \frac{-1}{15} \approx -0.0667\)
Group 3 \(\hat{p}_A = \frac{7}{15} \approx 0.4667\) \(\hat{p}_B = \frac{5}{15} \approx 0.3333\) \(\hat{p}_B - \hat{p}_A = \frac{-2}{15} \approx -0.1333\)
Group 4 \(\hat{p}_A = \frac{4}{15} \approx 0.2667\) \(\hat{p}_B = \frac{6}{15} = 0.40\) \(\hat{p}_B - \hat{p}_A = \frac{2}{15} \approx 0.1333\)
Overall \(\hat{p}_A = \frac{31}{75} \approx 0.4133\) \(\hat{p}_B = \frac{25}{75} \approx 0.3333\) \(\hat{p}_B - \hat{p}_A = \frac{-6}{75} \approx -0.08\)

References

Diez, D. M., Barr, C. D., & Çetinkaya-Rundel, M. (2012). OpenIntro statistics (4th ed.). OpenIntro. https://www.openintro.org/book/os/
Speegle, Darrin and Clair, Bryan. (2021). Probability, statistics, and data: A fresh approach using r. Chapman; Hall/CRC. https://probstatsdata.com/