Rate to “Success” (Discrete)

Geometric R.V. (Revisited)

Independent Bernoulli trials with “success” probability \(p\).
Count the number of “fail” trials before the first “success”.
The probability of getting \(k\) number of “fail” before a “success” follows the Geometric PMF.
The expected number of fails before a “success” is \(\text{E}(X) = \frac{1-p}{p}\).

\(\star\) In a time-based interpretation, if events occur in discrete time steps, the geometric r.v. represents the number of time steps required until an event of interest happens.

Assessment Retakes

A professor allows students to take a short assessment quiz, and if they fail, they can revise their answers and retake the quiz in the next session. The probability that a student passes on any given attempt is \(p=0.40\), and attempts continue until the student passes.

Let \(X\) be the number of fail attempts before the student get a pass.

Information Given:

R.V. is \(X \sim \text{Geom}(p)\) because we are counting the number of fail attempts before a pass occurs.
Expected value is \(\text{E}(X) = \frac{1-p}{p}\).
The probability of passing on a given attempt is \(p = 0.40\).

Interpretation:

What is the expected number of fail until a student get a pass? \[\text{E}(X) = \frac{1-0.40}{0.40} = 1.5 \approx 1 \text{ or } 2\]
On average, the student would fail on their 1st or 2nd attempt before they get a pass.

Rate to “Success” (Continuous)

Exponential R.V.

Unit length until an event occurs
“success” event happens at a constant rate

\(\star\) In a time-based interpretation, if events occur in continuous time, the exponential r.v. represents the length of time required until an event of interest happens.

The Exponential R.V.

The exponential r.v. is a continuous r.v. that models the time until an event occurs, given that the event happens at a constant rate over time called \(\lambda\): \[X \sim \text{Exp}(\lambda)\]

Sample Space:

\(x \in [0,\infty)\) because the length of time until an event occurs can be any positive unit length

Rate Parameter

\(\lambda\) is reciprocal to the average time of an event occurs.
This means that \(\text{E}(X) = \frac{1}{\lambda}\) is the average time until a “success” occurs.

The Exponential R.V.: PDF

The exponential r.v. \(X \sim \text{Exp}(\lambda)\) has infinite possible outcomes (or infinite sized sample space) where \(\lambda > 0\) is the rate of “success” with PDF given as \[f(x) = \lambda e^{-\lambda x}, \quad x \ge 0\]

\(\star\) The exponential r.v. models the unit length until an event happens.

Assessment Completion Length

A class of students is taking a quiz, and the time it takes for students to finish the quiz follows an exponential r.v., assuming unlimited quiz time allocation. On average, a student takes 15 minutes to complete the quiz.

Let \(X\) represent the time to complete the quiz.

Information Given:

R.V. is \(X \sim \text{Exp}(\lambda)\) because we are measuring the time it takes for a student to finish a quiz.
The expected value is \(\text{E}(X) = \frac{1}{\lambda}\).
the rate parameter is \(\lambda = \frac{1}{15}\).

Interpretation:

What is the expected completion length? \[\text{E}(X) = \frac{1}{\frac{1}{15}} = 15\]
\(\lambda = \frac{1}{15}\), which can be interpreted in two ways:
- on average, one student finishes the quiz in 15 minutes
- on average, a student finishes \(\frac{1}{15}\) part of the quiz per minute

Probabilities of PDFs

A Probability Density Function (PDF) describes the likelihood of a continuous r.v. taking a specific value from a continuous interval.

The probability of a single point is zero for continuous distributions: \[P(X = x) = 0, \ \text{for any } x\] because continuous distributions are defined over an infinite number of possible values, and the probability at a single point is infinitesimally small.
If \(f(x)\) is a PDF, we calculate probabilities over intervals of using integration:
- Cumulative \(\longrightarrow \displaystyle P(X \le x) = \int_{-\infty}^x f(t) \ dt\)
- Interval \(\longrightarrow \displaystyle P(a \le X \le b) = \int_a^b f(t) \ dt\)

\(\star\) The PMF (for discrete r.v.) \(P(X=k)\) is a meaningful probability while the PDF (for continuous r.v.) is \(P(X=x)=0\).

Flipping \(n\) Coins

Suppose we conduct an experiment of flipping \(n\) fair coins. The sample space \(S\) contains all possible outcomes, where the number of outcomes is \(2^n\).

Visualizing the possible outcomes using Pascal’s triangle

\(\star\) Pascal’s Triangle helps us visualize the total possible sequences of “success” (\(H\)) outcomes given \(n\) independent trials.

Counting the Number of \(H\) outcomes (Discrete)

Let \(X\) be the r.v. that counts the number of \(H\) outcomes in \(n\) trials.

Pascal’s triangle helps us count:

\(\star\) The binomial coefficient tells you how many ways \(k\) “success” outcomes can occur in \(n\) trials, corresponding to the \(k\)-th column in the \(n\)-th row of Pascal’s Triangle.

The Binomial Coefficient

Pascal’s Triangle and Combinations This formula calculates the number of ways to choose \(k\) elements from a set of \(n\). Each number in Pascal’s Triangle corresponds to a combination. Also known as the binomial coefficient.

Probability of Observing \(H\) outcomes in \(n\) Trials

Suppose we want to compute the probability of observing a certain number of “success” (\(H\)) outcomes in \(n\) trials. Note that for a fair coin the probability of “success” is \(\frac{1}{2}\).

Pascal’s triangle helps us compute these probabilities:

Flipping “enough” \(n\) Coins (Continuous)

To best illustrate this idea, here is a video.

\(\star\) The video explains how random events can produce a predictable pattern, specifically, how many random outcomes together form a normal “bell-curve” distribution.

Normal Approximation to the Binomial

\(\star\) The Binomial distribution is approximately the normal distribution given large enough samples because of the Law of Large Numbers.

The Normal R.V.

A normal r.v. is a type of continuous r.v. whose probability distribution follows the normal distribution, also known as the Gaussian distribution. The normal distribution is characterized by two parameters, \(\mu\) as the mean and \(\sigma^2\) as the variance: \[X \sim \text{N}(\mu,\sigma^2)\]

Sample Space:

\(x \in (-\infty,\infty)\) because the normal r.v. can take any value from the entire real number line and it is a continuous random variable.

Parameters

\(\mu\) is the mean (center) of the distribution.
\(\sigma^2\) is the variance, which measures the spread of the distribution.

The Normal R.V.: PDF

The normal r.v. \(X \sim \text{N}(\mu,\sigma^2)\) has infinite possible outcomes (or infinite sized sample space) where \(\mu\) is the mean and \(\sigma^2\) is the variance with PDF given as \[f(x) = \frac{1}{\sigma \sqrt{2 \pi}} \exp{\left(-\frac{1}{2} \left( \frac{x - \mu}{\sigma} \right)^2\right)}, \ -\infty < x < \infty\] where the term \(\exp{(\cdot)}\) is the exponential function \(e^{(\cdot)}\). We write it this way to make it clear because the exponent term is complicated.

\(\star\) The normal r.v. often approximates the distribution of many types of data, especially when there are large numbers of independent factors contributing to the outcome.

The Z-Score

A standardized score is a measure on how many standard deviations a value is from the mean. This is computed by the z-score formula: \[z = \frac{x - \mu}{\sigma}.\]

Using the z-score transformation, the normal r.v. reduces to \[Z \sim N(0,1),\] with mean \(0\) and standard deviation of \(1\). The normal PDF reduces to the standard normal PDF, \[f(z) = \frac{1}{\sqrt{2 \pi}} \exp{\left(-\frac{z^2}{2}\right)}, \ -\infty < z < \infty.\]

\(\star\) The standardized score is used to compare two normal distributions with different means and variances.

Normal Approximation to the Binomial

When can we use it?

The binomial distribution \(X \sim \text{Binom}(n,p)\) can be approximated by a normal distribution when:

\(np\) (expected “success”) and \(n(1-p)\) (expected “fail”) are sufficiently large. The rule of thumb is at least \(10\) “Success” and at least \(10\) “fail”.

Approximation Formula

If \(X \sim \text{Binom}(n,p)\), then for large \(n\), \[X \approx \text{N}(\mu,\sigma^2)\] where \(\mu = np\) and \(\sigma^2 = np(1-p)\).

\(\star\) The normal approximation simplifies binomial probability calculations for large \(n\).

Exam Scores

A large introductory statistics course has a final exam consisting of many independent multiple-choice questions. Each question has a probability of “success” \(p = 0.70\) (a typical student answers correctly with probability \(0.70\)). The exam has \(n = 100\) questions.

Let \(X\) represent the number of correct answers on the exam.

Information Given:

R.V. is \(\displaystyle X \sim \text{Binom}(n,p)\) because we are counting the number of correct answers out of a fixed number of questions.
The probability of answering one question is \(p = 0.70\).
The number of questions is \(n = 100\).

Normal Approximation:

Because the sample size is large and both \[ \begin{aligned} np & = 100(0.70) = 70 \\ n(1-p) & = 100(1-0.70) = 30 \end{aligned} \] are greater than 10, the Binomial distribution can be approximated by a Normal distribution: \[X \approx \text{N}(np,np(1-p)).\]

Exam Scores Percentiles (1/2)

90th Percentile:

The \(90\)th percentile of the standard normal distribution is \[z_{0.90} \approx 1.282.\]

Using R:

## [1] 1.281552

This can be interpreted as:

For a z-score of \(1.283\) standard deviations above the mean, the probability of observing a standardized value less than or equal to \(1.282\) is \(0.90\).
That is equivalent to: \[P(Z \le z_{0.90}) = 0.90\] where \(Z \sim N(0,1)\).

Exam Scores Percentiles (2/2)

Computing Percentiles:

Mean and standard deviation: \[ \begin{aligned} \mu & = np \\ & = 100(0.70) = 70 \\ \sigma^2 & = \sqrt{np(1-p)} \\ & = \sqrt{100(0.70)(1-0.70)} \approx 4.583 \end{aligned} \]
Convert the standardized score to the exam score using the z-score formula: \[ \begin{aligned} x & = \mu + z_{0.90} \sigma \\ x & = 70 + (1.282)(4.583) \\ x & \approx 75.88 \end{aligned} \]

Interpretation:

This suggests a score of about \(76\) correct answers is roughly the \(90\)th percentile.
A student who scores about \(\frac{76}{100}\) performed better than approximately \(90\)% of the class.

Using R:

## [1] 75.87335

\(\star\) Note that, this is approximately equal to 70 + qnorm(0.90,0,1)*(4.583), where the code qnorm(0.90,0,1) is the z-score.

Continuous Random Variables &
Probability Density Functions

Applied Statistics

Objectives

Rate to “Success” (Discrete)

Assessment Retakes

Rate to “Success” (Continuous)

The Exponential R.V.

The Exponential R.V.: PDF

Assessment Completion Length

Probabilities of PDFs

Flipping \(n\) Coins

Counting the Number of \(H\) outcomes (Discrete)

The Binomial Coefficient

Probability of Observing \(H\) outcomes in \(n\) Trials

Flipping “enough” \(n\) Coins (Continuous)

Normal Approximation to the Binomial

The Normal R.V.

The Normal R.V.: PDF

The Z-Score

Normal Approximation to the Binomial

Exam Scores

Exam Scores Percentiles (1/2)

Exam Scores Percentiles (2/2)

Continuous Random Variables & Probability Density Functions

Applied Statistics

Objectives

Rate to “Success” (Discrete)

Assessment Retakes

Rate to “Success” (Continuous)

The Exponential R.V.

The Exponential R.V.: PDF

Assessment Completion Length

Probabilities of PDFs

Flipping \(n\) Coins

Counting the Number of \(H\) outcomes (Discrete)

The Binomial Coefficient

Probability of Observing \(H\) outcomes in \(n\) Trials

Flipping “enough” \(n\) Coins (Continuous)

Normal Approximation to the Binomial

The Normal R.V.

The Normal R.V.: PDF

The Z-Score

Normal Approximation to the Binomial

Exam Scores

Exam Scores Percentiles (1/2)

Exam Scores Percentiles (2/2)

Continuous Random Variables &
Probability Density Functions