
MATH 4720/MSSC 5720 Introduction to Statistics
Sep 13, 2025

Sep 19, 2022




Probability : We know the process generating the data and are interested in properties of observations.
Statistics : We observed the data (sample) and are interested in determining what is the process generating the data (population).


Frequency Relative Frequency
Heads 1 0.1
Tails 9 0.9
Total 10 1.0
---------------------
Frequency Relative Frequency
Heads 515 0.515
Tails 485 0.485
Total 1000 1.000
---------------------
If we repeat tossing the coin 10 times, the probability of obtaining heads is 10%.
If 1000 times, the probability is 51.5%.
Any issue of relative frequency probability?

Classical probability : The probability is based on the concept of equally likely outcomes.
If the outcome of some process must be one of \(n\) different outcomes, the probability of each outcome is \(1/n\).
Any issue of classical probability?

Subjective probability : The probability is assigned or estimated using people’s knowledge, beliefs and information about the data generating process.
A person’s subjective probability of an outcome, rather than the true probability of that outcome.


Any probability operations and rules do NOT depend on interpretation of probability!
Experiment: any process in which the possible outcomes can be identified ahead of time.
Event: a set of possible outcomes of the experiment.
Sample space \((\mathcal{S})\) of an experiment: the collection of ALL possible outcomes of the experiment.
| Experiment | Possible Outcomes | Some Events | Sample Space |
|---|---|---|---|
| Flip a coin 🪙 | Heads, Tails | {Heads}, {Heads, Tails}, … | {Heads, Tails} |
| Roll a die 🎲 | 1, 2, 3, 4, 5, 6 | {1, 3, 5}, {2, 4, 6}, {2}, {3, 4, 5, 6}, … | {1, 2, 3, 4, 5, 6} |
Is the sample space also an event?
Draw a Venn Diagram every time you get stuck!

\(B \subset D\) or \(D \subset B\)?
Denote the probability of an event \(A\) on a sample space \(\mathcal{S}\) as \(P(A)\).
Treat the probability of an event as the area of the event in the Venn diagram.


The makers of the candy M&Ms report that their plain M&Ms are composed of

If you randomly select an M&M, what is the probability of the following?
02:00
If you randomly select an M&M, what is the probability of the following?
\(P(\mathrm{Brown}) = 0.15\)
\(\small \begin{align} P(\mathrm{Red} \cup \mathrm{Green}) &= P(\mathrm{Red}) + P(\mathrm{Green}) - P(\mathrm{Red} \cap \mathrm{Green}) \\ &= 0.10 + 0.15 - 0 = 0.25 \end{align}\)
\(P(\text{Not Blue}) = 1 - P(\text{Blue}) = 1 - 0.25 = 0.75\)
\(P(\text{Red and Brown}) = P(\emptyset) = 0\)
By the way, which interpretation of probability is used in this question?
\[ P(A \mid B) = \frac{P(A \cap B)}{P(B)} \] if \(P(B) > 0\), and it is undefined if \(P(B) = 0\).

Multiplication Rule: \(P(A \cap B) = P(A \mid B)P(B) = P(B \mid A)P(A)\)
\(P(A)\) and \(P(B)\) are unconditional or marginal probabilities.

Suppose 80% of people like peanut butter, 89% like jelly, and 78% like both.
Given that a randomly sampled person likes peanut butter, what’s the probability that she also likes jelly?

We want \(P(J\mid PB) = \frac{P(PB \cap J)}{P(PB)}\).
From the problem we have \(P(PB) = 0.8\), \(P(J) = 0.89\), \(P(PB \cap J) = 0.78\)
\(P(J\mid PB) = \frac{P(PB \cap J)}{P(PB)} = \frac{0.78}{0.8} = 0.975\).
If we don’t know if the person loves peanut butter, the probability that she loves jelly is 89%.
If we do know she loves peanut butter, the probability that she loves jelly is going up to 97.5%.
\(A\) and \(B\) are independent if \(\begin{align} P(A \mid B) &= P(A) \text{ or }\\ P(B \mid A) &= P(B) \text{ or } \\P(A\cap B) &= P(A)P(B)\end{align}\) \(\text{ if } P(A) > 0 \text{ and } P(B) > 0\)
Intuition: Knowing \(B\) occurs does not change the probability that \(A\) occurs, and vice versa.
Can we compute \(P(A \cap B)\) if we only know \(P(A)\) and \(P(B)\)?
No, we cannot compute \(P(A \cap B)\) since we do not know if \(A\) and \(B\) are independent.
We could only if \(A\) and \(B\) were independent.
In general, we need the multiplication rule \(P(A \cap B) = P(A \mid B)P(B)\).

02:00
Often, we know \(P(B \mid A)\) but are much more interested in \(P(A \mid B)\).
Example: diagnostic tests provide \(P(\text{positive test result} \mid \text{COVID})\), but we are interested in \(P(\text{COVID} \mid \text{positive test result})\)


\[\begin{align*} P(A \mid B) &= \frac{P(A \cap B)}{P(B)} \quad ( \text{def. of cond. prob.}) \\ &= \frac{P(A \cap B)}{P((B \cap A) \cup (B \cap A^c))} \quad ( \text{partition } B) \\ &= \frac{P(B \mid A)P(A)}{P(B \mid A)P(A) + P(B \mid A^c)P(A^c)} \quad ( \text{multiplication rule}) \end{align*}\]

After taking MATH 4720, \(80\%\) of students understand the Bayes’ formula.

Calculate the probability that a student understand the Bayes’ formula given the fact that she passed.
\(80\%\) of students understand the Bayes’ formula.
Of those who understand the Bayes’ formula, \(95\%\) passed ( \(5\%\) failed).
Of those who do not understand the formula, \(60\%\) passed ( \(40\%\) failed).
\[P(A \mid B) = \frac{P(B \mid A)P(A)}{P(B \mid A)P(A) + P(B \mid A^c)P(A^c)}\]
\(P(\text{understand} \mid \text{passed})\)
Let \(A =\) understand. \(B =\) passed. Then \(A^c =\) don’t understand and \(P(\text{understand} \mid \text{passed}) = P(A \mid B)\).
\(80\%\) of students understand the Bayes’ formula.
Of those who understand the Bayes’ formula, \(95\%\) passed ( \(5\%\) failed).
Of those who do not understand the formula, \(60\%\) passed ( \(40\%\) failed).
\[P(A \mid B) = \frac{P(B \mid A)P(A)}{P(B \mid A)P(A) + P(B \mid A^c)P(A^c)}\]
\(P(B \mid A) = P(\text{passed} \mid \text{understand}) = 0.95\), \(P(B \mid A^c) = P(\text{passed} \mid \text{don't understand}) = 0.6\)
\(P(A) = P(\text{understand}) = 0.8\), \(P(A^c) = 1 - P(A) = 0.2\).
\(P(\text{understand} \mid \text{passed}) = P(A \mid B) = \frac{P(B \mid A)P(A)}{P(B \mid A)P(A) + P(B \mid A^c)P(A^c)} = \frac{(0.95)(0.8)}{(0.95)(0.8) + (0.6)(0.2)} = 0.86\)
\(80\%\) of students understand the Bayes’ formula.
Of those who understand the Bayes’ formula, \(95\%\) passed ( \(5\%\) failed).
Of those who do not understand the formula, \(60\%\) passed ( \(40\%\) failed).

\[\begin{align*} & P(\text{yes} \mid \text{pass}) \\ &= \frac{P(\text{yes and } \text{pass})}{P(\text{pass})} \\ &= \frac{P(\text{yes and } \text{pass})}{P(\text{pass and yes}) + P(\text{pass and no})}\\ &= \frac{0.76}{0.76 + 0.12} = 0.86 \end{align*}\]