Files
uni/year2/semester1/logseq-stuff/pages/Probability.md

155 lines
7.7 KiB
Markdown

- #[[ST2001 - Statistics in Data Science I]]
- **Previous Topic:** [[Sampling]]
- **Next Topic:** [[Random Variables]]
- **Relevant Slides:** ![Topic 4 - Probability.pdf](../assets/Topic_4_-_Probability_1664204337770_0.pdf)
-
- Probability provides the *framework* for the study & application of statistics.
- # What are Probabilities?
- Take, for example, a 6-sided die about to be tossed for the first time.
- **Classical:** 6 possible outcomes, by symmetry, each equally likely to occur,
- **Frequentist:** Empirical evidence shows that similar dice thrown in the past have landed on each side about equally often.
- **Subjective:** The degree of individual belief in occurrence of an event can be influenced by classical or frequentist arguments.
- Subjective probabilities are also influenced by other reasons when symmetry arguments don't apply & repeated trials are not possible.
-
- # Probability
- The probability of an event $A$ is the number of (equally likely & disjoint) outcomes in the event divided by the total number of (equally likely & disjoint) possible outcomes.
- $$P(A) = \frac{\text{\# of outcomes in A}}{\text{\# of possible outcomes}}$$
- $$(0 \leq P(A) \leq 1)$$
- ## Sample Spaces
- What is a **sample space**? #card
card-last-interval:: -1
card-repeats:: 1
card-ease-factor:: 2.7
card-next-schedule:: 2022-11-15T00:00:00.000Z
card-last-reviewed:: 2022-11-14T16:42:50.065Z
card-last-score:: 1
- The set of all possible outcomes of a random experiment is called the **sample space**, $S$.
- $S$ is **discrete** if it consists of a finite or countably infinite set of outcomes.
- $S$ is **continuous** if it contains an interval of real numbers.
- $$P(S) = 1$$
- ## Events
- What is an **event**? #card
card-last-interval:: 4.14
card-repeats:: 2
card-ease-factor:: 2.56
card-next-schedule:: 2022-11-21T23:15:48.008Z
card-last-reviewed:: 2022-11-17T20:15:48.008Z
card-last-score:: 5
- An **event** is a specific collection of sample points / possible outcomes.
- An event is denoted by $E$ or by capital letters, $A$, $B$, etc.
- What is a **SImple Event**? #card
card-last-interval:: 3.57
card-repeats:: 2
card-ease-factor:: 2.46
card-next-schedule:: 2022-11-22T07:34:13.841Z
card-last-reviewed:: 2022-11-18T18:34:13.841Z
card-last-score:: 5
- A **Simple Event** is a collection of only **one** sample point / possible outcomes.
- What is a **Compound Event**? #card
card-last-interval:: 4.14
card-repeats:: 2
card-ease-factor:: 2.56
card-next-schedule:: 2022-11-21T23:15:54.761Z
card-last-reviewed:: 2022-11-17T20:15:54.762Z
card-last-score:: 5
- A **Compound Event** is a collection of **more than one** sample point / possible outcomes.
- ## Permuatations
- A **permutation** is an arrangement of objects.
- It can also be an arrangement of $r$ objects chosen from $n$ distinct objects where replacement in the selection is not allowed.
- The symbol, $P^n_r$ represents the number of permutations of $r$ objects selected from $n$ objects.
- The calculation is given by the formula:
- $$P^n_r = \frac{n!}{(n-r)!}$$
- ## Joint Events (and / or)
- Probabilities of **joint events** can often be determined from the probabilities of the individual events that comprise them.
- Joint events are generated by applying basic set operations to individual events, specifically:
- **Complement** of event $A$ is $\bar{A} =$ all outcomes *not* in $A$.
- **Union** of events $A \cup B$; $A$ **or** $B$ or both.
- **Intersection** of events $A$ **and** $B$ -> $A \cap B$.
- **Disjoint** events cannot occur together -> $A \cap B = \empty$.
-
- ## Probability of a Union ($A$ **or** $B$) #card
card-last-interval:: 21.53
card-repeats:: 4
card-ease-factor:: 2.32
card-next-schedule:: 2022-12-06T08:01:39.281Z
card-last-reviewed:: 2022-11-14T20:01:39.281Z
card-last-score:: 3
- For any two events $A$ and $B$, the probability of union is given by:
- $$P(A \cup B) = P(A) + P(B) - P(A \cap B)$$
- For two **disjoint** (also called **mutually exclusive**) events $A$ and $B$, the probability that one *or* the other occurs is the sum of the probabilities of the two events (provided that $A$ and $B$ are disjoint).
- $$P(A \cup B) = P(A) + P(B)$$
- If $P(A \cup B)$ is greater than 1, then you know you have made a mistake and that the events were not mutually exclusive -> there is an intersection.
- ## Intersections ($A$ **and** $B$)
card-last-interval:: 3.51
card-repeats:: 2
card-ease-factor:: 2.6
card-next-schedule:: 2022-10-08T00:26:58.336Z
card-last-reviewed:: 2022-10-04T12:26:58.337Z
card-last-score:: 5
- #### Multiplication Rule for Independent Events #card
card-last-interval:: 33.64
card-repeats:: 4
card-ease-factor:: 2.9
card-next-schedule:: 2022-12-18T11:07:08.232Z
card-last-reviewed:: 2022-11-14T20:07:08.232Z
card-last-score:: 5
- For two **independent** events $A$ and $B$, the probability that *both* $A$ **and** $B$ occur is the product of the probabilities of the two events.
- $$P(A \cap B) = P(A) \times P(B)$$
- If two events are **independent**, that means that one event has no impact on the probability of occurrence of the other event.
- ## Conditional Probability
card-last-score:: 1
card-repeats:: 1
card-next-schedule:: 2022-10-04T23:00:00.000Z
card-last-interval:: -1
card-ease-factor:: 2.5
card-last-reviewed:: 2022-10-04T12:31:44.517Z
- What is **conditional probability**? #card
card-last-interval:: -1
card-repeats:: 1
card-ease-factor:: 2.36
card-next-schedule:: 2022-11-15T00:00:00.000Z
card-last-reviewed:: 2022-11-14T16:30:38.038Z
card-last-score:: 1
- $P(B | A)$ is the probability of event $B$ occurring, given that event $A$ has already occurred.
- The **conditional probability** of $B$ given $A$, denoted by $P(B | A)$, is defined by: #card
card-last-interval:: 0.98
card-repeats:: 2
card-ease-factor:: 2.36
card-next-schedule:: 2022-11-15T19:08:28.312Z
card-last-reviewed:: 2022-11-14T20:08:28.312Z
card-last-score:: 3
- $$P(B|A) = \frac{P(A \cap B)}{P(A)} \text{, provided } P(A) > 0$$
- **Note:** $P(A)$ cannot equal 0, since we know that $A$ *has* occurred.
- ### General Multiplication Rule for Dependent Events #card
card-last-interval:: -1
card-repeats:: 1
card-ease-factor:: 2.6
card-next-schedule:: 2022-11-15T00:00:00.000Z
card-last-reviewed:: 2022-11-14T16:33:46.273Z
card-last-score:: 1
- The conditional probability can be rewritten to further generalise the multiplication rule:
- $$P(A \cap B) = P(A) \cdot P(B|A)$$
- $$P(B \cap A) = P(B)B \cdot P(B|A)$$
- $$\text{As } P(A \cap B) = P(B \cap A) \text{ implies}$$
- $$P(A) \cdot P(B | A) = P(B) \cdot P(A |B)$$
- These results mean that $P(A |B)$ can be calculated once we know $P(A)$, $P(B)$, and $P(B | A)$.
- #### Bayes' Theorem #card
card-last-interval:: -1
card-repeats:: 1
card-ease-factor:: 2.36
card-next-schedule:: 2022-11-22T00:00:00.000Z
card-last-reviewed:: 2022-11-21T13:07:04.337Z
card-last-score:: 1
- **Bayes' Theorem** states that:
- $$P(A | B) = \frac{P(B | A) \cdot P(A)}{P(B)} \text{ for } P(B)>0$$
- ## Independence
- Two events, $A$ and $B$ are independent, if and only if:
- $$P(A \cap B) = P(A)\cdot P(B)$$
- Therefore, to obtain the probability that two independent events will occur, we simply find the product of their individual probabilities.
- Two events $A$ and $B$ are independent, if and only if:
- $$P(B | A) = P(B) \text{ or } P(A|B) = P(A)$$
- assuming the existence of the conditional probabilities.
- Otherwise, $A$ and $B$ are **dependent**.
- If in an experiment, the events $A$ and $B$ can both occur, then:
- $$P(A \cap B) = P(A)P(B|A) \text{, provided } P(A) > 0$$
-