Probability
| Value | Meaning |
|---|---|
| 0 | Impossible — will never happen |
| 0.5 | Equal chance of happening or not |
| 1 | Certain — will always happen |
Example: Rolling a die, flipping a coin, drawing a card.
Coin flip: S = {Heads, Tails} Rolling a die: S = {1, 2, 3, 4, 5, 6}
Event A = rolling an even number = {2, 4, 6}
Rolling a 3 is one outcome.
| Type | Definition | Example |
|---|---|---|
| Simple event | Single outcome | Rolling a 2 |
| Compound event | Multiple outcomes combined | Rolling an odd number |
| Complementary event | Everything not in the event | Not rolling a 2 |
| Mutually exclusive | Can't both happen at once | Rolling a 2 AND a 5 |
| Independent events | One doesn't affect the other | Two separate coin flips |
If P(rain) = 0.3, then P(no rain) = 0.7
| Approach | How it works | Example |
|---|---|---|
| Classical | Count equally likely outcomes | Rolling dice, card games |
| Empirical (Frequentist) | Based on observed data/experiments | 60 out of 100 patients recovered → P = 0.6 |
| Subjective | Based on personal judgment/belief | "I think there's a 70% chance it rains" |
How to find it
| Situation | Method to Use |
|---|---|
| Fair game, equal chances | Classical formula |
| Have real data/observations | Empirical formula |
| Easier to find the opposite | Complement rule |
| "Or" / at least one of two events | Addition rule |
| "And" / both events happening | Multiplication rule |
What if no of sample space and possible outcomes of events are same like in rolling a dice
| Event | Favorable Outcomes | P(E) | Meaning |
|---|---|---|---|
| Rolling a number less than 7 | {1,2,3,4,5,6} = 6 | 6/6 = 1 | Certain ✅ |
| Rolling a number greater than 0 | {1,2,3,4,5,6} = 6 | 6/6 = 1 | Certain ✅ |
| Rolling a number between 0 and 7 | {1,2,3,4,5,6} = 6 | 6/6 = 1 | Certain ✅ |
| Condition | Formula | Result | Meaning |
|---|---|---|---|
| Favorable = Total | P = n/n | 1 | Certain (always happens) |
| Favorable = 0 | P = 0/n | 0 | Impossible (never happens) |
| Favorable < Total | P = x/n | Between 0 and 1 | Possible but not guaranteed |
If every outcome in the sample space satisfies your event, then no matter what happens, your event occurs — so probability = 1.
Yes
Branches of statistics
STATISTICS
├── Descriptive Statistics
└── Inferential Statistics
├── Estimation
└── Hypothesis Testing
Describes and summarizes data in a meaningful way — no conclusions beyond the data itself.
| Type | Examples |
|---|---|
| Measures of Central Tendency | Mean, Median, Mode |
| Measures of Dispersion | Range, Variance, Standard Deviation |
| Graphs & Charts | Bar chart, Histogram, Pie chart, Box plot |
| Frequency Distributions | Tables showing how often values occur |
A teacher finds the average score of 30 students on a test = 72/100 That's descriptive — it summarizes the class performance.
Uses a sample to make conclusions (inferences) about a larger population.
| Type | Description | Example |
|---|---|---|
| Point Estimate | Single value used to estimate a parameter | Sample mean = 72, so population mean ≈ 72 |
| Interval Estimate | Range of values (Confidence Interval) | Mean is between 68 and 76 with 95% confidence |
| Term | Meaning |
|---|---|
| Null Hypothesis (H₀) | The default claim (nothing changed/no effect) |
| Alternative Hypothesis (H₁) | What you want to prove |
| p-value | Probability of getting results by chance |
| Significance level (α) | Threshold to reject H₀ (usually 0.05) |
"Is a new drug more effective than the old one?" You test data from a sample to accept or reject this claim.
| Branch | Focus |
|---|---|
| Applied Statistics | Using stats in real-world fields (medicine, business, sports) |
| Mathematical Statistics | Theoretical foundations, proofs, and formulas |
| Bayesian Statistics | Updating probability with new evidence |
| Biostatistics | Statistics applied to biology and medicine |
| Business Statistics | Decision-making in economics and business |
| Computational Statistics | Using computers/software to analyze large data |
| Feature | Descriptive | Inferential |
|---|---|---|
| Purpose | Summarize data | Draw conclusions |
| Data used | Entire dataset | Sample only |
| Goes beyond data? | No | Yes |
| Example | Average exam score | Predicting national exam scores from a sample |
| Tools | Mean, charts, tables | t-tests, confidence intervals, regression |
Imagine you taste a spoon of soup to check if it's salty:
- Descriptive = "This spoon is salty" (describes what you observed)
- Inferential = "The whole pot must be salty" (conclusion about the bigger picture)
Statistics vs parameters
| Parameter | Statistic | |
|---|---|---|
| Comes from | Entire population | A sample |
| Known? | Usually unknown | Usually known |
| Changes? | Fixed (doesn't change) | Varies sample to sample |
| Purpose | What we want to know | What we use to estimate it |
| Symbols | Greek letters (μ, σ, π) | Latin letters (x̄, s, p̂) |
A fixed numerical value that describes a characteristic of an entire population.
A numerical value calculated from a sample that is used to estimate a parameter.
| Measure | Parameter (Population) | Statistic (Sample) |
|---|---|---|
| Mean | μ (mu) | x̄ (x-bar) |
| Standard Deviation | σ (sigma) | s |
| Variance | σ² | s² |
| Proportion | p (or π) | p̂ (p-hat) |
| Size | N | n |
| Correlation | ρ (rho) | r |
A country has 5 million students who took a national exam.
| Value | Type | |
|---|---|---|
| Average score of ALL 5 million students | μ = 68 | Parameter |
| Average score of a sample of 500 students | x̄ = 71 | Statistic |
A city has 2 million voters.
| Value | Type | |
|---|---|---|
| % of ALL voters who support Candidate A | p = 54% | Parameter |
| % from a survey of 1,000 voters | p̂ = 57% | Statistic |
A factory produces 100,000 bulbs per day.
| Value | Type | |
|---|---|---|
| Actual % of defective bulbs out of all 100,000 | p = 2% | Parameter |
| % defective in a tested batch of 200 bulbs | p̂ = 3% | Statistic |
POPULATION (everyone/everything)
│
│ ← Parameter describes this whole group
│
[Sample] (a small selected group)
│
│ ← Statistic describes this sample
│
▼
Used to ESTIMATE the Parameter
We can't always measure the whole population, so we take a sample, calculate a statistic, and use it to estimate the parameter.
| Letter | Stands for |
|---|---|
| Parameter | Population |
| Statistic | Sample |
P goes with P, S goes with S — that's all you need to remember!