Probability

Asking for Preferences

Probability: Basic Concepts & Definitions

What Is Probability?

Probability is a measure of how likely an event is to occur, expressed as a number between 0 and 1 (or 0% to 100%).

Value	Meaning
0	Impossible — will never happen
0.5	Equal chance of happening or not
1	Certain — will always happen

Key Vocabulary

1. Experiment

Any process with an uncertain outcome.

Example: Rolling a die, flipping a coin, drawing a card.

2. Sample Space (S)

The set of all possible outcomes.

Coin flip: S = {Heads, Tails} Rolling a die: S = {1, 2, 3, 4, 5, 6}

3. Event (E)

A specific outcome or group of outcomes you care about — a subset of the sample space.

Event A = rolling an even number = {2, 4, 6}

4. Outcome

A single possible result from an experiment.

Rolling a 3 is one outcome.

The Probability Formula (Classical)

When all outcomes are equally likely:

$$P(E) = \frac{\text{Number of favorable outcomes}}{\text{Total number of outcomes}}$$

Example: Probability of rolling a 4 on a fair die: $$P(4) = \frac{1}{6} \approx 0.167$$

Example: Probability of drawing a red card from a standard deck: $$P(\text{red}) = \frac{26}{52} = \frac{1}{2} = 0.5$$

Types of Events

Type	Definition	Example
Simple event	Single outcome	Rolling a 2
Compound event	Multiple outcomes combined	Rolling an odd number
Complementary event	Everything not in the event	Not rolling a 2
Mutually exclusive	Can't both happen at once	Rolling a 2 AND a 5
Independent events	One doesn't affect the other	Two separate coin flips

Complement Rule

The probability of an event not happening:

$$P(\text{not } E) = 1 - P(E)$$

If P(rain) = 0.3, then P(no rain) = 0.7

Three Approaches to Probability

Approach	How it works	Example
Classical	Count equally likely outcomes	Rolling dice, card games
Empirical (Frequentist)	Based on observed data/experiments	60 out of 100 patients recovered → P = 0.6
Subjective	Based on personal judgment/belief	"I think there's a 70% chance it rains"

Quick Summary

Probability lives between 0 and 1.
The sample space lists every possible outcome.
An event is any outcome (or group) you're measuring.
Use P(E) = favorable / total when outcomes are equally likely.
The complement of any event E has probability 1 − P(E).

Want to go deeper into any of these — like compound probability, Venn diagrams, or conditional probability?

How to find it

How to Find Probability

There are different methods depending on the situation:

Method 1: Classical Formula (Equally Likely Outcomes)

Use this when outcomes are fair (coins, dice, cards).

$$P(E) = \frac{\text{Number of favorable outcomes}}{\text{Total number of possible outcomes}}$$

Steps:

Count the total outcomes in the sample space
Count how many outcomes match your event
Divide

Example: What is the probability of drawing a King from a deck of cards?

Favorable outcomes: 4 Kings
Total outcomes: 52 cards
P(King) = 4/52 = 1/13 ≈ 0.077

Method 2: Empirical Formula (Based on Data/Experiments)

Use this when you have real data or past observations.

$$P(E) = \frac{\text{Number of times event occurred}}{\text{Total number of trials}}$$

Example: A basketball player made 35 out of 50 free throws.

P(making a shot) = 35/50 = 0.70 = 70%

Method 3: Complement (Easier Reverse Approach)

Use this when it's easier to find what you DON'T want.

$$P(E) = 1 - P(\text{not } E)$$

Example: What's the probability of rolling at least one 6 when rolling two dice?

P(no 6 on one die) = 5/6
P(no 6 on either die) = 5/6 × 5/6 = 25/36
P(at least one 6) = 1 − 25/36 = 11/36 ≈ 0.306

Method 4: Addition Rule (Either/Or Events)

For mutually exclusive events (can't happen at the same time): $$P(A \text{ or } B) = P(A) + P(B)$$

For non-mutually exclusive events (can overlap): $$P(A \text{ or } B) = P(A) + P(B) - P(A \text{ and } B)$$

Example (exclusive): P(rolling a 2 or a 5) = 1/6 + 1/6 = 2/6 = 1/3

Example (non-exclusive): P(drawing a King or a Heart):

P(King) = 4/52, P(Heart) = 13/52, P(King of Hearts) = 1/52
P = 4/52 + 13/52 − 1/52 = 16/52 ≈ 0.308

Method 5: Multiplication Rule (Both Events Together)

For independent events (one doesn't affect the other): $$P(A \text{ and } B) = P(A) \times P(B)$$

For dependent events (one affects the other): $$P(A \text{ and } B) = P(A) \times P(B | A)$$

Example (independent): P(flipping Heads twice) = 1/2 × 1/2 = 1/4

Example (dependent): Drawing 2 Aces in a row (no replacement):

P(1st Ace) = 4/52, P(2nd Ace | 1st was Ace) = 3/51
P = 4/52 × 3/51 = 12/2652 ≈ 0.0045

Quick Decision Guide

Situation	Method to Use
Fair game, equal chances	Classical formula
Have real data/observations	Empirical formula
Easier to find the opposite	Complement rule
"Or" / at least one of two events	Addition rule
"And" / both events happening	Multiplication rule

Want to practice with some example problems, or dive deeper into any one method?

What if no of sample space and possible outcomes of events are same like in rolling a dice

When Favorable Outcomes = Total Outcomes

If the number of favorable outcomes equals the total number of outcomes, the probability is:

$$P(E) = \frac{6}{6} = 1$$

This means the event is CERTAIN — it will always happen.

Dice Example

Rolling a standard die: S = {1, 2, 3, 4, 5, 6} → 6 total outcomes

Event	Favorable Outcomes	P(E)	Meaning
Rolling a number less than 7	{1,2,3,4,5,6} = 6	6/6 = 1	Certain ✅
Rolling a number greater than 0	{1,2,3,4,5,6} = 6	6/6 = 1	Certain ✅
Rolling a number between 0 and 7	{1,2,3,4,5,6} = 6	6/6 = 1	Certain ✅

The 3 Special Cases to Know

Condition	Formula	Result	Meaning
Favorable = Total	P = n/n	1	Certain (always happens)
Favorable = 0	P = 0/n	0	Impossible (never happens)
Favorable < Total	P = x/n	Between 0 and 1	Possible but not guaranteed

Simple Way to Think About It

If every outcome in the sample space satisfies your event, then no matter what happens, your event occurs — so probability = 1.

Example: "What is the probability of rolling a number from 1 to 6 on a standard die?"

Every possible result is between 1 and 6
So it's guaranteed → P = 1

Key Rule to Remember

$$0 \leq P(E) \leq 1 \quad \text{(always)}$$

P = 0 → Impossible
P = 1 → Certain
P = 0.5 → Equal chance
Anything between → Some level of likelihood

So if you ever set up a problem where all outcomes match your event, don't be confused — it just means the event is certain, and that's a valid and correct answer!

Want to try some practice problems to test this?

Yes

Probability Practice Problems

Try solving these on your own first, then check the answers below!

🎲 Basic Level

Q1. A bag contains 3 red balls, 2 blue balls, and 5 green balls. What is the probability of picking a red ball?

Q2. A fair coin is flipped. What is the probability of getting Tails?

Q3. A die is rolled. What is the probability of rolling a number greater than 6?

🎯 Medium Level

Q4. A die is rolled. What is the probability of rolling an even number?

Q5. A card is drawn from a standard 52-card deck. What is the probability of drawing a Queen or a Club?

Q6. A class has 12 boys and 8 girls. One student is chosen randomly. What is the probability of NOT choosing a boy?

🔥 Challenge Level

Q7. A coin is flipped twice. What is the probability of getting at least one Head?

Q8. Two dice are rolled. What is the probability that the sum equals 7?

✅ Answers

(Scroll down slowly — try first!)

Answer 1

Total balls = 3 + 2 + 5 = 10
Favorable (red) = 3
P = 3/10 = 0.3

Answer 2

Total outcomes = {H, T} = 2
Favorable = {T} = 1
P = 1/2 = 0.5

Answer 3

Total outcomes = {1,2,3,4,5,6} = 6
Favorable (greater than 6) = none = 0
P = 0/6 = 0 → Impossible ❌

Answer 4

Total outcomes = 6
Favorable (even) = {2, 4, 6} = 3
P = 3/6 = 1/2 = 0.5

Answer 5 (Addition Rule)

P(Queen) = 4/52
P(Club) = 13/52
P(Queen of Clubs) = 1/52 (overlap)
P = 4/52 + 13/52 − 1/52 = 16/52 = 4/13 ≈ 0.308

Answer 6 (Complement Rule)

Total students = 12 + 8 = 20
P(boy) = 12/20
P(not boy) = 1 − 12/20 = 8/20 = 2/5 = 0.4

Answer 7 (Complement Rule)

P(no heads at all) = P(TT) = 1/2 × 1/2 = 1/4
P(at least one Head) = 1 − 1/4 = 3/4 = 0.75

Answer 8

Total outcomes when rolling 2 dice = 6 × 6 = 36
Combinations that sum to 7: (1,6)(2,5)(3,4)(4,3)(5,2)(6,1) = 6
P = 6/36 = 1/6 ≈ 0.167

How did you do? Want more practice, harder questions, or explanations on any specific answer?

Branches of statistics

Branches of Statistics

Statistics is broadly divided into 2 main branches, with several sub-branches under each.

🌐 Overview

STATISTICS
├── Descriptive Statistics
└── Inferential Statistics
        ├── Estimation
        └── Hypothesis Testing

1. Descriptive Statistics

Describes and summarizes data in a meaningful way — no conclusions beyond the data itself.

What it does:

Organizes raw data
Presents data visually or numerically
Answers: "What does the data look like?"

Tools Used:

Type	Examples
Measures of Central Tendency	Mean, Median, Mode
Measures of Dispersion	Range, Variance, Standard Deviation
Graphs & Charts	Bar chart, Histogram, Pie chart, Box plot
Frequency Distributions	Tables showing how often values occur

Example:

A teacher finds the average score of 30 students on a test = 72/100 That's descriptive — it summarizes the class performance.

2. Inferential Statistics

Uses a sample to make conclusions (inferences) about a larger population.

What it does:

Draws conclusions beyond the data
Makes predictions and generalizations
Answers: "What can we conclude about the population?"

Two Main Sub-branches:

2a. Estimation

Estimating population values from sample data.

Type	Description	Example
Point Estimate	Single value used to estimate a parameter	Sample mean = 72, so population mean ≈ 72
Interval Estimate	Range of values (Confidence Interval)	Mean is between 68 and 76 with 95% confidence

2b. Hypothesis Testing

Testing a claim or assumption about a population.

Term	Meaning
Null Hypothesis (H₀)	The default claim (nothing changed/no effect)
Alternative Hypothesis (H₁)	What you want to prove
p-value	Probability of getting results by chance
Significance level (α)	Threshold to reject H₀ (usually 0.05)

Example:

"Is a new drug more effective than the old one?" You test data from a sample to accept or reject this claim.

Other Notable Branches

Branch	Focus
Applied Statistics	Using stats in real-world fields (medicine, business, sports)
Mathematical Statistics	Theoretical foundations, proofs, and formulas
Bayesian Statistics	Updating probability with new evidence
Biostatistics	Statistics applied to biology and medicine
Business Statistics	Decision-making in economics and business
Computational Statistics	Using computers/software to analyze large data

Key Difference: Descriptive vs Inferential

Feature	Descriptive	Inferential
Purpose	Summarize data	Draw conclusions
Data used	Entire dataset	Sample only
Goes beyond data?	No	Yes
Example	Average exam score	Predicting national exam scores from a sample
Tools	Mean, charts, tables	t-tests, confidence intervals, regression

Simple Analogy

Imagine you taste a spoon of soup to check if it's salty:

Descriptive = "This spoon is salty" (describes what you observed)

Inferential = "The whole pot must be salty" (conclusion about the bigger picture)

Want to go deeper into any branch — like how to calculate mean/median/mode, or how hypothesis testing works step by step?

Statistics vs parameters

Statistics vs Parameters

This is one of the most fundamental distinctions in statistics.

Core Difference

	Parameter	Statistic
Comes from	Entire population	A sample
Known?	Usually unknown	Usually known
Changes?	Fixed (doesn't change)	Varies sample to sample
Purpose	What we want to know	What we use to estimate it
Symbols	Greek letters (μ, σ, π)	Latin letters (x̄, s, p̂)

Definitions

📊 Parameter

A fixed numerical value that describes a characteristic of an entire population.

It's the "truth" about the whole group
Usually impossible to know exactly (too large to measure everyone)
Represented by Greek letters

📋 Statistic

A numerical value calculated from a sample that is used to estimate a parameter.

It's what we can actually measure
Changes depending on which sample you pick
Represented by Latin/Roman letters

Symbol Table

Measure	Parameter (Population)	Statistic (Sample)
Mean	μ (mu)	x̄ (x-bar)
Standard Deviation	σ (sigma)	s
Variance	σ²	s²
Proportion	p (or π)	p̂ (p-hat)
Size	N	n
Correlation	ρ (rho)	r

Real-World Examples

Example 1 — Student Exam Scores

A country has 5 million students who took a national exam.

	Value	Type
Average score of ALL 5 million students	μ = 68	Parameter
Average score of a sample of 500 students	x̄ = 71	Statistic

Example 2 — Election Poll

A city has 2 million voters.

	Value	Type
% of ALL voters who support Candidate A	p = 54%	Parameter
% from a survey of 1,000 voters	p̂ = 57%	Statistic

Example 3 — Factory Quality Control

A factory produces 100,000 bulbs per day.

	Value	Type
Actual % of defective bulbs out of all 100,000	p = 2%	Parameter
% defective in a tested batch of 200 bulbs	p̂ = 3%	Statistic

Visual Analogy

POPULATION  (everyone/everything)
     │
     │  ← Parameter describes this whole group
     │
  [Sample]  (a small selected group)
     │
     │  ← Statistic describes this sample
     │
     ▼
  Used to ESTIMATE the Parameter

Key Relationship

$$\text{Statistic} \xrightarrow{\text{estimates}} \text{Parameter}$$

We can't always measure the whole population, so we take a sample, calculate a statistic, and use it to estimate the parameter.

Simple Memory Trick

Letter	Stands for
Parameter	Population
Statistic	Sample

P goes with P, S goes with S — that's all you need to remember!

Want to move on to how we use statistics to estimate parameters (point estimates & confidence intervals)?

This is a shared conversation. Sign in to Orris to start your own chat.