Probability

Asking for Preferences

Probability: Basic Concepts & Definitions


What Is Probability?

Probability is a measure of how likely an event is to occur, expressed as a number between 0 and 1 (or 0% to 100%).
ValueMeaning
0Impossible — will never happen
0.5Equal chance of happening or not
1Certain — will always happen

Key Vocabulary

1. Experiment

Any process with an uncertain outcome.
Example: Rolling a die, flipping a coin, drawing a card.

2. Sample Space (S)

The set of all possible outcomes.
Coin flip: S = {Heads, Tails} Rolling a die: S = {1, 2, 3, 4, 5, 6}

3. Event (E)

A specific outcome or group of outcomes you care about — a subset of the sample space.
Event A = rolling an even number = {2, 4, 6}

4. Outcome

A single possible result from an experiment.
Rolling a 3 is one outcome.

The Probability Formula (Classical)

When all outcomes are equally likely:
$$P(E) = \frac{\text{Number of favorable outcomes}}{\text{Total number of outcomes}}$$
Example: Probability of rolling a 4 on a fair die: $$P(4) = \frac{1}{6} \approx 0.167$$
Example: Probability of drawing a red card from a standard deck: $$P(\text{red}) = \frac{26}{52} = \frac{1}{2} = 0.5$$

Types of Events

TypeDefinitionExample
Simple eventSingle outcomeRolling a 2
Compound eventMultiple outcomes combinedRolling an odd number
Complementary eventEverything not in the eventNot rolling a 2
Mutually exclusiveCan't both happen at onceRolling a 2 AND a 5
Independent eventsOne doesn't affect the otherTwo separate coin flips

Complement Rule

The probability of an event not happening:
$$P(\text{not } E) = 1 - P(E)$$
If P(rain) = 0.3, then P(no rain) = 0.7

Three Approaches to Probability

ApproachHow it worksExample
ClassicalCount equally likely outcomesRolling dice, card games
Empirical (Frequentist)Based on observed data/experiments60 out of 100 patients recovered → P = 0.6
SubjectiveBased on personal judgment/belief"I think there's a 70% chance it rains"

Quick Summary

  • Probability lives between 0 and 1.
  • The sample space lists every possible outcome.
  • An event is any outcome (or group) you're measuring.
  • Use P(E) = favorable / total when outcomes are equally likely.
  • The complement of any event E has probability 1 − P(E).

Want to go deeper into any of these — like compound probability, Venn diagrams, or conditional probability?

How to find it

How to Find Probability

There are different methods depending on the situation:

Method 1: Classical Formula (Equally Likely Outcomes)

Use this when outcomes are fair (coins, dice, cards).
$$P(E) = \frac{\text{Number of favorable outcomes}}{\text{Total number of possible outcomes}}$$
Steps:
  1. Count the total outcomes in the sample space
  2. Count how many outcomes match your event
  3. Divide
Example: What is the probability of drawing a King from a deck of cards?
  • Favorable outcomes: 4 Kings
  • Total outcomes: 52 cards
  • P(King) = 4/52 = 1/13 ≈ 0.077

Method 2: Empirical Formula (Based on Data/Experiments)

Use this when you have real data or past observations.
$$P(E) = \frac{\text{Number of times event occurred}}{\text{Total number of trials}}$$
Example: A basketball player made 35 out of 50 free throws.
  • P(making a shot) = 35/50 = 0.70 = 70%

Method 3: Complement (Easier Reverse Approach)

Use this when it's easier to find what you DON'T want.
$$P(E) = 1 - P(\text{not } E)$$
Example: What's the probability of rolling at least one 6 when rolling two dice?
  • P(no 6 on one die) = 5/6
  • P(no 6 on either die) = 5/6 × 5/6 = 25/36
  • P(at least one 6) = 1 − 25/36 = 11/36 ≈ 0.306

Method 4: Addition Rule (Either/Or Events)

For mutually exclusive events (can't happen at the same time): $$P(A \text{ or } B) = P(A) + P(B)$$
For non-mutually exclusive events (can overlap): $$P(A \text{ or } B) = P(A) + P(B) - P(A \text{ and } B)$$
Example (exclusive): P(rolling a 2 or a 5) = 1/6 + 1/6 = 2/6 = 1/3
Example (non-exclusive): P(drawing a King or a Heart):
  • P(King) = 4/52, P(Heart) = 13/52, P(King of Hearts) = 1/52
  • P = 4/52 + 13/52 − 1/52 = 16/52 ≈ 0.308

Method 5: Multiplication Rule (Both Events Together)

For independent events (one doesn't affect the other): $$P(A \text{ and } B) = P(A) \times P(B)$$
For dependent events (one affects the other): $$P(A \text{ and } B) = P(A) \times P(B | A)$$
Example (independent): P(flipping Heads twice) = 1/2 × 1/2 = 1/4
Example (dependent): Drawing 2 Aces in a row (no replacement):
  • P(1st Ace) = 4/52, P(2nd Ace | 1st was Ace) = 3/51
  • P = 4/52 × 3/51 = 12/2652 ≈ 0.0045

Quick Decision Guide

SituationMethod to Use
Fair game, equal chancesClassical formula
Have real data/observationsEmpirical formula
Easier to find the oppositeComplement rule
"Or" / at least one of two eventsAddition rule
"And" / both events happeningMultiplication rule

Want to practice with some example problems, or dive deeper into any one method?

What if no of sample space and possible outcomes of events are same like in rolling a dice

When Favorable Outcomes = Total Outcomes

If the number of favorable outcomes equals the total number of outcomes, the probability is:
$$P(E) = \frac{6}{6} = 1$$
This means the event is CERTAIN — it will always happen.

Dice Example

Rolling a standard die: S = {1, 2, 3, 4, 5, 6} → 6 total outcomes
EventFavorable OutcomesP(E)Meaning
Rolling a number less than 7{1,2,3,4,5,6} = 66/6 = 1Certain ✅
Rolling a number greater than 0{1,2,3,4,5,6} = 66/6 = 1Certain ✅
Rolling a number between 0 and 7{1,2,3,4,5,6} = 66/6 = 1Certain ✅

The 3 Special Cases to Know

ConditionFormulaResultMeaning
Favorable = TotalP = n/n1Certain (always happens)
Favorable = 0P = 0/n0Impossible (never happens)
Favorable < TotalP = x/nBetween 0 and 1Possible but not guaranteed

Simple Way to Think About It

If every outcome in the sample space satisfies your event, then no matter what happens, your event occurs — so probability = 1.
Example: "What is the probability of rolling a number from 1 to 6 on a standard die?"
  • Every possible result is between 1 and 6
  • So it's guaranteed → P = 1

Key Rule to Remember

$$0 \leq P(E) \leq 1 \quad \text{(always)}$$
  • P = 0 → Impossible
  • P = 1 → Certain
  • P = 0.5 → Equal chance
  • Anything between → Some level of likelihood

So if you ever set up a problem where all outcomes match your event, don't be confused — it just means the event is certain, and that's a valid and correct answer!
Want to try some practice problems to test this?

Yes

Probability Practice Problems

Try solving these on your own first, then check the answers below!

🎲 Basic Level

Q1. A bag contains 3 red balls, 2 blue balls, and 5 green balls. What is the probability of picking a red ball?
Q2. A fair coin is flipped. What is the probability of getting Tails?
Q3. A die is rolled. What is the probability of rolling a number greater than 6?

🎯 Medium Level

Q4. A die is rolled. What is the probability of rolling an even number?
Q5. A card is drawn from a standard 52-card deck. What is the probability of drawing a Queen or a Club?
Q6. A class has 12 boys and 8 girls. One student is chosen randomly. What is the probability of NOT choosing a boy?

🔥 Challenge Level

Q7. A coin is flipped twice. What is the probability of getting at least one Head?
Q8. Two dice are rolled. What is the probability that the sum equals 7?

✅ Answers

(Scroll down slowly — try first!)
 
 
 

Answer 1

  • Total balls = 3 + 2 + 5 = 10
  • Favorable (red) = 3
  • P = 3/10 = 0.3

Answer 2

  • Total outcomes = {H, T} = 2
  • Favorable = {T} = 1
  • P = 1/2 = 0.5

Answer 3

  • Total outcomes = {1,2,3,4,5,6} = 6
  • Favorable (greater than 6) = none = 0
  • P = 0/6 = 0 → Impossible ❌

Answer 4

  • Total outcomes = 6
  • Favorable (even) = {2, 4, 6} = 3
  • P = 3/6 = 1/2 = 0.5

Answer 5 (Addition Rule)

  • P(Queen) = 4/52
  • P(Club) = 13/52
  • P(Queen of Clubs) = 1/52 (overlap)
  • P = 4/52 + 13/52 − 1/52 = 16/52 = 4/13 ≈ 0.308

Answer 6 (Complement Rule)

  • Total students = 12 + 8 = 20
  • P(boy) = 12/20
  • P(not boy) = 1 − 12/20 = 8/20 = 2/5 = 0.4

Answer 7 (Complement Rule)

  • P(no heads at all) = P(TT) = 1/2 × 1/2 = 1/4
  • P(at least one Head) = 1 − 1/4 = 3/4 = 0.75

Answer 8

  • Total outcomes when rolling 2 dice = 6 × 6 = 36
  • Combinations that sum to 7: (1,6)(2,5)(3,4)(4,3)(5,2)(6,1) = 6
  • P = 6/36 = 1/6 ≈ 0.167

How did you do? Want more practice, harder questions, or explanations on any specific answer?

Branches of statistics

Branches of Statistics

Statistics is broadly divided into 2 main branches, with several sub-branches under each.

🌐 Overview

STATISTICS
├── Descriptive Statistics
└── Inferential Statistics
        ├── Estimation
        └── Hypothesis Testing

1. Descriptive Statistics

Describes and summarizes data in a meaningful way — no conclusions beyond the data itself.

What it does:

  • Organizes raw data
  • Presents data visually or numerically
  • Answers: "What does the data look like?"

Tools Used:

TypeExamples
Measures of Central TendencyMean, Median, Mode
Measures of DispersionRange, Variance, Standard Deviation
Graphs & ChartsBar chart, Histogram, Pie chart, Box plot
Frequency DistributionsTables showing how often values occur

Example:

A teacher finds the average score of 30 students on a test = 72/100 That's descriptive — it summarizes the class performance.

2. Inferential Statistics

Uses a sample to make conclusions (inferences) about a larger population.

What it does:

  • Draws conclusions beyond the data
  • Makes predictions and generalizations
  • Answers: "What can we conclude about the population?"

Two Main Sub-branches:


2a. Estimation

Estimating population values from sample data.
TypeDescriptionExample
Point EstimateSingle value used to estimate a parameterSample mean = 72, so population mean ≈ 72
Interval EstimateRange of values (Confidence Interval)Mean is between 68 and 76 with 95% confidence

2b. Hypothesis Testing

Testing a claim or assumption about a population.
TermMeaning
Null Hypothesis (H₀)The default claim (nothing changed/no effect)
Alternative Hypothesis (H₁)What you want to prove
p-valueProbability of getting results by chance
Significance level (α)Threshold to reject H₀ (usually 0.05)
Example:
"Is a new drug more effective than the old one?" You test data from a sample to accept or reject this claim.

Other Notable Branches

BranchFocus
Applied StatisticsUsing stats in real-world fields (medicine, business, sports)
Mathematical StatisticsTheoretical foundations, proofs, and formulas
Bayesian StatisticsUpdating probability with new evidence
BiostatisticsStatistics applied to biology and medicine
Business StatisticsDecision-making in economics and business
Computational StatisticsUsing computers/software to analyze large data

Key Difference: Descriptive vs Inferential

FeatureDescriptiveInferential
PurposeSummarize dataDraw conclusions
Data usedEntire datasetSample only
Goes beyond data?NoYes
ExampleAverage exam scorePredicting national exam scores from a sample
ToolsMean, charts, tablest-tests, confidence intervals, regression

Simple Analogy

Imagine you taste a spoon of soup to check if it's salty:
  • Descriptive = "This spoon is salty" (describes what you observed)
  • Inferential = "The whole pot must be salty" (conclusion about the bigger picture)

Want to go deeper into any branch — like how to calculate mean/median/mode, or how hypothesis testing works step by step?

Statistics vs parameters

Statistics vs Parameters

This is one of the most fundamental distinctions in statistics.

Core Difference

ParameterStatistic
Comes fromEntire populationA sample
Known?Usually unknownUsually known
Changes?Fixed (doesn't change)Varies sample to sample
PurposeWhat we want to knowWhat we use to estimate it
SymbolsGreek letters (μ, σ, π)Latin letters (x̄, s, p̂)

Definitions

📊 Parameter

A fixed numerical value that describes a characteristic of an entire population.
  • It's the "truth" about the whole group
  • Usually impossible to know exactly (too large to measure everyone)
  • Represented by Greek letters

📋 Statistic

A numerical value calculated from a sample that is used to estimate a parameter.
  • It's what we can actually measure
  • Changes depending on which sample you pick
  • Represented by Latin/Roman letters

Symbol Table

MeasureParameter (Population)Statistic (Sample)
Meanμ (mu)x̄ (x-bar)
Standard Deviationσ (sigma)s
Varianceσ²
Proportionp (or π)p̂ (p-hat)
SizeNn
Correlationρ (rho)r

Real-World Examples

Example 1 — Student Exam Scores

A country has 5 million students who took a national exam.
ValueType
Average score of ALL 5 million studentsμ = 68Parameter
Average score of a sample of 500 studentsx̄ = 71Statistic

Example 2 — Election Poll

A city has 2 million voters.
ValueType
% of ALL voters who support Candidate Ap = 54%Parameter
% from a survey of 1,000 votersp̂ = 57%Statistic

Example 3 — Factory Quality Control

A factory produces 100,000 bulbs per day.
ValueType
Actual % of defective bulbs out of all 100,000p = 2%Parameter
% defective in a tested batch of 200 bulbsp̂ = 3%Statistic

Visual Analogy

POPULATION  (everyone/everything)
     │
     │  ← Parameter describes this whole group
     │
  [Sample]  (a small selected group)
     │
     │  ← Statistic describes this sample
     │
     ▼
  Used to ESTIMATE the Parameter

Key Relationship

$$\text{Statistic} \xrightarrow{\text{estimates}} \text{Parameter}$$
We can't always measure the whole population, so we take a sample, calculate a statistic, and use it to estimate the parameter.

Simple Memory Trick

LetterStands for
ParameterPopulation
StatisticSample
P goes with P, S goes with S — that's all you need to remember!

Want to move on to how we use statistics to estimate parameters (point estimates & confidence intervals)?
This is a shared conversation. Sign in to Orris to start your own chat.