Excellent - now I have everything needed for a thorough, example-driven answer.
Denominator: "People at Risk at Start" vs. "Total Person-Time at Risk"
These are two different ways of building the denominator, used in two different situations. The choice depends on whether everyone in your study was followed for the same amount of time or not.
Part 1 - The Core Difference with a Simple Analogy
Imagine you're running a 5-year study on heart attacks among 5 people.
Think of it like tracking runners on a track. You want to know "how fast do people get heart attacks?" You can measure this two ways:
- Count-based (CI denominator): Count how many runners started - assume they all ran the full 5 years.
- Time-based (Incidence Rate denominator): Measure the actual metres each runner covered before they stopped.
The second way is more accurate when runners drop out or get injured at different times.
Part 2 - Side-by-Side Illustrated Example
The Setup
You follow 5 people for up to 5 years to study a disease.
| Person | What happened | When |
|---|
| A | Completed full 5 years - stayed healthy | Year 5 (end) |
| B | Developed disease | End of Year 3 |
| C | Lost to follow-up (moved away) | End of Year 2 |
| D | Died of unrelated cause (car accident) | End of Year 1 |
| E | Completed full 5 years - stayed healthy | Year 5 (end) |
Method 1: "People at Risk at Start" (Cumulative Incidence Denominator)
You simply count everyone who was disease-free on Day 1:
$$\text{Denominator} = 5 \text{ people}$$
$$\text{New cases} = 1 \text{ (Person B)}$$
$$\text{CI} = \frac{1}{5} = 20%$$
The problem: This assumes persons C and D each contributed 5 full years to the study. But Person C only contributed 2 years, and Person D only 1 year. You are overcounting the denominator. The real risk is being underestimated because you gave credit for time that was never actually observed.
This method only works cleanly when:
- Everyone is followed for the same fixed period
- No one drops out (no loss to follow-up)
- No competing deaths (no one dies of something else)
Method 2: "Total Person-Time at Risk" (Incidence Rate Denominator)
Now you add up the actual time each person contributed before they either got the disease, left, died, or the study ended:
| Person | Time contributed | Why it stopped |
|---|
| A | 5 years | Study ended |
| B | 3 years | Got disease - stops being "at risk" once they get it |
| C | 2 years | Lost to follow-up - we don't know what happened |
| D | 1 year | Died of other cause - can no longer get the disease |
| E | 5 years | Study ended |
$$\text{Total Person-Time} = 5 + 3 + 2 + 1 + 5 = \textbf{16 person-years}$$
$$\text{Incidence Rate} = \frac{1 \text{ case}}{16 \text{ person-years}} = 0.0625 \text{ cases/person-year}$$
= 6.25 cases per 100 person-years
Why Person B only contributes 3 years: Once B develops the disease, B is no longer "at risk" of developing it - B already has it. So B's contribution to the denominator ends at that point. - Kaplan & Sadock's Comprehensive Textbook of Psychiatry, p. 2643
Why Person D only contributes 1 year: D died of a car accident and can no longer develop the study disease. D is a "competing event" - D leaves the at-risk pool permanently.
Part 3 - How to Decide "Who is People at Risk at Start"?
This is the most important practical question. The CDC defines the population at risk as those who:
"have the potential to get the disease and be included in the numerator"
Park's Textbook of Preventive and Social Medicine states it precisely: "population at risk is restricted solely to those who are capable of having or acquiring the disease or condition in question."
The 3 Criteria for Being "At Risk":
1. Must NOT already have the disease
- Someone already diagnosed with diabetes on Day 1 is NOT at risk of "getting" diabetes - they already have it.
- They go in neither numerator nor denominator.
2. Must be biologically capable of getting the disease
- Studying ovarian cancer? Denominator = women only (men have no ovaries, so they cannot get it - they are not at risk)
- Studying prostate cancer? Denominator = men only
- Studying cervical cancer? Exclude women who had hysterectomy (no cervix = not at risk)
- Studying post-partum depression? Denominator = women who just gave birth
3. Must have had real exposure opportunity (in outbreak settings)
- At a dinner party food poisoning investigation: only people who attended the dinner are at risk - not the whole town
- Only those who ate the specific food are at risk for that food-borne illness
Decision Table
| Scenario | Who is "at risk"? | Who is EXCLUDED? |
|---|
| New flu cases in a school | All students/staff without current flu | Students already sick with flu |
| Post-surgery complications | All patients who had surgery | Patients who didn't have surgery |
| Ovarian cancer incidence | All women with ovaries | Men; women with oophorectomy |
| Second heart attack study | People who had 1st heart attack | People who never had a heart attack; people who already had 2+ |
| Pregnancy complications | Pregnant women | Non-pregnant women; men |
| Food outbreak on a cruise | Passengers who ate the food in question | Passengers who did NOT eat it |
Part 4 - Visual Timeline (What Person-Time Looks Like)
Year: 0 1 2 3 4 5
Person A: [====|====|====|====|====] → 5 years
Person B: [====|====|====X ] → 3 years (X = got disease)
Person C: [====|====L ] → 2 years (L = lost to follow-up)
Person D: [====† ] → 1 year († = died, unrelated)
Person E: [====|====|====|====|====] → 5 years
Total = 16 person-years
Each segment of the line = 1 year of actual observed at-risk time. The sum of all segments = your denominator for the incidence rate.
Part 5 - When to Use Which Denominator
| Use "People at Risk at Start" (CI) when... | Use "Person-Time" (Incidence Rate) when... |
|---|
| Short, fixed study period | Long follow-up (months to years) |
| Closed population (e.g., outbreak investigation) | Open/dynamic cohort (people join/leave) |
| No one drops out | Loss to follow-up expected |
| No competing deaths | Competing causes of death likely |
| e.g., attack rate during a 3-day festival | e.g., cancer cohort followed for 10 years |
Summary in One Sentence Each
-
"People at risk at start" = a simple headcount of disease-free individuals on Day 1. Assumes everyone contributed equally. Clean but fragile - fails when people drop out.
-
"Total person-time at risk" = the sum of each individual's actual observed at-risk time. Accounts for dropouts, competing deaths, and unequal follow-up. More accurate but requires tracking each person's timeline.
Both measure "who could have gotten the disease" - they just differ in whether that's measured as a count of people or a count of person-years.