Explain Mantel - Haenszel Method please.

Searching the Web

Mantel-Haenszel method epidemiology statistics explained

Finding Sources
Finding Sources
Reading Web Page

Mantel Haenszel method explained stratification confounding

Reading File
Reading File
Reading File
Reading File
Here is a thorough explanation of the Mantel-Haenszel method.

The Mantel-Haenszel (MH) Method

Background and Origin

Introduced by Nathan Mantel and William Haenszel in their landmark 1959 paper "Statistical Aspects of the Analysis of Data from Retrospective Studies of Disease" (Journal of the National Cancer Institute), the method was originally designed to analyze case-control (retrospective) data. It has since become a standard tool in epidemiology, clinical trials, and meta-analysis.

The Core Problem it Solves: Confounding

Suppose you want to measure the association between an exposure (e.g., smoking) and a disease (e.g., lung cancer). A simple 2x2 table gives you a crude odds ratio or risk ratio. But a third variable - say, age - might be associated with both smoking AND lung cancer. This is confounding: the crude estimate is distorted because the groups being compared differ on age, not just smoking.
The solution is stratification: split your data into strata (age groups), compute the association within each stratum (where the confounder is held constant), then combine those stratum-specific estimates into a single adjusted summary. The MH method is the standard algorithm for doing this.

The Setup: The 2x2xK Table

For each stratum i (i = 1, 2, ..., K), you have a 2x2 table:
Disease +Disease -Row Total
Exposeda_ib_in₁ᵢ
Unexposedc_id_in₀ᵢ
Col Totalm₁ᵢm₀ᵢNᵢ

The MH Pooled Odds Ratio

Each stratum contributes a weighted estimate of the odds ratio. The weight for each stratum is:
wᵢ = (bᵢ × cᵢ) / Nᵢ
The Mantel-Haenszel pooled odds ratio is:
OR_MH = Σ(aᵢdᵢ / Nᵢ) / Σ(bᵢcᵢ / Nᵢ)
That is: sum the "concordant" products weighted by stratum size, divided by the sum of "discordant" products weighted by stratum size. Strata with larger samples and more informative data receive more weight automatically.

The MH Pooled Risk Ratio (for cohort studies)

For cohort or cross-sectional data:
RR_MH = Σ(aᵢ × n₀ᵢ / Nᵢ) / Σ(cᵢ × n₁ᵢ / Nᵢ)
Again, each stratum is weighted by its contribution to the total, ensuring larger strata carry more influence.

The Mantel-Haenszel Chi-Square Test

Beyond just estimating the adjusted association, MH also provides a hypothesis test of whether the adjusted association is statistically significant (H₀: OR_MH = 1 or RR_MH = 1):
χ²_MH = [|Σ(aᵢ - Eᵢ)| - 0.5]² / Σ Var(aᵢ)
Where:
  • Eᵢ = n₁ᵢ × m₁ᵢ / Nᵢ (expected value of aᵢ under H₀)
  • Var(aᵢ) = n₁ᵢ × n₀ᵢ × m₁ᵢ × m₀ᵢ / (Nᵢ² × (Nᵢ - 1))
  • The 0.5 is a continuity correction (Yates correction)
  • This statistic follows a chi-square distribution with 1 degree of freedom, regardless of how many strata K you have

Worked Conceptual Example

(From Creasy & Resnik's Maternal-Fetal Medicine)
A case-control study examines alcohol use and preeclampsia. The crude OR is 2.26, suggesting alcohol raises risk. But parity (first vs. later pregnancy) could confound this. Stratifying:
  • Nulliparous stratum: OR = 1.0
  • Multiparous stratum: OR = 1.0
  • MH Summary OR = 1.0 (95% CI: 0.42 to 2.34)
The adjusted OR of 1.0 shows no true association - parity was confounding the original estimate of 2.26. The crude OR was inflated because parous women are more likely to drink alcohol AND less likely to get preeclampsia.

Assessing Confounding

To decide if confounding is present, compare:
  • Crude measure vs. MH adjusted measure
  • A common rule of thumb: if they differ by >10-15%, confounding is likely
There is no formal statistical test for confounding - it is a judgment call based on the magnitude of the difference.

Effect Modification (Interaction)

Before computing a pooled MH estimate, check whether the stratum-specific OR/RR values are homogeneous (similar across strata). If they differ substantially, the effect of exposure varies by level of the third variable - this is effect modification (also called interaction), NOT confounding.
To test this formally, use the Breslow-Day test:
  • H₀: All stratum-specific odds ratios are equal
  • If p < 0.05, effect modification is likely present
  • In that case, report stratum-specific estimates rather than a pooled MH estimate - pooling is misleading when there is true heterogeneity

The Cochran-Mantel-Haenszel (CMH) Extension

The CMH statistic generalizes the method to 2x2xK tables where K is any number of strata (not just 2). It:
  • Assumes a common odds ratio across all K strata
  • Tests the null hypothesis that the two main variables are conditionally independent after controlling for the stratifying variable
  • Still yields 1 degree of freedom regardless of K, giving it strong power
(Kaplan & Sadock's Comprehensive Textbook of Psychiatry)

Applications

ContextUse
EpidemiologyControlling confounders in case-control / cohort studies
Clinical trialsAdjusting treatment comparisons across center or baseline severity strata
Meta-analysisCombining odds ratios or risk ratios from multiple 2x2 tables (one per study)
PsychometricsDifferential item functioning (DIF) analysis in test development
Survival analysisMantel-Cox (log-rank) test for comparing survival curves

Assumptions and Limitations

  1. Common effect assumption: MH assumes the OR/RR is the same (or approximately the same) in each stratum. Violation = effect modification.
  2. No sparse data: Works best when each stratum has enough observations. Strata with very small counts contribute little but can distort results.
  3. Only one confounder at a time: Stratification handles one (or at most two) confounders simultaneously. For multiple confounders, use logistic regression (which generalizes MH).
  4. Binary exposure and outcome: Standard MH applies to 2x2 tables. Ordinal or continuous data require extensions.

MH vs. Other Methods

MethodWhen to Use
MHSimple confounder adjustment, one variable at a time; transparent and non-parametric
Logistic regressionMultiple confounders simultaneously; continuous covariates
Woolf methodAlternative pooling using log(OR) weights; less stable with sparse data
Random-effects meta-analysisWhen heterogeneity between strata/studies is substantial

Key Takeaway

The Mantel-Haenszel method produces a weighted average of stratum-specific effect estimates, where larger, more informative strata receive more weight. This makes it a simple, robust, and transparent way to control for a single confounding variable and obtain an adjusted measure of association - whether an OR, RR, or rate ratio - along with a valid significance test, all within a 1-df chi-square framework.

Please explain in simple and understandable terminology

The Mantel-Haenszel Method - Simply Explained


Start With a Story

Imagine you want to answer this question:
"Does drinking coffee cause heart disease?"
You collect data on 1,000 people - some drink coffee, some don't - and you check who got heart disease. You find that coffee drinkers get heart disease MORE often. Alarming, right?
But wait. Coffee drinkers also tend to smoke more. And smoking causes heart disease. So is it really the coffee... or is it the smoking sneaking into your results?
This "sneaky third variable" is called a confounder. The Mantel-Haenszel method is a tool to remove the confounder's influence and see the true picture.

The Simple Idea: Divide, Then Combine

The MH method works in two steps:

Step 1 - Divide (Stratify)

Split your data into groups based on the confounder.
  • Group 1: Smokers only
  • Group 2: Non-smokers only
Now compare coffee vs. no coffee within each group separately. Within a group of all-smokers, everyone smokes the same amount - so smoking can't distort the coffee comparison anymore. You've neutralized the confounder.

Step 2 - Combine (Pool)

You now have two separate results (one per group). The MH method combines them into one overall answer, giving more weight to the groups with more people (because bigger groups give more reliable estimates).
Think of it like averaging exam scores from two classes - but if one class has 100 students and the other has 10, you trust the bigger class more. MH does exactly this mathematically.

What Does "Weight" Mean Here?

Each group (stratum) gets a weight based on how many people are in it. A group with 500 people contributes more to the final answer than a group with 20 people. This makes the combined result more accurate and stable.

What Number Does It Produce?

The MH method produces an adjusted Odds Ratio (OR) or adjusted Risk Ratio (RR) - a single number that tells you how strongly the exposure is linked to the disease, after removing the confounder's effect.
ResultMeaning
OR = 1.0No association (exposure doesn't matter)
OR > 1.0Exposure increases the risk
OR < 1.0Exposure decreases the risk (protective)

A Concrete Example

Going back to the preeclampsia example from earlier:
  • Crude (unadjusted) OR = 2.26 - looks like alcohol doubles the risk of preeclampsia
  • You suspect parity (whether a woman has had children before) is confounding this
  • You split the data: first-time moms vs. women who've had babies before
  • In each group separately, the OR = 1.0
  • MH adjusted OR = 1.0 - no real association!
The original scary result of 2.26 was entirely due to parity sneaking in. Alcohol wasn't actually the problem - parity was. The MH method revealed this.

What About the Statistical Test?

After computing the adjusted OR, you want to know: is this result real or just due to chance?
The MH method includes a chi-square test that answers exactly this. It produces a p-value:
  • p < 0.05 = the association is statistically significant
  • p ≥ 0.05 = could be due to chance
The clever part: this test gives you just 1 single p-value, no matter how many groups you split into. So splitting into 10 age groups still gives one clean answer.

One Important Warning: Effect Modification

Before you combine the groups, you should check: are the results similar in all groups?
  • If coffee has OR = 3.0 in smokers but OR = 0.8 in non-smokers, that's very different.
  • This means the effect of coffee changes depending on smoking status - this is called effect modification.
  • In this case, you should NOT combine the groups into one number - you should report them separately, because they tell different stories.
You only use MH to combine results when the groups tell roughly the same story.

In a Nutshell

StepWhat You DoWhy
1Split data by confounderNeutralize its effect
2Calculate OR/RR in each groupGet a "clean" estimate per group
3Check groups are similarMake sure combining makes sense
4Combine using MH weightsGet one adjusted overall estimate
5Run the chi-square testCheck if the result is statistically real

When Is It Used?

  • Epidemiology: Controlling for confounders in disease studies
  • Clinical trials: Comparing treatments across different hospitals or patient subgroups
  • Meta-analysis: Combining results from multiple studies into one pooled estimate
  • Any situation where a third variable might be distorting a two-variable comparison

The Bottom Line

The Mantel-Haenszel method is like a fairness referee. It makes sure you're comparing apples to apples by holding confounders constant, then combines the fair comparisons into one trustworthy answer - giving more say to the groups with more data.

Can you explain with an example question?

Mantel-Haenszel Method: Step-by-Step Example


The Question

"Does smoking cause lung cancer?" But we suspect age might be a confounder (older people smoke more AND are more likely to get cancer anyway).
We collected data on 200 people and divided them into two age groups:
  • Young (under 50)
  • Old (50 and above)

Step 1: Look at the Raw (Crude) Data First

Here's the overall data, ignoring age:
Cancer +Cancer -Total
Smoker4060100
Non-smoker2080100
Total60140200
Crude Odds Ratio = (40 × 80) / (60 × 20) = 3200 / 1200 = 2.67
This says smokers have 2.67 times the odds of getting lung cancer. But is age distorting this?

Step 2: Stratify by Age (The Confounder)

Stratum 1 - Young People (Under 50)

Cancer +Cancer -Total
Smoker104050
Non-smoker54550
Total1585100
OR (Young) = (10 × 45) / (40 × 5) = 450 / 200 = 2.25

Stratum 2 - Old People (50 and above)

Cancer +Cancer -Total
Smoker302050
Non-smoker153550
Total4555100
OR (Old) = (30 × 35) / (20 × 15) = 1050 / 300 = 3.50

Step 3: Check - Are the Two ORs Similar?

  • Young group: OR = 2.25
  • Old group: OR = 3.50
They're in the same direction (both above 1, both suggesting smoking increases cancer risk). They're not wildly different, so it's reasonable to combine them. ✅
(If one was 2.25 and the other was 0.4, we'd say there's effect modification and we should NOT combine them.)

Step 4: Calculate the MH Weights

The MH formula weights each stratum by how much data it contributes. The weight for each stratum is:
Weight = (b × c) / N
Where:
  • b = exposed without disease
  • c = unexposed with disease
  • N = total in that stratum
Weight for Young group: = (40 × 5) / 100 = 200 / 100 = 2.0
Weight for Old group: = (20 × 15) / 100 = 300 / 100 = 3.0
The old group gets more weight because its numbers are more informative (the disease is more common there, so there's more statistical signal).

Step 5: Calculate the MH Adjusted Odds Ratio

The formula is:
OR_MH = Σ(a × d / N) / Σ(b × c / N)
Numerator (top of the fraction):
  • Young: (10 × 45) / 100 = 450/100 = 4.5
  • Old: (30 × 35) / 100 = 1050/100 = 10.5
  • Total numerator = 4.5 + 10.5 = 15.0
Denominator (bottom of the fraction):
  • Young: (40 × 5) / 100 = 200/100 = 2.0
  • Old: (20 × 15) / 100 = 300/100 = 3.0
  • Total denominator = 2.0 + 3.0 = 5.0
OR_MH = 15.0 / 5.0 = 3.0

Step 6: Interpret the Results

EstimateValueMeaning
Crude OR2.67Raw association before adjustment
MH Adjusted OR3.00Association after controlling for age
The adjusted OR (3.0) is slightly higher than the crude OR (2.67). This means age was actually masking some of the smoking effect (a phenomenon called negative confounding). After removing age's influence, the true association between smoking and lung cancer is a bit stronger.
Conclusion: Smokers have 3 times the odds of developing lung cancer compared to non-smokers, after adjusting for age.

Quick Visual Summary of What Just Happened

RAW DATA
(All mixed together)
Crude OR = 2.67  ← possibly distorted by age
        |
        ▼
SPLIT BY AGE
  Young group → OR = 2.25
  Old group   → OR = 3.50
        |
        ▼
COMBINE WITH WEIGHTS
  (Old group gets more weight = 3.0)
  (Young group gets less weight = 2.0)
        |
        ▼
MH ADJUSTED OR = 3.0  ← the honest, age-controlled answer

The Key Lesson From This Example

  • The crude OR was 2.67 - it looked like age was inflating the result
  • After controlling for age using MH, the true OR was actually 3.0 - slightly stronger
  • This showed age was suppressing the real effect, not inflating it
  • The MH method peeled away age's influence and revealed the cleaner truth
This is exactly why researchers never rely on a crude estimate alone in observational studies - confounders can hide, inflate, or reverse the real association.

What's the difference between confounding and effect modification?

Confounding vs. Effect Modification


The One-Line Difference

Confounding = a third variable is distorting your result. Fix it, remove it, adjust for it. Effect modification = a third variable is changing the true effect. Don't remove it — report it, because it's telling you something real and important.

Think of It This Way

Imagine you're trying to hear a song (the true exposure-disease relationship):
  • Confounding = background noise. It's interfering with what you're trying to hear. You want to remove it.
  • Effect modification = the song genuinely sounds different in different rooms. That difference is the finding. You want to describe it.

Confounding - Explained Simply

A confounder is a variable that:
  1. Is associated with the exposure
  2. Is associated with the outcome
  3. Is NOT on the causal pathway between them
It creates a fake or distorted association. The confounder is like a shadow - it makes things look different from what they really are.

Example of Confounding

Question: Does carrying a lighter cause lung cancer?
You find that lighter-carriers get lung cancer more often. Shocking? Not really.
  • Lighter-carriers tend to smoke
  • Smoking causes lung cancer
  • Carrying a lighter does NOT cause cancer - smoking does
Smoking is the confounder. Once you control for smoking, the lighter-cancer association disappears completely. The lighter was just a proxy for the real culprit.
Lighter ──────────────────────────── Cancer
    ↘                              ↗
      Smoking (the real cause)
The lighter-cancer association was fake - created entirely by the confounder.

Effect Modification - Explained Simply

Effect modification (also called interaction) is when the effect of an exposure on an outcome is genuinely different depending on a third variable.
It is NOT a bias or an error. It is a real biological or clinical phenomenon that you want to know about.

Example of Effect Modification

Question: Does aspirin prevent heart attacks?
You study two groups - men and women:
  • In men: Aspirin reduces heart attack risk significantly (OR = 0.6)
  • In women: Aspirin has almost no effect on heart attacks (OR = 0.97)
Sex is an effect modifier. The drug works differently in men vs. women. This is a real and important finding - not a statistical nuisance to remove.
If you pooled the data and computed one average OR, you'd miss this entirely and give wrong clinical advice to half the population.

Side-by-Side Comparison

FeatureConfoundingEffect Modification
What it isA distortion / biasA real biological difference
Is it a problem?Yes - it misleads youNo - it informs you
What to do with itAdjust for it, control it, remove itReport it, stratify by it
MH adjusted OR vs crude ORThey differ significantlyStratum-specific ORs differ from each other
Should you pool the ORs?Yes, after adjustmentNo - report separately
ExampleAge confounding coffee-cancer linkSex modifying aspirin's effect on heart attacks
Clinical implication"The crude result was misleading""The treatment works differently in subgroups"

How to Tell Them Apart in Practice

Here's the decision process when you stratify your data:
Stratify by the third variable
            │
            ▼
Are the stratum-specific ORs
    similar to each other?
        /           \
      YES             NO
       │               │
       ▼               ▼
Are they different    EFFECT MODIFICATION
from the crude OR?    → Report each stratum
    /       \           separately
  YES        NO
   │          │
   ▼          ▼
CONFOUNDING  No confounding,
→ Use the    no effect modification
  MH adjusted
  OR

A Worked Example Showing Both

Question: Does a new drug reduce mortality? Potential third variable: Disease severity (mild vs. severe)

Scenario A - Confounding

GroupDrug OR
Mild disease0.5
Severe disease0.5
Crude OR0.7
Both strata show the same OR (0.5). The crude was 0.7 - distorted by the fact that sicker people got the drug more often. The true effect is 0.5 in both groups. Severity was confounding the result.
✅ Use the MH adjusted OR = 0.5 and report that.

Scenario B - Effect Modification

GroupDrug OR
Mild disease0.9 (barely works)
Severe disease0.3 (works very well)
Crude OR0.6
The ORs are very different between strata. The drug works much better in severe disease. This is effect modification.
❌ Do NOT report a single pooled OR of 0.6 - it hides the truth. ✅ Report: "The drug reduces mortality by 70% in severe disease but has little effect in mild disease."

The Most Important Practical Point

SituationWrong ActionRight Action
ConfoundingReport the crude ORAdjust using MH or regression
Effect modificationPool into one numberStratify and report each group
Confounding is something you correct. Effect modification is something you communicate.

Memory Hook

Confounder = a liar in your data. Catch it, remove it. Effect modifier = a storyteller in your data. Listen to it, report it.
This is a shared conversation. Sign in to Orris to start your own chat.