Here is a thorough explanation of the Mantel-Haenszel method.
The Mantel-Haenszel (MH) Method
Background and Origin
Introduced by Nathan Mantel and William Haenszel in their landmark 1959 paper "Statistical Aspects of the Analysis of Data from Retrospective Studies of Disease" (Journal of the National Cancer Institute), the method was originally designed to analyze case-control (retrospective) data. It has since become a standard tool in epidemiology, clinical trials, and meta-analysis.
The Core Problem it Solves: Confounding
Suppose you want to measure the association between an exposure (e.g., smoking) and a disease (e.g., lung cancer). A simple 2x2 table gives you a crude odds ratio or risk ratio. But a third variable - say, age - might be associated with both smoking AND lung cancer. This is confounding: the crude estimate is distorted because the groups being compared differ on age, not just smoking.
The solution is stratification: split your data into strata (age groups), compute the association within each stratum (where the confounder is held constant), then combine those stratum-specific estimates into a single adjusted summary. The MH method is the standard algorithm for doing this.
The Setup: The 2x2xK Table
For each stratum i (i = 1, 2, ..., K), you have a 2x2 table:
| Disease + | Disease - | Row Total |
|---|
| Exposed | a_i | b_i | n₁ᵢ |
| Unexposed | c_i | d_i | n₀ᵢ |
| Col Total | m₁ᵢ | m₀ᵢ | Nᵢ |
The MH Pooled Odds Ratio
Each stratum contributes a weighted estimate of the odds ratio. The weight for each stratum is:
wᵢ = (bᵢ × cᵢ) / Nᵢ
The Mantel-Haenszel pooled odds ratio is:
OR_MH = Σ(aᵢdᵢ / Nᵢ) / Σ(bᵢcᵢ / Nᵢ)
That is: sum the "concordant" products weighted by stratum size, divided by the sum of "discordant" products weighted by stratum size. Strata with larger samples and more informative data receive more weight automatically.
The MH Pooled Risk Ratio (for cohort studies)
For cohort or cross-sectional data:
RR_MH = Σ(aᵢ × n₀ᵢ / Nᵢ) / Σ(cᵢ × n₁ᵢ / Nᵢ)
Again, each stratum is weighted by its contribution to the total, ensuring larger strata carry more influence.
The Mantel-Haenszel Chi-Square Test
Beyond just estimating the adjusted association, MH also provides a hypothesis test of whether the adjusted association is statistically significant (H₀: OR_MH = 1 or RR_MH = 1):
χ²_MH = [|Σ(aᵢ - Eᵢ)| - 0.5]² / Σ Var(aᵢ)
Where:
- Eᵢ = n₁ᵢ × m₁ᵢ / Nᵢ (expected value of aᵢ under H₀)
- Var(aᵢ) = n₁ᵢ × n₀ᵢ × m₁ᵢ × m₀ᵢ / (Nᵢ² × (Nᵢ - 1))
- The 0.5 is a continuity correction (Yates correction)
- This statistic follows a chi-square distribution with 1 degree of freedom, regardless of how many strata K you have
Worked Conceptual Example
(From Creasy & Resnik's Maternal-Fetal Medicine)
A case-control study examines alcohol use and preeclampsia. The crude OR is 2.26, suggesting alcohol raises risk. But parity (first vs. later pregnancy) could confound this. Stratifying:
- Nulliparous stratum: OR = 1.0
- Multiparous stratum: OR = 1.0
- MH Summary OR = 1.0 (95% CI: 0.42 to 2.34)
The adjusted OR of 1.0 shows no true association - parity was confounding the original estimate of 2.26. The crude OR was inflated because parous women are more likely to drink alcohol AND less likely to get preeclampsia.
Assessing Confounding
To decide if confounding is present, compare:
- Crude measure vs. MH adjusted measure
- A common rule of thumb: if they differ by >10-15%, confounding is likely
There is no formal statistical test for confounding - it is a judgment call based on the magnitude of the difference.
Effect Modification (Interaction)
Before computing a pooled MH estimate, check whether the stratum-specific OR/RR values are homogeneous (similar across strata). If they differ substantially, the effect of exposure varies by level of the third variable - this is effect modification (also called interaction), NOT confounding.
To test this formally, use the Breslow-Day test:
- H₀: All stratum-specific odds ratios are equal
- If p < 0.05, effect modification is likely present
- In that case, report stratum-specific estimates rather than a pooled MH estimate - pooling is misleading when there is true heterogeneity
The Cochran-Mantel-Haenszel (CMH) Extension
The CMH statistic generalizes the method to 2x2xK tables where K is any number of strata (not just 2). It:
- Assumes a common odds ratio across all K strata
- Tests the null hypothesis that the two main variables are conditionally independent after controlling for the stratifying variable
- Still yields 1 degree of freedom regardless of K, giving it strong power
(Kaplan & Sadock's Comprehensive Textbook of Psychiatry)
Applications
| Context | Use |
|---|
| Epidemiology | Controlling confounders in case-control / cohort studies |
| Clinical trials | Adjusting treatment comparisons across center or baseline severity strata |
| Meta-analysis | Combining odds ratios or risk ratios from multiple 2x2 tables (one per study) |
| Psychometrics | Differential item functioning (DIF) analysis in test development |
| Survival analysis | Mantel-Cox (log-rank) test for comparing survival curves |
Assumptions and Limitations
- Common effect assumption: MH assumes the OR/RR is the same (or approximately the same) in each stratum. Violation = effect modification.
- No sparse data: Works best when each stratum has enough observations. Strata with very small counts contribute little but can distort results.
- Only one confounder at a time: Stratification handles one (or at most two) confounders simultaneously. For multiple confounders, use logistic regression (which generalizes MH).
- Binary exposure and outcome: Standard MH applies to 2x2 tables. Ordinal or continuous data require extensions.
MH vs. Other Methods
| Method | When to Use |
|---|
| MH | Simple confounder adjustment, one variable at a time; transparent and non-parametric |
| Logistic regression | Multiple confounders simultaneously; continuous covariates |
| Woolf method | Alternative pooling using log(OR) weights; less stable with sparse data |
| Random-effects meta-analysis | When heterogeneity between strata/studies is substantial |
Key Takeaway
The Mantel-Haenszel method produces a weighted average of stratum-specific effect estimates, where larger, more informative strata receive more weight. This makes it a simple, robust, and transparent way to control for a single confounding variable and obtain an adjusted measure of association - whether an OR, RR, or rate ratio - along with a valid significance test, all within a 1-df chi-square framework.