I now have comprehensive material. Here is a detailed explanation:
Mantel-Haenszel Method: A Detailed Explanation
1. Background and Why It Was Developed
In community medicine and epidemiology, a core challenge is confounding - a situation where the observed association between an exposure and an outcome is distorted by a third variable (the confounder), which is independently associated with both the exposure and the outcome.
For example, if you study the association between obesity and cardiovascular disease (CVD), age can be a confounder - older people are both more likely to be obese and more likely to have CVD. If you ignore age, your crude odds ratio will be inflated beyond the true association.
The Mantel-Haenszel (MH) method, published by Nathan Mantel and William Haenszel in 1959, provides a statistically rigorous way to produce an adjusted (pooled) summary measure of association - odds ratio (OR), relative risk (RR), or rate ratio - after controlling for one or more confounders through stratification.
2. What Is Stratification?
Stratification means dividing the study population into homogeneous subgroups (strata) based on the levels of the confounding variable. Within each stratum, the confounder either does not vary or varies very little, so the association measured within each stratum is unconfounded.
For example, to control for age:
- Stratum 1: Age < 40
- Stratum 2: Age >= 40
You compute a separate 2x2 table and OR/RR for each stratum, then combine them using the MH formula.
3. The Cochran-Mantel-Haenszel Framework: The 2 x 2 x K Table
The MH method works with a 2 x 2 x K table structure:
- 2 rows: Exposure (Yes/No)
- 2 columns: Outcome (Yes/No)
- K layers: The K strata of the confounding variable
Notation for each stratum i:
| Outcome + | Outcome - | Total |
|---|
| Exposed | a_i | b_i | m1_i |
| Unexposed | c_i | d_i | m0_i |
| Total | n1_i | n0_i | n_i |
4. The MH Formula
For Odds Ratio (Case-Control Studies):
$$OR_{MH} = \frac{\sum_i \frac{a_i d_i}{n_i}}{\sum_i \frac{b_i c_i}{n_i}}$$
Each stratum contributes a weighted pair of products. The weight for each stratum is 1/n_i (the inverse of total subjects in that stratum), so larger strata contribute more to the final pooled estimate.
For Relative Risk / Risk Ratio (Cohort Studies):
$$RR_{MH} = \frac{\sum_i \frac{a_i \cdot m0_i}{n_i}}{\sum_i \frac{c_i \cdot m1_i}{n_i}}$$
For Rate Ratio (incidence rate data):
$$IRR_{MH} = \frac{\sum_i \frac{a_i \cdot T_{0i}}{T_i}}{\sum_i \frac{c_i \cdot T_{1i}}{T_i}}$$
where T represents person-time in each stratum.
5. Step-by-Step Application
Step 1: Calculate the Crude (Unadjusted) OR/RR
From the overall 2x2 table (ignoring the confounder):
OR = (a × d) / (b × c)
Step 2: Stratify by the suspected confounder
Create a separate 2x2 table for each stratum of the confounder.
Step 3: Calculate Stratum-Specific OR/RR
Compute OR or RR within each stratum. If the stratum-specific values are similar to each other but differ markedly from the crude estimate, confounding is present.
Step 4: Check for Homogeneity of Effect Across Strata
Before pooling, assess whether the stratum-specific measures are similar (homogeneous). This is the homogeneity assumption - you can use the Breslow-Day test to formally test this.
- If stratum-specific estimates are similar → pool them (MH is appropriate, confounding is present)
- If stratum-specific estimates are very different → do NOT pool; this is effect modification (interaction), and you should report stratum-specific results separately
Step 5: Calculate the MH Pooled OR/RR
Apply the MH formula to get the adjusted, unconfounded estimate.
Step 6: Assess Magnitude of Confounding
$$% \text{ Confounding} = \frac{Crude OR - Adjusted OR}{Crude OR} \times 100$$
A difference of >10-15% is generally considered meaningful confounding.
Step 7: MH Chi-Square Test
The MH chi-square statistic tests the null hypothesis that there is no association between exposure and outcome after adjusting for the confounder:
$$\chi^2_{MH} = \frac{\left[\sum_i \left(a_i - E(a_i)\right)\right]^2}{\sum_i Var(a_i)}$$
This is the Cochran-Mantel-Haenszel test of conditional independence.
6. Worked Example
Study: Association between obesity and CVD, with age as a potential confounder.
Crude data:
- OR (unadjusted) = (46 × 640) / (254 × 60) = 1.93
After stratifying by age (young vs. old):
| Stratum | Stratum OR |
|---|
| Age < 40 | ~1.50 |
| Age ≥ 40 | ~1.52 |
Both stratum-specific ORs are similar to each other (~1.5) but lower than the crude OR (1.93). This pattern indicates positive confounding by age (age inflated the crude estimate).
After applying MH formula:
Magnitude of confounding = (1.93 - 1.52) / 1.93 × 100 = ~21%
7. Key Assumptions of the MH Method
-
Homogeneity of effect: The association measure (OR or RR) must be approximately the same across all strata. If it is not, MH pooling is inappropriate - you are dealing with effect modification, not confounding.
-
No residual confounding within strata: Within each stratum, the confounder is assumed to be sufficiently controlled (subjects are homogeneous for the confounder).
-
Adequate cell counts: Each cell in each 2x2 stratum table should have sufficient numbers (sparse data reduces reliability; the Breslow-Day test may be underpowered with small samples).
8. Confounding vs. Effect Modification: A Critical Distinction
| Feature | Confounding | Effect Modification |
|---|
| Stratum-specific estimates | Similar to each other | Different from each other |
| Action | Pool using MH | Report separately |
| Crude vs. adjusted | Crude differs from adjusted | Crude is misleading |
| Biological meaning | Bias to remove | Real biological phenomenon |
9. Uses in Community Medicine
The MH method has wide applications across community medicine and public health:
a) Controlling Confounding in Observational Studies
- Case-control studies: Adjusting OR for age, sex, socioeconomic status
- Cohort studies: Adjusting RR or rate ratios for known confounders
- Example: Studying smoking and lung cancer while controlling for age and occupation
b) Multi-centre / Multi-site Studies
- When a study is conducted across multiple clinics or hospitals, "site" can confound results (sicker patients may be concentrated at one site)
- MH pools results across sites to give a single unconfounded summary estimate
- This is directly analogous to Mantel and Haenszel's original 1959 example
c) Meta-Analysis
- MH is the foundation for pooling 2x2 data across multiple independent studies
- In systematic reviews, MH gives a weighted pooled OR/RR across studies (inverse-variance or MH weighting)
- Used in Cochrane reviews and other evidence syntheses
d) Survival Analysis - Mantel-Haenszel Log-Rank Test
- The MH principle extends to survival data via the log-rank test (also called Mantel-Haenszel life table test)
- Compares survival curves between groups at each event time point
- Widely used to compare treatment arms in clinical trials and cohort studies
e) Programme Evaluation
- In health programme evaluation, MH helps assess whether an intervention effect holds across subgroups (rural vs. urban, different age groups)
- Adjusts for baseline differences between intervention and control populations
f) Disease Surveillance and Outbreak Investigation
- In field epidemiology, MH is used to adjust for age/sex when comparing attack rates across groups
- Helps identify the true exposure-disease association when demographic confounders exist
g) Vaccine Effectiveness Studies
- Adjusting vaccine effectiveness estimates for age, comorbidities, healthcare-seeking behaviour
h) Environmental Epidemiology
- Studying occupational exposures while controlling for smoking status, age, duration of employment
10. Advantages of the MH Method
| Advantage | Detail |
|---|
| Simple and intuitive | No complex software needed for basic calculations |
| Non-parametric | Does not assume a distribution for the data |
| Transparent | Each stratum's contribution is explicit and can be examined |
| Handles sparse data better | More robust than logistic regression with small samples |
| Widely accepted | Gold standard for stratified analysis in epidemiology |
| Works for multiple study designs | Case-control, cohort, cross-sectional, survival |
11. Limitations
- Limited to one or two confounders at a time (stratification becomes impractical with many confounders - you run into sparse cell counts)
- Cannot handle continuous confounders directly (must be categorized, which introduces residual confounding within categories)
- Homogeneity assumption may not hold - requires checking before pooling
- Sparse data in strata reduces precision
- For multiple confounders simultaneously, multivariable logistic regression or Poisson regression is preferred
12. MH vs. Multivariable Regression
| Feature | Mantel-Haenszel | Multivariable Regression |
|---|
| Number of confounders | 1-2 | Many |
| Model assumptions | Fewer | Distributional assumptions |
| Interpretability | High (transparent) | Moderate |
| Sparse data | Better | May be unstable |
| Standard use | Teaching, field epi | Complex analyses |
In practice, both methods are complementary. MH stratified analysis is often used first to check for confounding and effect modification, and then multivariable regression is used for the final adjusted analysis.
Summary
The Mantel-Haenszel method is one of the cornerstone techniques in community medicine and epidemiology. It provides a statistically valid way to calculate a weighted, pooled summary measure of association (OR or RR) that is adjusted for confounding, by combining stratum-specific 2x2 tables. It distinguishes confounding from effect modification, underpins meta-analytic methods, and is applicable across case-control studies, cohort studies, multi-site trials, and outbreak investigations. Its conceptual clarity makes it an essential teaching tool and practical analytical method in public health practice.
References:
- Mantel N, Haenszel W. Statistical aspects of the analysis of data from retrospective studies of disease. J Natl Cancer Inst. 1959.
- Creasy & Resnik's Maternal-Fetal Medicine, Chapter 15 (Stratified Analysis)
- Kaplan & Sadock's Comprehensive Textbook of Psychiatry (Cochran-Mantel-Haenszel statistic)
- Stratification for Confounding - The MH Formula - Nephron Clinical Practice