Strata Academy
Effect Measures for Meta-Analysis: RR, OR, HR, MD & SMD
Choosing and interpreting risk ratios, odds ratios, hazard ratios, mean differences, and standardised mean differences in systematic reviews
Quick answer
Choose effect measures that match your outcome type: RR or OR for dichotomous events (RR preferred when events are common), HR for time-to-event, MD for same-scale continuous outcomes, SMD when scales differ. Meta-analysis pools on a log scale for ratios; forest plot null lines are at 1 for RR/OR/HR and 0 for MD/SMD. Never mix incompatible measures in one forest plot.
1. Why effect measure choice matters
Meta-analysis combines estimates that measure the same treatment effect on the same scale. Choosing the wrong effect measure distorts pooled results, confuses readers, and triggers GRADE indirectness or inconsistency concerns.
The effect measure should be pre-specified in PROSPERO or the review protocol. Post-hoc switching from OR to RR after seeing results invites selective reporting criticism.
Software (RevMan, R meta, Stata metan) will pool numbers you provide — your methodological job is to ensure those numbers answer a coherent clinical question.
- Dichotomous outcomes → RR, OR, risk difference, or Peto OR (rare events)
- Time-to-event → hazard ratio (log HR)
- Continuous same scale → mean difference (MD)
- Continuous different scales → standardised mean difference (SMD)
2. Risk ratio (RR) vs odds ratio (OR)
Risk ratio compares event probabilities between groups: (events in intervention / n intervention) ÷ (events in control / n control). It is intuitive and aligns with absolute risk reduction when baseline risk is known.
Odds ratio compares odds of events, not probabilities. When events are common (>10%), OR diverges from RR and can exaggerate apparent benefit or harm. Cochrane prefers RR for interpretability when data allow.
Both are pooled on the log scale in meta-analysis. Forest plots display OR or RR with null line at 1.0 — values left of 1 favour control; right favour intervention (for beneficial outcomes framed as reduction).
3. Hazard ratio (HR) for time-to-event outcomes
Hazard ratio compares event rates over time between groups, accounting for censoring. Use HR when the outcome is time-to-death, time-to-relapse, or similar survival endpoints.
Meta-analysis pools log HRs with their standard errors — usually extracted from published Kaplan–Meier analyses, Cox regression tables, or via digitisation methods when only figures exist.
Do not substitute a significant log-rank p-value without a hazard ratio estimate. Do not pool HRs with RRs on the same forest plot.
- HR < 1 → lower hazard in intervention group (for adverse outcomes framed as events)
- Requires comparable follow-up and censoring assumptions across trials
- Check whether HR is from Cox model adjusted for covariates vs unadjusted
- Immature survival data (few events) → imprecise HRs and GRADE downgrades
4. Mean difference (MD) vs standardised mean difference (SMD)
Mean difference pools continuous outcomes measured on the same scale (e.g. mm Hg, points on the same questionnaire). The pooled MD is in original units — directly comparable to MCID thresholds.
Standardised mean difference (Cohen's d or Hedges' g) pools trials using different instruments measuring the same construct (e.g. multiple depression scales). SMD is unitless — harder to interpret clinically without back-translation.
Rule of thumb: if all trials use the same validated scale, prefer MD. If scales differ but construct is shared, use SMD and explain clinical meaning cautiously.
- MD null value on forest plot → 0
- SMD null value → 0; |0.2| small, |0.5| medium, |0.8| large — context-dependent
- Change scores vs follow-up scores — extract consistently
- Correlated outcomes in same trial — avoid double-counting in synthesis
5. Pooling rules and heterogeneity
Pool only studies estimating the same contrast on the same scale: intervention vs comparator, same time point, same outcome definition.
Random-effects models are Cochrane's default when clinical heterogeneity is plausible. Fixed-effect assumes one true effect — large trials dominate.
High I² does not automatically forbid pooling but demands investigation. If studies measure different constructs, switch to narrative synthesis rather than forcing SMD.
6. Reading effect measures on forest plots
Before interpreting any diamond, read the axis label: OR, RR, HR, MD, or SMD. The null line position follows the measure — 1 for ratios, 0 for differences.
Study weights reflect precision (and model choice), not study quality. A biased but large trial can pull the pool — pair forest reading with ROB 2.
Use our interactive forest plot to practise identifying measure type, null line, and pooled diamond crossing.
- Read effect measure label and null value
- Confirm all studies use compatible contrasts
- Check whether model is random or fixed effects
- Note whether diamond crosses null — statistical significance
- Translate ratio measures to absolute effects for clinical significance
- Read heterogeneity statistics below the plot
7. Extraction tips for student dissertations
Build your extraction spreadsheet with columns for events and totals (for RR/OR), means and SDs (for MD), or HR and log SE. Record whether estimates are adjusted or ITT.
When papers report multiple time points, extract the pre-specified primary time point from your protocol — not the most favourable post-hoc time point.
Contact authors for missing SDs or incomplete 2×2 tables before excluding studies. Document imputation rules if using published methods (e.g. Wan et al. for medians).
Interactive version (quizzes, walkthroughs) loads when JavaScript is enabled.