Which model should I use for my meta-analysis?

Random-effects is recommended as the default for most meta-analyses. Clinical, methodological, and statistical diversity across studies is the norm, meaning the assumption that all studies share one true effect is rarely justified. The Cochrane Handbook supports this position for systematic reviews of healthcare interventions.

Does model choice affect statistical significance?

Yes. Random-effects models produce wider confidence intervals than fixed-effect models, especially when heterogeneity is present. A result that is statistically significant under a fixed-effect model may lose significance under random-effects because the additional between-study variance widens the interval around the pooled estimate.

What is the DerSimonian-Laird method?

DerSimonian-Laird (DL) is the most widely used method for estimating tau-squared, the between-study variance in a random-effects meta-analysis. It uses a method-of-moments approach and is computationally simple, but it can underestimate tau-squared when the number of studies is small.

If I-squared is 0%, does it matter which model I use?

When I-squared is 0%, both models produce virtually identical results because the estimated between-study variance (tau-squared) is zero. In this scenario, random-effects weights collapse to fixed-effect weights. However, I-squared of 0% does not prove homogeneity: it may reflect low statistical power to detect heterogeneity, especially with fewer than ten studies.

Should I report results from both models?

Best practice is to pre-specify one model as your primary analysis and report the alternative model as a sensitivity analysis. This demonstrates robustness. If the two models yield different conclusions, discuss the discrepancy and explain which result you consider more credible given the clinical context.

What is REML and when should I use it?

Restricted Maximum Likelihood (REML) is an iterative method for estimating tau-squared that produces less biased variance estimates than DerSimonian-Laird, particularly when the number of studies is small (fewer than 15-20). It is increasingly recommended by methodologists and is the default estimator in several modern meta-analysis software packages.

Random-Effects Meta-Analysis: When to Use Each Model

What Is the Difference Between Fixed-Effect and Random-Effects Meta-Analysis?

Side-by-side conceptual diagram of fixed-effect and random-effects meta-analysis models with formulas and weighting rules

Fixed-effect assumes a single true effect; random-effects assumes a distribution of true effects. Source: Borenstein et al., 2021.

The difference is rooted in what you believe about the studies you are combining. A fixed-effect model assumes that every study in your meta-analysis estimates the same single true effect. Differences between observed study results are attributed entirely to sampling error, the random variation that occurs because each study enrolls a finite number of participants. Under this assumption, there is one true effect size in the population, and every study is trying to estimate that same number.

A random-effects model assumes that the true effect size varies from study to study. Each study estimates its own true effect, and those true effects are drawn from a distribution of possible true effects. The variability between study-level true effects is called between-study heterogeneity, quantified by the variance parameter tau-squared. Under this assumption, your pooled estimate represents the mean of the distribution of true effects rather than a single common effect.

Statistics

Logistic Regression: A Practical Guide for Researchers

Logistic regression models a binary outcome and reports odds ratios. Learn when to use it, how to interpret coefficients, and the assumptions to check.

Methodology

Cohort Study Design: When and How to Use It

A cohort study follows people by exposure status over time to measure incidence and relative risk. Learn prospective vs retrospective designs and the bias to control.

Methodology

Cross-Sectional Study Design: A Practical Guide

A cross-sectional study measures exposure and outcome at one time point. Learn when to use this design, how to analyze prevalence, and the bias to avoid.

Feature	Fixed-Effect Model	Random-Effects Model
Core assumption	One true effect across all studies	True effect varies across studies
Sources of variance	Within-study only	Within-study + between-study (tau-squared)
Weighting	Inverse of within-study variance	Inverse of (within-study variance + tau-squared)
Weight distribution	Large studies dominate	Weights more balanced across studies
Confidence interval	Narrower	Wider (unless tau-squared = 0)
Interpretation of pooled estimate	Estimate of the single common effect	Estimate of the mean of the distribution of true effects
Generalizability	Limited to studies included	Generalizes to population of similar studies
When I-squared = 0%	Standard result	Collapses to fixed-effect result
Common estimator	Inverse variance (Mantel-Haenszel for binary)	DerSimonian-Laird, REML, Paule-Mandel
Cochrane recommendation	Use when studies are truly identical	Default for most clinical meta-analyses

Tau-Squared Estimator	Method	Strengths	Weaknesses
DerSimonian-Laird	Method of moments	Simple, fast, widely available	Underestimates tau-squared with few studies
REML	Iterative likelihood	Less biased, good small-sample properties	Requires iteration, less intuitive
Paule-Mandel	Iterative generalized moments	Robust, good CI coverage	Less widely implemented

How Model Choice Affects Results: A Worked Example

Consider a meta-analysis of seven randomized controlled trials examining the effect of an intervention on systolic blood pressure. Suppose the observed mean differences (mmHg) and standard errors are as follows:

Study	Mean Difference	Standard Error	Fixed-Effect Weight (%)	Random-Effects Weight (%)
Study A	-8.2	1.1	35.7	19.8
Study B	-5.1	2.0	10.8	16.2
Study C	-6.9	1.5	19.2	18.3
Study D	-3.4	2.5	6.9	14.1
Study E	-7.5	1.3	25.6	19.1

Under a fixed-effect model, Study A receives 35.7% of the weight because it has the smallest standard error. The pooled mean difference is approximately -6.8 mmHg (95% CI: -7.9 to -5.7). Notice that the three largest studies (A, C, E) together account for over 80% of the weight.

Under a random-effects model with DerSimonian-Laird estimation and a tau-squared of approximately 2.1, the weights become more evenly distributed. Study A still receives the most weight but only 19.8% instead of 35.7%. The pooled mean difference shifts to approximately -5.6 mmHg (95% CI: -7.4 to -3.8). The point estimate has moved toward the results of the smaller, less precise studies, and the confidence interval has widened from a width of 2.2 mmHg to 3.6 mmHg.

In this example, both models yield statistically significant results (neither confidence interval crosses zero). But the random-effects estimate is more conservative, smaller in magnitude and less precise. In some meta-analyses, this difference is enough to change a statistically significant result to a non-significant one, which is precisely why model choice matters.

The forest plot under each model tells a visual story as well. A fixed-effect forest plot shows the diamond (pooled estimate) pulled toward the large-sample studies. A random-effects forest plot shows a wider diamond positioned closer to the center of all the individual study estimates. For guidance on reading and creating forest plots, see our step-by-step meta-analysis guide.

When to Use Each Model

Decision tree for selecting fixed-effect or random-effects meta-analysis based on homogeneity, inferential goal, and heterogeneity expectation — Pre-specify the model in the protocol, never select after seeing the data. Source: Cochrane Handbook v6.5 (2024) Chapter 10.

The decision between fixed-effect and random-effects should be pre-specified in your protocol and grounded in your understanding of the studies you are combining. Here is a decision framework.

Use a fixed-effect model when all studies share the same population, intervention, comparator, outcome, and design, and you believe the only reason for differences between study results is sampling variability. This is most common in multi-center trials where each site follows the same protocol, or in highly controlled laboratory replications.

Use a random-effects model when you expect clinical or methodological differences between studies. This includes differences in patient demographics, intervention dosing, comparator types, outcome measurement instruments, follow-up periods, or risk of bias. Because nearly all systematic reviews of independent studies involve some degree of heterogeneity, random-effects is the safer default.

Use a random-effects model when your goal is to generalize beyond the specific studies in your review to a broader population of similar studies that could have been conducted. If you want your conclusion to apply to future studies in different settings, random-effects is the appropriate model because it explicitly accounts for variation across settings.

Consider a fixed-effect model as a sensitivity analysis even when random-effects is your primary model. If both models yield the same conclusion, your result is robust. If they disagree, investigate why, the discrepancy usually reveals something important about heterogeneity in your evidence base.

Avoid choosing the model based on the observed I-squared value. Selecting random-effects when I-squared is high and fixed-effect when I-squared is low constitutes data-driven model selection, which inflates type I error and undermines the pre-specified analysis plan. Your model should be chosen based on your a priori expectations about between-study variability, not on the data you observe.

Common Mistakes in Model Selection

Several recurring errors undermine the credibility of meta-analyses, and many of them relate directly to the fixed-effect versus random-effects decision.

Selecting the model post hoc based on results. Some analysts run both models and report whichever gives a more favorable (usually more significant) result. This is a form of selective reporting. Pre-specify your model in the protocol and stick with it. Report the alternative model as a sensitivity analysis, but base your primary conclusions on the pre-specified model.

Equating I-squared of 0% with no heterogeneity. An I-squared of 0% means that the observed variability between study results is no greater than what you would expect from sampling error alone. It does not prove that the true effects are identical. With fewer than ten studies, the power to detect heterogeneity is low. Cochran's Q test is similarly underpowered in small meta-analyses. The absence of statistical evidence for heterogeneity is not evidence of absence.

Ignoring tau-squared and reporting only I-squared. I-squared is a relative measure, it tells you what percentage of the observed variability is attributable to heterogeneity rather than chance. But it does not tell you the magnitude of the heterogeneity on the scale of the outcome. Tau-squared does. A meta-analysis of blood pressure reductions might have I-squared of 60%, but tau-squared of only 1.5 mmHg-squared, meaning the standard deviation of true effects across studies is about 1.2 mmHg, which may or may not be clinically important. Always report both statistics.

Using DerSimonian-Laird without acknowledging its limitations. DerSimonian-Laird is the default in most software and most published meta-analyses. But if your meta-analysis includes fewer than 15 studies, consider using REML or Paule-Mandel as your primary estimator, or at minimum as a sensitivity analysis. The Veroniki et al. (2016) simulation study provides evidence that DL's underestimation of tau-squared can lead to confidence intervals that are too narrow, a problem that is more severe with fewer studies.

Confusing the fixed-effect model with the common-effect assumption in all contexts. In network meta-analysis and multivariate meta-analysis, the term "fixed effects" can refer to study-level fixed effects (treating each study as a separate parameter) rather than a common-effect assumption. Make sure you understand which "fixed" you are discussing.

Failing to consider prediction intervals. Even under a random-effects model, the confidence interval around the pooled mean tells you only about the uncertainty in the mean of the effect distribution. A prediction interval tells you where you would expect the true effect of a new study to fall. Prediction intervals are almost always wider than confidence intervals and provide a more honest assessment of the range of plausible treatment effects. If you want to know whether the intervention will be effective in the next clinical setting, the prediction interval is more informative than the confidence interval.

Not reporting a sensitivity analysis with the alternative model. Reviewers and editors increasingly expect to see both models reported. If your pre-specified model is random-effects, include a fixed-effect sensitivity analysis in your supplementary materials. This costs nothing and substantially strengthens the manuscript's credibility.

Understanding the distinction between random-effects and fixed-effect meta-analysis is fundamental to producing trustworthy evidence synthesis. The model you choose shapes your weights, your confidence intervals, your point estimate, and the scope of your conclusions. By grounding the decision in a priori assumptions about between-study variability, and by reporting sensitivity analyses with the alternative model, you produce meta-analyses that withstand methodological scrutiny and provide the evidence base that clinical decision-makers need. For a comprehensive walkthrough of the entire meta-analysis process, including model selection within the broader workflow, see our step-by-step meta-analysis guide. And for deeper coverage of heterogeneity assessment, including subgroup analysis and meta-regression, see our I-squared and heterogeneity guide.

Implementing random effects models requires specialized software. Understand why Excel falls short for random effects meta-analysis and which tools handle it properly.

If you choose a random-effects model, the next decision is which tau-squared estimator to use. Our comparison of REML versus DerSimonian-Laird explains why REML is now the default recommendation.

Random-effects models should report prediction intervals alongside confidence intervals. Learn what prediction intervals add and why reviewers increasingly request them.

Random-Effects Meta-Analysis vs Fixed-Effect

Key Takeaways

What Is the Difference Between Fixed-Effect and Random-Effects Meta-Analysis?

Pre-specify your model in the protocol

Always report both I-squared and tau-squared alongside the model

Frequently Asked Questions

Related Articles

Reading About Meta-Analysis? Our PhD Team Runs Them Every Day.

Dr. Sarah Mitchell

Reading About Meta-Analysis? Our PhD Team Runs Them Every Day.

How the Fixed-Effect Model Works

How the Random-Effects Model Works

DerSimonian-Laird (DL) Estimator

Restricted Maximum Likelihood (REML) Estimator

Paule-Mandel (PM) Estimator

Comparison Table: Fixed-Effect vs Random-Effects Meta-Analysis

How Model Choice Affects Results: A Worked Example

When to Use Each Model

Common Mistakes in Model Selection

Run sensitivity analysis with the alternative model

Related Articles