Guides

5 min read

REML vs DerSimonian-Laird: Choosing the Right Tau-Squared Estimator

The tau-squared estimator you choose shapes the weights, confidence intervals, and pooled effect in every random-effects meta-analysis. This guide compares REML and DerSimonian-Laird directly so you can make an informed, defensible choice.

Dr. Sarah Mitchell

March 30, 2026

Want to try this yourself? Use our free research tools, no sign-up required.

Key Takeaways

Tau-squared is the between-study variance in random-effects meta-analysis; its estimator directly controls study weights, pooled effect precision, and confidence interval width.

DerSimonian-Laird (1986) uses a single-pass method-of-moments formula that consistently underestimates tau-squared, producing confidence intervals that are on average too narrow.

REML uses iterative Fisher scoring on a restricted likelihood function, corrects for fixed-effects estimation bias, and shows lower mean squared error and better confidence interval coverage in simulation studies.

Need professional help with your research?

Our PhD experts deliver complete systematic reviews and meta-analyses, from protocol to manuscript.

Get a Free Quote

Written by

Dr. Sarah Mitchell

PhD, Biostatistics & Research Methodology

Dr. Sarah Mitchell holds a PhD in Biostatistics from Johns Hopkins Bloomberg School of Public Health and has over 15 years of experience in systematic review methodology and meta-analysis. She has authored or co-authored 40+ peer-reviewed publications in journals including the Journal of Clinical Epidemiology, BMC Medical Research Methodology, and Research Synthesis Methods. A former Cochrane Review Group statistician and current editorial board member of Systematic Reviews, Dr. Mitchell has supervised 200+ evidence synthesis projects across clinical medicine, public health, and social sciences.

Learn more about our team

Need professional help with your systematic review or meta-analysis? Get a free quote from our team of PhD researchers.

Ready to Start Your Research Project?

Our PhD-level methodologists handle every stage: systematic reviews, meta-analyses, scoping reviews, and more. PRISMA 2020-compliant, publication-ready.

Get a Free Quote View Pricing

Every random-effects meta-analysis rests on an estimate of tau-squared, the between-study variance that quantifies how much true effect sizes differ across your included studies. The estimator you choose to compute tau-squared is not a technical footnote. It determines the weights assigned to each study, the width of your pooled confidence interval, and ultimately the precision of your summary effect.

Two estimators dominate the field: DerSimonian-Laird (DL), introduced in 1986 and still the most commonly used, and REML (Restricted Maximum Likelihood), which simulation studies consistently show outperforms DL in most realistic meta-analysis scenarios.

Try our free Forest Plot Generator to visualize pooled effects under both estimators and compare how your forest plot changes.

How Tau-Squared Estimation Shapes Your Entire Random-Effects Analysis

In a random-effects synthesis, each study receives a weight proportional to the inverse of its total variance, which is the sum of within-study sampling variance and the between-study variance tau-squared.

When tau-squared is small, large studies dominate. When tau-squared is large, weights become more equal across studies. An overestimated tau-squared makes your weights artificially uniform and widens confidence intervals unnecessarily. An underestimated tau-squared produces overconfident results and artificially narrow intervals.

The DerSimonian-Laird Method: Strengths and Known Limitations

DerSimonian and Laird (1986) proposed a method-of-moments estimator that became the default in almost every meta-analysis software package. Its appeal is computational simplicity with a closed-form solution.

However, simulation studies identified consistent problems. The estimator is biased downward, tending to underestimate the true tau-squared, particularly when the number of studies is small (k less than 30) or when heterogeneity is moderate to high. Studies by Viechtbauer (2005) and Langan and colleagues (2019) showed that DL produces 95% confidence intervals with actual coverage closer to 92-93%.

REML: Fisher Scoring, Iterations, and Why It Performs Better

Restricted Maximum Likelihood estimation treats tau-squared as a variance component estimated via maximum likelihood applied to a restricted log-likelihood function. Unlike DL, REML uses iterative optimization with Fisher scoring iterations, updating the estimate at each step based on the gradient and curvature of the likelihood until convergence.

Simulation results consistently favor REML. Viechtbauer's comprehensive comparison showed REML has lower mean squared error than DL across most scenarios, particularly when k is small (5-20 studies). The metafor package in R uses REML as its default estimator, reflecting the methodological consensus.

When DerSimonian-Laird Remains Acceptable

When k is large (30 or more studies), DL's bias diminishes substantially. When tau-squared is very small or zero, both estimators agree by definition. For exploratory preliminary analyses where computational simplicity aids rapid iteration, DL remains useful.

However, if you are writing for a high-quality clinical journal, if your synthesis has fewer than 20 studies, or if reviewers are likely to scrutinize your heterogeneity estimate, REML is the defensible choice.

See our Sensitivity Analysis Tool to examine whether your pooled estimate is stable across individual study exclusions.

Other Estimators Worth Knowing

Paule-Mandel (PM) avoids DL's downward bias by iterating to find tau-squared such that the expected Q equals its observed value. Simulation studies suggest PM performs comparably to REML.

Hedges' estimator applies a bias correction to the DL estimate and outperforms DL when k is small.

ML (maximum likelihood) without the restriction component underestimates tau-squared more than REML and is generally not recommended.

Our Funnel Plot Generator complements this analysis by letting you visualize how publication bias might interact with your heterogeneity estimate.

Key Takeaways

Tau-squared is the between-study variance in random-effects meta-analysis; its estimator directly controls study weights, pooled effect precision, and confidence interval width.
DerSimonian-Laird (1986) uses a single-pass method-of-moments formula that consistently underestimates tau-squared, producing confidence intervals that are on average too narrow.
REML uses iterative Fisher scoring on a restricted likelihood function, corrects for fixed-effects estimation bias, and shows lower mean squared error and better confidence interval coverage in simulation studies.
REML is preferred when k is below 30, when heterogeneity is moderate to high, and when submitting to high-impact clinical or methodological journals.
DL remains acceptable when k is large (30+) or when software constraints prevent REML implementation.
Always report which estimator you used, cite its methodological source, and consider a sensitivity comparison.

FAQ

Why does REML produce wider confidence intervals than DerSimonian-Laird?

REML typically estimates a larger tau-squared than DL because it corrects for the downward bias in the method-of-moments approach. A larger tau-squared means more between-study variance is attributed to true heterogeneity, which widens the pooled confidence interval. These wider intervals are more accurate reflections of genuine uncertainty.

Does the choice of tau-squared estimator affect the I-squared statistic?

Yes. When tau-squared enters the calculation of study weights in iterative estimators, the Q statistic itself can differ from its DL-based counterpart. In practice, the effect on I-squared is usually modest.

How many Fisher scoring iterations does REML typically need to converge?

For most meta-analysis datasets, REML converges in 10 to 30 iterations. The metafor package uses a default maximum of 100 iterations. Non-convergence is rare but can occur with very few studies or extreme outliers.

Should I use REML for subgroup analyses as well?

Yes. If you are running within-subgroup random-effects models, REML is appropriate for each subgroup, provided each subgroup has at least 3 to 4 studies.

Is there a consensus recommendation on tau-squared estimators from Cochrane?

The Cochrane Handbook (version 6) recommends REML as the preferred estimator for random-effects meta-analyses in most circumstances, while acknowledging DL as a widely used alternative.

Need help with your systematic review or meta-analysis? Get a free quote from our team of PhD researchers.

REML vs DerSimonian-Laird: Choosing the Right Tau-Squared Estimator

Key Takeaways

Dr. Sarah Mitchell

Ready to Start Your Research Project?

How Tau-Squared Estimation Shapes Your Entire Random-Effects Analysis

The DerSimonian-Laird Method: Strengths and Known Limitations

REML: Fisher Scoring, Iterations, and Why It Performs Better

When DerSimonian-Laird Remains Acceptable

Other Estimators Worth Knowing

Key Takeaways

FAQ

Why does REML produce wider confidence intervals than DerSimonian-Laird?

Does the choice of tau-squared estimator affect the I-squared statistic?

How many Fisher scoring iterations does REML typically need to converge?

Should I use REML for subgroup analyses as well?

Is there a consensus recommendation on tau-squared estimators from Cochrane?

Frequently Asked Questions

Related Articles

Related Articles