What is the difference between subgroup analysis and meta-regression?

Subgroup analysis splits studies into discrete categories (for example, by continent, study design, or intervention dose category) and compares the pooled effect between groups. Meta-regression models the association between a study-level variable (which can be continuous, like mean participant age or intervention duration) and the effect size using weighted regression. Meta-regression is more flexible because it can handle continuous covariates and adjust for multiple covariates simultaneously.

How many studies do I need for subgroup analysis?

Each subgroup should contain at least 2 studies for a pooled estimate, but 5-10 per subgroup are recommended for meaningful between-group comparisons. The test for subgroup differences has very low statistical power when subgroups contain fewer than 5 studies, so findings should be interpreted as exploratory rather than confirmatory.

How many studies do I need for meta-regression?

The general rule of thumb is 10 studies per covariate included in the meta-regression model. With fewer than 10 studies total, meta-regression should not be attempted. With 10-20 studies, only one covariate should be examined. This rule exists because meta-regression with too few studies produces unstable estimates and inflated false positive rates.

Should subgroup analyses be prespecified?

Yes, subgroup analyses should be prespecified in the review protocol registered on PROSPERO. Prespecification prevents data-dredging, where researchers test multiple subgroups and selectively report statistically significant findings. Cochrane recommends that subgroup analyses be prespecified, limited in number, and supported by a biological or theoretical rationale.

Can subgroup analysis prove causation?

No, subgroup analysis in meta-analysis cannot prove causation. It is an observational analysis at the study level that identifies associations between study characteristics and effect sizes. Confounding at the study level (ecological bias) means that an apparent subgroup difference may be caused by other factors that differ between groups of studies, not the subgroup variable itself.

Subgroup Analysis and Meta-Regression: Complete Guide

Subgroup analysis and meta-regression are the two primary methods for investigating why effect sizes vary across studies in a meta-analysis. When your I-squared value is high, indicating substantial explore heterogeneity beyond what chance alone would explain, these methods help identify which study-level characteristics are associated with larger or smaller effects. Understanding when and how to use each method is essential for producing informative, transparent meta-analyses that go beyond a single pooled estimate.

Both methods address the same fundamental question: do effect sizes differ depending on specific study characteristics? But they differ in how they model this relationship. Subgroup analysis is simpler, dividing studies into discrete groups and comparing pooled estimates. Meta-regression is more flexible, using weighted regression to model the relationship between covariates and effect sizes. The choice between them depends on the nature of your moderator variables, the number of included studies, and whether your investigation is prespecified or exploratory.

When to Investigate Heterogeneity

Before conducting subgroup analysis or meta-regression, confirm that meaningful heterogeneity exists. The Cochrane Handbook recommends investigating heterogeneity when:

I-squared exceeds 50 percent, suggesting moderate to substantial heterogeneity
The Q-test is statistically significant, indicating that between-study variation exceeds what would be expected by chance
Clinical or methodological diversity among included studies suggests that effect sizes may legitimately differ
The prediction interval around the pooled estimate is wide, indicating that the true effect in a new study could differ substantially from the average

If heterogeneity is low (I-squared below 30%) and studies are clinically similar, subgroup analysis and meta-regression add little value and may produce spurious findings through multiple testing.

Subgroup Analysis: Methodology

Subgroup analysis pipeline with the test for subgroup differences — Subgroup analysis: 4-step Q-between pipeline. Source: Cochrane Handbook v6.5, ch 10.11; Borenstein et al., 2009.

How It Works

Subgroup analysis divides included studies into groups based on a categorical study-level characteristic and then calculates separate pooled effect estimates for each group. The key output is the test for subgroup differences (also called the interaction test or between-group Q-test), which evaluates whether the pooled estimates differ significantly between groups.

Example: A meta-analysis of exercise interventions for depression includes studies from high-income and low-income countries. Subgroup analysis pools the effect estimate separately for each country group and tests whether the pooled effects are statistically different.

Step-by-Step Process

Prespecify your subgroups in the protocol. Limit to 3-5 subgroups with strong clinical or theoretical rationale
Classify each study into the appropriate subgroup based on the moderator variable
Pool effect sizes within each subgroup using the same meta-analytical model as your main analysis
Conduct the test for subgroup differences to determine whether the between-group variation is statistically significant
Present results with separate explore forest plots or a single forest plot with subgroup sections
Interpret with caution, especially if the analysis was not prespecified

Interpreting the Test for Subgroup Differences

The correct way to evaluate subgroup differences is the interaction test, which directly compares the pooled estimates between groups. A common mistake is comparing whether individual subgroup estimates are statistically significant. This approach is flawed because a significant estimate in one subgroup and a non-significant estimate in another does not mean the effects are different; the confidence intervals may overlap substantially.

Meta-Regression: Methodology

Meta-regression bubble plot with weighted slope and R-squared analog — Meta-regression: weighted slope on continuous moderator. Source: Thompson & Higgins, 2002, Stat Med 21:1559-73.

How It Works

Meta-regression uses weighted least squares regression (or restricted maximum likelihood) to model the relationship between one or more study-level covariates and the effect size. Each study is a data point, with the effect size as the dependent variable and the study-level characteristic as the independent variable. Studies are weighted by their precision (inverse variance).

Example: A meta-analysis includes studies with intervention durations ranging from 4 to 52 weeks. Meta-regression models whether longer intervention duration is associated with larger effect sizes, treating duration as a continuous variable.

When Meta-Regression Is Better Than Subgroup Analysis

Continuous moderator variables. Subgroup analysis requires categorizing a continuous variable (e.g., splitting age into "young" and "old"), which loses information. Meta-regression can model the continuous relationship directly
Multiple covariates. Meta-regression can include multiple covariates simultaneously, allowing you to assess the independent association of each while controlling for others (although the number of studies rarely supports more than 2-3 covariates)
Dose-response relationships. Meta-regression can model whether effects increase linearly or non-linearly with dose, duration, or intensity

Subgroup Analysis and Meta-Regression in Meta-Analysis: When and How to Use Them

Key Takeaways

When to Investigate Heterogeneity

Subgroup Analysis: Methodology

How It Works

Step-by-Step Process

Interpreting the Test for Subgroup Differences

Meta-Regression: Methodology

How It Works

When Meta-Regression Is Better Than Subgroup Analysis

Frequently Asked Questions

Related Articles

Reading About Meta-Analysis? Our PhD Team Runs Them Every Day.

Dr. Sarah Mitchell

Reading About Meta-Analysis? Our PhD Team Runs Them Every Day.

Minimum Studies Required

Common Pitfalls

1. Multiple Testing Without Correction

2. Ecological Bias (Ecological Fallacy)

3. Confounding

4. Post-Hoc Subgroup Analysis

5. Ignoring Within-Study Variation

Reporting Subgroup Analysis and Meta-Regression

Related Articles