Can I do a systematic review with only one reviewer?

Some academic institutions permit single-reviewer systematic reviews for thesis work, but the Cochrane Handbook explicitly recommends at least two independent reviewers. A single-reviewer approach weakens methodological rigor and may result in desk rejection at journals enforcing PRISMA 2020 compliance.

What kappa value is acceptable for systematic review screening?

A Cohen's kappa of 0.61 or higher indicates substantial agreement and is generally accepted by peer reviewers. The Cochrane target is 0.80 or above. If your initial kappa falls below 0.60, recalibrate your eligibility criteria with a pilot screening round.

How do I report inter-rater reliability in my methods section?

Report the kappa statistic with 95 percent confidence intervals, the number of records screened, the screening tool used, and the conflict resolution process including whether disagreements were resolved by discussion or third-reviewer arbitration.

Can a research assistant serve as a second reviewer?

Yes, provided they receive proper training on the eligibility criteria and complete a pilot screening round. The Cochrane Handbook allows non-expert screeners at the title-and-abstract stage under supervision.

What software is best for blind dual screening?

Covidence (paid, Cochrane-recommended) and Rayyan (free) are the two most widely used platforms. Both support blinded independent screening, automatic conflict detection, and agreement statistics that meet PRISMA 2020 reporting requirements.

Second Reviewer Systematic Review Guide

A second reviewer for a systematic review is required by every major evidence synthesis guideline. The Cochrane Handbook (Higgins et al., 2023) states that at least two reviewers must independently screen titles, abstracts, and full texts to minimize selection bias. PRISMA 2020 requires authors to report how many reviewers screened records and how disagreements were resolved. The Joanna Briggs Institute (JBI) mirrors this standard, requiring independent dual screening across all stages of study selection, data extraction, and critical appraisal. If you are conducting a systematic review alone, you face a serious methodological gap that peer reviewers and journal editors will identify during submission.

What Happens When You Submit a Single-Reviewer Systematic Review

Journals that publish systematic reviews evaluate methodological rigor before scientific content. A single-reviewer systematic review raises immediate red flags during editorial screening, and many manuscripts never reach peer review as a result.

Desk rejection is the most common outcome. Editors at journals indexed in MEDLINE, Scopus, and Web of Science check whether the methods section describes independent dual screening with a documented conflict resolution process. When that description is absent, the manuscript is returned without review. Journals that follow the PRISMA 2020 reporting guideline (Page et al., 2021) expect item 16a to describe the selection process, including the number of reviewers involved at each stage.

Peer reviewer criticism targets single-reviewer screening even when the manuscript passes editorial triage. Reviewers trained in evidence synthesis methodology will question the risk of selection bias, noting that a single reviewer introduces subjective judgment without any calibration check. The reviewer's recommendation will typically request that the authors repeat the screening process with a second reviewer and report inter-rater agreement statistics.

Quality assessment tools penalize single-reviewer methods. The AMSTAR 2 critical appraisal tool (Shea et al., 2017) includes Item 5, which asks whether study selection was performed in duplicate. A "no" answer on this item downgrades the overall confidence rating of the review. Similarly, the ROBIS tool (Whiting et al., 2016) evaluates study selection within its assessment of bias in the review process.

The practical consequence is clear: investing months in a systematic review only to face rejection because of a single-reviewer design is an avoidable loss that proper planning eliminates from the start.

Why Dual Screening Reduces Bias in Study Selection

The requirement for a second reviewer is not bureaucratic. It addresses a well-documented source of error in evidence synthesis.

Confirmation bias causes a solo reviewer to favor studies that align with their hypothesis or prior expectations. When two reviewers screen independently, each serves as a check on the other's judgment, making it far more likely that borderline studies receive fair evaluation rather than reflexive exclusion.

Fatigue-related errors increase as the screening workload grows. A systematic review searching multiple databases can generate 2,000 to 15,000 records after deduplication. Screening thousands of titles and abstracts in a single session leads to declining attention, increased miss rates, and inconsistent application of inclusion and exclusion criteria. A second reviewer catches the studies that a fatigued primary reviewer overlooks.

Ambiguous eligibility decisions are inherent to systematic reviews. Studies that partially meet inclusion criteria, use non-standard outcome definitions, or employ mixed-methods designs create legitimate disagreement. Two independent reviewers surface these ambiguous cases for structured discussion rather than allowing one person's judgment call to determine study inclusion silently.

Reproducibility depends on dual screening. The Cochrane Handbook (Higgins et al., 2023) emphasizes that a systematic review should be reproducible by another team following the same protocol. When two reviewers document their independent decisions, the selection process becomes transparent and verifiable. A single reviewer's decisions are opaque by definition.

Research by Edwards et al. (2002) demonstrated that single-reviewer screening missed approximately 8 percent of relevant studies compared to dual-reviewer screening. In a review with 30 included studies, that represents two or three missing studies that could change the direction and magnitude of pooled effect estimates in a meta-analysis.

Inter-Rater Reliability: How to Measure and Report Screening Agreement

Inter-rater reliability quantifies how consistently two reviewers make the same screening decisions. Reporting this statistic is expected in the methods section of every systematic review, and calculating it correctly strengthens your manuscript.

Cohen's kappa is the standard measure for inter-rater agreement in systematic reviews. Introduced by Fleiss (1971) as an extension of Cohen's original work, kappa adjusts for agreement that would occur by chance alone. The formula accounts for the base rate of inclusion and exclusion, making it more informative than simple percent agreement.

Kappa interpretation thresholds follow the scale published by Landis and Koch (1977). A kappa of 0.81 to 1.00 indicates almost perfect agreement. A kappa of 0.61 to 0.80 indicates substantial agreement, which most journals consider acceptable. A kappa of 0.41 to 0.60 indicates moderate agreement and signals the need for recalibration. Anything below 0.40 indicates poor to fair agreement and suggests fundamental problems with your eligibility criteria or reviewer training.

Calculate your kappa using our free Cohen's kappa calculator, which generates the statistic from a 2x2 contingency table of reviewer decisions and provides the interpretation alongside the raw value.

When to calculate kappa depends on your screening volume. For reviews with fewer than 500 records, calculate kappa after completing all title-and-abstract screening. For larger reviews, calculate kappa after a pilot screening of 50 to 100 records, use the result to identify and resolve disagreements in criteria interpretation, then proceed with the remaining records. Report the pilot kappa and the final kappa separately in your methods section.

Percent agreement alone is insufficient. Two reviewers screening 1,000 records and excluding 950 will achieve 95 percent agreement even if they disagree on every single included study. Kappa corrects for this prevalence-dependent inflation of raw agreement, which is why journals require it over simple percentage reporting.

Reporting template for your methods section: "Two reviewers [initials] independently screened all titles and abstracts against pre-defined eligibility criteria. Inter-rater agreement was measured using Cohen's kappa (kappa = [value], 95% CI [lower, upper]). Disagreements were resolved through discussion, with a third reviewer [initials] consulted when consensus could not be reached."

How to Train a Second Reviewer for Your Systematic Review

Finding a second reviewer is only the first step. Training that reviewer to apply your eligibility criteria consistently is what produces the high inter-rater agreement that journals expect.

Create a screening manual before any screening begins. This document should define each inclusion and exclusion criterion with at least two concrete examples of studies that meet and do not meet the criterion. Ambiguous scenarios should be addressed explicitly. For instance, if your inclusion criterion specifies "randomized controlled trials," your manual should clarify whether cluster-randomized trials, crossover designs, and quasi-randomized studies are included or excluded.

Run a pilot screening round on a random sample of 30 to 50 records. Both reviewers screen this batch independently, then compare decisions. Disagreements in the pilot round reveal criteria that are poorly defined, overly subjective, or interpreted differently by each reviewer. Revise your inclusion and exclusion criteria based on the pilot findings before proceeding to full screening.

Calculate pilot kappa after the pilot round. If kappa falls below 0.60, the criteria need refinement and additional discussion before moving forward. If kappa is 0.60 to 0.80, minor calibration may be sufficient. If kappa exceeds 0.80, proceed with confidence.

Document decision rules for edge cases encountered during the pilot. These rules become an appendix to your screening manual and ensure that similar studies are handled consistently throughout the review. Common edge cases include studies published as conference abstracts only, studies reporting relevant outcomes as secondary endpoints, and studies conducted in populations that overlap with but do not exactly match your inclusion criteria.

Schedule regular calibration meetings during the screening phase. After every 200 to 300 records, compare a random sample of decisions to confirm that agreement remains stable. Drift in criteria application is common in long screening phases, and periodic calibration prevents it.

Options for Finding a Second Reviewer

Solo researchers, early-career faculty without a review team, and graduate students working on their first systematic review all face the same challenge: where to find a qualified second reviewer. Several options exist, each with distinct advantages and limitations.

A departmental colleague is the most accessible option. A faculty member or fellow graduate student in your department can serve as a second reviewer if they understand the subject area and commit to the time required. The advantage is cost (typically free), but the limitation is availability. Colleagues have their own research priorities, and screening 3,000 records is a significant time commitment that competes with their own deadlines.

A research assistant or graduate student can be trained as a second reviewer, particularly for title-and-abstract screening where clinical expertise is less critical than consistent criteria application. The Cochrane Handbook permits non-expert screeners at the title-and-abstract stage as long as they are properly trained and supervised, with clinical judgment reserved for full-text decisions.

A co-author from another institution adds methodological credibility to the review team while distributing the workload geographically. Multi-institutional author teams are viewed favorably during peer review and reduce the risk that a single institution's perspective biases the study selection.

Struggling to find a second reviewer? You are not alone. Many researchers, especially solo PhD students and early-career investigators, face this exact barrier. Research Gold provides professional second-reviewer services where an experienced methodologist screens your records independently using your protocol and eligibility criteria. You receive documented screening decisions, a completed kappa calculation, and a conflict resolution log ready for your methods section. Request a quote or learn more about our systematic review services.

A professional systematic review service provides a second reviewer as part of a comprehensive methodology package. This option is particularly valuable when you need not just screening support but also data extraction, risk of bias assessment, and statistical analysis. Professional services maintain trained reviewers who work exclusively on evidence synthesis, ensuring consistent quality and turnaround without the coordination challenges of volunteer collaborators.

Online research communities such as systematic review methodology groups on ResearchGate and LinkedIn occasionally connect solo researchers with volunteer second reviewers. However, the reliability and availability of volunteer matches vary widely, and there is no quality assurance mechanism for volunteer screeners.

Screening Workflow: Using Covidence and Rayyan for Blind Dual Screening

Blind screening means that each reviewer makes inclusion and exclusion decisions without seeing the other reviewer's choices. This independence is essential for calculating a valid inter-rater reliability statistic. Two software platforms dominate systematic review screening workflows.

Covidence is the screening platform recommended by Cochrane. It imports records directly from database searches, removes duplicates, and presents each record to both reviewers in a blinded interface. Reviewer 1 and Reviewer 2 see the same title, abstract, and keywords but cannot view each other's decisions until both have voted. Covidence automatically identifies conflicts (where one reviewer includes and the other excludes) and routes them to a resolution queue. The platform generates screening statistics, including kappa, that you can export directly for your methods section.

Rayyan is a free alternative developed by the Qatar Computing Research Institute. Rayyan uses a similar blinded screening workflow with a "blind mode" toggle that hides collaborator decisions during active screening. It supports unlimited collaborators, integrates with reference managers like EndNote and Zotero, and uses machine learning to suggest relevant records based on your early screening decisions. Rayyan's conflict resolution interface highlights disagreements for discussion.

The screening workflow in both platforms follows the same evidence-based process. First, import deduplicated records from your database search. Second, enable blind mode so that both reviewers screen independently. Third, each reviewer applies the eligibility criteria to every record, voting include, exclude, or maybe. Fourth, after both reviewers complete screening, disable blind mode and review the conflict list. Fifth, resolve conflicts through discussion, applying documented decision rules, and consult a third reviewer for persistent disagreements. Sixth, export the completed screening data, including agreement statistics and the PRISMA flow diagram numbers.

Label your exclusion reasons during screening. Both Covidence and Rayyan allow you to tag each excluded record with a standardized reason (wrong population, wrong intervention, wrong study design, wrong outcome, duplicate, and similar categories). These labels populate the "reasons for exclusion" section of your PRISMA flow diagram and demonstrate to reviewers that your decisions were systematic rather than arbitrary.

Conflict Resolution: How to Handle Screening Disagreements

Disagreements between reviewers are expected and healthy. A systematic review with zero conflicts between two independent reviewers is more suspicious than reassuring, as it may suggest that one reviewer simply deferred to the other rather than screening independently.

Structured discussion is the first step in resolving conflicts. Both reviewers present their reasoning for inclusion or exclusion, referencing specific eligibility criteria. In many cases, the disagreement stems from ambiguous criteria language or differing interpretations of a study's population, intervention, or outcome, rather than from fundamental differences in judgment.

A third reviewer is consulted when discussion between the two primary reviewers does not produce consensus. The third reviewer is typically a senior methodologist or subject matter expert who reviews the study against the eligibility criteria and casts the deciding vote. The Cochrane Handbook (Higgins et al., 2023) explicitly recommends this arbitration process.

Document every conflict and its resolution. Maintain a conflict log that records the study identifier, each reviewer's initial decision, the reason for disagreement, and the final resolution (including which reviewer or arbiter made the deciding call). This log serves as supplementary material during peer review and demonstrates the rigor of your selection process.

Err on the side of inclusion at the title-and-abstract stage. The established convention in systematic review methodology is to include uncertain records at the initial screening stage and make final exclusion decisions during full-text review, where more information is available. This approach minimizes the risk of inadvertently excluding a relevant study based on an uninformative abstract.

Update decision rules after each batch of conflict resolutions. If you notice recurring disagreements about a specific type of study (for example, mixed-methods studies or studies with non-standard outcome definitions), add a clarifying rule to your screening manual and apply it consistently to all remaining records.

Documenting Screening Agreement for Your Methods Section

The way you report your dual screening process determines how journals and peer reviewers evaluate the credibility of your study selection. Transparent documentation turns a methodological requirement into a strength of your manuscript.

PRISMA 2020 Item 16a requires authors to describe the process for selecting studies, including the number of reviewers, whether screening was independent, and how disagreements were resolved. Use the extraction template builder to structure your data collection forms alongside your screening documentation.

Report screening at each stage separately. Title-and-abstract screening and full-text screening should each have their own kappa statistic because agreement rates typically differ between stages. Title-and-abstract screening, where decisions are made on limited information, usually produces lower kappa values than full-text screening, where the complete study is available.

Include the numbers that feed into your PRISMA flow diagram. Report the total records identified, the number after deduplication, the number screened at title-and-abstract stage, the number assessed at full-text stage, the number included in the final review, and the exclusion reasons with counts at the full-text stage.

Methods section template: "Two reviewers ([Reviewer 1 initials] and [Reviewer 2 initials]) independently screened [n] titles and abstracts against pre-specified eligibility criteria using [Covidence/Rayyan]. Inter-rater reliability was calculated using Cohen's kappa (kappa = [value]). [N] records were advanced to full-text review, where both reviewers independently assessed each study for inclusion. Full-text inter-rater agreement was kappa = [value]. Disagreements at both stages were resolved through discussion, with [third reviewer initials] serving as arbiter when consensus was not reached. A complete conflict resolution log is available in Supplementary Table [X]."

Data extraction agreement follows the same dual-reviewer principle. After selecting your final included studies, both reviewers should independently extract data using a standardized data extraction form. Discrepancies in extracted data are resolved through source verification, returning to the original study to confirm the correct value.

Following this step-by-step systematic review process from protocol through screening to synthesis ensures that your dual-reviewer documentation is complete and consistent with current reporting standards.

Second Reviewer for Systematic Review: Why You Need One and How to Find Help

Key Takeaways