MGH Guides: Scoping + Systematic Reviews: How Do I Conduct the Data Extraction?

Data Extraction in Scoping vs Systematic Reviews

The Data Extraction Process

As with all of the other steps in the review process, best practice for data extraction is to have two independent reviewers extract data from each article.

Beyond that, here are some tips for a smooth data extraction process.

Follow a Framework
1. For systematic reviews, this is typically the same PICO(T) framework you used to create your question.
2. For scoping reviews, this is a bit trickier, since you often don't know what you're going to find until you find it. However, having a framework in mind can still help guide the extraction and analysis. For example, if your review involves social determinants of health, you might rely on the WHO's Social Determinants of Health.
Create a Template
Regardless of the type of review, a good template is essential. It helps everyone on your team know exactly what data they are looking for. Take a look at the section on data extraction templates in this guide.
Test Your Template
Clarity is essential with a template. A template only works if everyone interprets its categories the same way. Even if your entire team creates the template together and seems to share an understanding about its categories, questions about interpretation may only become clear once data extraction begins. Therefore, you'll want to test it out with several articles and compare results across your team before jumping in. You don't want to get to the end of the extraction process just to realize you need to start over.
Standardize Your Language
The more you and your teammates are alike in the data you extract, the easier it will be for you to move forward to data analysis. After you've tested your template, the conversation you have to compare results should include some discussion about the data you included in each of the template's categories to see if you can standardize that data as much as possible.

A practical guide to data extraction for intervention systematic reviews.
Covidence. (2024). A practical guide to data extraction for intervention systematic reviews.
Recommendations for the extraction, analysis, and presentation of results in scoping reviews
Pollock, D., et al. (2023). JBI Evidence Synthesis, 21(3), 520-532. https://doi.org/10.11124/JBIES-22-00123

Data Extraction and Evidence Tables

Data extraction is the process of taking the most important characteristics of each study in the review and placing them into a chart, so that the details can be easily compared across studies. Typically, at least two people independently extract the data and then compare the results afterward, resolving any conflicts in the data that may have arisen in the process. Their data is then combined to create one final table.

While there are no prescribed set of study characteristics to include in a review, the most frequently extracted data items are things like:

Authors and their affiliations
Title
Publication date
Geographic location of the study
Study design details
Aims/objectives of the study
Population studied
Major findings
Implications

There are many different ways to extract data. You can use a form that you build in Microsoft Forms, Google Forms, RedCap or in a tool like Covidence. (See the Review Tools and Applications tab.) You can enter the data directly into a spreadsheet program like Excel and then blend the charts later by copying and pasting. No matter how you extract the data, the important thing is to describe how you did it in the methods section and to reduce bias in the results by having the extractors work independently.

As mentioned earlier, you should also pilot your data extraction tool. To do this, have your extractors work on 2-3 articles to start. Compare answers and ensure that the results are similar. If your tool is lacking interrater reliability, it may be a sign that the extractors aren't understanding the data items the same way, and that definitions may need to be clarified.

More Advice

Recommendations for the extraction, analysis, and presentation of results in scoping reviews
Pollock, D., et al. (2023). JBI Evidence Synthesis, 21(3), 520-532. https://doi.org/10.11124/JBIES-22-00123
Cochrane Handbook - Chapter 5, Collecting Data
Büchter, R. B., Weise, A., & Pieper, D. (2020). Development, testing and use of data extraction forms in systematic reviews: A review of methodological guidance.
BMC Medical Research Methodology, 20(1), 259. https://doi.org/10.1186/s12874-020-01143-3

Examples of Evidence Tables

NICE Examples of evidence tables
The National Institute for Health and Care Excellence provides examples of evidence tables for both quantitative and qualitative studies, with definitions for each data item.
Completing ‘Summary of findings’ tables and grading the certainty of the evidence
The Cochrane Handbook provides guidance for choosing data items to extract and for building the evidence table.

Data Extraction Templates

Data Extraction from the John A. Burns School of Medicine Health Sciences Library
The John A. Burns School of Medicine Health Sciences Library at the University of Hawaii shares a few examples of templates for data extraction.
Data Collection form for RCTs from Cochrane

Can AI Help?

Yes! There are several tools that can help with data extraction. However, as with all AI products, please always verify the results-it is not a substitute for human reviewers.

Many of the tools recommended on the following tabs have been reviewed for their usefulness (Blaizot, et al, 2022; Khalil, 2022).

Covidence

Covidence helps automate several stages of a systematic and scoping review. Covidence uses both "study data from external repositories" and a "large language model extraction from full text articles" to collect data.

Learn More About Covidence

Covidence Self-Paced Workshop
Our self-paced and asynchronous D2L workshop makes it easy and convenient to learn how to use Covidence. The entire workshop takes about 1-2 hours to complete.
Ask a Librarian
If you need further assistance after taking the workshop or would prefer a live group training, please Ask a Librarian to set up a consultation/training.
Covidence Knowledge Base
Covidence has an extensive Knowledge Base with instructions and videos to help you learn how to use all of Covidence's features.
How to complete data extraction for a study using Extraction 1
This article from the Covidence Knowledge Base goes over how to complete data extraction for a study using Extraction 1.
Register for Covidence
To access Covidence, request an invitation using the above link and entering your mgh.harvard.edu or mghihp.edu e-mail address. Do not use your partners.org address.

Systematic Review Data Repository (SRDR+)

The Systematic Review Data Repository (SRDR+) is a collaborative, Web-based repository of systematic review data. This resource serves as both an archive and data extraction tool and is shared among organizations and individuals producing systematic reviews worldwide, enabling the creation of a central database of systematic review data which may be critiqued, updated, and augmented on an ongoing basis. This database is freely accessible to facilitate evidence reviews and thus improve and speed up policy-making with regards to healthcare.

Learn More About How to Use SRDR+ for Data Extraction

SRDR+
Access the SRDR+ site to create an account.

DistillerSR

DistillerSR is similar to Covidence in that it is a review management tool. DistillerSR "automates the management of literature collection, screening, and assessment using AI and intelligent workflows."

Regarding the data-extraction component, DistillerSR says, "Simplify data extraction through templates and configurable forms. Extract data easily with in-form validations and calculations, and easily capture repeating, complex data sets."

Please note that DistillerSR is a paid subscription.

Below is a video walkthrough from Evidence Partners on how to export your references and data from DistillerSR

DistillerSR
Learn more about DistillerSR and how it can help with Systematic Reviews.

MGH RedCap

Our final tool for data extraction that we wanted to introduce is MGH RedCap. While this is not an AI tool, it is a software application that can be used for electronic collection and management of research and clinical study data. More often used for surveys, but some of our faculty and students have used it to build data extraction forms.

MGH RedCap
Software application for electronic collection and management of research and clinical study data. More often used for surveys, but some of our faculty and students have used it to build data extraction forms.