Data Sources

Data Sources

ABOUT ALL OF US PARTICIPANTS

The All of Us Research Program aims to engage a community of one million or more volunteers who reflect the diversity of the United States.

This includes people who haven’t taken part in or have been left out of health research before. We recruit many types of participants—both those who are healthy and those experiencing illness—from across the country, including many communities that have historically been underrepresented in biomedical research.

These diverse participants contribute a wide variety of datatypes, including survey responses, measurements, biosamples, electronic health records (EHRs), and data from mobile health devices.

The All of Us Research Program’s Data and Research Center curates and validates these data as part of the data collection process.

Through the All of Us Research Hub, researchers can explore participant data by state, ethnicity, age, and more through our Data Snapshots or Data Browser.

We collect participant data from multiple sources, including surveys and electronic health records. We also collect biosamples, which we’ll use to generate genomic data (coming soon). As the program grows, additional data sources will become available.

Find out more about our protocol and data curation methodology, including our use of the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM).

SURVEYS

All participants are invited to complete surveys on the secure All of Us Participant Portal (learn more about how participants join). Initial surveys include questions on health and lifestyle. Additional surveys on more specific subjects are added regularly. You can view survey questions and learn more about where the questions come from with our Survey Explorer.

Survey responses are curated to protect personally identifiable information (PII). See the All of Us Registered Tier Data Dictionary for more information.

ELECTRONIC HEALTH RECORDS

Many participants agree to share their EHRs with the program. All of Us removes and/or generalizes personal identifiers from participants’ EHR data before adding this information to the Research Hub. See the All of Us Registered Tier Data Dictionary for more information.

We use the OMOP CDM to standardize all EHR data. Learn more about OMOP on our Methods page.

PHYSICAL MEASUREMENTS

Some participants will provide physical measurements at one of our All of Us Research Program partner locations. These measurements include: height, weight, BMI, waist circumference, hip circumference, blood pressure, and heart rate. Program staff also note participants’ pregnancy status and wheelchair use (if applicable) at the visit.

Explore aggregate data on our participants’ physical measurements in the Data Browser.

BIOSAMPLES

Some participants contribute blood, urine, and/or saliva samples for later analyses, including whole genome sequencing. These samples are stored in our biobank located at the Mayo Clinic. In the future, genomic data will become available on the Research Hub.

MOBILE HEALTH DATA (WEARABLE DEVICES)

All of Us participants can contribute information from wearable health tracking devices. Wearables track biometric data like heart rate and blood pressure.