Research Projects Directory

Research Projects Directory

Information about each research project within the Workbench is available in the Research Projects Directory below. Approved researchers provide their project’s research purpose, description, populations of interest and more. This information helps All of Us ensure transparency on the type of research being conducted.

At this time, all listed projects are using data in the Registered Tier. The Registered Tier contains individual-level data from electronic health records, survey answers, and physical measurements. These data have been altered to protect participant privacy.

Note: Researcher Workbench users provide information about their research projects independently. Any views expressed in the Research Projects Directory belong to the relevant users and do not necessarily represent those of the All of Us Research Program.

Information in the Research Projects Directory is also cross-posted on AllofUs.nih.gov in compliance with the 21st Century Cures Act.

There are currently 291 active workspaces. This information was updated on 12/5/2020.

Sort By Title:

Scratch Workspace

Project Purpose(s)

  • Educational ...

Scientific Questions Being Studied

This scratch workspace will be used to explore the All of Us data to assess the current and future feasibility of gene-environment mental health studies on a diverse population. Eventually, I hope to leverage this data to design a risk prediction model for major depression that takes into account genetic risk and environmental factors.

Scientific Approaches

Genomics, health record, survey, and wearable health device data may be used. Genomics data may be used to perform a genome-wide association study, and machine learning will be used for the remaining environmental data. Genetic risk factors may then be aggregated with the environmental models for a more complete risk scoring system.

Anticipated Findings

Anticipated findings for this study lie mostly in assessing the feasibility for what is described above, both on the short- and long-term. This workspace will be used for exploratory purposes.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Gordon Ye - Undergraduate Student, University of California, San Diego

Self-reported fractures in RA

Project Purpose(s)

  • Disease Focused Research (rheumatoid arthritis) ...

Scientific Questions Being Studied

We seek to determine the accuracy of self-reported fractures in both men and women with rheumatoid arthritis. We will describe variations in the validity of self-reported fracture according to sex, race, socioeconomic factors, and other clinical factors. Fragility fractures are a significant cause of morbidity and mortality in patients with rheumatoid arthritis. Self-report of fractures is often used to identify fracture outcomes in large-scale studies of fracture prevention or treatment; hence, research findings related to fracture are predicated on the accuracy of the self-report of fractures. In the general population, there is evidence of variation in accuracy of self-reported fractures by gender, race, age, and clinical characteristics such as smoking status and BMI. However, to the best of our knowledge, there are no prior studies evaluating the validity of self-reported fractures in rheumatoid arthritis patients, a particularly vulnerable population for fractures.

Scientific Approaches

For all persons with rheumatoid arthritis and a self-report of fracture in the last 5 years from the enrollment survey, their EMR data will be searched by querying for all ICD-9 and ICD-10 codes correlating to new incident fracture in the 5 years prior to survey completion. Descriptive analyses of baseline characteristics (age, race, annual household income, clinical characteristics) will be presented for all persons with self-reported fracture, stratified by whether that fracture was confirmed or unconfirmed in the EMR. To assess the accuracy of self-reported fractures, confirmation rates, overall and according to covariate levels, will be calculated as the proportion of self-reported fractures with a fracture confirmed at any site. False positive rates and positive predictive values were similarly computed. Logistic regression models will be used to compute odds ratio (OR) and 95% confidence interval (95% CI) for predictors of unconfirmed fractures (i.e. false positives).

Anticipated Findings

In previous studies of self-report fracture validation in the general population, the rate of false positives ranges widely from 5-50% depending on the fracture site. We anticipate there will be lower rates of false positive self-reports of fracture in rheumatoid arthritis patients because they are being followed more closely for musculoskeletal complaints. While men, black women, and lower educational attainment among other clinical characteristics have been associated with increased false positive rates of self-reported fractures in the general population, it is unclear if this will translate to the rheumatoid arthritis patient population. Our study will be the first to characterize validity of self-reported fractures specifically in rheumatoid arthritis patients, a particularly vulnerable population for fracture.

Demographic Categories of Interest

  • Race / Ethnicity
  • Age
  • Education Level
  • Income Level

Research Team

Owner:

  • Deepak Nag Ayyala - Early Career Tenure-track Researcher, Augusta University

Self-reported knowledge of family history of disease

Project Purpose(s)

  • Population Health
  • Methods Development ...
  • Ancestry

Scientific Questions Being Studied

Family history is used as a screening tool to refer patients for predictive and diagnostic genetic testing; reimbursement of said testing is often also based upon a family history of disease. Therefore, individuals who are less knowledgeable about their family history of disease are less likely to qualify for genetic counseling and/or testing.

I will seek to fulfill the following research aims:
1. Explore patterns of missingness in survey data in order to better understand populations that may or may not be represented among AoU survey respondents.
2. Characterize survey respondents with different levels of self-reported knowledge of a family history of disease, as indicated by the survey question "How much do you know about illnesses or health problems for your parents, grandparents, brothers, sisters, and/or children?"
3. Test whether self-reported family history of disease knowledge is associated with uptake of preventive screenings and/or genetic counseling.

Scientific Approaches

Inclusion criteria:
- Adult All of Us survey participants
Exclusion criteria:
- None

Methods:
I will compare the characteristics of survey respondents who completed "The Basics" survey with those who completed other AoU surveys, those who consented to linkage of their EHR data, and those who completed other AoU Research Program activities.

I will then characterize study participants who completed the family health questionnaire. Survey and EHR data will be linked. Those with different levels of self-rated family history knowledge will be compared by their sociodemographic characteristics, overall health, and access to healthcare using descriptive statistics and/or regression analyses. Use of genetic counseling, medical genetics, and genetic testing services, and routine preventive screenings will be assessed as a function of self-rated family history knowledge using univariate and multivariate regression modeling.

Anticipated Findings

I anticipate that individuals who completed only The Basics survey will differ from the populations that have high survey completion, consent to EHR linkage, and participate in other aspects of the AoU Research Program.

I also anticipate that individuals with lower socioeconomic status will be less likely to endorse higher levels of self-rate family history knowledge. Lower self-rated family history knowledge will be associated with less uptake of preventive screenings and genetic counseling and/or testing services, compared to higher self-rated family history knowledge.

Use of family history as a primary screening tool to determine who receives genetic testing may be inherently flawed if knowledge of one's family history of disease differs amongst different populations. This study will provide new insights into whether family history risks assessments may miss groups of individuals who might benefit from genetic testing because of poor family history knowledge.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Leland Hull - Early Career Tenure-track Researcher, The Broad Institute

Sensory Impairment & Access to Care

Project Purpose(s)

  • Population Health ...

Scientific Questions Being Studied

People with sensory impairment (vision and/or hearing) report greater barriers to access to care, and have different health care utilization patterns, including greater medical costs and hospitalizations. We are exploring the dataset to examine questions related to how people with sensory impairment perceive their access to care and differences in their health care utilization.

Scientific Approaches

The EHR and survey data will be used to examine how self-reported sensory impairment (vision and/or hearing) can impact access to care (i.e., unmet health care needs) and health care utilization (e.g. emergency room visits, preventive care utilization). Research methods include cross-sectional and longitudinal analyses.

Anticipated Findings

Based on previous literature, people with sensory impairment may have more barriers to access to care than people without. However, no studies have explored these associations in a data set that focused on underrepresented populations. Moreover, the linked survey and EHR data provides many opportunities for a wide range of outcomes.

Demographic Categories of Interest

  • Age
  • Disability Status
  • Access to Care

Research Team

Owner:

  • Lama Assi - Graduate Trainee, Johns Hopkins University

Sensory Impairment & Access to Care

Project Purpose(s)

  • Population Health ...

Scientific Questions Being Studied

People with sensory impairment (vision and/or hearing) report greater barriers to access to care, and have different health care utilization patterns, including greater medical costs and hospitalizations. We are exploring the dataset to examine questions related to how people with sensory impairment perceive their access to care and differences in their health care utilization.

Scientific Approaches

The EHR and survey data will be used to examine how self-reported sensory impairment (vision and/or hearing) can impact access to care (i.e., unmet health care needs) and health care utilization (e.g. emergency room visits, preventive care utilization). Research methods include cross-sectional and longitudinal analyses.

Anticipated Findings

Based on previous literature, people with sensory impairment may have more barriers to access to care than people without. However, no studies have explored these associations in a data set that focused on underrepresented populations. Moreover, the linked survey and EHR data provides many opportunities for a wide range of outcomes.

Demographic Categories of Interest

  • Age
  • Disability Status
  • Access to Care

Research Team

Owner:

  • Lama Assi - Graduate Trainee, Johns Hopkins University

Sensory Impairment & Access to Care

Project Purpose(s)

  • Population Health ...

Scientific Questions Being Studied

People with sensory impairment (vision and/or hearing) report greater barriers to access to care, and have different health care utilization patterns, including greater medical costs and hospitalizations. We are exploring the dataset to examine questions related to how people with sensory impairment perceive their access to care and differences in their health care utilization.

Scientific Approaches

The EHR and survey data will be used to examine how self-reported sensory impairment (vision and/or hearing) can impact access to care (i.e., unmet health care needs) and health care utilization (e.g. emergency room visits, preventive care utilization). Research methods include cross-sectional and longitudinal analyses.

Anticipated Findings

Based on previous literature, people with sensory impairment may have more barriers to access to care than people without. However, no studies have explored these associations in a data set that focused on underrepresented populations. Moreover, the linked survey and EHR data provides many opportunities for a wide range of outcomes.

Demographic Categories of Interest

  • Age
  • Disability Status
  • Access to Care

Research Team

Owner:

  • Lama Assi - Graduate Trainee, Johns Hopkins University

Sepsis

Project Purpose(s)

  • Disease Focused Research (Sepsis)
  • Drug Development ...
  • Ancestry

Scientific Questions Being Studied

I wish to look at specific genes of lipid metabolism and how they influence development of sepsis

Scientific Approaches

Initially we will investigate the data using univariate statistics and evaluate associations between genes of lipid metabolism and incidence of sepsis. We will then perform Lasso regression to determine which covariates are predictors of sepsis development.

Anticipated Findings

We will determine whether certain lipid-related genes when present increase or decrease risk of sepsis development.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Faheem Guirgis - Early Career Tenure-track Researcher, University of Florida

Sepsis, Septic Shock

Project Purpose(s)

  • Disease Focused Research (primary bacterial infectious disease) ...

Scientific Questions Being Studied

Determine predictors of outcomes for adults admitted with sepsis and septic shock.
Evaluate hospital survival, organ failures, infections and complications

Scientific Approaches

Retrospective institutional administrative dataset
Descriptive, univariate and multivariate modeling approaches

Anticipated Findings

Identify population levels of variation in outcomes related to the acute management of sepsis and septic shock

Demographic Categories of Interest

  • Race / Ethnicity
  • Age
  • Access to Care
  • Income Level

Research Team

Owner:

  • Ivor Douglas - Senior Researcher, Denver Health

SGM Demo Project

Project Purpose(s)

  • Population Health ...

Scientific Questions Being Studied

SGM demonstration project

Scientific Approaches

Not available.

Anticipated Findings

SGM demonstration project

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Mitchell Lunn - Early Career Tenure-track Researcher, Stanford University

Sickle cell

Project Purpose(s)

  • Disease Focused Research (Benign hematologic diseases) ...

Scientific Questions Being Studied

What is the population of individuals with benign hematologic conditions in the All of Us research program. There are phenotyping algorithms that exist for benign hematologic conditions, and knowing how adequate these algorithms are within the AoU program could lend to knowledge about what the population.

Scientific Approaches

We will look through all the records that exist and utilize EHR records, such as ICD codes and laboratory values to look for individuals with benign hematological conditions.

Anticipated Findings

Demonstration of phenotyping algorithms for benign hematologic conditions can help demonstrate the power of the All of Us Research Database.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Robert Cronin - Early Career Tenure-track Researcher, Vanderbilt University Medical Center

Smoking Cessation among Rural and Non-Rural Residents

Project Purpose(s)

  • Population Health ...

Scientific Questions Being Studied

Rural populations in the United States have higher smoking prevalence rates than their urban counterparts and consume a higher number of cigarettes per day. In the last 10 years, the proportion of smokers in rural populations has remained stable or even increased, emphasizing the need for further efforts to assess smoking cessation treatment access in rural communities. The specific aim of the proposed research is to examine receipt of smoking cessation treatment among rural and non-rural individuals in the United States who are current or former smokers. We hypothesize that:
H1: Rural individuals will be less likely to have received smoking cessation treatment than urban individuals.
H1: Rural individuals who have a lower household income will be less likely to have received smoking cessation treatment than rural individuals who have a higher household income.

Scientific Approaches

The All of US Research Program Dataset Ver. 3 will be used, including information from EHR , survey data, and medications (buproprion) data. Chi-square tests will be performed. In addition, logistic methods will be used to obtain odds ratios and 95% confidence intervals.

Anticipated Findings

The study will provide information about receipt of smoking cessation treatment by rural and non-rural individuals.

Demographic Categories of Interest

  • Geography

Research Team

Owner:

  • Steven Coughlin - Late Career Tenured Researcher, Augusta University

SmokingCessation

Project Purpose(s)

  • Population Health ...

Scientific Questions Being Studied

Examine receipt of smoking cessation treatment among rural and non-rural individuals in the United States who are current or former smokers.

Scientific Approaches

Data to be used:

(Survey data: Demographics/the Basics)
Limit cohort to participants age 18 years or older
Variables of interest:
Age
Sex
Race
Hispanic ethnicity
Health insurance

(Survey data: Health Care Access/Utilization)
Rural residence
There are many reasons people delay getting medical care. Have you delayed getting care for any of the following reasons in the PAST 12 MONTHS?
You live in a rural area where distance to the health care provider is too far.•Yes•No•Don’t know

(Data: Procedures)

Smoking Cessation
Smoking cessation education
Smoking and tobacco use cessation counseling visit

(Data: Medications)
Bupropion
Varenicline
nicotine

Anticipated Findings

The specific aim of the proposed research is to examine receipt of smoking cessation treatment among rural and non-rural individuals in the United States who are current or former smokers. We hypothesize that:
H1: Rural individuals will be less likely to have received smoking cessation treatment than urban individuals.
H1: Rural individuals who have a lower household income will be less likely to have received smoking cessation treatment than rural individuals who have a higher household income.

Demographic Categories of Interest

  • Geography
  • Access to Care

Research Team

Owner:

  • Jie Chen - Late Career Tenured Researcher, Augusta University

Social Determinants and Healthcare Access in Eye Conditions

Project Purpose(s)

  • Population Health ...

Scientific Questions Being Studied

We are planning to explore disparities in healthcare access and utilization for patients with eye conditions across different demographic groups. We would like to evaluate risk of developing advanced/severe disease in different eye conditions, and understand how social determinants contribute to this risk while adjusting for other known risk factors. We are also interested in understanding the availability of social determinants of health data in this data repository compared to EHR clinical data warehouses alone.

Scientific Approaches

We will build cohorts of patients with various eye diseases (i.e. diabetic retinopathy, retinal vein occlusions, glaucoma, etc.). Then we will develop concept sets and extract data on outcomes (i.e. development of complications), as well as predictors including clinical data and social data. We will draw on survey data and EHR data within All of Us. When genomic data and wearable data become available, we are interested in evaluating those data sources as well. We will use statistical modeling and machine learning to generate predictive models.

Anticipated Findings

We anticipate that there may be differential risk for developing complications based on disparities in healthcare access and utilization for patients with eye conditions.

Demographic Categories of Interest

  • Race / Ethnicity
  • Age
  • Geography
  • Disability Status
  • Access to Care
  • Education Level
  • Income Level

Research Team

Owner:

  • Sally Baxter - Research Fellow, University of California, San Diego

Collaborators:

  • Terrence Lee - Graduate Trainee, University of California, San Diego
  • John McDermott - Graduate Trainee, University of California, San Diego
  • Gordon Ye - Undergraduate Student, University of California, San Diego
  • Alison Chan - Graduate Trainee, University of California, San Diego
  • Bita Shahrvini - Graduate Trainee, University of California, San Diego
  • Bharanidharan Radha Saseendrakumar - Project Personnel, University of California, San Diego

Social Determinants and Healthcare Access in Glaucoma

Project Purpose(s)

  • Population Health ...

Scientific Questions Being Studied

We are planning to explore disparities in healthcare access and utilization for patients with glaucoma across different demographic groups.

Scientific Approaches

We will use survey data, EHR data (diagnosis codes), and statistical modeling.

Anticipated Findings

We anticipate that there may be gender-related or race-related disparities in healthcare access and utilization for patients with glaucoma.

Demographic Categories of Interest

  • Race / Ethnicity
  • Age
  • Geography
  • Disability Status
  • Access to Care
  • Education Level
  • Income Level

Research Team

Owner:

  • Sally Baxter - Research Fellow, University of California, San Diego

Social determinants of Hypertension

Project Purpose(s)

  • Disease Focused Research (hypertension)
  • Population Health ...
  • Social / Behavioral

Scientific Questions Being Studied

We want to understand the social determinants of health(income, education, race/ethnicity, healthcare access, nativity status, geography) that are associated with hypertension among United States adults. This research question is important because hypertension is a major public health problem in the US. Despite the availability of affordable medications available to improve blood pressure control, significant progress has not been made. Prior studies have shown that there are disparities in hypertension prevalence, treatment, and control. We would like to explore the question further to understand how social determinants contribute to the burden of hypertension in the US.

Scientific Approaches

We use descriptive statistics to describe the sociodemographic characteristics of adults who have been diagnosed with hypertension. We will use inferential statistics to determine social determinants are associated with hypertension prevalence, treatment, and control among US adults.

Anticipated Findings

We anticipate that social determinants of health will be associated with hypertension prevalence, treatment, and control.

Demographic Categories of Interest

  • Race / Ethnicity
  • Age
  • Geography
  • Access to Care
  • Education Level
  • Income Level

Research Team

Owner:

  • Yvonne Commodore-Mensah - Early Career Tenure-track Researcher, Johns Hopkins University

Collaborators:

  • Ruth-Alma Turkson-Ocran - Research Fellow, Johns Hopkins University
  • Jarratt Pytell - Research Fellow, Johns Hopkins University

Sociodemographics and Health Status of Sexual and Gender Minorities in the AoURP

Project Purpose(s)

  • Population Health
  • Other Purpose (This work is the result of an All of Us Research Program Demonstration Project. Demonstration Projects are efforts by the All of Us Research Program designed to meet the goal of ensuring the quality and utility of the Research Hub as a resource for accelerating precision medicine. This work has been approved, reviewed, and overseen by the All of Us Research Program Science Committee and Data and Research Center to ensure compliance with program policy.) ...

Scientific Questions Being Studied

The All of Us Research Program is poised to be an important resource for studying questions related to the health and well-being of sexual and gender minority (SGM) people. Our primary goal is to investigate the sociodemographic characteristics and health conditions/behaviors of SGM participants compared with non-SGM participants and determine whether previously-reported health disparities are replicated in the AoURP data repository.

In addition, while SGM people are typically studied as a single group (i.e. "lumped together") in population-level health research studies due to their relatively smaller proportion and statistical sample size, we aim to leverage the large sample of SGM participants included in the AoURP to investigate the health disparities that may be unique to sexual minority (SM) and gender minority (GM) subgroups.

Scientific Approaches

We will characterize SGM participants on the following sociodemographic variables – age, sexual orientation, sex assigned at birth, gender identity, race, ethnicity, geography, educational attainment, employment status, annual household income, and insurance coverage – using participant-provided information (PPI). We will report the standardized mean difference in each demographic variable between SGM and non-SGM participants. We will characterize the prevalence of the following health conditions among SM and GM participants – anxiety, asthma, cancer, cardiovascular disease, chronic kidney disease, depression, diabetes mellitus, HIV infection, hypertension, tobacco use, substance use disorder, and elevated body weight – using EHR data from in conjunction with associated PPI and physical measurement data (PMI). Logistic regression models will be used to generate an odds ratio for each health condition/behavior in each SM and GM subgroup using an appropriate non-SGM reference group.

Anticipated Findings

Sexual and gender minority (SGM) people face numerous health and health care disparities compared to their heterosexual and cisgender peers including greater rates of depression, human immunodeficiency virus (HIV) infection, certain cancers, and tobacco use. AoURP collects sexual orientation, gender identity, and sex assigned at birth from all participants, and the comprehensiveness of this data collection is unique among federal research efforts. As the field of SGM health research grows, the AoURP data resource will undoubtedly serve as a highly valuable resource for studying SGM health as well as SGM health care disparities. This project will represent an important description of the sociodemographic and health condition/behavior differences between SGM and non-SGM people using a large, national research dataset containing participant-reported, EHR, and physical measurement data, and demonstrate the diversity and utility of AoURP data for SGM-related health research.

Demographic Categories of Interest

  • Gender Identity
  • Sexual Orientation
  • Geography
  • Access to Care
  • Education Level
  • Income Level

Research Team

Owner:

  • Kate Vitale - Project Personnel, Stanford University

Collaborators:

  • David Schlueter - Other, All of Us Program Operational Use
  • Roxana Loperena Cortes - Other, All of Us Program Operational Use
  • Francis Ratsimbazafy - Other, All of Us Program Operational Use
  • Mitchell Lunn - Early Career Tenure-track Researcher, Stanford University
  • Haley Hedlin - Project Personnel, Stanford University
  • Andrea Ramirez - Other, All of Us Program Operational Use
  • Annesa Flentje - Early Career Tenure-track Researcher, University of California, San Francisco

Solid tumor Research

Project Purpose(s)

  • Disease Focused Research (Solid tumors)
  • Population Health ...

Scientific Questions Being Studied

Cancer is the second most common cause of death in the United States. More than 600,000 Americans are expected to die of cancer in 2020. About 58% of these deaths could be prevented if cancer is detected at an early stage. At present population based screening tests are in place for early diagnosis of cancer. It is in part responsible for an overall reduction in cancer rates by 25% from 1990 to 2005. However population based screening still remains to be an imperfect method. In many parts of the world, participation of subjects in screening programs are at a lower level than desired. So additional methods for early cancer detection need to be employed. Machine learning algorithms have shown to help improve early detection of cancer. Our research effort aims to identify, evaluate and validate machine learning algorithms to predict the incidence, prognosis and complications of cancer, so as to create a more proactive approach to the management of cancer.

Scientific Approaches

The purpose of this study is to utilize machine learning algorithms to predict the incidence, prognosis and complications of solid tumors in adults.
Aims:
- to create an alternative and efficient screening tool for cancer detection
-to diagnose solid tumors at an early stage
-to reduce cancer morbidity and mortality
-to utilize lab values and basic patient information to detect people at risk for cancer
-to reduce healthcare costs, in the long term

Anticipated Findings

As a result of this study we anticipate detection of cancer at an early stage and thereby reduce the morbidity and mortality associated with it. All significant findings will be published in a high-impact journal and presented at academic conferences. The results of this study may give way to a new screening test for cancer.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Bijun Kannadath - Early Career Tenure-track Researcher, University of Arizona

Collaborators:

  • Jiali Ling - Project Personnel, University of Arizona

South Asian cohort

Project Purpose(s)

  • Disease Focused Research (Cardiometabolic diseases)
  • Population Health ...
  • Social / Behavioral

Scientific Questions Being Studied

Understanding health disparities in dis-aggregated Asian American populations, particularly for chronic disease management and prevention, specifically cardiometabolic diseases.

Scientific Approaches

Standard epidemiologic methods, including descriptive statistics and statistical inference testing, will be used to analyze the data.

Anticipated Findings

The findings are expected to reveal heterogeneity in disaggregated Asian American health status that is masked in other datasets when Asian American participants are aggregated into one 'Asian' group.

Demographic Categories of Interest

  • Race / Ethnicity
  • Age
  • Access to Care
  • Education Level
  • Income Level

Research Team

Owner:

  • Nilay Shah - Early Career Tenure-track Researcher, Northwestern University

South Asian cohort

Project Purpose(s)

  • Disease Focused Research (Cardiometabolic diseases)
  • Population Health ...
  • Social / Behavioral

Scientific Questions Being Studied

Understanding health disparities in dis-aggregated Asian American populations, particularly for chronic disease management and prevention, specifically cardiometabolic diseases.

Scientific Approaches

Standard epidemiologic methods, including descriptive statistics and statistical inference testing, will be used to analyze the data.

Anticipated Findings

The findings are expected to reveal heterogeneity in disaggregated Asian American health status that is masked in other datasets when Asian American participants are aggregated into one 'Asian' group.

Demographic Categories of Interest

  • Race / Ethnicity
  • Age
  • Access to Care
  • Education Level
  • Income Level

Research Team

Owner:

  • Nilay Shah - Early Career Tenure-track Researcher, Northwestern University

South Asian cohort

Project Purpose(s)

  • Disease Focused Research (Cardiometabolic diseases)
  • Population Health ...
  • Social / Behavioral

Scientific Questions Being Studied

Understanding health disparities in dis-aggregated Asian American populations, particularly for chronic disease management and prevention, specifically cardiometabolic diseases.

Scientific Approaches

Standard epidemiologic methods, including descriptive statistics and statistical inference testing, will be used to analyze the data.

Anticipated Findings

The findings are expected to reveal heterogeneity in disaggregated Asian American health status that is masked in other datasets when Asian American participants are aggregated into one 'Asian' group.

Demographic Categories of Interest

  • Race / Ethnicity
  • Age
  • Access to Care
  • Education Level
  • Income Level

Research Team

Owner:

  • Nilay Shah - Early Career Tenure-track Researcher, Northwestern University

SPADE project

Project Purpose(s)

  • Drug Development
  • Methods Development ...

Scientific Questions Being Studied

Our central hypothesis is that various data (clinical, race, ethnicity, socioeconomic, medical history, health literacy, healthcare access, and lifestyle) collected within the All of Us program will allow us to effectively characterize ADE profiles among participants receiving chronic disease medications. Specific aims:
1. Determine the prevalence, specific types and characteristics of ADEs among participants who are receiving chronic disease medications.
2. Derive and validate a prediction model to identify characteristics that are associated with ADEs related to selected chronic disease medications.

ADEs are a serious public health problem. As the United States is growing older and more diverse, it is important to investigate the characteristics associated with ADEs. Through a diverse group of participants enrolled in All of Us, results of this study will allow the development of individualized plans for prevention of ADEs.

Scientific Approaches

In this nested case control study, we will evaluate patients who have taken drugs with known polymorphisms that result in clinically significant variation in drug response. Twenty drugs used to treat chronic disease states were selected due to the availability of clinically actionable pharmacogenetics recommendations and widespread use within our clinical setting at UCI as well as across the United States. Available data will be reviewed to examine the prevalence, specific types, and characteristics of ADEs among participants who are receiving one of the above chronic disease medications. Cases and matched controls will be selected from the database. To determine potential ADE risk factors, data about exposure to risk factors will be collected via abstraction from records. Once the descriptive analysis for all 20 drugs is complete, we will select up to three drugs to derive and validate prediction models for ADEs.

Anticipated Findings

Our study intends to fill a current research gap and present findings that will contribute an indispensable part of a future larger study that examines ADEs for association with both patient characteristics and pharmacogenetic information, when available, which will contribute an additional layer of information critical to the preventability of ADEs. While limitations such as inconsistency in recording of ADEs and risk factors is anticipated, advantages of this study are that cases and controls will be selected from a large participant pool (All of Us) rather than a specific site, which will
allow us to evaluate the impact of ADEs in a group that better mirrors the general patient population in the clinical setting.

Our long-term goal is to implement precision pharmacotherapy services that will be routinely used to assist in drug therapy decisions in clinical practice, incorporating various patient factors with genetic information to take a proactive approach to prevention of ADEs.

Demographic Categories of Interest

  • Race / Ethnicity
  • Access to Care
  • Education Level
  • Income Level

Research Team

Owner:

  • Christine Cadiz - Other, University of California, Irvine

Spectrum of tumors after breast cancer

Project Purpose(s)

  • Disease Focused Research (breast cancer)
  • Population Health ...
  • Other Purpose (Interested to know what secondary or recurrent tumors developed after breast cancer and are there any demographic/behavioral/medications/medical conditions that modify/influence the risk of secondary tumors (recurrent/new tumors))

Scientific Questions Being Studied

Interested to know what secondary or recurrent tumors developed after breast cancer and are there any demographic/behavioral/medications/medical conditions that modify/influence the risk of secondary tumors (recurrent/new tumors)

Scientific Approaches

Plan to
Create a cohort of all participants diagnosed with breast cancer
This is case only study
For the above cohort collect all tumors that happened after diagnosis of breast cancer
Use cox model in order to determine the risk of subsequent tumors (outcome will be time to the second tumor)
Since the range of second/recurrent tumors is large we propose to look first at the spectrum of tumors and then do analyses for specific second tumors like a recurrence
Need to examine what are the factors associated with the second tumor

Anticipated Findings

We anticipate to determine guidelines for risk of second tumors after diagnosis of breast cancer

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Argyrios Ziogas - Late Career Tenured Researcher, University of California, Irvine

Collaborators:

  • Steven Hiek - Project Personnel, University of California, Irvine
  • Kathryn Campbell - Project Personnel, University of California, Irvine

Staph GWAS

Project Purpose(s)

  • Disease Focused Research (staphylococcus infection)
  • Control Set ...
  • Ancestry

Scientific Questions Being Studied

The question is: Are genetic variant (as identified by GWAS) associated with susceptibility to staphylococcus infection reproducible in the All Of Us cohort

The All of US cohort will be used as replication cohort for a GWAS on staphylococcus infection done on the Personalized Medicine Research Cohort at the Marshfield Clinic Research Institute.

Scientific Approaches

The All of US cohort will be used as replication cohort for a GWAS on staphylococcus infection done on the Personalized Medicine Research Cohort at the Marshfield Clinic Research Institute.

data set:
cases are participants with staphylococcus infection
control are age/sex/bmi matched particpants with no record of staphylococcus infection

method:
GWAS

Anticipated Findings

The finding would confirm genetic susceptibility associated with staphylococcus infection. The confirmed hits will be further studies in functional assays to determine the molecular pathway associated with the genetic susceptibility.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Patrick Allaire - Research Associate, Marshfield Clinic Research Institute

Statin-Myopathy

Project Purpose(s)

  • Educational ...

Scientific Questions Being Studied

Statins are widely prescribed for lowering cholesterol levels to help reduce the risk of strokes and heart attacks. One severe side effect of statins is myopathy that results in muscle pain. This study will characterize the incidence of myopathy in users of statins in the AoU cohort.

Scientific Approaches

We will define the study cohort to include any participant who was prescribed a statin. For this cohort we will create a data set that includes demographics, medications and diagnoses. In this data set we will estimate the incidence of myopathy.

Anticipated Findings

The incidence of statin myopathy is reported to be between 1.5–5.0%. We will compare the incidence of statin myopathy in the AoU cohort to the incidence that is reported in the literature.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Shyam Visweswaran - Mid-career Tenured Researcher, University of Pittsburgh

Stephanie H. Cook - Project 1

Project Purpose(s)

  • Disease Focused Research (CVD)
  • Population Health ...
  • Social / Behavioral

Scientific Questions Being Studied

Although cardiovascular disease (CVD) is the leading cause of death among men and women over the age of 65 in the United States, men account for more than half of the deaths due to CVD. Moreover, evidence suggests that cardiovascular disparities exist among men, such that those who identify as sexual minorities are more likely to experience poor cardiovascular health as compared to heterosexuals. The objective of this proposal is to examine physiological and behavioral mechanisms linking self-reported discrimination to ambulatory blood pressure, an intermediate outcome used to assess subclinical cardiovascular disease, in a racially diverse group of YSMM aged 18-35. The long-term objective of this research is to inform the creation of culturally relevant interventions to reduce the risk of CVD among YSMM. The proposed study specifically aims to:

Scientific Approaches

I will explore datasets that provide self report information on the constructs discussed above.
1. Descriptive statistics. Descriptive statistics for all variables will be calculated. Distributional properties will be examined using summary statistics (e.g., mean, median, SD) and graphs (e.g., histograms and box plots). I will confirm psychometric properties (e.g., internal consistency) of all scales. All management, basic analyses, and assumption testing will be done in Stata 16.23 Hypothesis testing will be conducted in Mplus v8.24
2. Multivariate linear regression (MLR) will be used to asses if discrimination predicts mean CVD risk.
3. MLR will be used to assess if each of the individual health behaviors of substance use, sleep , and physical activity predicts mean CVD risk.
4. Structural Equation Modeling (SEM) will be used. The Monte Carlo method for assessing mediation will be used

Anticipated Findings

the mechanisms linking discrimination and intermediate cardiovascular factors among racially diverse young sexual minority men (YSMM) remain underexplored. Conducting this research is vitally important considering the heightened rates of both stress from discrimination and subclinical cardiovascular disease among these potentially vulnerable populations. Secondly, understanding what the potential "buffers" are that mitigate the negative effects of intersectional discrimination on cardiovascular risk factors is critically important if we are going to create tailored interventions to reduce CVD among diverse YSMM.

Demographic Categories of Interest

  • Race / Ethnicity
  • Sexual Orientation

Research Team

Owner:

  • Stephanie Cook - Early Career Tenure-track Researcher, New York University

Collaborators:

  • Sallie Yassin - Graduate Trainee, New York University

Surgical Resilience

Project Purpose(s)

  • Methods Development ...

Scientific Questions Being Studied

Can the different trajectories of response to the stress of surgery among older adults be modeled over time using physiological and biochemical data?

Aging is known to be associated with decline in the function of all organ systems (Hall, 2020). The order of decline and impact on functional capacity varies (Townsend et al., 2016). While this fact alone would historically have precluded older adults from a range of surgical interventions, an increase in average adult lifespan and the recognition that surgical interventions on individuals over the age of 65 can be performed safely means that they now account for 40% of operations performed in the United States ( CDC, National Hospital Discharge Survey , 2010) .

Age-related decline in physiological function or homeostenosis (Davies, 2016) is thought to result in a state of vulnerability to minor stressors. While there are around 80 frailty tools, these do not capture the dynamic trait of resilience.

Scientific Approaches

Time series analysis of physiological and biochemical laboratory data to assess states and traits of resilience among surgical patients.

Resilience can be thought of as the ability of a system to return to a homeostatic baseline in response to stress (Hadley et al., 2017) compared with robustness which represents an ability to maintain system state within a fixed range in spite of stress. Based on these definitions it is clear that the study of resilience and robustness require multiple serial measures of physiological and functional parameters over time.

We will study the physiological and biochemical responses to stress at baseline, during and after a stressor- in this case surgery.

Anticipated Findings

There are limited data on longitudinal trends of physiological dysregulation among older surgical patients. We believe that using trended lab and vital sign data will provide information about the trajectories of robust and resilient individuals and allow better selection of surgical candidates and addressing of care needs.
We anticipate that certain patterns of low resilience will be associated with terminal decline in older adults. Among older adults who are classified as frail by existing instruments, many will still have enough resilience to tolerate surgical procedures

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • nikhil prasad - Research Fellow, University of Maryland, Baltimore

Systemic Disease and Glaucoma

Project Purpose(s)

  • Disease Focused Research (Primary open angle glaucoma)
  • Other Purpose (This work is the result of an All of Us Research Program Demonstration Project. Demonstration Projects are efforts by the All of Us Research Program designed to meet the goal of ensuring the quality and utility of the Research Hub as a resource for accelerating precision medicine. This work has been approved, reviewed, and overseen by the All of Us Research Program Science Committee and Data and Research Center to ensure compliance with program policy. ) ...

Scientific Questions Being Studied

We have previously published a predictive model of glaucoma progression using electronic health record (EHR) data pertaining to systemic attributes from a single institution. We aim to use the All of Us dataset to 1) serve as external validation for this single-center model and 2) to train new models focused on predicting glaucoma progression using systemic predictors. This is important to understand whether the original findings are generalizable and provide additional knowledge about the utility of systemic predictors on a national-level dataset.

Scientific Approaches

We plan to primarily work with EHR data contained in All of Us for a cohort of adult participants diagnosed with primary open-angle glaucoma. We will extract data on systemic conditions and medications for this cohort, as well as physical measurements and vital signs. We will clean the data such that the format is consistent with the data from our previously published model. Then, we will use this data as an external validation of a logistic regression model derived from our prior study that was based at a single academic center. Next, we will use All of Us data to train a new set of models, using techniques such as logistic regression, random forests, and artificial neural networks. We will optimize these models using feature selection methods and class balancing procedures. By evaluating performance metrics such as area under the curve (AUC), precision, recall, and accuracy, we will assess whether we can achieve superior predictive performance when training models using All of Us.

Anticipated Findings

We anticipate that the All of Us data will validate the findings from the model, which demonstrated that blood pressure-related metrics and certain medication classes had predictive value for glaucoma progression. In addition, we anticipate that the models trained with All of Us data will outperform the model trained with single institution data due to larger sample size and greater diversity. These findings will support further investigation in understanding the relationship between systemic conditions like blood pressure with glaucoma progression.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Bharanidharan Radha Saseendrakumar - Project Personnel, University of California, San Diego

Collaborators:

  • Tsung-Ting Kuo - Early Career Tenure-track Researcher, University of California, San Diego
  • Sally Baxter - Research Fellow, University of California, San Diego
  • Roxana Loperena Cortes - Other, All of Us Program Operational Use
  • Paulina Paul - Project Personnel, University of California, San Diego
  • Lucila Ohno-Machado
  • Luca Bonomi - Research Fellow, University of California, San Diego
  • Jihoon Kim - Project Personnel, University of California, San Diego
  • Francis Ratsimbazafy - Other, All of Us Program Operational Use

TBI Project

Project Purpose(s)

  • Disease Focused Research (Traumatic Brain Injuries)
  • Educational ...
  • Methods Development
  • Control Set

Scientific Questions Being Studied

In this workbench, I am trying to develop computational models to better predict the short-term and long-term outcome of the TBI patients. A huge number of people are diagnosed with this injury every year in the USA. The results of this research can significantly help physicians to better understand, predict, and control this injury.

Scientific Approaches

I am planning to use Machine Learning and Deep Learning tools to do the analysis. I am also focusing on using time series data.

Anticipated Findings

My research would probably lead to the better and new computational tools for the TBI outcome prediction.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Amin Nayebi - Graduate Trainee, University of Arizona

Collaborators:

  • Vignesh Subbian - Early Career Tenure-track Researcher, University of Arizona
  • Baran Balkan - Project Personnel, University of Arizona

Test

Project Purpose(s)

  • Other Purpose (Test) ...

Scientific Questions Being Studied

Test: to create a test project in order to demonstrate workbench to researchers from our institution who may be interested in leveraging the AoU data.

Scientific Approaches

Test: to create a test project in order to demonstrate workbench to researchers from our institution who may be interested in leveraging the AoU data.

Anticipated Findings

Test: to create a test project in order to demonstrate workbench to researchers from our institution who may be interested in leveraging the AoU data.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Kai Zheng - Mid-career Tenured Researcher, University of California, Irvine

Test

Project Purpose(s)

  • Population Health ...

Scientific Questions Being Studied

What are the secular trends on medication utilization among the various population based on age, location, ethnicity and race?

At what age do specific groups of individuals have to deal with poly-pharmacy. Poly-pharmacy is also known as taking multiple medications to treat one or more disease states.

Scientific Approaches

Descriptive data analysis will be used to describe the incidence of each group of individuals that use medication. The data will be broken down into age groups of 5 to 10 year increments and social/demographic information.

Anticipated Findings

The anticipated findings is that at patients grow older, they tend to have more medical problems that require more medication.

But, what is that age group and demographic look like. How has it changed or will it change over time. More and more medications are being produced annually. As patients take multiple medications, it puts them at risk for more adverse effect and drug interactions.

Demographic Categories of Interest

  • Race / Ethnicity
  • Age
  • Sex at Birth
  • Geography
  • Disability Status
  • Access to Care
  • Education Level
  • Income Level

Research Team

Owner:

  • Lee Nguyen - Other, University of California, Irvine

test

Project Purpose(s)

  • Educational ...

Scientific Questions Being Studied

I'm exploring the workbench as part of an applied biomedical informatics graduate course. I will be conducting exploratory data analysis to gain a better understanding of the data available in the workbench.

Scientific Approaches

Overall I am reviewing tools, processes, and data across the longitudinal cohort of All of Us participants. This specific workbench will answer a few simple questions with the cohort builder.

Anticipated Findings

We want to understand what data are available from the All of Us workbench while at the same time gain familiarity with use of the cohort builder tool.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Barrett Jones - Graduate Trainee, Vanderbilt University

test

Project Purpose(s)

  • Educational ...

Scientific Questions Being Studied

Relationship of SES and incidence of Alzheimer's disease

Scientific Approaches

Case control study

Anticipated Findings

Low SES associated with increased Alzheimer's

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Shaista Malik - Mid-career Tenured Researcher, University of California, Irvine

Test

Project Purpose(s)

  • Population Health
  • Social / Behavioral ...
  • Educational

Scientific Questions Being Studied

social determinants of health - will further refine after gaining experience with the workspace

Scientific Approaches

tbd - will further refine after gaining experience with the workspace

Anticipated Findings

tbd - will further refine after gaining experience with the workspace

Demographic Categories of Interest

  • Race / Ethnicity
  • Geography
  • Access to Care
  • Education Level
  • Income Level

Research Team

Owner:

  • Bryan Heckman - Mid-career Tenured Researcher, Meharry Medical College

Test

Project Purpose(s)

  • Population Health
  • Social / Behavioral ...
  • Educational

Scientific Questions Being Studied

social determinants of health - will further refine after gaining experience with the workspace

Scientific Approaches

tbd - will further refine after gaining experience with the workspace

Anticipated Findings

tbd - will further refine after gaining experience with the workspace

Demographic Categories of Interest

  • Race / Ethnicity
  • Geography
  • Access to Care
  • Education Level
  • Income Level

Research Team

Owner:

  • Bryan Heckman - Mid-career Tenured Researcher, Meharry Medical College

test

Project Purpose(s)

  • Disease Focused Research (hypertension, alzheimers, heart disease)
  • Educational ...

Scientific Questions Being Studied

What is the link between risk factors for heart disease, like hypertension and development of Alzheimer's disease?

Scientific Approaches

Will do a case=control study, utilizing self reported disease conditions and genetic markers linked with Alzheimer's and hypertension

Anticipated Findings

Alzheimer's disease might be preventable if risk factors are controlled early in life.

Demographic Categories of Interest

  • Race / Ethnicity

Research Team

Owner:

  • Shaista Malik - Mid-career Tenured Researcher, University of California, Irvine

Test - concomitant colchicine-CYP3A4 inhibitor exposure

Project Purpose(s)

  • Population Health ...

Scientific Questions Being Studied

We will be exploring the data at this stage to study interactions between colchicine and drugs that inhibit cytochrome P450 3A4 . Clarithromycin is one drug but there are several others. The outcome of interest will be evidence of toxicity which will include lab measures and other indicators

Scientific Approaches

We plan to use a self-controlled pharmacoepidemiologic design comparing outcomes before and after exposure within patients. We might find this infeasible as we try to make progress. We will revise the approach if needed. We will compare outcomes to exposure to drugs that we expect will not have the outcome (negative controls).

Anticipated Findings

Colchicine toxicity is highly fatal and it is critical for researchers to identify which drug combinations place the patients at greatest risk. With only case evidence present in the scientific literature, and clinical trials unable to answer the question, researchers need to rapidly generate evidence to inform drug interaction decision support. This study will determine if it is feasible to use data from the All of Us Cohort to generate evidence for the research question.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Richard Boyce - Mid-career Tenured Researcher, University of Pittsburgh

Test Glaucoma

Project Purpose(s)

  • Disease Focused Research (glaucoma) ...

Scientific Questions Being Studied

glaucoma

Scientific Approaches

Not available.

Anticipated Findings

glaucoma

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Luca Bonomi - Research Fellow, University of California, San Diego

Test Workspace 2

Project Purpose(s)

  • Control Set ...

Scientific Questions Being Studied

Test

Scientific Approaches

Not available.

Anticipated Findings

Test

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Eric Song - Administrator, All of Us Program Operational Use

test_workspace_clv

Project Purpose(s)

  • Educational ...

Scientific Questions Being Studied

This is a test workspace to learn how to use the site. I am learning how to import selection of the dataset and use the notebooks for processing.

Scientific Approaches

This is a test workspace to learn how to use the site. I am learning how to import selection of the dataset and use the notebooks for processing.

Anticipated Findings

This is a test workspace to learn how to use the site. I am learning how to import selection of the dataset and use the notebooks for processing.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Colby Lewis V - Graduate Trainee, Columbia University

TGCT Exploration

Project Purpose(s)

  • Disease Focused Research (Testicular germ cell tumor (TGCT) epidemiology) ...

Scientific Questions Being Studied

I have been studying the epidemiology of TGCT for>20 years, principally using data from my local community. I am curious as to what I might be able to learn that is new by accessing TGCT information in All of Us.

Scientific Approaches

I plan to employ standard epidemiologic approaches such as estimating associations between lifestyle histories and TGCT. For example, I hope to be able to create the equivalent of a case-control study based on reported TGCT diagnoses in All of Us participants (cases) and All of Us participants who do not have TGCT diagnoses (controls).

Anticipated Findings

Ideally I will identify one or more new non-genetic risk factor for TGCT that has not yet been reported by prior research.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Stephen Schwartz - Mid-career Tenured Researcher, Fred Hutchinson Cancer Research Center

TGCT Exploration

Project Purpose(s)

  • Disease Focused Research (Testicular germ cell tumor (TGCT) epidemiology) ...

Scientific Questions Being Studied

I have been studying the epidemiology of TGCT for>20 years, principally using data from my local community. I am curious as to what I might be able to learn that is new by accessing TGCT information in All of Us.

Scientific Approaches

I plan to employ standard epidemiologic approaches such as estimating associations between lifestyle histories and TGCT. For example, I hope to be able to create the equivalent of a case-control study based on reported TGCT diagnoses in All of Us participants (cases) and All of Us participants who do not have TGCT diagnoses (controls).

Anticipated Findings

Ideally I will identify one or more new non-genetic risk factor for TGCT that has not yet been reported by prior research.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Stephen Schwartz - Mid-career Tenured Researcher, Fred Hutchinson Cancer Research Center

TGCT Exploration

Project Purpose(s)

  • Disease Focused Research (Testicular germ cell tumor (TGCT) epidemiology) ...

Scientific Questions Being Studied

I have been studying the epidemiology of TGCT for>20 years, principally using data from my local community. I am curious as to what I might be able to learn that is new by accessing TGCT information in All of Us.

Scientific Approaches

I plan to employ standard epidemiologic approaches such as estimating associations between lifestyle histories and TGCT. For example, I hope to be able to create the equivalent of a case-control study based on reported TGCT diagnoses in All of Us participants (cases) and All of Us participants who do not have TGCT diagnoses (controls).

Anticipated Findings

Ideally I will identify one or more new non-genetic risk factor for TGCT that has not yet been reported by prior research.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Stephen Schwartz - Mid-career Tenured Researcher, Fred Hutchinson Cancer Research Center

TGCT Exploration

Project Purpose(s)

  • Disease Focused Research (Testicular germ cell tumor (TGCT) epidemiology) ...

Scientific Questions Being Studied

I have been studying the epidemiology of TGCT for>20 years, principally using data from my local community. I am curious as to what I might be able to learn that is new by accessing TGCT information in All of Us.

Scientific Approaches

I plan to employ standard epidemiologic approaches such as estimating associations between lifestyle histories and TGCT. For example, I hope to be able to create the equivalent of a case-control study based on reported TGCT diagnoses in All of Us participants (cases) and All of Us participants who do not have TGCT diagnoses (controls).

Anticipated Findings

Ideally I will identify one or more new non-genetic risk factor for TGCT that has not yet been reported by prior research.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Stephen Schwartz - Mid-career Tenured Researcher, Fred Hutchinson Cancer Research Center

Training

Project Purpose(s)

  • Educational ...

Scientific Questions Being Studied

I would like to use this workspace purely for educational purposes only. It will be used to demonstrate to students various data analysis approaches using large datasets and to familiarize them with All of Us cloud storage workflow.

Scientific Approaches

To produce aggregate summary statistics and regression models for various measurement variables available in All of Us data.

Anticipated Findings

This exploratory analysis will enable us to explore heterogeneity in anthropometric measures among various racial-ethnic groups

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Amit Arora - Graduate Trainee, University of Arizona

Collaborators:

  • Yann Klimentidis - Mid-career Tenured Researcher, University of Arizona
  • Jo-el Banini - Undergraduate Student, University of Arizona
  • Heidi Steiner - Graduate Trainee, University of Arizona
  • Claire Jurecky - Undergraduate Student, University of Arizona

Transgender health conditions

Project Purpose(s)

  • Population Health ...

Scientific Questions Being Studied

Little is known about the general health of gender diverse communities (transgender and non-binary gender identities). We seek to characterize the health and well being of gender diverse individuals.

Scientific Approaches

To characterize the health of gender diverse individuals compared with cisgender individuals. The health conditions and beavhiors we will examine include: smoking, body size, alcohol use, substance use, mental health, stress, disability, HIV, hepatitis B and C, HPV, gallstones, hypertension, cholesterol, diabetes, heart disease, and receipt of preventative services (vaccinations and cancer screenings).

Anticipated Findings

This analysis will characterize the health and well-being of gender diverse communities in the US. This research can be used by clinicians and future researchers as targets for prevention and treatment.

Demographic Categories of Interest

  • Gender Identity

Research Team

Owner:

  • Sarah Jackson - Research Fellow, NIH

Transgender health conditions

Project Purpose(s)

  • Population Health ...

Scientific Questions Being Studied

Little is known about the general health of gender diverse communities (transgender and non-binary gender identities). We seek to characterize the health and well being of gender diverse individuals.

Scientific Approaches

To characterize the health of gender diverse individuals compared with cisgender individuals. The health conditions and beavhiors we will examine include: smoking, body size, alcohol use, substance use, mental health, stress, disability, HIV, hepatitis B and C, HPV, gallstones, hypertension, cholesterol, diabetes, heart disease, and receipt of preventative services (vaccinations and cancer screenings).

Anticipated Findings

This analysis will characterize the health and well-being of gender diverse communities in the US. This research can be used by clinicians and future researchers as targets for prevention and treatment.

Demographic Categories of Interest

  • Gender Identity

Research Team

Owner:

  • Sarah Jackson - Research Fellow, NIH

Traumatic brain injury

Project Purpose(s)

  • Disease Focused Research (Traumatic brain injury)
  • Educational ...

Scientific Questions Being Studied

Traumatic brain injury (TBI) causes a large burden of morbidity and mortality in the United States. There are multiple factors that may influence how individuals with TBI recover and interact with the healthcare system after the initial injury, including demographic factors (age, sex, gender, race, ethnicity, urban vs. rural locations, etc.), patient factors (medical history, family medical history, medications), and clinical factors (co-occurring injuries, surgeries, and medical treatments). The focus of this research is to identify factors that may contribute to individuals' post-injury time course including physical recovery, physical and mental health, and healthcare utilization, with the overall goal of identifying opportunities to improve health of patients who have sustained traumatic brain injuries.

Scientific Approaches

This research will seek to longitudinally characterize demographic and clinical information in patients with TBI, and then analyze how various factors are associated with post-injury health and healthcare utilization. The All of Us research program provides an unparalleled opportunity to evaluate these relationships on a large scale and in a diverse population of participants.

Anticipated Findings

It is anticipated that findings of this research will identify barriers to successful recovery after TBI. This could help physicians be more cognizant of these barriers and improve patient care, and help researchers uncover topics that require further investigation to further our understanding of TBI and post-injury health.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • David Barton - Research Fellow, University of Pittsburgh

Trends in the Use of Cancer Screening Tests

Project Purpose(s)

  • Population Health
  • Other Purpose (This work is a result of an All of Us Research Program Demonstration Project. The projects are efforts by the Program designed to meet the program's goal of ensuring the quality and utility of the Research Hub as a resource for accelerating discovery in science and medicine. This work was reviewed and overseen by the All of Us Research Program Science Committee and the Data and Research Center to ensure compliance with program policy, including policies for acceptable data access and use. ) ...

Scientific Questions Being Studied

This Demonstration Project assessed the availability of data needed to describe trends in the use of Prostate Specific Antigen (PSA) testing and mammography by age, race/ethnicity, and other demographic factors among US adults aged 40-89. We calculated the percentage of men receiving PSA tests and women undergoing mammography by year among All of Us participants. We then compared our findings with previously published estimates in order to demonstrate that data from the All of Us Research Program may be useful for analyses of these important cancer screening tests.

Scientific Approaches

For analyses of PSA testing, we included males aged 40-89, and searched for relevant codes for PSA in measurement and procedure datasets, including LOINC and CPT4 codes. For analyses of mammography, we included females aged 40-89, and searched for relevant CPT4 procedure codes. We then calculated the percentage of eligible participants who underwent each test by year, and examined trends by age group and race/ethnicity.

We collected demographic data on All of Us participants who had at least one PSA or mammography code in their electronic data, and compared the distrbution of sex, race, ethnicity, insurance status, education level, marital status, and geographic region to participants without any PSA or mammography codes in their record.

Anticipated Findings

For this study, we seek to leverage the electronic health record (EHR) and participant provided information to examine trends in the use of cancer screening tests by age, geography and race/ethnicity. We anticipate observing a decline in PSA testing in recent years, and a fairly consistent high use of mammography over time, with variation by age group and race/ethnicity.

Demographic Categories of Interest

  • Age
  • Geography

Research Team

Owner:

  • Mara Epstein - Early Career Tenure-track Researcher, University of Massachusetts Medical School

Collaborators:

  • Yanhua Zhou - Project Personnel, University of Massachusetts Medical School
  • Larry Garber - Other, Reliant Medical Group

Tutorial - Cancer

Project Purpose(s)

  • Disease Focused Research (cancer) ...

Scientific Questions Being Studied

Are there differences in cancer incidence and prevalence between SEER and AoU?

Scientific Approaches

Not available.

Anticipated Findings

No significant difference

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Jihoon Kim - Project Personnel, University of California, San Diego

Uncovering disease factors related to NF1, NF2 and Schwannomatosis

Project Purpose(s)

  • Disease Focused Research (neurofibromatosis type 1, neurofibromatosis type 2, schwannomatosis)
  • Drug Development ...
  • Methods Development
  • Ancestry

Scientific Questions Being Studied

We intend to study the relationship between genetic factors, health records, and the symptoms of neurofibromatosis type 1, type 2, and schwannomatosis (NF),. At the present, we are exploring the data to formalize a specific research question. We hope to identify specific predictive biomarkers or therapies for NF.

Scientific Approaches

We intend to use health record, survey, and (when available) genomic data from participants with NF and to analyze these datasets using statistical modeling and machine learning approaches for categorical and continuous data.

Anticipated Findings

We hope to and anticipate this study will increase our understanding of neurofibromatosis type 1, type 2, and schwannomatosis. Our findings would contribute to the body of scientific knowledge by revealing new biological causes for the symptoms associated with these disease, which may lead to new ways to treat these symptoms.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Research Team

Owner:

  • Robert Allaway - Senior Researcher, Sage Bionetworks

Uterine Fibroids

Project Purpose(s)

  • Disease Focused Research (Uterine Fibroids or Leiomyomas)
  • Population Health ...
  • Social / Behavioral
  • Methods Development
  • Other Purpose (“This work is a result of an All of Us Research Program Demonstration Project. The projects are efforts by the Program designed to meet the program's goal of ensuring the quality and utility of the Research Hub as a resource for accelerating discovery in science and medicine. This work was reviewed and overseen by the All of Us Research Program Science Committee and the Data and Research Center to ensure compliance with program policy, including policies for acceptable data access and use”.)

Scientific Questions Being Studied

From this analysis, we hope to observe if there are differences in age, racial and risk distribution between both fibroids diagnostic cohorts. We also will determine the odds of this condition in both cohorts, based on modifiable risk factors such as age, race and menopausal state and non-modifiable risk factors like obesity, use of hormones and smoking. These findings will be compared to findings from pre-existing data.

Scientific Approaches

Participants were eligible if they were assigned female sex at birth and had both Electronic Health Records (EHR) data and Patient Provided Information (PPI). Two fibroids cohorts were created based on:
1. Presence of at least one SNOMED code for uterine fibroids in their Electronic Health Records
2. Presence of one ICD-9 or CPT code for pelvic imaging (e.g., pelvic ultrasound) and one diagnostic code for uterine fibroids
Variables of interest were identified from the EHR and PPI and imported as concept sets into the notebook, otherwise they were created in the notebook. Data was analyzed in R software version 3.6.2, 2019.

Anticipated Findings

We anticipate that black females will have higher odds of fibroids compared to white women and that smokers will have lower odds of fibroids compared to non-smokers. While findings from this analysis are not novel, they validate existing knowledge and underscore the importance of the AoU data cohort in research. Furthermore, AoU cohort data represents females in the United States and Canada and, importantly, populations that are underrepresented in research.

Demographic Categories of Interest

  • Sex at Birth
  • Gender Identity
  • Sexual Orientation
  • Geography
  • Disability Status
  • Access to Care
  • Education Level
  • Income Level

Research Team

Owner:

  • Nyia Noel - Mid-career Tenured Researcher, Boston Medical Center

Collaborators:

  • Roxana Loperena Cortes - Other, All of Us Program Operational Use
  • Francis Ratsimbazafy - Other, All of Us Program Operational Use
  • Lizette Mendez - Project Personnel, Boston Medical Center
  • Confidence Achilike - Project Personnel, Boston Medical Center
  • Guohai Zhou - Early Career Tenure-track Researcher, Massachusetts General Hospital