Research Projects Directory

Research Projects Directory

10,333 active projects

This information was updated 4/14/2024

The Research Projects Directory includes information about all projects that currently exist in the Researcher Workbench to help provide transparency about how the Workbench is being used. Each project specifies whether Registered Tier or Controlled Tier data are used.

Note: Researcher Workbench users provide information about their research projects independently. Views expressed in the Research Projects Directory belong to the relevant users and do not necessarily represent those of the All of Us Research Program. Information in the Research Projects Directory is also cross-posted on AllofUs.nih.gov in compliance with the 21st Century Cures Act.

Duplicate of SAS 101 Data Fundamentals

This workspace contains multiple SAS files to educate users on best practices for performing common SAS procedures and how to explore the All of Us dataset using SAS. This tutorial is meant to educate users on the general data structure…

Scientific Questions Being Studied

This workspace contains multiple SAS files to educate users on best practices for performing common SAS procedures and how to explore the All of Us dataset using SAS. This tutorial is meant to educate users on the general data structure and data model used by the All of Us program.

Project Purpose(s)

  • Educational

Scientific Approaches

There are no scientific approach used in this workspace because it is meant for educational purposes only. We will cover all aspects of All of Us data availability and therefore will use most datasets available in the Workbench for demonstration.

Anticipated Findings

We do not anticipate to have any findings. Instead, we are educating Workbench users on the use of the SAS Studio application in the Workbench and the data structure used by the All of Us Research Program, the OMOP Common Data Model.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Data Set Used

Registered Tier

Research Team

Owner:

Duplicate of SAS 101 Data Fundamentals

This workspace contains multiple SAS files to educate users on best practices for performing common SAS procedures and how to explore the All of Us dataset using SAS. This tutorial is meant to educate users on the general data structure…

Scientific Questions Being Studied

This workspace contains multiple SAS files to educate users on best practices for performing common SAS procedures and how to explore the All of Us dataset using SAS. This tutorial is meant to educate users on the general data structure and data model used by the All of Us program.

Project Purpose(s)

  • Educational

Scientific Approaches

There are no scientific approach used in this workspace because it is meant for educational purposes only. We will cover all aspects of All of Us data availability and therefore will use most datasets available in the Workbench for demonstration.

Anticipated Findings

We do not anticipate to have any findings. Instead, we are educating Workbench users on the use of the SAS Studio application in the Workbench and the data structure used by the All of Us Research Program, the OMOP Common Data Model.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Data Set Used

Registered Tier

Research Team

Owner:

Schizophrenia

Is there a significant correlation between early life environmental factors, such as childhood trauma or maternal infection, and the development of schizophrenia later in life? We chose Schizophrenia because it is a complex mental disorder that offers valuable insights into…

Scientific Questions Being Studied

Is there a significant correlation between early life environmental factors, such as childhood trauma or maternal infection, and the development of schizophrenia later in life? We chose Schizophrenia because it is a complex mental disorder that offers valuable insights into the workings of the human brain and the complexities of mental health. Studying schizophrenia provides a unique opportunity to dive into the intricate interactions between genetics, environment, and neurobiology that contribute to the development of psychiatric disorders. Understanding schizophrenia can have more implications for mental health research and treatment strategies. Insights gained from studying schizophrenia can help unravel the mysteries of other mental illnesses and pave the way for more effective diagnostic tools and interventions. By studying schizophrenia, researchers can create personalized medicine and targeted therapies that address the specific needs of individuals with psychiatric disorders.

Project Purpose(s)

  • Educational

Scientific Approaches

To dive deeper into the causes of schizophrenia it is essential that we consider environmental, genetic and social aspects of the illness. We plan to use the all of us data as well as well vetted research journals to see if there is a correlation between early childhood/ prenatal infections and/or trauma and the development of schizophrenia later in life.

Anticipated Findings

Based on previous research we expect to find a direct correlation between prenatal/ early childhood infection and/or adolescent trauma and the development of schizophrenia. Confirming the correlation between the two would allow for the scientific community to have a better understanding of the causes of schizophrenia.

Demographic Categories of Interest

  • Age

Data Set Used

Registered Tier

Research Team

Owner:

  • mariah nessim - Undergraduate Student, Arizona State University

Collaborators:

  • Tara Tierney - Undergraduate Student, Arizona State University
  • Hafan Yan - Undergraduate Student, Arizona State University
  • amariah robinson - Undergraduate Student, Arizona State University

Blood biomarkers for detection of early-stage lung cancer

Research question: What is the diagnostic capability of blood biomarkers and their correlation with early-stage lung cancer detection, emphasizing the identification of specific biomarkers and their potential utility in improving early detection strategies? Ultimately, our efforts aim to transform how…

Scientific Questions Being Studied

Research question:
What is the diagnostic capability of blood biomarkers and their correlation with early-stage lung cancer detection, emphasizing the identification of specific biomarkers and their potential utility in improving early detection strategies?

Ultimately, our efforts aim to transform how we approach lung cancer diagnosis. We hope for earlier detection, better outcomes, and a brighter future for those at risk of this formidable disease.

We predict that there may be activation or presence of particular blood biomarkers which are a precursor to lung cancer, and that identifying and learning more in depth about them will allow us to potentially alter diagnosis and / or treatment of lung cancer.

Project Purpose(s)

  • Disease Focused Research (lung cancer)
  • Educational
  • Ancestry

Scientific Approaches

To address this issue, our research project aims to investigate the use of blood biomarkers to detect early-stage lung cancer. Biomarkers are biological molecules that can indicate the presence of a disease or condition. We want to understand how specific biomarkers are related to identifying lung cancer during its early stages. We aim to identify reliable indicators of the disease's onset by analyzing the biochemical signatures in blood samples. We will be investigating several biomarkers, including the epidermal growth factor receptor (EGFR) is a tyrosine kinase receptor member of the ERBB family.

Anticipated Findings

By identifying reliable biomarkers for early lung cancer detection, we can save lives and improve the quality of life for those affected. We may also aim to re-examine previously identified potential biomarkers and whether they are in fact accurate to early identification of lung cancer. This research represents a significant effort to apply scientific knowledge to make tangible advancements that have profound implications for public health and clinical practice. Early detection of lung cancer can significantly improve the chances of successful treatment and survival rates and we hope to contribute to this endeavor.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Data Set Used

Controlled Tier

Research Team

Owner:

Collaborators:

  • Ryan Kruize - Student, Arizona State University

NHPI cardiorenal workspace

We are studying genetic risk for cardiorenal disease, focusing on the genetic diversity of Native Hawaiian / Pacific Islander (NHPI) populations, which have been historically underrepresented in research and underserved in healthcare. We seek to understand and address the unique…

Scientific Questions Being Studied

We are studying genetic risk for cardiorenal disease, focusing on the genetic diversity of Native Hawaiian / Pacific Islander (NHPI) populations, which have been historically underrepresented in research and underserved in healthcare. We seek to understand and address the unique genetic vulnerabilities faced by these groups to empower precision approaches to drug discovery and healthcare.

Project Purpose(s)

  • Disease Focused Research (Cardiovascular and metabolic disease)

Scientific Approaches

We plan to use genome-wide association studies (GWAS) and regression modeling approaches to characterize disease prevalence and genetic risk architecture in NHPI people compared to other groups.

Anticipated Findings

We anticipate discovering that there is indeed a unique genetic basis for cardiometabolic disease risk in NHPI people, or the alternative that genetic risk is largely unchanged and disparities in health outcomes can be attributed to social determinants of health, which would warrant its own investigation.

Demographic Categories of Interest

  • Race / Ethnicity

Data Set Used

Controlled Tier

Research Team

Owner:

genetic analysis for afib, hf, and pfo

I will be looking into the mitochondrial genetic correlation to Atrial Fibrillation, Heart Failure, and Patent foramen ovale. Trying to extract meaningful relationships that can help further the research in these field and ultimately improve patient outcomes.

Scientific Questions Being Studied

I will be looking into the mitochondrial genetic correlation to Atrial Fibrillation, Heart Failure, and Patent foramen ovale. Trying to extract meaningful relationships that can help further the research in these field and ultimately improve patient outcomes.

Project Purpose(s)

  • Disease Focused Research (Atrial Fibrillation, Heart Failure, and Patent foramen ovale)

Scientific Approaches

Data used will be genetic data and clinical / observational data for patients with Atrial Fibrillation, Heart Failure, and Patent foramen ovale.

Research will be done through python jupyter notebooks using an exploratory framework approach.

Anticipated Findings

We hope to uncover unique mitochondrial variants that could help explain cases of Atrial Fibrillation, Heart Failure, and Patent foramen ovale.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Data Set Used

Controlled Tier

Research Team

Owner:

  • Brian Tao - Graduate Trainee, Boston University

Collaborators:

  • Nathnaniel Fisher - Graduate Trainee, Boston University

New Alzheimer workspace

The role of APOE mutation plays in the progression of Alzheimer's given effects in lifestyle and genetics

Scientific Questions Being Studied

The role of APOE mutation plays in the progression of Alzheimer's given effects in lifestyle and genetics

Project Purpose(s)

  • Educational

Scientific Approaches

Using data provided from the ALL OF US database, assistance from TA's and ATA's from class room, and discuss the research with my group.

Anticipated Findings

APOE mutation progresses the effect of Alzheimers in lifestyle and genetics. Our information would help better understand the role of the APOE mutation.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Data Set Used

Controlled Tier

Research Team

Owner:

  • Juan Martinez - Undergraduate Student, Arizona State University

Collaborators:

  • Nathan Roue - Undergraduate Student, Arizona State University

Breast Cancer Study

We are wanting to understand how hormones, like estrogen, may affect breast cancer risk in individuals. Do elevated levels of these hormones correlte with an increase in breast cancer risk?

Scientific Questions Being Studied

We are wanting to understand how hormones, like estrogen, may affect breast cancer risk in individuals. Do elevated levels of these hormones correlte with an increase in breast cancer risk?

Project Purpose(s)

  • Educational

Scientific Approaches

We plan on comparing data of individuals who have hormone levels within standard range against those with elevated hormone levels. Based on this data, we would also want to investigate how many individuals of each group have or have had a diagnosis of breast cancer.

Anticipated Findings

Since many cancers use hormones to grow more quickly, we would anticipate that individuals with elevated hormone levels would have a higher percentage of breast cancer diagnoses than those with hormone levels in standard range.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Data Set Used

Controlled Tier

Research Team

Owner:

Tukey's 1-df robust regression

Question: How can gene-gene and gene-environment interactions be effectively incorporated into statistical hypothesis testing methods for association studies with continuous outcomes of different characteristics? This question is crucial because understanding these interactions can significantly enhance our ability to identify SNPs…

Scientific Questions Being Studied

Question: How can gene-gene and gene-environment interactions be effectively incorporated into statistical hypothesis testing methods for association studies with continuous outcomes of different characteristics?

This question is crucial because understanding these interactions can significantly enhance our ability to identify SNPs that are closely related to disease and to fine-map these regions for better understanding of complex diseases. Recognizing these interactions allows researchers to address the complicated nature of genetic contributions to disease. This could be important for developing targeted interventions and personalized medicine, thus having significant implications for public health and the prevention and treatment of diseases.

Project Purpose(s)

  • Methods Development
  • Ancestry

Scientific Approaches

Datasets:

Real case-control data - Specifically, phenotype (health outcome) data and their related genetic and environmental covariates.

Research Methods and Tools:

Method and Tools: The robust testing method for Tukey’s 1-degree of freedom (df) model of interaction, this approach aims to efficiently exploit the linkage disequilibrium (LD) pattern among SNPs within a gene while accounting for interactions with other SNPs or environmental exposures. It works robustly for various kinds of continuous phenotypes.

Anticipated Findings

Anticipated Findings:
The novel hypothesis testing method is expected to demonstrate significant advantages in power with continuous outcomes of different characteristics over traditional methods that are more sensitive to outliers and requires Gaussian residual assumptions.
The use of robust testing with Tukey’s 1-df model for interactions should allow for a more parsimonious and statistically robust analysis, potentially reducing the complexity and computational demand of analyzing large datasets.

Contributions to Scientific Knowledge:
The development of a novel methodological approach to incorporate gene-gene and gene-environment interactions for various kinds of health outcomes in association studies could set a new standard for genetic analysis in complex disease research.
By revealing how specific genetic and environmental factors interact to influence disease risk, these findings could lead to a deeper understanding of disease etiology.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Data Set Used

Controlled Tier

Research Team

Owner:

DRAFT 2 GRAPHING RESEARCH MARIA PALMER

Essential questions such as the increase or decrease of allergies over the last 50 years in research from various populations in groups of people, sexes, and genders are essential to understanding the data in research for a project.

Scientific Questions Being Studied

Essential questions such as the increase or decrease of allergies over the last 50 years in research from various populations in groups of people, sexes, and genders are essential to understanding the data in research for a project.

Project Purpose(s)

  • Educational

Scientific Approaches

Creating graphs and using the question above, research from this program, sites that are approved with credit, and credit facts.

Anticipated Findings

Seeing which group is most impacted with allergies and has there been an increase or decrease of allergies over the years.

Demographic Categories of Interest

  • Race / Ethnicity
  • Age
  • Sex at Birth
  • Gender Identity
  • Sexual Orientation
  • Geography
  • Disability Status
  • Access to Care
  • Education Level
  • Income Level

Data Set Used

Registered Tier

Research Team

Owner:

  • maria palmer - Undergraduate Student, Arizona State University

Trajectory of Cystic Fibrosis

The scientific question my research group and I intend to study is, "how has the introduction of Trikafta altered the trajectory of treatment and life expectancy for cystic fibrosis patients compared to pre-existing medications?" This question is relevant to science…

Scientific Questions Being Studied

The scientific question my research group and I intend to study is, "how has the introduction of Trikafta altered the trajectory of treatment and life expectancy for cystic fibrosis patients compared to pre-existing medications?"
This question is relevant to science due to the prevalence of cystic fibrosis amongst the population and the lack of an official treatment to target the root cause of the disease thus far.

Project Purpose(s)

  • Educational

Scientific Approaches

Credible sources, including All of Us, will be used to formulate data to support the research question. I will be using mixed method research to collect various forms of data including diagnostic statistics, life expectancy trends over the years, quality of life versus drug impacts, and treatment trajectories. Tools used will be scientific journals, the All of Us database, as well as other databases.

Anticipated Findings

The anticipated findings are an increase in life expectancy as well as quality of life due to novel treatment options. The findings would contribute to increased knowledge and awareness of cystic fibrosis management and treatment methods.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Data Set Used

Controlled Tier

Research Team

Owner:

Collaborators:

  • Sierra Weatherly - Undergraduate Student, Arizona State University
  • Jamie Buckingham - Undergraduate Student, Arizona State University
  • Andrea Mitchell - Undergraduate Student, Arizona State University
  • Amaya Alvis - Undergraduate Student, Arizona State University

Heart Failure GWAS

Introduction: The All of Us dataset is a growing dataset of biomedical data from a diverse group of participants across the United States. This dataset contains more than 245,000 WGS samples as well as health data which includes specific conditions…

Scientific Questions Being Studied

Introduction:
The All of Us dataset is a growing dataset of biomedical data from a diverse group of participants across the United States. This dataset contains more than 245,000 WGS samples as well as health data which includes specific conditions such as acute heart failure, cardiac arrhythmia, and atrial fibrillation. While there are many exciting questions to ask using the All of Us dataset, my question will attempt to identify variants in nuclear-encoded oxidation phosphorylation genes (OXPHOS) that are associated with heart failure and identify putative causal variants among associated variants. Previous studies have identified associations between heart failure phenotypes and mitochondrial dysfunction but more investigation with a large dataset like All of Us may allow for discovery of new associated variants.

Project Purpose(s)

  • Disease Focused Research (heart disease)
  • Educational

Scientific Approaches

1. Label individuals as controls (non-heart failure phenotype) and cases (heart failure phenotype).
2. Quality control:HWE testing on controls, prune and test all variants for heterozygosity, test for non-random missingness, and sex checks.
3. PCA to determine population structure and look at patterns of clusters by race and phenotype of interest (heart-failure). I will include significant PCs as covariates.
4. Perform a GWAS using the REGENIE method. The model will be a logistic regression:
Logit(pi) = B0 + B1(Additively coded allele) + B2(sex) + B4(age) + … +Bi(PCj)

5. Significant SNPs will have a p-value <= 5E-8 and I will visualize using QQ and Manhattan plots.
6. Perform fine mapping to determine independent signals and determine putative causal variants.
7. Perform a gene set enrichment analysis on significant variants using the canonical MsigDB gene sets as well as a curated gene set of nuclear-encoded OXPHOS genes.

Anticipated Findings

Identification of nuclear and mitochondrial variants associated with heart disease could provide lines of inquiry for follow-up validation studies on causal variants.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Data Set Used

Controlled Tier

Research Team

Owner:

Duplicate of Demo - Polygenic_Risk_Score_Genetic_Ancestry_Calibration

Polygenic risk scores (PRS) are available for a wide array of traits and conditions, offering many potential applications. There is a serious concern that clinical use of PRS could contribute to health disparities due to the poorer performance of PRS…

Scientific Questions Being Studied

Polygenic risk scores (PRS) are available for a wide array of traits and conditions, offering many potential applications. There is a serious concern that clinical use of PRS could contribute to health disparities due to the poorer performance of PRS in non-European ancestry individuals. We aim to improve our ability to correct the genetic ancestry-dependent bias in PRS for 10 conditions (Asthma, Atrial fibrillation, Breast Cancer, Chronic Kidney Disease, Coronary heart disease, Hypercholesterolemia, Obesity/BMI, Prostate cancer, Type 1 Diabetes, Type 2 Diabetes). We will use the AoU dataset to produce a resource that can be used to reduce the ancestry-dependent bias in these 10 PRS. The citation for this work is:
Lennon, N.J., Kottyan, L.C., Kachulis, C. et al. Selection, optimization and validation of ten chronic disease polygenic risk scores for clinical implementation in diverse US populations. Nat Med 30, 480–487 (2024). https://doi.org/10.1038/s41591-024-02796-z

Project Purpose(s)

  • Control Set

Scientific Approaches

Arrays will be imputed using the phasing and imputation tools Eagle2 and Minimac4. Polygenic risk score will then be calculated using the population genomics tool PLINK. A simple linear model will then be fit to the scores, which attempt to describe the macroscopic relationship between genetic ancestry and observed polygenic scores. The fitted parameters of this model can then be used to reduce genetic ancestry-dependent bias when calculating these scores in a clinical setting.

Anticipated Findings

We will produce a set of fitted parameters for a simple model which attempts to describe the macroscopic relationship between genetic ancestry and observed polygenic scores. The fitted parameters of this model can then be used as a resource to reduce genetic ancestry-dependent bias when calculating these scores in a clinical setting.

Demographic Categories of Interest

  • Race / Ethnicity

Data Set Used

Controlled Tier

Research Team

Owner:

assignment

Exploring data at this stage to formalize a specific research question for a class assignment. My aim is to determine if a specific antibiotic(e.g., Doxepin) truly has an effect on symptoms.

Scientific Questions Being Studied

Exploring data at this stage to formalize a specific research question for a class assignment. My aim is to determine if a specific antibiotic(e.g., Doxepin) truly has an effect on symptoms.

Project Purpose(s)

  • Educational

Scientific Approaches

My data set will focus on one antibiotic (e.g., Doxepin), and focus on a demographic population (e.g., African American, Asian).

Anticipated Findings

I anticipate to determine if a specific antibiotic truly has affects on certain symptoms.

Demographic Categories of Interest

  • Race / Ethnicity

Data Set Used

Registered Tier

Research Team

Owner:

Thalassemia-BIO340

Researching more into Thalassemia for a college project. Thalassemia is a disease that can hold different severity levels and has complexities. Thalassemia is a disease that affects hemoglobin. There are two different forms: Alpha Thalassemia and Beta Thalassemia. With that…

Scientific Questions Being Studied

Researching more into Thalassemia for a college project. Thalassemia is a disease that can hold different severity levels and has complexities. Thalassemia is a disease that affects hemoglobin. There are two different forms: Alpha Thalassemia and Beta Thalassemia. With that being said, how do the different forms of Thalassemia affect individuals within their genetic makeup and phenotypic characteristics? Additionally, do the different forms of Thalassemia affect different lifestyle changes, such as diet, treatment, and physical activity?

Project Purpose(s)

  • Educational

Scientific Approaches

The school's library, Google Scholar, All of Us research data and NIH will be just a few databases that we will use to complete our research for our school project.

Anticipated Findings

We intend for you to find the background of this disease, as we have little knowledge of it. Learning more about it is one of our goals. Additionally, answering our research questions allows the questions to guide our research and expand our research questions.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Data Set Used

Registered Tier

Research Team

Owner:

  • India Cole - Undergraduate Student, Arizona State University
  • Emma Moody - Undergraduate Student, Arizona State University
  • Carlos Castaneda - Undergraduate Student, Arizona State University

HIV respiratory conditions

This project is in the early stages of formalizing a specific research question and methods. Broadly we will look at the respiratory health outcomes in people with HIV compared to a matched group of people without HIV, and assess how…

Scientific Questions Being Studied

This project is in the early stages of formalizing a specific research question and methods. Broadly we will look at the respiratory health outcomes in people with HIV compared to a matched group of people without HIV, and assess how these outcomes interact with cigarette smoking status and smoking behaviour. We hope this study will inform the services people with HIV need and develop smoking cessation treatment plans to improve respiratory health in people living with HIV.

Research questions:
1. What is the incidence of respiratory disease diagnoses, respiratory symptoms, treatments for respiratory diseases, among PLWH compared to people without an HIV diagnosis?
2. Do these respiratory health outcomes depend on cigarette smoking status, dependence and smoking behaviour?
2. Are there differences in the causes of death due to smoking-related diseases (including respiratory diseases specifically) between PLWH who smoke compared to people who smoke without an HIV diagnosis.

Project Purpose(s)

  • Disease Focused Research (Human immunodeficiency virus infectious disease)
  • Population Health
  • Social / Behavioral

Scientific Approaches

Datasets:
1. People living with HIV and a matched group of people without HIV.
2. People living with HIV or smoke, and a matched group of people living without HIV who do not smoke.

Respiratory health outcomes to be Measured
1. Prevalence and incidence of Chronic Respiratory disease (COPD)
2. Pulmonary function testing results
3. Respiratory symptoms (difficulty breathing, cough, phlegm production, use of inhalers)

Smoking related measures
1. Smoking status
2. Smoking cessation
3. Smoking reduction
4. Cigarette smoking dependence
5. Use of non-combusted nicotine products
6. Use of smoking cessation pharmacotherapy

Other variables measures (to be confirmed)
7. Race/ethnicity
8. Socio-economic position
9. Age
10. Sex at birth
11. Gender identity
12. Substance use

Anticipated Findings

This study will provide estimates of respiratory disease outcomes in PLWH who smoke and do not smoke, and explore how this is affected by smoking behaviour. These data will help inform understanding of whether respiratory outcomes are worse among PLWH who smoke, and how this is relates to smoking behaviour.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Data Set Used

Controlled Tier

Research Team

Owner:

  • Loren Kock - Research Fellow, University of Vermont

oligodendroglioma

How does early detection of IDH mutant and 1p/19q co-deleted lead to effective prevention of Oligodendroglioma?

Scientific Questions Being Studied

How does early detection of IDH mutant and 1p/19q co-deleted lead to effective prevention of Oligodendroglioma?

Project Purpose(s)

  • Educational

Scientific Approaches

Genetically, this tumor is caused by DH mutation and 1p/19q co-deletion. These tumors typically develop in the frontal lobe, leading to seizures. Seizures can cause a decrease in the quality of life, and more intensely can lead to death. (Paleologos, 2019). One way to combat this is to understand genetic mutations and deletions. The medical field has started to analyze patients more thoroughly through biometric trends and genetic testing to allow for a more personalized diagnosis, and treatment. This could include a push for more preventative measures in healthcare. This research is intended to address the need for early detection through genetic analysis and preventative treatment plans for potentially affected patients. The goal of this research is to emphasize the need for preventative action against Oligodendrogliomas.

Anticipated Findings

Knowing more about the benefits of testing for IDH mutation and 1p/19q co-deletion will help patients extending life span.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Data Set Used

Registered Tier

Research Team

Owner:

  • Minh Pham - Undergraduate Student, Arizona State University
  • Hannah Sydow - Undergraduate Student, Arizona State University

Collaborators:

  • Kris Oshiro - Undergraduate Student, Arizona State University

Impact of social and environmental exposures on T2D Severity

How do social and environmental exposures affect the severity of Type 2 diabetes (T2D) in the All of Us Cohort ? We will explore how social burdens (food and housing insecurity, social isolation, discrimination, perception of security) and environmental exposures…

Scientific Questions Being Studied

How do social and environmental exposures affect the severity of Type 2 diabetes (T2D) in the All of Us Cohort ?
We will explore how social burdens (food and housing insecurity, social isolation, discrimination, perception of security) and environmental exposures (built environments) influence the progression of T2D with genetic markers (Polygenic Risk Scores) associated with increased risk for T2D. By fostering a deeper understanding of chronic health's social and environmental drivers, we can empower communities to advocate for policies and initiatives that promote health equity and well-being that disproportionately affect the communities Federally Qualified Health Centers (FQHCs) serve.

Project Purpose(s)

  • Disease Focused Research (type 2 diabetes mellitus)
  • Population Health
  • Ancestry

Scientific Approaches

Population of Interest: Participants with 3+ years of EHR history at same location and documented T2D in EHR or participants who have self-identified as having T2D on survey.
Outcome/Response Variable is the severity of T2D, based on A1C values over time (above/below a threshold). Predictors include social and environmental exposures (SDoH survey responses to specific questions of interest to the study, Area Deprivation Index from Neighborhood Atlas), treatment data (medication sequencing, density of EHR information) and other data as needed (Census, air quality, built environment, C-Reactive Protein, length of disease). Confounding Factors / Control Variables include Polygenic Risk Score (PRS) for diabetes, age. weight, number of medications.

Anticipated Findings

Identification of social and environmental risk factors associated with specific chronic disease.
Early intervention strategies based on knowledge of disease associated with specific social stressors and environmental factors, which may lead to early genetic testing, policy changes and educational programs.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Data Set Used

Controlled Tier

Research Team

Owner:

  • Soumya Kini - Project Personnel, The MITRE Corporation
  • Justin Hill - Project Personnel, The MITRE Corporation

Substance Use and Eye Disease

We want to determine the association between substance use and vision-threatening eye disease. We aim to determine how this cluster of conditions might be impact eye disease.

Scientific Questions Being Studied

We want to determine the association between substance use and vision-threatening eye disease. We aim to determine how this cluster of conditions might be impact eye disease.

Project Purpose(s)

  • Disease Focused Research (substance use, eye disease)

Scientific Approaches

We plan to create cohorts that will allow us to determine the association between substance use and eye diseases like epiphora. Depending on the associations found, we might perform subgroup analysis to determine the combined influence of other factors.

Anticipated Findings

There has been an association shown between substance use and eye disease risk. However, there is limited research with a data set this large. We do expect to find some association, but hope that the data from this database will allow more detailed conclusions.

Demographic Categories of Interest

  • Age
  • Education Level
  • Income Level

Data Set Used

Registered Tier

Research Team

Owner:

Oli

How does early detection of IDH mutant and 1p/19q co-deleted lead to effective prevention of Oligodendroglioma?

Scientific Questions Being Studied

How does early detection of IDH mutant and 1p/19q co-deleted lead to effective prevention of Oligodendroglioma?

Project Purpose(s)

  • Educational

Scientific Approaches

For example, if a patient can be tested early for IDH mutation and 1p/19q co-deletion, then perhaps they can complete screening MRI scans to determine if there is an abnormal white matter growth in their frontal lobe. Diagnosis of Oligodendroglioma is predicted to significantly increase in the United States. This research is intended to address the need for early detection through genetic analysis and preventative treatment plans for potentially affected patients. The goal of this research is to emphasize the need for preventative action against Oligodendrogliomas.

Anticipated Findings

early detection of IDH mutant and 1p/19q co-deleted lead to effective prevention of Oligodendroglioma

Demographic Categories of Interest

This study will not center on underrepresented populations.

Data Set Used

Controlled Tier

Research Team

Owner:

  • Minh Pham - Undergraduate Student, Arizona State University

Wernikes, Thiamine and alchohol use disorder

Analyze Wernicke Encepalopothy, alchohol use disorder genetic components and thiamine deficiency, how all three relate to one another

Scientific Questions Being Studied

Analyze Wernicke Encepalopothy, alchohol use disorder genetic components and thiamine deficiency, how all three relate to one another

Project Purpose(s)

  • Educational

Scientific Approaches

Hope to gather samples of those with Wernike Encephalopathy, AUD and thiamine disorder and see how those populations correlate to one another

Anticipated Findings

I expect those that have both thiamine definiency and AUD are more likely o be suceptible to Wernike Encephalopothy

Demographic Categories of Interest

This study will not center on underrepresented populations.

Data Set Used

Registered Tier

Research Team

Owner:

  • Cody Hamm - Undergraduate Student, Arizona State University

Introduction to healthcare access among those diagnosed w/ obesity

“Differences in healthcare access among those diagnosed with obesity”. It is relevant to public health and the final research question, my partner as well as myself developed centered around population health.

Scientific Questions Being Studied

“Differences in healthcare access among those diagnosed with obesity”. It is relevant to public health and the final research question, my partner as well as myself developed centered around population health.

Project Purpose(s)

  • Educational

Scientific Approaches

Identifying the disparities in access to healthcare among individuals who are
obese.
● To comprehend the elements influencing variations in healthcare access according
to socioeconomic, geographic, and demographic considerations (SDOH)
● Investigate the effects of discriminatory access to healthcare on the health and
quality of life of obese people.
● Make recommendations for strategies and objectives that will reduce the
disparities regarding this population's access to healthcare.

Anticipated Findings

When comparing individuals with obesity to those without the diagnosis, there are
notable discrepancies in the healthcare access that these individuals receive; these
variances are largely shaped by the individual’s social determinants of health.
Including but not limited to a person's socioeconomic level, location, and health

Demographic Categories of Interest

  • Race / Ethnicity
  • Age
  • Sex at Birth
  • Gender Identity
  • Sexual Orientation
  • Geography
  • Disability Status
  • Access to Care
  • Education Level
  • Income Level

Data Set Used

Registered Tier

Research Team

Owner:

breast cancer srWGS asn euro afr replicate gwas

We are interested in studying genetic variants associated with breast cancer in Hispanic/Latinas. The specific questions are, are there additional ancestry-specific genetic variants associated with breast cancer in H/L? Can we use these genetic variants to develop a better PRS…

Scientific Questions Being Studied

We are interested in studying genetic variants associated with breast cancer in Hispanic/Latinas. The specific questions are, are there additional ancestry-specific genetic variants associated with breast cancer in H/L? Can we use these genetic variants to develop a better PRS model for H/L?

Project Purpose(s)

  • Disease Focused Research (breast cancer)
  • Ancestry

Scientific Approaches

We would like to use the genomic data and phenotype data to perform genome wide association study (GWAS). We will use Hail, PLINK, Python and/or R scripts to do the analysis.

Anticipated Findings

The anticipated findings may include novel genetic variants associated with breast cancer in H/L samples, novel PRS model performs better in H/L samples. Our findings will contribute to the improvement of prediction of breast cancer risk in H/L samples and improve the prevention, diagnosis and therapeutics development for breast cancer patients in Hispanic/Latinas.

Demographic Categories of Interest

  • Race / Ethnicity

Data Set Used

Controlled Tier

Research Team

Owner:

HAP 823 Antidepressants Project

Exploring the duration of antidepressant use and reasons for discontinuation can provide insights into the effectiveness of antidepressant treatment within this population. This information is essential for optimizing treatment strategies and improving mental health outcomes. Antidepressant use is widespread, and…

Scientific Questions Being Studied

Exploring the duration of antidepressant use and reasons for discontinuation can provide insights into the effectiveness of antidepressant treatment within this population. This information is essential for optimizing treatment strategies and improving mental health outcomes.
Antidepressant use is widespread, and its impact extends beyond individual patients to public health outcomes. By examining antidepressant usage patterns and associated comorbidities among African Americans, we can identify areas for intervention and improve population-level mental health outcomes.

Project Purpose(s)

  • Educational

Scientific Approaches

Obtain a dataset containing patient demographics, including date of birth, race, and ethnicity.
Acquire a dataset containing information on deceased individuals, including date of death.
Gather data on antidepressant usage, Utilize a dataset containing information on diseases, including standard concept codes and associated disease groups.
Extract a list of all antidepressant codes from the dataset, including the selected antidepressant.
Create columns for the start date, end date, and duration of antidepressant use for the selected antidepressant.
Descriptive Analysis: Describe the population studied in Table 1, including demographic characteristics, antidepressant usage patterns, and comorbidity profiles.
Model Parameters: Assess the fit between the observed data and expected outcomes based on the research hypotheses.
Python Programming Language: Utilize libraries such as pandas, numpy, and scikit-learn for data manipulation and analysis.

Anticipated Findings

The study will likely reveal information about the frequency of antidepressant use among African American patients, including the duration of use, adherence rates, and reasons for discontinuation.
By analyzing the comorbidities present among African American patients receiving antidepressant treatment, the study can identify common health issues within this demographic.
The study may uncover factors influencing antidepressant use among African American patients, including socioeconomic status, cultural beliefs, and healthcare provider practices. Understanding these factors can guide interventions aimed at improving access to and utilization of mental health services among African American communities.
The findings will have implications for public health, as they can inform policies and programs aimed at addressing mental health disparities and promoting mental well-being among African American populations.

Demographic Categories of Interest

  • Race / Ethnicity

Data Set Used

Registered Tier

Research Team

Owner:

Understanding Pneumonia and its Environmental Determinants

The study aims to investigate the relationship between environmental factors, including air quality, and the incidence of pneumonia. Understanding the impact of household air pollution, ambient air pollution, and other health risks on pneumonia development is crucial for public health…

Scientific Questions Being Studied

The study aims to investigate the relationship between environmental factors, including air quality, and the incidence of pneumonia. Understanding the impact of household air pollution, ambient air pollution, and other health risks on pneumonia development is crucial for public health interventions and policies aimed at reducing pneumonia burden globally.

Project Purpose(s)

  • Educational

Scientific Approaches

The study will employ epidemiological methods to analyze population-based data from the All of Us Registered Tier Dataset v7. Statistical analyses will explore correlations between air quality indicators, demographic factors, and pneumonia incidence. Geospatial analysis may be utilized to assess regional variations in pneumonia prevalence and environmental exposures.

Anticipated Findings

Anticipated findings include identifying significant associations between exposure to air pollution and pneumonia incidence, elucidating vulnerable populations at heightened risk, and informing targeted public health interventions. The study findings would contribute to the existing body of scientific knowledge by providing valuable insights into the environmental determinants of pneumonia and guiding evidence-based policies for mitigating pneumonia burden.

Demographic Categories of Interest

  • Race / Ethnicity
  • Geography

Data Set Used

Registered Tier

Research Team

Owner:

1 - 25 of 10333
<
>
Request a Review of this Research Project

You can request that the All of Us Resource Access Board (RAB) review a research purpose description if you have concerns that this research project may stigmatize All of Us participants or violate the Data User Code of Conduct in some other way. To request a review, you must fill in a form, which you can access by selecting ‘request a review’ below.