Xuelu (Jeff) Liu

Project Personnel, Dana-Farber Cancer Institute

2 active projects

Phenotypic and Genomic Data Summarization

Investigators, scientists and trainees are conducting diverse cancer research programs in the Division of Population Sciences, Dana-Farber Cancer Institute, and the All of Us Research data is a valuable new data resource that we can use to improve our understandings…

Scientific Questions Being Studied

Investigators, scientists and trainees are conducting diverse cancer research programs in the Division of Population Sciences, Dana-Farber Cancer Institute, and the All of Us Research data is a valuable new data resource that we can use to improve our understandings in cancers.

The primary objective is focus on the genomic data sources, specific genomic data types, its availability to participants with different cancer types and sub-types, and other cohort characterization.

Project Purpose(s)

  • Educational

Scientific Approaches

We will examine data sources in control and register tiers, categorize the cohort for different cancer cases across different populations. We will develop Jupyter notebooks to evaluate the data access and integration.

Anticipated Findings

We expect to generate descriptive reports and Jupyter Notebooks for data summary and data type demonstration. We will document data models and data element mapping for research and EHR data. Data flow diagrams and best practices will also be created for data access and data integration.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Data Set Used

Controlled Tier

Research Team

Owner:

Phenotypic Data Summarization (V6, Registered Tier)

Investigators, scientists and trainees are conducting diverse cancer research programs in the Division of Population Sciences, Dana-Farber Cancer Institute, and the All of Us Research data is a valuable new data resource that we can use to improve our understandings…

Scientific Questions Being Studied

Investigators, scientists and trainees are conducting diverse cancer research programs in the Division of Population Sciences, Dana-Farber Cancer Institute, and the All of Us Research data is a valuable new data resource that we can use to improve our understandings in cancers.

The primary objective of this study is focused on summarizing curated datasets in the Register Tier, i.e. survey data, EHRs, and wearables.

The secondary objective is to evaluate and document the best practices for data access, storage, data analysis and infrastructure environment.

The third objective is the data normalization/harmonization process with OMOP CDM. How is participants’ external EHR data harmonized and integrated into the All of US research hub? what data elements are included in some pacific data sources, such as wearable devices and family health history?

Project Purpose(s)

  • Educational

Scientific Approaches

We will start with examining data sources in the registered tier, categorize the cohort for different cancer cases across different populations. We will develop Jupyter notebooks to evaluate the data access and integration.

Anticipated Findings

We expect to generate descriptive reports and Jupyter Notebooks for data summary and data type demonstration. We will document data models and data element mapping for research and EHR data. Data flow diagrams and best practices will also be created for data access and data integration.

Demographic Categories of Interest

This study will not center on underrepresented populations.

Data Set Used

Registered Tier

Research Team

Owner:

1 - 2 of 2
<
>
Request a Review of this Research Project

You can request that the All of Us Resource Access Board (RAB) review a research purpose description if you have concerns that this research project may stigmatize All of Us participants or violate the Data User Code of Conduct in some other way. To request a review, you must fill in a form, which you can access by selecting ‘request a review’ below.