Genomic Datasets for Cancer Research
A variety of datasets from genome-wide association studies of cancer and other genotype-phenotype studies, including sequencing and molecular diagnostic assays, are available to approved investigators through the Extramural National Cancer Institute (NCI) Data Access Committee (DAC).
The Committee is charged with implementing the National Institutes of Health (NIH) data sharing policy for many NCI-supported and -conducted genome-wide association and genomic studies. It reviews all requests from the research community (including NIH intramural staff) for controlled access to genomic data, as well as to other cancer-related datasets for which it is responsible.
Learn more about the Data Access Request Process.
Available Datasets
Click the link below each of the following cancer types to learn more about the datasets available, including the type of data and consent groups.
-
Bladder Cancer
View Datasets- Whole Genome and Exon Capture Sequencing of Bladder Cancers
- Data type: individual-level, whole-genome sequencing, custom target sequencing
- Consent group: general research use
- Link to dataset details in dbGaP

- Whole Genome and Exon Capture Sequencing of Bladder Cancers
-
Brain Cancer
View Datasets- Gene fusion discovery through RNA sequencing of nine human glioblastoma stem cell lines
- Data type: individual level data, RNA sequencing
- Consent group: general research use
- Link to dataset details in dbGaP

- Genomic Analysis of Medulloblastoma
- Data type: individual level, whole genome genotyping
- Consent group: general research use
- Link to dataset details in dbGaP

- Genomic Characterization of Meningiomas
- Data type: individual level data, whole genome sequencing, whole exome sequencing
- Consent group: brain cancer and brain disorders research and methods
- Link to dataset details in dbGaP

- Gene fusion discovery through RNA sequencing of nine human glioblastoma stem cell lines
-
Breast Cancer
View Datasets- California Pacific Medical Center Research Breast Health Cohort
- Data type: individual level data, whole genome genotyping
- Consent group: general research use
- Link to dataset details in dbGaP

- GWAS in African Americans, Latinos, and Japanese
- Data type: individual level data, whole genome genotyping
- Consent group: multiple consent groups
- Link to dataset details in dbGaP

- Genome-Wide Association Study of Breast Cancer in the African Diaspora - the ROOT study
- Data type: individual level data, whole genome genotyping
- Consent group: health research and general methods
- Link to dataset details in dbGaP

- Rearrangements of the MAST Kinase and Notch Gene Families in Breast Cancer
- Data type: individual level data, RNA sequencing
- Consent group: cancer research and general methods
- Link to dataset details in dbGaP

- Sequence Analysis of Mutations and Translocations Across Breast Cancer Subtypes
- Data type: individual level data, whole genome and whole exome sequencing
- Consent group: general research use, cancer research and general methods
- Link to dataset details in dbGaP

- California Pacific Medical Center Research Breast Health Cohort
-
Childhood Cancers
View Datasets- Analysis of Somatic Mutations in Pediatric AML FAB-M7 Subtype by Whole Transcriptome Sequencing
- Data type: individual level, whole genome sequencing
- Consent group: general research use
- Link to dataset details in dbGaP

- Genome-Wide Analysis of Hypodiploid Acute Lymphoblastic Leukemia
- Data type: individual level, whole genome sequencing
- Consent group: general research use
- Link to dataset details in dbGaP

- Genomic Complexity of Sporadic and Inherited Retinoblastoma: Matched Orthotopic Xenograft
- Data type: individual level data, whole genome sequencing
- Consent group: general research use
- Link to dataset details in dbGaP

- Genomic Complexity of Early T-Cell Progenitor Acute Lymphoblastic Leukemia
- Data type: individual level data, whole genome sequencing
- Consent group: general research use
- Link to dataset details in dbGaP

- Genomic Sequencing of Pediatric Rhabdoid Cancers
- Data type: individual level data, whole exome sequencing, whole genome genotyping
- Consent group: pediatric cancer research and pediatric methods, cancer research
- Link to dataset details in dbGaP

- Hyperdiploid Acute Lymphoblastic Leukemia RNA-Seq
- Data type: individual level data, RNA sequencing
- Consent group: pediatric cancer research
- Link to dataset details in dbGaP

- National Cancer Institute (NCI) Therapeutically Applicable Research to Generate Effective Treatments (TARGET)
- Data type: individual level data, genomic and transcriptomic sequencing
- Consent group: pediatric cancer research
- Link to dataset details in dbGaP

- Whole Genome Sequencing of Core-Binding Factor Leukemia
- Data type: individual level, whole genome sequencing
- Consent group: general research use
- Link to dataset details in dbGaP

- Analysis of Somatic Mutations in Pediatric AML FAB-M7 Subtype by Whole Transcriptome Sequencing
-
Colorectal Cancer
View Datasets- Epigenetic Profiling of Human Colorectal Cancer
- Data type: individual level data, whole genome genotyping, whole genome bisulfite sequencing
- Consent group: general research use
- Link to dataset details in dbGaP

- Epigenetic Profiling of Human Colorectal Cancer
-
Fibrous Tumors
View Datasets- Genomic Sequencing of Solitary Fibrous Tumors
- Data type: individual level data, whole exome sequencing
- Consent group: general research use
- Link to dataset details in dbGaP

- Identification of Recurrent NAB2-STAT6 Gene Fusions in Solitary Fibrous Tumor by Integrative Sequencing
- Data type: individual level data, RNA sequencing
- Consent group: cancer research and methods
- Link to dataset details in dbGaP

- Genomic Sequencing of Solitary Fibrous Tumors
-
Kidney Cancer
View Datasets- Whole Genome and Exome Sequencing in Clear- Cell Renal Cell Carcinoma
- Data type: individual level, whole genome sequencing, whole genome genotyping, exome sequencing
- Consent group: general research use
- Link to dataset details in dbGaP

- Whole Genome and Exome Sequencing in Clear- Cell Renal Cell Carcinoma
-
Leukemia
View Datasets- Adult Leukemia
- Acquired Copy Number Alterations in Adult Acute Myeloid Leukemia Genomes
- Data type: individual level data, whole genome genotyping
- Consent group: research related to hematologic diseases and methods
- Link to dataset details in dbGaP

- High Density Copy Number Analysis and Whole Exome Sequencing of Chronic Lymphocytic Leukemia
- Data type: individual level data, whole genome genotyping, whole exome sequencing, high-density SNP exome array
- Consent group: cancer research and methods
- Link to dataset details in dbGaP

- Acquired Copy Number Alterations in Adult Acute Myeloid Leukemia Genomes
- Childhood Leukemia
- Analysis of Somatic Mutations in Pediatric AML FAB-M7 Subtype by Whole Transcriptome Sequencing
- Data type: individual level, whole genome sequencing
- Consent group: general research use
- Link to dataset details in dbGaP

- Genome-Wide Analysis of Hypodiploid Acute Lymphoblastic Leukemia
- Data type: individual level, whole genome sequencing
- Consent group: general research use
- Link to dataset details in dbGaP

- Genomic Complexity of Early T-Cell Progenitor Acute Lymphoblastic Leukemia
- Data type: individual level data, whole genome sequencing
- Consent group: general research use
- Link to dataset details in dbGaP

- Hyperdiploid Acute Lymphoblastic Leukemia RNA-Seq
- Data type: individual level data, RNA sequencing
- Consent group: pediatric cancer research
- Link to dataset details in dbGaP

- Whole Genome Sequencing of Core-Binding Factor Leukemia
- Data type: individual level, whole genome sequencing
- Consent group: general research use
- Link to dataset details in dbGaP

- Analysis of Somatic Mutations in Pediatric AML FAB-M7 Subtype by Whole Transcriptome Sequencing
- Adult Leukemia
-
Liver Cancer
View Datasets- Genentech Whole Genome and Transcriptome Sequencing of Four Hepatocellular Carcinoma Patients
- Data type: individual level data, whole genome sequencing, RNA sequencing
- Consent group: general research use
- Link to dataset details in dbGaP

- Genentech Whole Genome and Transcriptome Sequencing of Four Hepatocellular Carcinoma Patients
-
Lung Cancer
View Datasets- The Mutation Spectrum Revealed by Paired Genome Sequences from a Lung Cancer Patient
- Data type: individual level data, whole genome genotyping, whole genome sequencing
- Consent group: biomedical research use
- Link to dataset details in dbGaP

- The Mutation Spectrum Revealed by Paired Genome Sequences from a Lung Cancer Patient
-
Lymphoma
View Datasets- Genome-Wide Analysis of Splenic Marginal Zone Lymphoma
- Data type: Individual level data, whole genome genotyping, whole exome sequencing
- Consent group: cancer research and methods
- Link to dataset details in dbGaP

- High Density Copy Number Analysis and Whole Exome Sequencing of Diffuse Large B-Cell Lymphoma
- Data type: individual level data, whole exome sequencing, genome-wide high-density SNP array analysis
- Consent group: cancer research and methods
- Link to dataset details in dbGaP

- The Genetic Landscape of Mutations in Burkitt Lymphoma
- Data type: exome sequencing, gene expression
- Consent group: general research use
- Link to dataset details in dbGaP

- Genome-Wide Analysis of Splenic Marginal Zone Lymphoma
-
Melanoma
View Datasets- High Density SNP Association Analysis of Melanoma: Case-Control and Outcomes Investigation
- Data type: individual level data, whole genome genotyping
- Consent group: general research use
- Link to dataset details in dbGaP

- High Density SNP Association Analysis of Melanoma: Case-Control and Outcomes Investigation
-
Multiple Cancers
View Datasets- National Cancer Institute Cancer Genome Characterization Initiative (CGCI)
- Data type: individual level data, whole genome and transcriptome sequencing
- Consent group: cancer research and methods, pediatric Cancer
- Link to dataset details in dbGaP

- National Cancer Institute (NCI) Therapeutically Applicable Research to Generate Effective Treatments (TARGET)
- Data type: individual level data, genomic and transcriptomic sequencing
- Consent Group: pediatric cancer research
- Link to dataset details in dbGaP

- Temporal Dissection of Tumorigenesis in Primary Cancers
- Data type: individual level, exome sequencing
- Consent group: cancer research and general methods
- Link to dataset details in dbGaP

- National Cancer Institute Cancer Genome Characterization Initiative (CGCI)
-
Multiple Myeloma
View Datasets- Towards a Genomic Understanding of Myeloma
- Data type: individual level data, whole genome and whole exome sequencing
- Consent group: general research use, cancer, cancer-related disorders, and methods
- Link to dataset details in dbGaP

- Towards a Genomic Understanding of Myeloma
-
Pancreatic Cancer
View Datasets- Genome-Wide Characterization of Pancreatic Adenocarcinoma Patients Using Next Generation Sequencing
- Data type: whole genome sequencing, RNA sequencing
- Consent group: general research use
- Link to dataset details in dbGaP

- Genome-Wide Characterization of Pancreatic Adenocarcinoma Patients using Next Generation Sequencing
- Data type: individual level data, RNA sequencing, whole genome sequencing
- Consent group: general research use
- Link to dataset details in dbGaP

- Genome-Wide Characterization of Pancreatic Adenocarcinoma Patients Using Next Generation Sequencing
-
Prostate Cancer
View Datasets- A Multiethnic Genome-Wide Scan of Prostate Cancer
- Data type: individual level data, whole genome genotyping
- Consent group: multiple consent groups
- Link to dataset details in dbGaP

- Functionally Active Copy Number Variants Associated with Prostate Cancer Risk
- Data type: individual level data, whole genome genotyping
- Consent group: prostate cancer research and cancer methods
- Link to dataset details in dbGaP

- Transcriptome Sequencing Across a Prostate Cancer Cohort Identifies PCAT-1, An Unannotated lincRNA Implicated in Disease Progression
- Data type: individual level data, RNA sequencing
- Consent group: general research use
- Link to dataset details in dbGaP

- A Multiethnic Genome-Wide Scan of Prostate Cancer