Genomic Datasets for Cancer Research

A variety of datasets from genome-wide association studies of cancer and other genotype-phenotype studies, including sequencing and molecular diagnostic assays, are available to approved investigators through the Extramural National Cancer Institute (NCI) Data Access Committee (DAC).

The Committee is charged with implementing the National Institutes of Health (NIH) data sharing policy for many NCI-supported and -conducted genome-wide association and genomic studies. It reviews all requests from the research community (including NIH intramural staff) for controlled access to genomic data, as well as to other cancer-related datasets for which it is responsible.

Request Data Access


Available Datasets

Use the table below to learn more about the datasets available, including the type of data and consent groups. Click on the name of the dataset to visit the dbGaP study page for more information.

Sort/Filter Datasets

Sort By:


Filter By:

Accession Number Dataset Cancer Type Individual or Aggregate Level Data Data Types Germline or Somatic Tumor or Normal
1. phs000201 Aquired Copy Number Alterations in Adult Acute Myeloid Leukemia GenomesExternal Web Site Policy Acute Myeloid Leukemia Individual Level Data Whole Genome Genotyping Germline N/A
2. phs000395 California Pacific Medical Center Research Breast Health CohortExternal Web Site Policy Breast Cancer Individual Level Data Whole Genome Genotyping Germline N/A
3. phs000385 Epigenetic Profiling of Human Colorectal CancerExternal Web Site Policy Colorectal Cancer Individual Level Data Whole Genome Genotyping, Whole Genome Bisulfite Sequencing Somatic Both
4. phs000525 Expressed Pseudogenes in the Translational Landscape of Human CancersExternal Web Site Policy Multiple Cancers Individual Level Data RNA Sequencing Both Both
5. phs000487 Functionally Active Copy Number Variants Associated with Prostate Cancer RiskExternal Web Site Policy Prostate Cancer Individual Level Data Whole Genome Genotyping Germline N/A
6. phs000517 GWAS in African Americans, Latinos, and JapaneseExternal Web Site Policy Breast Cancer Individual Level Data Whole Genome Genotyping Germline N/A
7. phs000505 Gene Fusion Discovery Through RNA Sequencing of Nine Human Glioblastoma Stem Cell LinesExternal Web Site Policy Glioblastoma Individual Level Data RNA sequencing Somatic Tumor
8. phs000384 Genentech Whole Genome and Transcriptome Sequencing of Four Hepatocellular Carcinoma PatientsExternal Web Site Policy Hepatocellular Carcinoma Individual Level Data Whole Genome Sequencing, RNA Sequencing Both Tumor
9. phs000299 Genentech Whole Genome Sequencing of a Non-Small-Cell Lung CarcinomaExternal Web Site Policy Lung Cancer Individual Level Data Whole Genome Genotyping, Whole Genome Sequencing Somatic Both
10. phs000573 Genetic Heterogeneity of Diffuse Large B-cell LymphomaExternal Web Site Policy Diffuse Large B-cell Lymphoma Individual Level Data Whole Exome Sequencing Somatic Both
11. phs000562 The Genetic Landscape of Mutations in Burkitt LymphomaExternal Web Site Policy Lymphoma Individual Level Data Exome Sequencing, Gene Expression Both Both
12. phs000341 Genome-Wide Analysis of Hypodiploid Acute Lymphoblastic LeukemiaExternal Web Site Policy Acute Lymphoblastic Leukemia Individual Level Data Whole Genome Sequencing Somatic Both
13. phs000502 Genome-Wide Analysis of Splenic Marginal Zone LymphomaExternal Web Site Policy Lymphoma Individual Level Data Whole Genome Genotyping, Whole Exome Sequencing Both Tumor
14. phs000621 Genome Wide Association Studies in ECOG 2997 TrialExternal Web Site Policy Chronic Lymphocytic Leukemia Individual Level Data Whole Genome Genotyping Germline N/A
15. phs000383 Genome-Wide Association Study of Breast Cancer in the African Diaspora - the ROOT studyExternal Web Site Policy Breast Cancer Individual Level Data Whole Genome Genotyping Germline N/A
16. phs000550 Genome-Wide Characterization of Pancreatic Adenocarcinoma Patients Using Next Generation SequencingExternal Web Site Policy Pancreatic Cancer Individual Level Data RNA Sequencing, Whole Genome Sequencing Both Tumor
17. phs000409 Genomic Analysis of MedulloblastomaExternal Web Site Policy Medulloblastoma Individual Level Data Whole Genome Genotyping Somatic Both
18. phs000552 Genomic Characterization of MeningiomasExternal Web Site Policy Meningioma Individual Level Data Whole Genome Sequencing, Whole Exome Sequencing Somatic Both
19. phs000340 Genomic Complexity of Early T-Cell Progenitor Acute Lymphoblastic LeukemiaExternal Web Site Policy T-lineage Acute Lymphoblastic Leukemia Individual Level Data Whole Genome Sequencing Somatic Both
20. phs000352 Genomic Complexity of Sporadic and Inherited Retinoblastoma: Matched Orthotopic XenograftExternal Web Site Policy Retinoblastoma Individual Level Data Whole Genome Sequencing Somatic Both
21. phs000508 Genomic Sequencing of Pediatric Rhabdoid CancersExternal Web Site Policy Rhabdoid Cancers Individual Level Data Whole Exome Sequencing, Whole Genome Genotyping Somatic Both
22. phs000568 Genomic Sequencing of Solitary Fibrous TumorsExternal Web Site Policy Solitary Fibrous Tumors Individual Level Data Whole Exome Sequencing Somatic Tumor
23. phs000563 The Genomics of Pilocytic Astrocytoma Formation in Neurofibromatosis Type 1External Web Site Policy Pilocytic Astrocytomas Individual Level Data Whole Genome Sequencing, Methylation Both Tumor
24. phs000364 High Density Copy Number Analysis and Whole Exome Sequencing of Chronic Lympocytic LeukemiaExternal Web Site Policy Chronic Lymphocytic Leukemia Individual Level Data Whole Genome Genotyping, Whole Exome Sequencing, High-density SNP Exome Array Germline N/A
25. phs000328 High Density Copy Number Analysis and Whole Exome Sequencing of Diffuse Large B-cell LymphomaExternal Web Site Policy Diffuse Large B-cell Lymphoma Individual Level Data Whole Exome Sequencing, Genome-wide High-density SNP Array Analysis Somatic Tumor
26. phs000187 High Densigy SNP Association Analysis of Melanoma: Case-Control and Outcomes InvestigationExternal Web Site Policy Melanoma Individual Level Data Whole Genome Genotyping Germline N/A
27. phs000522 Hyperdiploid Acute Lymphoblastic Leukemia RNA-SeqExternal Web Site Policy Acute Lymphoblastic Leukemia Both RNA Sequencing Somatic Both
28. phs000567 Identification of Recurrent NAB2-STAT6 Gene Fusions in Solitary Fibrous Tumor by Integrative SequencingExternal Web Site Policy Solitary Fibrous Tumors Individual Level Data RNA Sequencing Somatic Tumor
29. phs000602 Identification of Targetable FGFR Gene Fusions in Diverse CancersExternal Web Site Policy Multiple Cancers Individual Level Data Exome Sequencing, RNAseq Both Tumor
30. phs000306 A Multiethnic Genome-Wide Scan of Prostate CancerExternal Web Site Policy Prostate Cancer Individual Level Data Whole Genome Genotyping Germline N/A
31. phs000235 National Cancer Institute Cancer Genome Characterization Initiative (CGCI)External Web Site Policy Multiple Cancers Individual Level Data Whole Genome Sequencing, Transcriptome Sequencing Somatic Both
32. phs000218 National Cancer Institute (NCI) Therapeutically Applicable Research to Generate Effective Treatments (TARGET)External Web Site Policy Multiple Childhood Cancers Individual Level Data Genomic Sequencing, Transcriptomic Sequencing Somatic Both
33. phs000124 Neuroblastoma Genome-Wide Association Study (NBL-GWAS)External Web Site Policy Neuroblastoma Both Whole Genome Genotyping Germline N/A
34. phs000513 Rearrangements of the MAST Kinase and Notch Gene Families in Breast CancerExternal Web Site Policy Breast Cancer Individual Level Data RNA Sequencing Both Both
35. phs000369 Sequence Analysis of Mutations and Translocations Across Breast Cancer SubtypesExternal Web Site Policy Breast Cancer Individual Level Data Whole Genome Sequencing, Whole Exome Sequencing Somatic Both
36. phs000418 Temporal Dissection of Tumorigenesis in Primary CancersExternal Web Site Policy Cutaneous Squamous Cell Carcinomas and Ovarian Adenocarcinomas Individual Level Data Exome Sequencing Somatic Both
37. phs000348 Toward a Genomic Understanding of MyelomaExternal Web Site Policy Multiple Myeloma Individual Level Data Whole Genome Sequencing, Whole Exome Sequencing Somatic Both
38. phs000443 Transciptome Sequencing Across a Prostate Cancer Cohort Identifies PCAT-1, An Unannotated lincRNA Implicated in Disease ProgressionExternal Web Site Policy Prostate Cancer Individual Level Data RNA Sequencing Somatic Both
39. phs000491 Whole Genome and Exome Sequencing in Clear-Cell Renal Cell CarcinomaExternal Web Site Policy Renal Cell Carcinoma Individual Level Data Whole Genome Sequencing, Whole Genome Genotyping, Exome Sequencing Somatic Both
40. phs000535 Whole Genome and Exon Capture Sequencing of Bladder CancersExternal Web Site Policy Bladder Cancer Individual Level Data Whole Genome Sequencing, Custom Target Sequencing Both Both
41. phs000414 Whole Genome Sequencing of Core-Binding Factor LeukemiaExternal Web Site Policy Leukemia Individual Level Data Whole Genome Sequencing Somatic Both
42. phs000413 Whole Genome Sequencing of Pediatric Acute Megakaryoblastic LeukemiaExternal Web Site Policy Leukemia Individual Level Data Whole Genome Sequencing Somatic Both
43. phs000579 Small Intestine Neuroendocrine Tumors (Carcinoid Tumors)External Web Site Policy Neuroendocrine Tumors Individual Level Data Whole Exome Sequencing, Whole Genome Sequencing Somatic Both
44. phs000601 Filtering and Annotation of Variants that are Rare (FAVR)External Web Site Policy Breast Cancer Individual Level Data Whole Exome Sequencing Germline N/A
45. phs000604 Mapping Genes for Mammographic DensityExternal Web Site Policy Breast Cancer Aggregate Level Data Whole Genome Genotyping Germline N/A
46. phs000614 Genomic Analysis of Pediatric Low Grade GliomasExternal Web Site Policy Glioma Individual Level Data Whole Genome Sequencing Somatic Both
47. phs000644 The Effect of the Menstrual Cycle on the Normal Human BreastExternal Web Site Policy Breast Cancer Individual Level Data Transcriptome Sequencing Somatic Normal
48. phs000646 Breakpoint Detection Using Long Insert Whole Genome SequencingExternal Web Site Policy Multiple Cancers Individual Level Data Whole Genome Sequencing Both Tumor
There are no results for your selection.

Return to Top