April 2019 Cancer Epidemiology Matters E-News
Cancer Epidemiology Matters E-News
NCI & NIH Resources for Data Analysis and More
What’s New at NCI
Resources for Researchers is a directory of NCI-supported tools and services for cancer researchers. The types of resources available include data analysis tools, datasets, databases, and more. The research areas encompassed by these tools are broad – examples include
- causes of cancer,
- cancer omics,
- cancer prevention,
- cancer and public health,
- cancer screening and detection, and
- cancer statistics.
These resources are either developed and maintained by NCI scientists, including staff from the Epidemiology and Genomics Research Program and others in the Division of Cancer Control and Population Sciences, or created with grant funding. Most resources are free and available to all.
On April 15, 2019, NCI’s Surveillance Research Program released the Surveillance, Epidemiology, and End Results (SEER) Cancer Statistics Review, 1975-2016. The updated Cancer Statistics Review presents the most recent cancer incidence, mortality, survival, and prevalence statistics.
Useful NIH Tools
The database of Genotypes and Phenotypes (dbGaP) is one of many NIH data sharing repositories. It was developed to archive and distribute data and results from studies that investigate the associations between genotypes and phenotypes in humans. One useful but less well-known tool in dbGaP is the dbGaP Data Browser, which allows approved users to find and view discrete regions of a dbGaP genomic sequence dataset at nucleotide level resolution without having to download the datasets to local computers. Approved users may also view summary information on variants and individual genotypes, if available. To request use of the dbGaP Data Browser, researchers should log in to the dbGaP portal as they would for controlled-access data and select the “Data Browser” tab. Requests are made through a simplified controlled-access application.
A listing other NIH data sharing repositories is available through the National Library of Medicine’s website. Examples of NIH-supported resources which may help facilitate epidemiologic research include:
- Biologic Specimen and Data Repository Information Coordinating Center (bioLINCC) facilitates access to the National Heart, Lung and Blood Institute (NHLBI) Biorepository and Data Repository and promotes the use of other NHLBI-funded population-based biospecimen and data resources.
- Gabriella Miller Kids First Pediatric Research Program, through dbGaP and through the Kids First Data Resource Portal, offers clinical and genetic sequence data from more than 5,000 individuals with a structural birth defect or pediatric cancer.
- Genotype – Tissue Expression (GTEx) project, an ongoing effort to build a comprehensive public resource to study tissue-specific gene expression and regulation, provides open access to data including gene expression, QTLs, and histology images.
- Metabolomics Workbench is a resource sponsored by the Common Fund Metabolomics Program to serve as a national and international repository for metabolomics data and metadata and provides analysis tools and access to metabolite standards, protocols, and more.
- National Institute of Environmental Health Sciences (NIEHS) EpiShare is a web-based platform for sharing biospecimens and/or datasets from studies conducted by NIEHS’ Epidemiology Branch. NIEHS also offers several other databases as a resource to the scientific community, such as the Environmental Genome Project, and more.