The Genomic Data Commons (GDC), a unified data system that promotes sharing of genomic and clinical data among researchers, was launched on June 6 with a visit from Vice President Joe Biden to the operations center at the University of Chicago. An initiative of the National Cancer Institute (NCI), the GDC will be a core component of the National Cancer Moonshot and the President’s Precision Medicine Initiative (PMI) and benefits from $70 million allocated to NCI to lead efforts in cancer genomics as part of PMI for Oncology. The GDC will centralize, standardize, and make accessible data from large-scale NCI programs such as The Cancer Genome Atlas (TCGA) and its pediatric equivalent, Therapeutically Applicable Research to Generate Effective Treatments (TARGET).
Together, TCGA and TARGET represent some of the largest and most comprehensive cancer genomics data sets in the world, comprising more than two petabytes of data (one petabyte is equivalent to 223,000 DVDs filled to capacity with data). In addition, the GDC will accept submissions of cancer genomic and clinical data from researchers around the world who wish to share their data broadly. In so doing, researchers will be able to use the state-of-the-art analytic methods of the GDC, allowing them to compare their findings with other data in the GDC.
Data in the GDC, representing thousands of cancer patients and tumors, will be harmonized using standardized software algorithms so that they are accessible and broadly useful to any cancer researcher. The storage of raw genomic data in the GDC will also allow reanalysis, as computational methods and genome annotations improve. The GDC has important safeguards to ensure secure data storage and downloading.
Douglas Lowy, MD
“With the GDC, NCI has made a major commitment to maintaining long-term storage of cancer genomic data and providing researchers with free access to these data,” said NCI Acting Director Douglas Lowy, MD. “Importantly, the explanatory power of data in the GDC will grow over time as data from more patients are included, and ultimately the GDC will accelerate our efforts in precision medicine.”
“Of particular significance, the GDC will also house data from a number of newer NCI programs that will sequence the DNA of patients enrolled in NCI clinical trials,” said
Louis M. Staudt, MD, PhD
Louis M. Staudt, MD, PhD, of the NCI. “These data sets will lead to a much deeper understanding of which therapies are most effective for individual cancer patients. With each new addition, the GDC will evolve into a smarter, more comprehensive knowledge system that will foster important discoveries in cancer research and increase the success of cancer treatment for patients.”
GDC researchers will be able to integrate genetic and clinical data, such as cancer imaging and histologic data, with information on the molecular profiles of tumors, as well as treatment response. From this perspective, the GDC would become an important resource for generating potentially actionable and life-changing information that ultimately could be used by doctors and their patients.
The GDC is being built and managed by the University of Chicago Center for Data Intensive Science in collaboration with the Ontario Institute for Cancer Research, all under an NCI contract with Leidos Biomedical Research, based in Frederick, Maryland.
For more information about the GDC, visit gdc.nci.nih.gov. ■