Populus Trichocarpa Genome-Wide Association Study (GWAS) Population SNP Dataset Released
10.13139/OLCF/1411410This dataset includes genetic variations found in 882 poplar trees, and provides useful information to scientists studying plants as well as researchers more generally in the fields of biofuels, materials science, and secondary plant compounds. For nearly 10 years, researchers with DOE's BioEnergy Science Center (BESC), a multi-institutional organization headquartered at ORNL, have studied the genome of Populus - a fast-growing perennial tree recognized for its economic potential in biofuels production. This Genome-Wide Association Study (GWAS) dataset includes more than 28 million single nucleotide polymorphisms, or SNPs that have been derived from 17 trillion bases of sequence data generated from 882 undomesticated Populus genotypes. Each SNP represents a variation in a single DNA nucleotide, or building block, that can act as a biological marker and/or causal allele within a protein sequence, helping scientists locate genes associated with certain characteristics, conditions or diseases. The results of this analysis have been used, among other things, to 1) seek genetic control of cell-wall recalcitrance - a natural characteristic of plant cell walls that prevent the release of sugars under microbial conversion and restricts biofuels production and 2) identify the molecular mechanisms controlling deposition of lignin in plant structures. Lignin is a polyphenolic polymer that strengthens plant cell walls and acts as a barrier to microbial access to cellulose during saccharfication - the process of breaking cellulose down into simple sugars for fermentation. Although the dataset's most immediate applications are in fundamental plant sciences, ORNL researchers plan to use the GWAS data to inform applied work in areas such as cleaner, sustainable transportation biofuels, carbon fiber for lightweight vehicles and alternatives to conventional plastics and building insulation materials.
Published: 2017-12-06 11:25:28 Download DatasetDataset Properties
Field | Value |
---|---|
Authors |
|
Project Identifier | BIF102 |
Dataset Type | GD Genome/Genetics Data |
Keywords |
|
Originating Organizations | Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States) |
Sponsoring Organizations | Office of Science (SC), Biological and Environmental Research (BER) (SC-23);Center for Bioenergy Innovation (CBI) |
Other Contributing Organizations | Joint Genome Institute |
DOE Contract | DE-AC05-00OR22725 |
Acknowledgements
Papers using this dataset are requested to include the following text in their acknowledgements:
*Support for 10.13139/OLCF/1411410 is provided by the U.S. Department of Energy, project BIF102 under Contract DE-AC05-00OR22725. This research used resources of the Oak Ridge Leadership Computing Facility, which is a DOE Office of Science User Facility.