ORNL_AISD_DL-HLgap
- Yoo, Pilsun | Oak Ridge National Laboratory
- Irle, Stephan | Oak Ridge National Laboratory
- Lupo Pasini, Massimiliano | Oak Ridge National Laboratory
- Mehta, Kshitij | Oak Ridge National Laboratory
Overview
Description
This dataset provides supplementary molecular dataset of Deep Learning Workflow for the Inverse Design of Molecules with Specific Optoelectronic Properties. The dataset comprises three main directories such as GDB-9_dataset, Low_HL_Gap_dataset, and High_HL_Gap_dataset which individually has csv files, smiles_txt files, pdb files and xyz files containing information of molecular structures, properties and coordinates generated from deep learning workflow using generative model, surrogate model and DFTB calculation results. GDB-9_dataset contains the molecular data extracted from the original GDB-9 dataset with additional data of DFTB HL gap, surrogate HL gap and molecular property analysis. (the number of atoms, aromaticity and double bond equivalent) Low_HL_Gap_dataset and High_HL_Gap_dataset contains series of dataset for different generations with further split to train and test dataset that were obtained from the iterative workflow described in the manuscript. Additional directory Chemiscope_visualization in Low_HL_Gap_dataset directory contains compressed json files to visualize molecules using chemiscope.org page or application to help readers examine generated molecules.
Funding resources
DOE contract number
DE-AC05-00OR22725Originating research organization
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)Sponsoring organization
Office of Science (SC)Details
DOI
10.13139/ORNLNCCS/1996925Release date
September 1, 2023Dataset
Dataset type
ND Numeric DataSoftware
Python, Chemiscope, RDKitAcknowledgements
Users should acknowledge the OLCF in all publications and presentations that speak to work performed on OLCF resources:
This work was carried out [in part] at Oak Ridge National Laboratory, managed by UT-Battelle, LLC for the U.S. Department of Energy under contract DE-AC05-00OR22725.
Category
- 37 INORGANIC, ORGANIC, PHYSICAL, AND ANALYTICAL CHEMISTRY
Keywords
- Optical Property of Molecules,
- Inverse Design,
- Density-Functional Tight-Binding