Skip to main content

ORNL_AISD_DL-HLgap

  • Yoo, Pilsun | Oak Ridge National Laboratory
  • Irle, Stephan | Oak Ridge National Laboratory
  • Lupo Pasini, Massimiliano | Oak Ridge National Laboratory
  • Mehta, Kshitij | Oak Ridge National Laboratory
Download dataset
Overview

Description

This dataset provides supplementary molecular dataset of Deep Learning Workflow for the Inverse Design of Molecules with Specific Optoelectronic Properties. The dataset comprises three main directories such as GDB-9_dataset, Low_HL_Gap_dataset, and High_HL_Gap_dataset which individually has csv files, smiles_txt files, pdb files and xyz files containing information of molecular structures, properties and coordinates generated from deep learning workflow using generative model, surrogate model and DFTB calculation results. GDB-9_dataset contains the molecular data extracted from the original GDB-9 dataset with additional data of DFTB HL gap, surrogate HL gap and molecular property analysis. (the number of atoms, aromaticity and double bond equivalent) Low_HL_Gap_dataset and High_HL_Gap_dataset contains series of dataset for different generations with further split to train and test dataset that were obtained from the iterative workflow described in the manuscript. Additional directory Chemiscope_visualization in Low_HL_Gap_dataset directory contains compressed json files to visualize molecules using chemiscope.org page or application to help readers examine generated molecules.

Funding resources

DOE contract number

DE-AC05-00OR22725

Originating research organization

Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)

Sponsoring organization

Office of Science (SC)

Details

DOI

10.13139/ORNLNCCS/1996925

Release date

September 1, 2023

Dataset

Dataset type

ND Numeric Data

Software

Python, Chemiscope, RDKit

Acknowledgements

Users should acknowledge the OLCF in all publications and presentations that speak to work performed on OLCF resources:

This work was carried out [in part] at Oak Ridge National Laboratory, managed by UT-Battelle, LLC for the U.S. Department of Energy under contract DE-AC05-00OR22725.

Category

  • 37 INORGANIC, ORGANIC, PHYSICAL, AND ANALYTICAL CHEMISTRY

Keywords

  • Optical Property of Molecules,
  • Inverse Design,
  • Density-Functional Tight-Binding