Skip to main content

Data Depositor Guide

Welcome! Constellation is a service of the Oak Ridge Leadership Computing Facility (OLCF), available as a long term public data repository for OLCF users as well as Oak Ridge National Laboratory staff. Datasets published in Constellation are assigned a DOI by the DOE Office of Scientific and Technical Information and reported for inclusion in the DOE Data Explorer.

Data submitted to this repository may be:

1) the results of research conducted using a project allocation on OLCF resources including Frontier, Summit, Titan, Slate, Andes, etc., or

2) funded wholly or in part by the US Department of Energy under its contract with UT-Battelle, LLC (DE-AC05-00OR22725) for the management of ORNL.

For questions and help with data deposits, email doi_support@ornl.gov.

Steps to publish a dataset in Constellation

There are seven steps involved in publishing your dataset on Constellation:

  1. Obtain a Globus ID
  2. Create your Constellation user account
  3. Reserve a DOI for your dataset
  4. Add metadata and create a README
  5. Add data files and submit your dataset for review
  6. Address any questions or concerns about your dataset received from ORNL curators
  7. Receive notification that your dataset has been published

Staff members of Oak Ridge National Laboratory should also follow relevant SBMS procedures to Review and Release Scientific and Technical Information. A submission workflow for datasets is available in RESolution, where submitters will be asked to provide the DOI that has been reserved in Constellation.

Detailed submission instructions

 

1. Obtain a Globus ID

a. If you don't already have a Globus ID, go to https://globusid.org/create and fill in the form to create a Globus ID. 

b. You will be asked to verify your email address.

c. Once your email address is verified, your Globus ID can be found under Settings > Account > Identity. It should be in the form of [globus_username]@globusid.org

 

2. Create your Constellation user account

a. Click on the 'Log In' button on the Constellation website. Register using your ORNL UCAMS/XCAMS or ORNL credentials.

b. Enter your Globus ID and review and accept the User Agreement.

c. You will be taken to your account dashboard.

 

 3. Reserve a DOI for your dataset

a. Start the process of adding your dataset by clicking on the "Reserve new DOI" button in your user dashboard.

b. Enter a draft Title for your dataset.

c. Indicate whether the data you are submitting was created using resources of the Oak Ridge Leadership Computing Facility (OLCF). OLCF resources include Frontier, Summit, Titan, Slate (Onyx, Marble).

d. Click Save. Your dataset and reserved DOI will now be visible in your dashboard under "Draft datasets."

NOTE: This step does not actually create a DOI - rather, it creates a request record with OSTI that will be finalized when the dataset is published. Prior to submitting a DOI for review, you can update the DOI metadata and data files as many times as needed.

 

 4. Add metadata and create a README

In your user dashboard, find your dataset in the "Draft datasets" table and click the "add" button under "Metadata." This will open a form that enables entry of metadata for associated with the new DOI. The fields of this form are described below:


Title (required) A short descriptive title that will help users understand what the dataset contains.

Authors (required) You can add multiple authors to the DOI, and you must have at least one author. Fill out all required fields for each author including: First Name, Last Name, Affiliation, and E-mail.

DOE Contract Nos (required) The Department of Energy contract number under which the work was funded. This field is validated against the OSTI contract authority. Multiple DOE contract numbers may be entered, separated by a semicolon. Invalid and non-DOE contract numbers should instead be entered in the 'Other Contract Nos' field.

Other Contract Nos (optional) Other contract numbers that do not fit elsewhere in the form, including ORNL and non-DOE funding identifiers. Multiple contract numbers may be entered in this text field.

Originating Organization (required) The name of the organization that performed the research or issued the dataset.

Sponsor Orgs. (required) The name of the organization that sponsored (provided funding) for the dataset.

Contributing Orgs (optional) Any organizations that contributed to the dataset through significant review, site management, data collection, etc.

Dataset Type (required) The data's main or most important content type.

Subject(s) (optional) The subject identifies the scientific discipline of the DOI based on a list provided by OSTI. Multiple subjects may be selected.

Keywords (optional) Terms that describe the content of the DOI's dataset and help users discover the data. More than one term may be entered in this text field.

Description (required) Long description of the DOI being created, similar to an abstract.

Software Needed (optional) Any software needed to access the dataset contents. 

OLCF Project Identifier (required) The Oak Ridge Leadership Computing Facility project ID assigned to the research associated with the DOI. Project identifiers are usually six alphanumeric digits, such as 'ABC001'. If the project was not funded by OLCF, do not complete this field.

Product Nos. (optional) Product Numbers identifying the dataset that have been assigned by either the originating/submitting organization or by the organization currently hosting the data.

Other Identifying Nos. (optional) Any other identifying numbers that do not fit anywhere else in this form.

Related Identifiers (optional) The DOI being created may be related to other DOIs or web content. Select whether the related identifier is a DOI or URL, provide the identifier link, and select a phrase that communicates the relationship of the Constellation dataset to the related item.

c. When all required fields are complete, click the “Save” button to save the DOI request as a draft. If any required field is omitted, you will see a “Data Entry Error” alert indicating the error. To resolve the issue, enter the missing information in the form and click the "Save" button again.

d. A README file is recommended for inclusion with all data deposits. A README provides additional context and citation information for your dataset, and should help future researchers understand and reuse your data. If you do not have an existing README file, a Constellation curator can share our template via email (doi_support@ornl.gov).

 

5. Add data files and submit your dataset for review

a. You can access the Globus endpoint for a DOI reservation by clicking the "upload" link in your user dashboard table of draft DOIs. This will open the Globus web interface to your assigned directory in the OLCF DOI-UPLOADS collection.

Files may be transferred from another Globus collection (ex. an OLCF storage system) or uploaded from your local machine. To add data files over 1 GB in size from your laptop or desktop, install Globus Connect Personal to create your own Globus collection.

b. Add your README file or other dataset documentation to the same directory.

c. To send your dataset for curator review and publication, return to your Constellation dashboard. Open the metadata form, change the dropdown status at bottom of page to "Needs Approval" and click "Save."

 

6. Address any questions or concerns about your dataset received from ORNL curators

a. DOIs that are created using the OLCF Constellation Portal are reviewed prior to publication by a member of the Constellation Data Curation team. Curators make sure data does not contain PII, check metadata and documentation for completion, and make suggestions to improve data discoverability and reuse. You will be contacted if the curator has any concerns or is requesting changes to the dataset.

b. Datasets may be approved, rejected, or encounter a failure in the issue process. You will be contacted by a curator if your dataset is rejected or the issue process has failed.

c. Your account dashboard will display a list of datasets that are in the process of being uploaded, reviewed and published. Each dataset will indicate its current status:

        i. Draft (you are working on it)

        ii. Submitted for approval (dataset has been submitted for review and approval)

        iii. Approved (dataset has been approved and is being moved to the data repository)

        iv. Published (available for download)
 

 

7. Receive notification that your dataset has been published

Once your dataset has been approved and it has been moved to the ORNL data archive, its status will change to "Published" and it will be made available for download by anyone. You will receive an email once the DOI has been transmitted to OSTI for activation.