Pseudonymized User-Perspective Summit Login Node Data for 2020 and 2021


This dataset contains hourly snapshot data from each of the 5 login nodes of the Summit supercomputer at Oak Ridge Leadership Computing Facility (OLCF) over a period of 2 years, starting January 2020 and ending after December 2021. The snapshots include lists of currently logged-in users, CPU and memory usage, status of users' batch jobs, and disk usage statistics. Usernames, project identifiers, and file paths have been pseudonymized in order to allow studies of user behavior without divulging Personally Identifiable Information (PII).

Published: 2022-05-05 17:22:33 Download Dataset

Dataset Properties

Field Value
  • Maheshwari, Ketan Oak Ridge National Laboratory
  • Wilkinson, Sean Oak Ridge National Laboratory
  • Ferreira da Silva, Rafael Oak Ridge National Laboratory
Project Identifier DLSW
Dataset Type SM Specialized Mix
  • Summit
  • User Behavior
Originating Organizations Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organizations Office of Science (SC)
DOE Contract DE-AC05-00OR22725


Papers using this dataset are requested to include the following text in their acknowledgements:

*Support for 10.13139/OLCF/1866372 is provided by the U.S. Department of Energy, project DLSW under Contract DE-AC05-00OR22725. This research used resources of the Oak Ridge Leadership Computing Facility, which is a DOE Office of Science User Facility.