Skip to main content

Dataset for Top Model Decision Tree: Selecting Segmentation Models for Reliable Quantitative Analysis in Low- and Ultralow-Dose CryoEM

    Lynnicia N Massenburg | Oak Ridge National Laboratory
    Sita S Madugula | Oak Ridge National Laboratory
    Spenser R Brown | Oak Ridge National Laboratory
    Amber N Bible | Oak Ridge National Laboratory
    Chanda R Harris | Oak Ridge National Laboratory
Download Dataset on Globus

Description

Motivation Multiple deep learning model architectures can be used to segment bacterial membranes in cryoEM images. However, an AI-based tool advancement is often presented with only a single segmentation model for broad use, and this single model may show inconsistent results across datasets from different users. Here, we present the Top Model Decision Tree, a model screening framework to screen for the best model to generate bacterial inner and outer membrane masks based on user priorities. We use pre-trained segmentation models from YOLOv11, YOLO26, U-Net, Detectron2 and SAM3 fine-tuned on bacterial inner and outer membranes imaged with cryoEM. Run the Framework This notebook must be opened in Google Colab. Mount Google Drive and run with a GPU-based runtime. Open the notebook and follow steps to git clone in folders and files within this repository. There will be a repeating top_model_decision_tree.ipynb (notebook clone) that will not be used. Save your .png binary mask files and .csv table outputs within your Google Drive or download before closing the notebook. The models and all analysis/training scripts are available at [GitHub: https://github.com/Lynnicia/CryoEM_membranes_top_model_decision_tree and https://github.com/Sireesiru/Semantic-Segmentation-of-bacterial-cell-envelope-using-U-Nets.

Funding Information

DOE Contract Number

AC05-00OR22725

Originating Research Organization

Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)

Sponsoring Organization

Office of Science (SC)

Related Works

Details

Release Date

June 10, 2026

Subject

60 APPLIED LIFE SCIENCES, 59 BASIC BIOLOGICAL SCIENCES

Keywords

machine learning, computer vision, artificial Intelligence, bacteria

Dataset

Dataset Type

SM Specialized Mix

Software

Ultralytics (YOLOv11, YOLO26), U-Net, Detectron2, SAM3

Cite This Dataset:

Massenburg, L., Madugula, S., Brown, S., Bible, A., Harris, C., Zhang, L., Parker, K., Retterer, S., Morrell-Falvey, J., Vasudevan, R., Williams, A. (2026). Dataset for Top Model Decision Tree: Selecting Segmentation Models for Reliable Quantitative Analysis in Low- and Ultralow-Dose CryoEM. Oak Ridge National Laboratory. https://doi.org/10.13139/ORNLNCCS/3025228.

Acknowledgements

This work was carried out [in part] at Oak Ridge National Laboratory, managed by UT-Battelle, LLC for the U.S. Department of Energy under contract DE-AC05-00OR22725.