View markdown source on GitHub

Galaxy Tabular Learner: Building a Model using Chowell clinical data

Contributors

Questions

Objectives

last_modification Published: May 5, 2025
last_modification Last Updated: Jan 10, 2026

What you will do

Speaker Notes

This tutorial treats the published LORIS LLR6 model as a benchmark baseline. The main goal is to understand what changes (and what does not) when the model is rebuilt under a standardized Galaxy workflow.


Tool availability


Use case: LORIS LLR6 (Chang et al., 2024)

schema of the whole process of training model and test.


Dataset and predictors

Predictors (LLR6):

Target:


Model selection idea

Tabular Learner can:

  1. Compare multiple candidate classifiers and pick the best under its evaluation protocol.
  2. Restrict candidates to a specific family (e.g., logistic regression only).

In this tutorial, we train all candidate models and check whether logistic regression remains among the best performers.


Data upload

Import the preprocessed TSV files from Zenodo:

Ensure the datatype is tabular, and optionally tag datasets for traceability.

Speaker Notes

The tutorial focuses on the preprocessed tables; a separate notebook/script performs the truncation and encoding steps.


Run 1: Train and select a best model

Tabular Learner parameters:

Run the tool to produce the Best Model and the HTML report.


Run 2: Re-evaluate at a selected threshold

Rerun Tabular Learner with:

Speaker Notes

Threshold-dependent metrics (accuracy, precision, recall, F1, MCC) change with the cutoff. This second run makes threshold choice explicit for comparisons.


Outputs to inspect


Report structure

The report has four tabs:

report tabs


Benchmarking approach

Separate metrics into:

Use the report to check:


Key numbers

Model Threshold Accuracy ROC-AUC PR-AUC F1
LLR6 (reference) 0.30 0.68 0.72 0.53 0.53
Tabular Learner Run 1 0.50 0.80 0.76 0.55 0.42
Tabular Learner Run 2 0.25 0.67 0.76 0.55 0.52

Interpreting the differences

Bar chart comparing LLR6 metrics vs. Tabular Learner (threshold = 0.25)


Conclusion


Galaxy Training Resources

GTN stats


Thank you!

This material is the result of a collaborative work. Thanks to the Galaxy Training Network and all the contributors! Galaxy Training Network Tutorial Content is licensed under Creative Commons Attribution 4.0 International License.