A Disease Progression Modelling (DPM) implementation of the Goldilocks framework for data-driven model configuration.
Goldilocks (conference paper: Oxtoby, AAIC 2024) is a framework for helping users ensure that their data-driven model of choice is configured “just right” for the available data. Conceptually, Goldilocks informs on feature selection and hyperparameter tuning, with respect to signal in the data. This complements work in the field of explainable AI.

goldilocks-dpm workflowSee goldilocks-pysustain.py for a worked example using ZScoreSustain.
## 1. Prepare your data
## 2. Create a Goldilocks DPM object and run the framework
## ZScoreSuStaIn
from goldilocks_dpm import goldilocks_ZscoreSustain
gdpm = goldilocks_ZscoreSustain(
classes = y,
dpmData = X,
output_folder = "path/to/output_folder",
robust_zscores = False,
case_label = 1,
ctrl_label = 0,
direction_abnormal = direction_abnormal,
biomarker_labels = biomarkers
)
gdpm.run_goldilocks()
## 3. Interrogate the resulting output.
print(gdpm.Z_vals)
print(gdpm.Z_max)
MIT.