Benchmarking¶
Public Exports¶
uncertainty_flow.benchmarking exports:
BenchmarkConfigBenchmarkResultBenchmarkRunnerAVAILABLE_DATASETSCHRONOS_DATASETSlist_datasetslist_datasets_by_domainload_datasetdownload_datasetTuningResultauto_tune
Module Structure¶
flow.py:BenchmarkFloworchestration moduleproviders.py: provider interface seam and stable built-in model providersconfigs.py: benchmark/build configuration contractsresults.py: benchmark result contractssinks.py: output serialization policy viaResultSinkrunner.py: public compatibility adapter over flow/providers/sinks
Stable Model-Name Contract¶
Built-in benchmark names remain stable:
quantile-forestconformal-regressorconformal-forecaster
Lifecycle¶
BenchmarkFlow lifecycle:
loadsplittune-per-run-contextfit/predictevaluatesink