DMT-Eval: Universal Validation Framework
Data, Models, Tests — validation as structured scientific argumentation. DMT-Eval decouples analyses from models through formal adapter interfaces, producing structured scientific reports (LabReports) from any (model, data) pair. The architectural insight was proven over seven years at the Blue Brain Project (EPFL, 2017–2024) and is now rebuilt for any domain where computational models need systematic evaluation. Live Demo bench.mayalucia.dev — run evaluations in real time. Weather prediction, drug efficacy, and Brain-Score NeuroAI benchmarks, all producing structured LabReports through the same pipeline. ...