Support a directory of manual dataset definitions
We currently "harvest" manual datasets from a single file, but it should be trivial to support a directory of multiple files which should allow us to structure these files, e.g. with one file per data provider.