This presentation by Jun Zhao, University of Oxford – but missed the start as I presented just now, and getting myself sorted…
Want to use some of the systems/expertise of libraries (esp. digital preservation) to preserve workflows – to make experiments repeatable at any time in the future. Project is ‘Workflow4Ever‘ – http://www.wf4ever-project.org/
Need to be able re-run experiments over data sets as data sets grow – does finding remain true as data grows.
Biology Use case – ‘reuse’:
- Search for existing experiments from myExperiment (http://myexperiment.org)
- Challenge – understand the workflow
- Perform test runs with test data and his/her own data
- Read others’ logs
- Read annotations to workflows
- Reuse scripts from colleagues and perform test that his/her colleagues are familiar with
Provenance Challenges:
- Identity
- Context
- Storage
- Retrieval