Provenance in the Dynamic, Collaborative New Science

This presentation by Jun Zhao, University of Oxford – but missed the start as I presented just now, and getting myself sorted…

Want to use some of the systems/expertise of libraries (esp. digital preservation) to preserve workflows – to make experiments repeatable at any time in the future. Project is ‘Workflow4Ever‘ – http://www.wf4ever-project.org/

Need to be able re-run experiments over data sets as data sets grow – does finding remain true as data grows.

Biology Use case – ‘reuse’:

  • Search for existing experiments from myExperiment (http://myexperiment.org)
    • Challenge – understand the workflow
    • Perform test runs with test data and his/her own data
    • Read others’ logs
    • Read annotations to workflows
  • Reuse scripts from colleagues and perform test that his/her colleagues are familiar with

Provenance Challenges:

  • Identity
  • Context
  • Storage
  • Retrieval

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.