CORE (COnnecting REpositories) (Presented by Petr Knoth from Open University)
Working with content and metadata from Open Access Institutional Repositories – approx 167 repositories in the UK. Mainly interested in Full-text items (approx 10 percent of metadata records in repositories have full-text items attached).
Will use OAI-PMH to harvest metadata, and then use to grab the pdf (or other full-text) representations of resource. Will then analyse content, and find ‘similarities’ between items – and then express as RDF. Will then make available via triple store.
Have started working with the Open University repository (ORO) – finding about 30% have full-text. Will focus on extracting relationships – specifically ‘semantic similarity’ based on content… (rather than on metadata)
Use cases – demonstrator client that can be integrated into any repository – which will provide links to papers in other repositories based on similarity relationships – will be open to any institution to use.