intute repository search project a showcase for uk research output sophia jones sherpa...
TRANSCRIPT
Intute Repository Search Project
A showcase for UK research output
http://www.intute.ac.uk/irs/
Sophia JonesSHERPA
October 2008
Introduction
• Funded by JISC (Joint Information Systems Committee) - www.jisc.ac.uk
• a Mimas/ Intute project (www.mimas.ac.uk), in partnership with UKOLN and SHERPA.
• Three year project – current project finishes in May 2009
High Level Aims
• Identify, develop and support high-value research knowledge communities within search and discovery process;
• Encourage the embedding of Repository search in familiar and day-to-day research desktop environments;
• Develop metrics to estimate the volume of content being cross-searched and support the future development of metrics to aid the research appraisal process;
• Showcase the collective and collaborative UK Research output and perspectives in Institutional Repositories.
High Level Aims (cont.)
• Deliver free targeted search and discovery facilities derived from UK HEI Institutional Repositories to support researchers and academics;
• Harvesting and aggregation of repositories of academic and research papers from HE and other relevant open access sources across the UK;
• Provide improved services to individuals including ability to personalise information based on user profile, directed browse and dynamic navigation;
• Help to develop meaningful synergies between research orientated and learning object repositories;
• Investigate opportunities to include other relevant information sources outside the UK as guided by the academic community;
• Provide a richer more meaningful contextual search facility including full-text searching, text mining and other techniques;
Scope
• The scope of the IRS Project is intended to initially cover both a UK national and where appropriate, global dimension and support the following domains:
– Research Lifecycle (discovery, development, collaboration,
dissemination);
– Teaching & Learning (resources, pedagogic activity / processes,
resource-based learning);
– Research Administration (deposition, repository support and
exposure)
The Challenge & Complexities
• Knowledge Management Context for Researchers, Teachers and Students
Knowledge Context
• Where can I find…?
• What can help me?
• Who can help me?
• What do we know?
• What do I / we don’t know?
Simple Search Full-text Search Subject Classification Automated aggregation Concept matching Personalisation
Search paradigm Discovery Paradigm Meaning-based computing
Content Context
Development Paths
• Simple metadata search
• Full-text indexing of documents
• Text-mining full-text documents
• Consider approaches to automatic subject classification, clustering of results, browsing/visualisation
Metadata search
• Metadata harvested from institutional repositories
• Cross repository search, aggregation and retrieval from all HE and relevant UK repositories, including the Depot
• Searches over 90 academic repositories (taken from the OpenDOAR directory of Open Access repositories) http://www.opendoar.org/
http://www.intute.ac.uk/irs/
Advanced discovery and retrieval
• Parallel approaches:– NaCTEM (National Centre for Text Mining) -
www.nactem.ac.uk – Autonomy IDOL software
• Advanced browsing and searching –– Automated document clustering and classification based
on terminology– Personalisation of searching– Concept visualisation from automated clustering
Broad Range of Requirements
Ongoing requirements focusing on:
• Capture, analysis and management of scenario-based requirements (stakeholders and end-users);
Researcher (PI, Assistant, Post Grad)
Teaching & Learning Community
Higher Education Academy - LSCAcademic & Research Deans
Knowledge / Info specialists (HE/FE)
Cross-sector shared service developments
Other National Repository Aggregators
Common Repository Interface WG - JISC
Research Support Departments
Developers
JISC
UK Research Councils
JISC Repositories & PreservationInstitutional Repository Managers
JISC IE & IEMSR
Librarians
Commercial Technology Stakeholders
Standards Communities
Interworking during project lifecycle
Suggested areas:
• Links with international projects/initiatives - joint information gathering and setting up strategic alliances – broader research & teaching knowledge domain
• Advocacy – e.g. RSP including sharing plans and approaches and findings, also joint events, conference papers, publicity materials i.e. very practical efficiency savings
• Repository landscape - making sense of it together in order to prioritize strategically and identify quick wins (e.g. prioritizing search targets);
• Sharing links and experiences of related work e.g. UK PubMed, DRIVER and sharing experience outputs relevant to other projects
Key activity towards end of project
• Integration of user group requirements into development iterations
• Report on global repository interactions
• Learning object scoping study