untitled i: challenges ahead

59
Herbert Van de Sompel Olybris 2005, Monday April 18 th , Kos, Greece RESEARCH LIBRARY Untitled I challenges ahead Herbert Van de Sompel Research Library Los Alamos National Laboratory, USA

Upload: herbert-van-de-sompel

Post on 08-May-2015

2.348 views

Category:

Technology


1 download

TRANSCRIPT

Page 1: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

Untitled Ichallenges ahead

Herbert Van de SompelResearch Library

Los Alamos National Laboratory, USA

Page 2: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

About?

• Original intent: talk about technical work at LANL

• But LANL is sooooo different:- Local storage of Terrabytes of content- Local creation of services over that content

⇒ Whatever LANL does, doesn’t apply to other libraries

Page 3: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

About?

• In this keynote, I will:• Show that many libraries will soon be in a quite similar

situation• Explore the characteristics and consequences of that

situation• Focus on fundamental infrastructure

• Structure:• Slides that make the major arguments• Sidebars that illustrate (related) thoughts

Page 4: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

Sidebar

Page 5: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

A brief history of digital library collections

XXfull content

XXXXA&I

XXcatalogue

RemoteLocalRemoteLocal

ServiceStorage

• 2 considerations:o Minimal locally hosted collectiono Storage and Service are tied together

• Both will change

Page 6: Untitled I: Challenges ahead

the

repository

model

Page 7: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

The repository model

"Pattern Recognition: The 2003 OCLC Environmental Scan"http://www.oclc.org/membership/escan/toc.htm

Page 8: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

The repository model

Different repository types:• scholarly communication

(prerpint, postprint), • dataset repositories, • cultural heritage

collections, • cultural event collections,• learning object

repositories, • teaching object

repositories, • digitized book

repositories,• ….

Can be institution-based, discipline-based, …

Page 9: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

http://www.arl.org/newsltr/226/ir.html

Page 10: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

The repository model

Before they know it, institutions will be swamped with digital information of all kinds

Libraries seem to be the natural parties to take care of this

Vast growth of digital collection:

• Local repository (ies)• Thousands of remote

repositories

Page 11: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

The repository model

Explore (some of) the characteristics & consequences of this model:

• Value chains starting in repositories

• Local capacity• Archiving• Rights• Interoperability• Standards

Page 12: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

The repository model

Explore (some of) the characteristics & consequences of this model:

• Value chains starting in repositories

• Local capacity• Archiving• Rights• Interoperability• Standards

Page 13: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

Value chains starting in repositories

• New knowledge is really being created when allowing for non-anticipated use of stuff.

• These repositories are not about creating services for local users (only)

• These repositories are not about creating a service (user interface) for all users

• These repositories are about facilitating the use of materials in many contexts

• These repositories are the starting point of value chains

Page 14: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

http://www.technorati.com

• Value chains emerging from RSS feeds

Page 15: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

• Journal system is just one possible, vertically integrated value chain

• In a networked world, the functions it performs can/will be handled in a deconstructed/distributed manner:

o registration in repository

o validation by different nodes/parties

o archiving by different nodes/parties

o awareness by different nodes/parties

Example: scholarly communication value chains

Page 16: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

registration validationawareness archiving

Example: scholarly communication value chains

Page 17: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

http://dx.doi.org/10.1045/september2004-vandesompel

Page 18: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

Value chains starting in repositories

• Lesson learned:

To allow value chains to emerge on the basis of materials in repositories, those repositories need a clear/clean machine interface that allows downstream applications to consume materials, aggregate them, build services, …

⇒ Disconnection of repository content and service: allows for creation of both local and remote services

⇒ On-Web: Protocol-oriented interface⇒ These value chains are about the real stuff not (only) about

metadata

Page 19: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

http://dx.doi.org/10.1045/december2004-vandesompel

• LANL aDORe

• APS/LANL

• DSpace plugin

• mod_oai

Page 20: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

The repository model

Explore (some of) the characteristics & consequences of this model:

• Value chains starting in repositories

• Local capacity• Archiving• Rights• Interoperability• Standards

Page 21: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

Local capacity

• Need basic infrastructure to be able to deal with digital materials of all kinds

• Infrastructure has the real stuff, not metadata at its core• DSpace, eprints.org, Fedora, …

• Doctypes?• Vertical application vs basic plumbing?• Service-orientation?• On-Web?• Multiple repositories?• Scale?

Page 22: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

DIDs

OAI-PMH request DID

DID

DID

DID

DID

DID

LANL

A&I Publisher

publisher

TechReport

A&I

A&I

baseURL(1)

baseURL(1)

baseURL(2)

baseURL(3)

baseURL(4)

baseURL(x)

FTXT

Ingest

ARC

BaseURL

OAI-PMH request

DID, METS, IMS-CP, ...

OAI-PMH request

OAI-PMH request

DIDDID + DIM

Profile/BehaviorRegistry

Registry of trans-formations

MPEG-21DIP

Engine

Open

URL

Identifier Locator

OpenU

RL g

ateway

OAI-P

MH F

edera

tor

OpenURL

transformed content

Content-id or Package-id

baseURL(n) & Package-id

DIMInserter

DID

DID

DID

DID

DID

DID

DID

DID

DID

DID

DID

DID

DID

DID

DID

DID

DID

DID

FTXT

DID

DID

DID

DID

DID

DID

publisher

A&I Publisher

OpenURL

ARC

OpenURL

ARC

OpenURL

ARC

OpenURL

ARC

OpenURL

ARC

OpenURL

ARC

OpenURL

ARC

OpenURL

ARC

OpenURL

ARC

OpenURL

baseURL(2)

baseURL(3)

baseURL(4)

baseURL(x)

RepositoryIndex

aDORe

Page 23: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

http://arXiv.org/abs/cs.DL/0502028

• not a product

Page 24: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

The repository model

Explore (some of) the characteristics & consequences of this model:

• Value chains starting in repositories

• Local capacity• Archiving• Rights• Interoperability• Standards

Page 25: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

Archiving

• Very early days• Current strategies:

• Deal with materials in a way that supports their preservation:

• Be certain of what you store / Record datastream-related metadata

• Risk detecting tools• Mirroring

Page 26: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

http://hul.harvard.edu/jhove/

Page 27: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

http://metadata.net/panic/

Page 28: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

XMLtape

DID

DID

DID

DID

DID

DID

DID

DID

ARC

resource

ARC Index

ARC pointer 3arc id 3

ARC pointer 2arc id 2ARC pointer 1arc id 1

resource

resource

resource

resource

resource

resource

resource

resource

pointers are OpenURLs

DID-id 8DID-

created 2

DID-id 2DID-created 1

XMLtape Index

(Byte offset 3, Byte Count 3)DID-id 3

(Byte offset 2, Byte Count 2)DID-id 2

(Byte offset 1, Byte Count 1)DID-id 1

Page 29: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

http://arXiv.org/abs/cs.DL/0503016

Page 30: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

Paper in June 2005 D-Lib

• APS/LANL mirroring:• Mirrors objects, not applications, not filesystems• Complex object format for XML-based object representation• OAI-PMH ~ syncing• XML Signatures ~ accuracy of data transfer

Page 31: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

The repository model

Explore (some of) the characteristics & consequences of this model:

• Value chains starting in repositories

• Local capacity• Archiving• Rights• Interoperability• Standards

Page 32: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

Rights

• When facilitating the (re)use of materials (not just metadata) IP concerns increase significantly:

• Data authenticity• Data integrity• Usage rights

• Need machine readable rights expressions:• Robots are the next generation readers• Even when materials are “free”• Object-level expressions• The world of CC, MPEG-21 REL. ODRL, XRMLo NISO meeting to explore needs of scholarly community in this

realm

Page 33: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

Rights

• Urgent need for an environment in which scholarly assets behave in a manner that matches the “gift exchange” spirit of scholarship.

• James Boyle: Think about what we loose by sticking with the current paradigm!

o enormous constraints on ability to use scholarly assets: process to extract knowledge, attach knowledge, mine, evolve, build upon: robots are the next generation readers

Page 34: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

http://creativecommons.org

Page 35: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

http://science.creativecommons.org

Page 36: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

http://www.openarchives.org/OAI/2.0/guidelines-rights.htm

Page 37: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

The repository model

Explore (some of) the characteristics & consequences of this model:

• Value chains starting in repositories

• Local capacity• Archiving• Rights• Interoperability• Standards

Page 38: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

Interoperability

• Use and re-use of materials in global contexto Clean/clear machine interface is not enough. o Need cross-repository content-level interoperability o Interoperable, global federation of repositories

Page 39: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

Interoperability

• Architectural issues include:o Object representation (MPEG-21 DIDL, IMS/CP, METS, .)o Object identificationo Object harvestingo Object disseminationso Object relationshipso Discovery of object repositorieso …

Page 40: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

http://cordra.net

Page 41: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

DIDs

OAI-PMH request DID

DID

DID

DID

DID

DID

LANL

A&I Publisher

publisher

TechReport

A&I

A&I

baseURL(1)

baseURL(1)

baseURL(2)

baseURL(3)

baseURL(4)

baseURL(x)

FTXT

Ingest

ARC

BaseURL

OAI-PMH request

DID, METS, IMS-CP, ...

OAI-PMH request

OAI-PMH request

DIDDID + DIM

Profile/BehaviorRegistry

Registry of trans-formations

MPEG-21DIP

Engine

Open

URL

Identifier Locator

OpenU

RL g

ateway

OAI-P

MH F

edera

tor

OpenURL

transformed content

Content-id or Package-id

baseURL(n) & Package-id

DIMInserter

DID

DID

DID

DID

DID

DID

DID

DID

DID

DID

DID

DID

DID

DID

DID

DID

DID

DID

FTXT

DID

DID

DID

DID

DID

DID

publisher

A&I Publisher

OpenURL

ARC

OpenURL

ARC

OpenURL

ARC

OpenURL

ARC

OpenURL

ARC

OpenURL

ARC

OpenURL

ARC

OpenURL

ARC

OpenURL

ARC

OpenURL

baseURL(2)

baseURL(3)

baseURL(4)

baseURL(x)

RepositoryIndex

aDORe

Page 42: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

The repository model

Explore (some of) the characteristics & consequences of this model:

• Value chains starting in repositories

• Local capacity• Archiving• Rights• Interoperability• Standards

Page 43: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

Standards

• Standards are the glue that holds the networked information environment together.

• Standards are crucial to facilitate the emergence of improved and integrated services across repositories.

• As the information environment becomes more complex, and as we move towards new levels of services, we will need more, not less standards.

Page 44: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

Standards

• Standardization efforts/bodies in our community are seriously challenged:

o Many standards defined outside our community.o Lack of impact on major standardization bodies of the

networked world (W3C, IETF, IANA, …)

Page 45: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

http://info-uri.info

Page 46: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

Standards/Interoperability context

• Standardization efforts/bodies in our community are seriously challenged:

o Many standards defined outside our community.o Lack of impact on major standardization bodies of the

networked world (W3C, IETF, IANA, …)o Problems to interconnect within and amongst related

efforts in our community: digital library, grid computing, e-learning, library automation, …

o Operational models/processes not adequately adapted to the realities of the networked world (cf. patent challenges OpenURL, MetaSearch)

o Funding for standardization efforts and related infrastructure is very hard to find (cf. OAI, CIMI, info URI Registry, OpenURL Registry, …)

Page 47: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

http://www.loc.gov/rr/program/lectures/moen.html

Page 48: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

http://www.sis.pitt.edu/~dlwkshop/paper_sompel.html

Page 49: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

http://opensearch.a9.com/spec/opensearchquerysyntax/1.0/

• there is something about simplicity

Page 50: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

http://opensearch.a9.com/spec/opensearchdescription/1.0/

• there is another page: more complex than you thought!

Page 51: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

The repository model

Explore (some of) the characteristics & consequences of this model:

• Value chains starting in repositories

• Local capacity• Archiving• Rights• Interoperability• Standards

Page 52: Untitled I: Challenges ahead

so

what?

can we conclude

Page 53: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

The future of digital library collections?

XXXXrepositories

XXfull content

XXXXA&I

XXcatalogue

RemoteLocalRemoteLocal

ServiceStorage

• Important locally hosted collection• Storage and Service disconnected• Important challenges

Page 54: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

A content-node & service-node ecology?

• Content nodes:o Libraries become content-nodes, capturing the

intellectual output of their parent institutions and “exposing” it.

o Vision: A network of federated repositories that makes available the collective intellectual output of faculty and researchers of the world's research institutions

o Ongoing with the Institutional Repository movemento Libraries must act in this realm

Page 55: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

A content-node & service-node ecology?

• Service nodes:o Need services (value chains) to emerge on top of tat

contento “If the content is on-Web, the services will bloom”o Can not solely rely on … euh .. Google Scholar• Service node tasks include:

- indexing, searching, recommendation, linking, data-mining, visualization, … nodes

- annotation, certification, metric-collecting, rewarding, … nodes

- archiving, normalization/transformation, … nodes• Vision: A federation of networked services - in which

Libraries take on specific service tasks - that turns into a global scholarly value chain

Page 56: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY

The repository model

Physical libraries:• Local storage of content originating with 3rd parties• Facilitate use of that content by local user base

Current libraries:• Remote storage of content originating with 3rd parties• Facilitate use of that content by local user base

Repository model libraries:• Local storage of content that originates in-house• Facilitating its use by remote and local users by facilitating the

emergence of services

Emergence of a quite fundamental new library model

Page 57: Untitled I: Challenges ahead

but really, dude,

how?

Page 58: Untitled I: Challenges ahead

let’s call upon

Page 59: Untitled I: Challenges ahead

Herbert Van de SompelOlybris 2005, Monday April 18th, Kos, GreeceRESEARCH

LIBRARY