linked library data

43
Linked Library Data Tuning Library Metadata for the [Semantic] Web Presented 2011-03-16 ALCTS RDA Webinar Series Corey A Harper

Upload: caleb-mcfadden

Post on 31-Dec-2015

45 views

Category:

Documents


1 download

DESCRIPTION

Linked Library Data. Tuning Library Metadata for the [Semantic] Web. Presented 2011-03-16 ALCTS RDA Webinar Series   Corey A Harper. Topical Overview. Semantic Web & RDF Intro Linked Open Data [Linked] Library Data Resource Description and Access (RDA) Beyond MARC As RDF Vocabularies - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Linked Library Data

Linked Library Data

Tuning Library Metadata for the [Semantic] Web

Presented 2011-03-16ALCTS RDA Webinar Series   Corey A Harper

Page 2: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 22011-03-16

Topical Overview

Semantic Web & RDF Intro Linked Open Data [Linked] Library Data Resource Description and Access (RDA)

Beyond MARC As RDF Vocabularies

Broader Interoperability Small steps forward…

Page 3: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 32011-03-16

Semantic Web

TBL’s original vision“Weaving the Web” – 1999

Then: Focus on Machine Reasoning Scientific American Article

Now: Focus on things & linksReasoning & Inferencing less central

Page 4: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 42011-03-16

Semantic Web

Originally:Metadata standard built on XMLMetadata about “Web” things (documents)

Eventually:Metadata about all sorts of thingsAnd about relationships between things

What are the “things”?

Page 5: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 52011-03-16

Semantic Web Terminology

Resource: Any “thing” Class: Abstraction of a type of thing Individual: An instance of a class Property: An attribute of an individual Statement/Triple:

A Resource (subject) A Property (predicate / verb) A Value (object) - Nodes

Graph: Visual Representation of statements Ontology: A domain specific collection of classes and

properties

Page 6: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 62011-03-16

Semantic Web Terminology

Nodes: The Subjects and Objects in a Graph Arcs: The Predicates in a Graph Domains and Ranges: Constraints on Nodes

Domain: What things can be subjects Range: What things (or strings) can be objects

Literals: Values as strings rather than things Named Graphs: Graphs with URIs treated as

nodes.

Page 7: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 72011-03-16

Page 8: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 82011-03-16

Linked Open Data

Use URIs as names for things Use HTTP URIs so that people can look

up those names. When someone looks up a URI, provide

useful information. Include links to other URIs. so that they

can discover more things. http://www.w3.org/DesignIssues/LinkedData.html

Page 9: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 92011-03-16

Page 10: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 102011-03-16

Page 11: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 112011-03-16

Page 12: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 122011-03-16

Data in the Cloud

Hubs in the May 2008 Version: FOAF DBPedia

Myriad Sources coming online: Thompson Reuters New York Times British Broadcasting Corporation Government Data (UK, US and more) Google and Facebook More and More Library, Archive and Museum Data

Geonames MusicBrains

Page 13: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 132011-03-16

DBpedia

Structured Wikipedia Data Genres, Influences, External Links Multi-lingual / Multi-script labels Rich Semantics Many linkages to other datasets

Page 14: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 142011-03-16

DBpedia Model

Partial basis in data entry conventions InfoBox’s, and InfoBox Templates Metadata Entry Format Partial source of Ontology

Class StructureVocabulary Design

Page 15: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 152011-03-16

DBpedia

3.4 Million “things” described Ontology based on “infoboxes”

1.5 million things classifiedhttp://wiki.dbpedia.org/Ontology

Approx. 50,000 “Properties”Approx. 1,200 defined in ontology

Page 16: Linked Library Data
Page 17: Linked Library Data
Page 18: Linked Library Data
Page 19: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 192011-03-16

What *things* are in our data???

Page 20: Linked Library Data

…Librarydata is extremelycomplicated

Page 21: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 212011-03-16

Library Metadata

Rich stores of MARC, MODS, &c. Robust Controlled Vocabularies

Subject Heading listsCode listsThesauri

Emerging data model in FR*

Page 22: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 222011-03-16

Bibliographic Vocabs

Bibliographic Ontology (Bibo)Zotero, Omeka, EPrints and Others

FRBR – unofficialAnd now Official (Thank you IFLA!)

ISBD Resource Description and Access (RDA)

Page 23: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 232011-03-16

Linked Library [Archive, Museum] Data LIBRIS (Swedish Union Catalog) Library of Congress (LCSH, OSI) German National Library Hungarian National Library British Library Europeana Archives Hub & LOCAH

Page 24: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 242011-03-16

Library Authority Data

“Include links to other URIs. so that they can discover more things.”

Short of providing and linking to URIs, this *is* authority data.

This is what our authority files are for.

Page 25: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 252011-03-16

Library Controlled Vocabularies: Benefits Reputation - Trusted Tradition Mature - Time tested and carefully

developed General & Comprehensive - Cover large

knowledge spaces

Page 26: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 262011-03-16

SKOS

Simple Knowledge Organization System Properties and Classes for describing

Controlled Vocabulary Heavily used in Linked Library Data

id.loc.govVirtual International Authority File (VIAF)

bibo:bookskos:primaryTopic

skos:subject

Page 27: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 272011-03-16

Other Vocabularies

Thesaurus for Economics French Subject Headings Swedish Subject Headings IconClass (not on web yet) OCLC Terminology Services Dewey Decimal Classification Virtual International Authority File Metadata Authority Description Schema (MADS)

Page 28: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 282011-03-16

Resource Description and Access Current focus on MARC

Much criticismWithin MARC, not a tremendous changeDifferent problems outside of MARC

Possible focus outside of MARCRDA as realization of FRBRRDA as Metadata VocabulariesRDA as related to Bibo

Page 29: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 292011-03-16

RDA as Metadata Vocabularies

RDA elements, roles and vocabularies have been provisionally registered

IFLA FRBRer and ISBD elements and vocabularies have been officially registered

Discussions about long term maintenance of both RDA and the vocabularies

Effort to create multi-language RDA Vocabularies

Slid

e A

dapte

d fro

m D

iane

Hillm

ann

Page 30: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 302011-03-16

Metadata Registries

Formerly NSDL Registry Now “Open Metadata Registry” Managing Vocabularies Providing Vocabulary Services

RDA – Now adding translations IFLA Work

FRBR, FRAD, FRSAD, ISBD

Page 31: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 312011-03-16

Page 32: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 322011-03-16

Page 33: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 332011-03-16

Page 34: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 342011-03-16

Page 35: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 352011-03-16

RDA as realization of FRBR

What will this look like? Probably *won’t* be stored in MARC Overly constrained by FRBR?

Properties have FRBR domains & rangesUnofficial “Generalized” properties

Non-FRBR metadata Similar to DCMI’s range constraints…

Page 36: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 362011-03-16

Support Free Range Metadata!

Photo Credit: http://www.flickr.com/photos/ciwf/3217378769/

Page 37: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 372011-03-16

BIBO and RDAVocab

Open question re: alignment Simplified view of Bib Data is useful

Interlinking with more general data Interlinking with non-library domain data

FRBR as internal model for library domain Examples

Page 38: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 382011-03-16

Why Does This Matter?Our descriptions no longer stand alone!Connect our data with the rest of the WEBAllow others to reuse more easily

FOAF, Geonames DBPedia MusicBrains New York Times, Thomson Reuters Government Data - data.gov British Broadcasting Corporation Other Library, Archive and Museum Data

Page 39: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 392011-03-16

Conclusions

Distributed bibliographic control environment Linking Data Focus on identification over description

“In short, by treating values as non-literal resources and assigning URIs to them we give ourselves (and others) the hooks on which to hang further descriptions.” - Andy Powell

Page 40: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 402011-03-16

Future Work

“Records” in Linked Library Data Vocabulary Alignment and Interoperability

DCMI planning in this space

General Metadata Interoperability Application Profiles?

Archival Data for *context* - (EAC-CPF)

Page 41: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 412011-03-16

W3C Linked Library Data Incubator Collecting, Curating and Clustering over

50 Use Cases Mining use cases for functional

requirements and design patterns Recommendations to W3C

Should lead to Working Groups http://www.w3.org/2005/Incubator/lld/

Page 42: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 422011-03-16

Other Activities

ALCTS/LITA Linked Library Data IG IFLA Semantic Web IG

https://wiki.d-nb.de/x/vA10Ag Open Knowledge Foundation

http://okfn.org/ CKAN Linked Library Data Group:

http://ckan.net/group/lld

Page 43: Linked Library Data

Harper - Linked Library Data - RDA Webinar SeriesHosted by the Association for Library Collections and Technical Services 432011-03-16

Thanks!

[email protected]

212.998.2479

@chrpr

Questions?