cbsdl 2015 koch_wg_v01

33
CloudBased Services in Digital Libraries workshop Vocabulary Web Services, Business Intelligence and Digital Curation Walter Koch Gerda Koch AITAngewandte Informationstechnik ForschungsGmbH, Graz Austria LoCloud is funded by the European Commission's ICT Policy Support Programme

Upload: walter-koch

Post on 16-Apr-2017

130 views

Category:

Software


1 download

TRANSCRIPT

Page 1: Cbsdl 2015 koch_wg_v01

Cloud‐Based Services in Digital Libraries workshopVocabulary Web Services, Business Intelligence and Digital Curation

Walter Koch ‐ Gerda KochAIT‐Angewandte Informationstechnik Forschungs‐GmbH, Graz ‐ Austria

LoCloud is funded by the European Commission's ICT Policy Support Programme

Page 2: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 2

locoud.eu

About:– LoCloud is a Best Practice Network co‐funded under the CIP ICT‐PSP programme of the European Commission.

– LoCloud is supporting small and medium‐sized institutions in making their content and metadata available to Europeana.

– A cloud‐based technology infrastructure will enable the aggregation of local content, and

– a number of micro‐services will help to reduce technical, semantic and skills barriers and to render the content more discoverable and interoperable. 

Page 3: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 3

Relevant Work Packages

Specify, modify, test, implement core infrastructure components: MINT, MoRe, Lightweight digital library – all build on existing work  Specify, modify, test, implement core infrastructure components: MINT, MoRe, Lightweight digital library – all build on existing work  

WP 2WP 2Design and implementation of aggregation Design and implementation of aggregation infrastructure 

Collaborative  cloud‐based testlab, involving partners and users; develop and implement a suite of SaaS: geolocation enrichment; metadata enrichment; multilingual vocabularies; historic place‐names; Wikimedia and crowdsourcing

Collaborative  cloud‐based testlab, involving partners and users; develop and implement a suite of SaaS: geolocation enrichment; metadata enrichment; multilingual vocabularies; historic place‐names; Wikimedia and crowdsourcing

WP 3WP 3Micro servicesMicro services for small and medium institutions

Page 4: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 4

The LoCloudAggregation Structure

EnrichmentServices

Europe

ana

Europe

ana

Clou

dTestlab

Page 5: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 5

Reference Model

Micro-services(Vocabulary,...)

Operating Environment(Linux, Windows, MAC)

Virtual Engines(KVM, XEN, VMDK, ...)

Micro-services(Vocabulary,...)

Outcome / Value

Process / Procedure

Page 6: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 6

Testlab Implementation(opennebula.org)

Private Cloud

Private Cloud

Public Cloud

Page 7: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 7

OpenNebula ‐ Frontend

http://test186.ait.co.at

Vocabulary WebServices

Page 8: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 8

Evolution of Architecturespwc: Technology Forecast: Rethinking Integration, Issue 1, 2014, page 3

Page 9: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 9

Microservice

Wikipedia: https://en.wikipedia.org/wiki/Microservices (2015‐09‐15)

[...] microservices is a • software architecture style in which • complex applications are composed

of small, independent processes• communicating with each other using language‐agnostic APIs.

• These services are small, highly decoupled and focus on doing a small task,facilitating a modular approach to system‐building.

Page 10: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 10

http://vocabulary.locloud.eu/

• Vocabulary Microservices • This is the entry page for the LoCloud vocabulary application. 

• LoCloud uses a variety of standard vocabularies during the metadata enrichment process.These vocabularies have been established with the TemaTres tool and are available in the LoCloud cloud testlab.

• In addition, it is possible to feed the tool with your own vocabularies, establish new vocabularies or add languages to existing vocabularies.

Page 11: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 11

Vocabulary WebService20+ Functions, API

Parameters

Example

Page 12: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 12

Example: SearchUNESCO‐Thesaurus

http://test113.ait.co.at/tematres/unesco/services.php?task=search&arg=peace

Page 13: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 13

SPARQL‐ExampleDISMARC‐Genres

• Query: "Find all Broader Term for 'Acid Jazz‘“• SPARQL End Point: http://test113.ait.co.at/tematres/vocab/sparql.php

QueryResult

ThesaurusEntry

Page 14: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 14

Service Call („fetchTopTerms“)embedded in a test application

Micro-Service

Vocabulary Web Service

Page 15: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 15

Test Application DevelopmentDesigner View

Development of a Drop Down List

Development of a Drop Down List

Page 16: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 16

The Test Application:Part of Data Entry

Path to the Thesaurus Item

„Candidate Term“ Crowdsourcing activity

Next Slide(Vocabulary Management)

Page 17: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 17

Vocabulary ManagementBPMN ‐ Diagramme

„CandidateTerm“

Crowdsrc.Activity

CandidateNew Item

Page 18: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 18

Business Intelligence

Page 19: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 19

Business Intelligence

Wikipedia: https://en.wikipedia.org/wiki/Business_intelligence (2015‐09‐15)• Business intelligence (BI) is the set of techniques and tools for the transformation of raw data into meaningful and useful information for business analysis purposes. 

• From Business Analysis to Cultural Analyticshttps://en.wikipedia.org/wiki/Cultural_analytics (2015‐09‐15)

• One Tool: ETL – „Extract Transform – Load“https://en.wikipedia.org/wiki/Extract,_transform,_load (2015‐09‐15)

• Example: Pentaho – Data Integration („Kettle“)http://www.pentaho.com/product/data‐integration

Page 20: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 20

OpenUp! ‐ Natural HistoryAggregator for Europeana

Micro Service:Common Name

Service

Micro Service:Common Name

Service

Page 21: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 21

OpenUp!Overall Workflow

Transformation Process

Page 22: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 22

OpenUp!Transformation Workflow

Invoke Micro Service:

Common Name Service

Page 23: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 23

OpenUp!Transformation Parameters

Micro Service:Common Name Service

Service End Point (Cache)

Page 24: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 24

Result: OpenUp!Enriched Meta Data Set

Latin Name

Common Names

Page 25: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 25

Pentaho‐PDISelected Tools

... and many more !

Page 26: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 26

Digital Curation

Page 27: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 27

Digital CurationState‐of‐the‐Art Examples (2010)

• Archivematica– Linux VM based toolset allows producers and archivists to implement a small scale workflow

• UC3 / CDL Merritt– Micro‐Services specifications – implement building blocks for the Digital Curation Lifecycle

Page 28: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 28

ArchivmaticaMicro Services

Page 29: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 29

California Digital LibraryUC3 Merritt

• California Digital Library UC Curation Center• Independant, Interoperable Micro‐Services• Complex curation function through combination of atomistic services

Knowledge Enhancement

Annotation

Page 30: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 30

Micro‐Services(Merritt vs. Archivematica)

CDL/UC3-MERRITT ARCHIVEMATICA

Page 31: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 31

High‐Level Workflow

• OAIS activities providea sound basis

• Archivematica Use Cases

• Merritt specifications• Micro‐Services are thebuilding blocks

Page 32: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 32

CDLCuration Micro Services

1) „An Emergent Micro‐Services Approach to Digital Curation Infrastructure“, IPRES 2009https://escholarship.org/uc/item/5313h6k9

2) UC Curation Center / California Digital Library, merritt: An Emergent Micro‐services Approach to Digital Curation Infrastructure; Rev. 0.6, 2010‐03‐25

1) Annotation service 2)

The Annotation service provides a mechanism by which the consumers of managed digital content can enrich that content through additional description, association, or related content. The service supports the following methods:

• Annotate

• Submit.

Page 33: Cbsdl 2015 koch_wg_v01

2015‐09‐17 Gerda & Walter Koch (AIT) 33

Contact

Gerda Koch, [email protected] Koch, [email protected]

http://www.ait.co.at