전문가토크릴레이 2탄 open data and linked data (김학래 박사)

37
Open Data and Linked Data Making Emergent Creativity 김학래, 코리아데이터허브, 2012

Upload: saltlux-zinyus

Post on 10-Dec-2014

1.897 views

Category:

Documents


2 download

DESCRIPTION

전문가 토크릴레이 2탄 Open data and linked data : 웹사이언스 워크그룹 김학래 박사

TRANSCRIPT

Page 1: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Open����������� ������������������  Data����������� ������������������  and����������� ������������������  Linked����������� ������������������  Data����������� ������������������  Making Emergent Creativity

김학래,����������� ������������������  코리아데이터허브,����������� ������������������  2012����������� ������������������  

Page 2: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Data.gov  WikiLeaks  

“This  led  to  changes  in  the  cons6tu6on  and  the  establishment    of  a  more  open  government”  –  WikiLeaks  

Let’s  Think  

Page 3: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Open Data starts with making available the data that you already have, in whatever format.

•  Equal access for all •  Licensing, legal issues •  Transparency •  Changing the way government works

Open Data vs Linked Data Quick Summary

Open Data

Linked Data •  URIs •  HTTPs •  RDF vocabularies •  Standards

3

Page 4: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Introduction Open Data and Open Government

Data

The Semantic Web & Linked Data What We Will Do

This Presentation ..... Today

4

Page 5: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Web in Evolution “a steady progression from a document-centric Web to one that is data-centric, including the mediation of semantics”

Let’s Start

5

(Source: Mike, 2007)

Page 6: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

What is the Semantic Web for? Question

6

Search

Inference

Intelligence

Standards

Page 7: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Google’s Semantic Search Case Studies

People should be able to ask questions and we should understand their meaning, or they should be able to talk about things at a conceptual level. ... A lot of people will turn to things like the semantic Web as a possible answer to that.“ - Google Vice President of Search Products & User Experience Marissa Mayer

7

an initiative launched on 2 June 2011 by Bing, Google and Yahoo! to "create and support a common set of schemas for structured data markup on web pages."

http://schema.org/docs/full.html

The Knowledge Graph is a collection of information sources that help discern a user’s specified intent with each individual query. The graph is actually an encyclopedia with structured information obtained from the web. (currently, 200 million entities)

Freebase is an open, Creative Commons licensed repository of structured data of almost 22 million entities. An entity is a single person, place, or thing connected by a graph.

Page 8: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Apple’s Siri Case Studies

Ask Siri how Apple recorded the best quarter in history for a tech company, and her answer should be: "Me."

8

Siri (Speech Interpretation and Recognition Interface) is an intelligent personal assistant and knowledge navigator which works as an application for Apple's iOS. A Brief History - In December 2007 Siri, Inc. was formed by Dag Kittlaus (CEO), Adam Cheyer (VP Engineering), and Tom Gruber (CTO/VP Design). - Siri Inc. went after funding and by November 2009 it had secured $15.5 million investment, resulted in the creation of the first Siri application, which debuted on the iPhone 3GS in February 2010. - Siri acquired by Apple; iPhone becomes the Virtual Personal Assistant

Knowledge Navigator (1987) a concept described by former Apple Computer CEO John Sculley in his 1987 book, Odyssey.

(Source: http://www.youtube.com/watch?v=QRH8eimU_20)

Page 9: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

9

Active Ontology Case Studies

A processing formalism where distinct processing elements are arranged according to ontology notions; an execution environment.

Basic concepts * Ontology : A data structure - Formal representation for domain knowledge - Classes, attributes, relations * Active Ontology : A processing environment - Processing elements arranged according to ontology

notions - Communication channels movie

genre actor rating P P P

P

rule set

rule

condition

action

rule

condition

action

rule condition

action

(Baur et al., 2007)

Page 10: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Introduction Open Data and Open Government

Data

The Semantic Web & Linked Data What We Will Do

This Presentation ..... Today

10

Page 11: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Big Data “data that becomes large enough that it cannot be processed using conventional methods”

Let’s Start

11

“Big Data is like Sex in High School–Lots of people are talking about it, but few are having it.” -Eric Hansen, SiteSpect founder and CEO

Page 12: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

London 2012: Open Data Olympics Best Practices

12

Page 13: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

OpenStreetMap - Project Haiti

Page 14: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

“Open” material (data) is open if it can be freely used, reused and redistributed by anyone

“Government data” data and information produced or commissioned by government or government controlled entities.

Source: Open Knowledge Foundation, 2010

14

What is Open (Government) Data? Definition

Page 15: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

•  Transparency •  Participation •  Collaboration

“My administration is committed to creating an unprecedented level of openness in Government.” – Barack Obama

“Memorandum for the Heads of Executive Departments and Agencies – Transparency and Open Government” Jan 2009

Page 16: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Data.gov  

•  The  first  phase  of  Data.gov  features  downloadable  federal  data  sets  organized  by  category  and  federal  organiza6on.  

•  Data  sets  are  available  for  download  in  XML,  CSV,  and  shape  file  formats.  

Launched  on  May  21,  2009,  Data.gov  allows  ci;zens  to  par;cipate  by  leveraging  federal  data  sets  to  build  applica;ons,  conduct  analysis,  and  perform  research.  

16  

Page 17: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Data.gov.uk  

Establishment  of  the  Public  Sector  Transparency  Board  chaired  by  Francis  Maude,  Minister  for  the  Cabinet  Office    The  Board  will  be  responsible  for  seRng  open  data  standards  across  the  public  sector,  publishing  further  datasets  on  the  basis  of  public  demand  

Prime  Minister,  David  Cameron,  writes  to  all  government  departments,  31  May  2010:  instruc;ng  them  to  free  up  more  datasets  as  part  of  Transparency  Agenda  

17  

Page 18: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

hTp://www.prac6calpar6cipa6on.co.uk/odi/wp-­‐content/uploads/2010/06/Open-­‐Data-­‐Impacts-­‐Timeline-­‐Dra[-­‐0.1.png  18  

Page 19: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Postcode Newspaper Where Does My Money Go World Events Visualiser EU Public Data

Applications Case Studies

19

Page 20: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Source: http://tinyurl.com/44rub56

The State of Open Government Data Public Sector Dataset

20

Page 21: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

“The application of the four types of instruments by the five countries is depicted – the larger the circle the more instruments are applied” – Huijboom & Van den Broek, 2011.

Open data instruments Open Data Strategies

21

DK DK

DK DK

US

ES ES

ES

AU

UK

UK ES

AU

US

UK US

AU

AU

UK

US

Education and training

Economic instruments

Voluntary approaches

Legislation and control

Page 22: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Drivers and barries of open data policy implementation Critical factors

22

Strategies and experience in front runner countries 1

2

3

4

5

6

7

8

9

10

Political leadership

Regional initiatives

Citizen initiatives

Market initiatives

Emerging technologies

European legislation

Thought leaders

Possibility of monitoring government

Budgets cuts

Closed government culture

Privacy legislation

Limited quality of data

Limited user-friendliness/information overload

Lack of standardization of open data policy

Security threats

Existing charging models

Uncertain economic impact

Digital divide

Network overload

Source:  Huijboom  and  Van  den  Broek,  2011  

Page 23: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Makes it easy to publish, share, and find dataset. Integrated data storage, processing, viewing and visualization

CKAN – Open Source Data Portal Open Data Portals

23

Page 24: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Introduction Open Data and Open Government

Data

The Semantic Web & Linked Data What We Will Do

This Presentation ..... Today

24

Page 25: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

.. a system of interlinked hypertext documents accessed via the Internet

The Web as a Global Data Platform Let’s Start

25

Page 26: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

HTTP

World Wide Web

URI HTML

26

Page 27: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

All data including documents, services, people ...

DATA DATA links

The Semantic Web is not about links between web pages.

27

Page 28: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

“The Semantic Web isn't just about putting data on the web. It is about making links, so that a person or machine can explore the web of data. With linked data, when you have some of it, you can find other, related, data” - TBL.

Linked Data & The Semantic Web Overview

28

5 Stars Open linked data

Make your stuff available on the Web Make it avaiable as structured data Use open, standard formats (instead of

excel) Use a open data format – URLs,

descriptions Link your data to other people’s data

★★

★★★

★★★★

★★★★★

Page 29: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

… Linked Data provides the means to reach the goal of the Semantic Web – “the emergence of a Web of Data”

29

Growth of Interlinks Overview

2007-05-01 2007-10-08 2007-11-10 2008-02-28 2008-03-31

2008-09-18 2009-03-05 2009-03-27 2009-07-14 2010-09-22

Page 30: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

30  October, 2011 295 interlinked datasets, approximately 31 billions triples

DBpedia

Structured Wikipedia

BBC

Best Buy UK Gov

Multimedia Content

Commercial Product Government Data

Page 31: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Linked Data and Open Government Data Why

31

Page 32: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Applications Case Studies

32

DBPedia BBC New York Times thedatahub

Page 33: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Introduction Open Data and Open Government

Data

The Semantic Web & Linked Data What We Will Do

This Presentation ..... Today

33

Page 34: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

34

Roadmap of linked open government data Conceptual Architecture

“the combination of machine power and human power and deliver higher-quality data to a wide range of data consumers via visualization, mashups, and more.”

(Ding et al., 2012)

Page 35: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

Rebuild Fireout

“We won’t get there tomorrow, but maybe the day after” – Rufus Pollock

How to Start

Low-hanging fruit, Less conversational data and quick wins.

Expand, with more….. Data Services Efficiency Costs saving Transparency Participation Inclusion

35  

Page 36: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

- Charles Baur, Adam Cheyer, Didier Guzzoni, Active, a platform for building intelligent software - Noor Huijboom and Tijs Van den Broek, Open Data: an international comparison of strategies, European journal of ePractices, March/April 2011 - Li Ding, Vassilios Peristeras, and Michael Hausenblas, Linked Open Government Data, IEEE Intelligent Systems, May/June 2012 -  Page 1: http://www.w3.org/DesignIssues/diagrams/websci/Marius%20Watz%20-%20Web%20Science%20artwork.png -  Page 4: http://www.go-gulf.com/60seconds.jpg -  Page 9: http://cloud.frontpagemag.com/wp-content/uploads/2012/03/obama11.jpg -  Page 27: http://www.patentlyapple.com/.a/6a0120a5580826970c0168e5ccdd81970c-800wi -  Page 29: http://programminggeeks.com/wp-content/uploads/2010/05/Programming-Geeks-Web-Science.jpg -  Page 29: http://3.bp.blogspot.com/-C0Kyck90Djo/T4KZTg3k1XI/AAAAAAAAAsE/RUp165S0FCQ/s1600/Commitment.jpeg

Page 2 Case Studies -  http://www.guardian.co.uk/commentisfree/2012/aug/03/london-2012-olympics-open-data -  http://www.bbc.co.uk/news/uk-19050139 -  http://london2012.nytimes.com/results -  http://www.guardian.co.uk/sport/interactive/2012/jul/23/could-you-be-a-medallist -  http://www.guardian.co.uk/sport/datablog/2012/aug/13/olympics-2012-data-journalism -  http://www.guardian.co.uk/sport/datablog/interactive/2012/jul/26/london-2012-price-olympic-games-visualised

References

36

Page 37: 전문가토크릴레이 2탄 Open data and linked data (김학래 박사)

For more information contact Haklae Kim via [email protected] Twitter: haklaekim Or see more activities at: http://blogweb.co.kr http://thedatahub.kr http://getthedata.kr