전문가토크릴레이 2탄 open data and linked data (김학래 박사)
DESCRIPTION
전문가 토크릴레이 2탄 Open data and linked data : 웹사이언스 워크그룹 김학래 박사TRANSCRIPT
Open����������� ������������������ Data����������� ������������������ and����������� ������������������ Linked����������� ������������������ Data����������� ������������������ Making Emergent Creativity
김학래,����������� ������������������ 코리아데이터허브,����������� ������������������ 2012����������� ������������������
Data.gov WikiLeaks
“This led to changes in the cons6tu6on and the establishment of a more open government” – WikiLeaks
Let’s Think
Open Data starts with making available the data that you already have, in whatever format.
• Equal access for all • Licensing, legal issues • Transparency • Changing the way government works
Open Data vs Linked Data Quick Summary
Open Data
Linked Data • URIs • HTTPs • RDF vocabularies • Standards
3
Introduction Open Data and Open Government
Data
The Semantic Web & Linked Data What We Will Do
This Presentation ..... Today
4
Web in Evolution “a steady progression from a document-centric Web to one that is data-centric, including the mediation of semantics”
Let’s Start
5
(Source: Mike, 2007)
What is the Semantic Web for? Question
6
Search
Inference
Intelligence
Standards
Google’s Semantic Search Case Studies
People should be able to ask questions and we should understand their meaning, or they should be able to talk about things at a conceptual level. ... A lot of people will turn to things like the semantic Web as a possible answer to that.“ - Google Vice President of Search Products & User Experience Marissa Mayer
7
an initiative launched on 2 June 2011 by Bing, Google and Yahoo! to "create and support a common set of schemas for structured data markup on web pages."
http://schema.org/docs/full.html
The Knowledge Graph is a collection of information sources that help discern a user’s specified intent with each individual query. The graph is actually an encyclopedia with structured information obtained from the web. (currently, 200 million entities)
Freebase is an open, Creative Commons licensed repository of structured data of almost 22 million entities. An entity is a single person, place, or thing connected by a graph.
Apple’s Siri Case Studies
Ask Siri how Apple recorded the best quarter in history for a tech company, and her answer should be: "Me."
8
Siri (Speech Interpretation and Recognition Interface) is an intelligent personal assistant and knowledge navigator which works as an application for Apple's iOS. A Brief History - In December 2007 Siri, Inc. was formed by Dag Kittlaus (CEO), Adam Cheyer (VP Engineering), and Tom Gruber (CTO/VP Design). - Siri Inc. went after funding and by November 2009 it had secured $15.5 million investment, resulted in the creation of the first Siri application, which debuted on the iPhone 3GS in February 2010. - Siri acquired by Apple; iPhone becomes the Virtual Personal Assistant
Knowledge Navigator (1987) a concept described by former Apple Computer CEO John Sculley in his 1987 book, Odyssey.
(Source: http://www.youtube.com/watch?v=QRH8eimU_20)
9
Active Ontology Case Studies
A processing formalism where distinct processing elements are arranged according to ontology notions; an execution environment.
Basic concepts * Ontology : A data structure - Formal representation for domain knowledge - Classes, attributes, relations * Active Ontology : A processing environment - Processing elements arranged according to ontology
notions - Communication channels movie
genre actor rating P P P
P
rule set
rule
condition
action
rule
condition
action
rule condition
action
(Baur et al., 2007)
Introduction Open Data and Open Government
Data
The Semantic Web & Linked Data What We Will Do
This Presentation ..... Today
10
Big Data “data that becomes large enough that it cannot be processed using conventional methods”
Let’s Start
11
“Big Data is like Sex in High School–Lots of people are talking about it, but few are having it.” -Eric Hansen, SiteSpect founder and CEO
London 2012: Open Data Olympics Best Practices
12
OpenStreetMap - Project Haiti
“Open” material (data) is open if it can be freely used, reused and redistributed by anyone
“Government data” data and information produced or commissioned by government or government controlled entities.
Source: Open Knowledge Foundation, 2010
14
What is Open (Government) Data? Definition
• Transparency • Participation • Collaboration
“My administration is committed to creating an unprecedented level of openness in Government.” – Barack Obama
“Memorandum for the Heads of Executive Departments and Agencies – Transparency and Open Government” Jan 2009
Data.gov
• The first phase of Data.gov features downloadable federal data sets organized by category and federal organiza6on.
• Data sets are available for download in XML, CSV, and shape file formats.
Launched on May 21, 2009, Data.gov allows ci;zens to par;cipate by leveraging federal data sets to build applica;ons, conduct analysis, and perform research.
16
Data.gov.uk
Establishment of the Public Sector Transparency Board chaired by Francis Maude, Minister for the Cabinet Office The Board will be responsible for seRng open data standards across the public sector, publishing further datasets on the basis of public demand
Prime Minister, David Cameron, writes to all government departments, 31 May 2010: instruc;ng them to free up more datasets as part of Transparency Agenda
17
hTp://www.prac6calpar6cipa6on.co.uk/odi/wp-‐content/uploads/2010/06/Open-‐Data-‐Impacts-‐Timeline-‐Dra[-‐0.1.png 18
Postcode Newspaper Where Does My Money Go World Events Visualiser EU Public Data
Applications Case Studies
19
Source: http://tinyurl.com/44rub56
The State of Open Government Data Public Sector Dataset
20
“The application of the four types of instruments by the five countries is depicted – the larger the circle the more instruments are applied” – Huijboom & Van den Broek, 2011.
Open data instruments Open Data Strategies
21
DK DK
DK DK
US
ES ES
ES
AU
UK
UK ES
AU
US
UK US
AU
AU
UK
US
Education and training
Economic instruments
Voluntary approaches
Legislation and control
Drivers and barries of open data policy implementation Critical factors
22
Strategies and experience in front runner countries 1
2
3
4
5
6
7
8
9
10
Political leadership
Regional initiatives
Citizen initiatives
Market initiatives
Emerging technologies
European legislation
Thought leaders
Possibility of monitoring government
Budgets cuts
Closed government culture
Privacy legislation
Limited quality of data
Limited user-friendliness/information overload
Lack of standardization of open data policy
Security threats
Existing charging models
Uncertain economic impact
Digital divide
Network overload
Source: Huijboom and Van den Broek, 2011
Makes it easy to publish, share, and find dataset. Integrated data storage, processing, viewing and visualization
CKAN – Open Source Data Portal Open Data Portals
23
Introduction Open Data and Open Government
Data
The Semantic Web & Linked Data What We Will Do
This Presentation ..... Today
24
.. a system of interlinked hypertext documents accessed via the Internet
The Web as a Global Data Platform Let’s Start
25
HTTP
World Wide Web
URI HTML
26
All data including documents, services, people ...
DATA DATA links
The Semantic Web is not about links between web pages.
27
“The Semantic Web isn't just about putting data on the web. It is about making links, so that a person or machine can explore the web of data. With linked data, when you have some of it, you can find other, related, data” - TBL.
Linked Data & The Semantic Web Overview
28
5 Stars Open linked data
Make your stuff available on the Web Make it avaiable as structured data Use open, standard formats (instead of
excel) Use a open data format – URLs,
descriptions Link your data to other people’s data
★★
★
★★★
★★★★
★★★★★
… Linked Data provides the means to reach the goal of the Semantic Web – “the emergence of a Web of Data”
29
Growth of Interlinks Overview
2007-05-01 2007-10-08 2007-11-10 2008-02-28 2008-03-31
2008-09-18 2009-03-05 2009-03-27 2009-07-14 2010-09-22
30 October, 2011 295 interlinked datasets, approximately 31 billions triples
DBpedia
Structured Wikipedia
BBC
Best Buy UK Gov
Multimedia Content
Commercial Product Government Data
Linked Data and Open Government Data Why
31
Applications Case Studies
32
DBPedia BBC New York Times thedatahub
Introduction Open Data and Open Government
Data
The Semantic Web & Linked Data What We Will Do
This Presentation ..... Today
33
34
Roadmap of linked open government data Conceptual Architecture
“the combination of machine power and human power and deliver higher-quality data to a wide range of data consumers via visualization, mashups, and more.”
(Ding et al., 2012)
Rebuild Fireout
“We won’t get there tomorrow, but maybe the day after” – Rufus Pollock
How to Start
Low-hanging fruit, Less conversational data and quick wins.
Expand, with more….. Data Services Efficiency Costs saving Transparency Participation Inclusion
35
- Charles Baur, Adam Cheyer, Didier Guzzoni, Active, a platform for building intelligent software - Noor Huijboom and Tijs Van den Broek, Open Data: an international comparison of strategies, European journal of ePractices, March/April 2011 - Li Ding, Vassilios Peristeras, and Michael Hausenblas, Linked Open Government Data, IEEE Intelligent Systems, May/June 2012 - Page 1: http://www.w3.org/DesignIssues/diagrams/websci/Marius%20Watz%20-%20Web%20Science%20artwork.png - Page 4: http://www.go-gulf.com/60seconds.jpg - Page 9: http://cloud.frontpagemag.com/wp-content/uploads/2012/03/obama11.jpg - Page 27: http://www.patentlyapple.com/.a/6a0120a5580826970c0168e5ccdd81970c-800wi - Page 29: http://programminggeeks.com/wp-content/uploads/2010/05/Programming-Geeks-Web-Science.jpg - Page 29: http://3.bp.blogspot.com/-C0Kyck90Djo/T4KZTg3k1XI/AAAAAAAAAsE/RUp165S0FCQ/s1600/Commitment.jpeg
Page 2 Case Studies - http://www.guardian.co.uk/commentisfree/2012/aug/03/london-2012-olympics-open-data - http://www.bbc.co.uk/news/uk-19050139 - http://london2012.nytimes.com/results - http://www.guardian.co.uk/sport/interactive/2012/jul/23/could-you-be-a-medallist - http://www.guardian.co.uk/sport/datablog/2012/aug/13/olympics-2012-data-journalism - http://www.guardian.co.uk/sport/datablog/interactive/2012/jul/26/london-2012-price-olympic-games-visualised
References
36
For more information contact Haklae Kim via [email protected] Twitter: haklaekim Or see more activities at: http://blogweb.co.kr http://thedatahub.kr http://getthedata.kr