semantic wikis - social semantic web in action

Post on 11-May-2015

1.721 Views

Category:

Technology

3 Downloads

Preview:

Click to see full reader

DESCRIPTION

My invited talk at a technology seminar as part of Tsinghua University's centennial celebration in Seattle, WA on semantic wikis.

TRANSCRIPT

Semantic WikisSemantic WikisSocial Semantic Web In Action

2011-03-25Specially Prepared for Tsinghua University Alumniin greater Seattle area for centennial celebration

2

About Me: Jesse Wang 王嘉欣About Me: Jesse Wang 王嘉欣

1996

1997

2005

1988 1998

1993

3

Who is VulcanWho is Vulcan

4

What does Vulcan doWhat does Vulcan do

Vulcan Inc. was established in 1986 by investor and philanthropist Paul G. Allen, co-founder of Microsoft, to manage his

business and philanthropic efforts. Allen is chairman of Vulcan and his sister, Jody

Allen, is president and CEO.

5

It all began with a vision…It all began with a vision…

6

Now the Vision Continues as Project HaloNow the Vision Continues as Project Halo

Project Halo is a staged, long-range research effort by Vulcan Inc. towards the development of a "Digital Aristotle"—a reasoning system capable of answering novel questions and solving advanced problems in a broad range of scientific

disciplines and related human affairs. The project focuses on creating two primary functions: a tutor capable of instructing and assessing students in those

subjects, and a research assistant with broad, interdisciplinary skills to help scientists and others in their work.

Automatic Question Answering System

7

Project Halo’s Focus AreasProject Halo’s Focus Areas

• Automated User-Centered Reasoning and Acquisition System

• Text book you can talk to

AURA

• Semantic Inference with Large Knowledge-base

• Non-monotonic rule system / RIF

SILK

• Semantic MediaWiki +• Knowledge authoring with SMEs

SMW+

Plus other related semantic technologies and commercial efforts

Knowledge Acquisition

9

Crowdsourcing for Better Knowledge AcquisitionCrowdsourcing for Better Knowledge Acquisition

10

Success of WikisSuccess of Wikis

One of human’s greatest inventions

12

A Key Feature of WikiA Key Feature of Wiki

Consensus

This distinguishes wikis from other publication tools

13

Consensus in Wikis Comes fromConsensus in Wikis Comes from

Collaboration– ~17 edits/page on average in

Wikipedia (with high variance)– Wikipedia’s Neutral Point of View

Convention– Users follow customs and

conventions to engage with articles effectively

14

Software Support Makes Wikis SuccessfulSoftware Support Makes Wikis Successful

Trivial to edit by anyone Tracking of all changes, one-

step rollback Every article has a “Talk” page

for discussion Notification facility allows

anyone to “watch” an article Sufficient security on pages,

logins can be required A hierarchy of administrators,

gardeners, and editors Software Bots recognize certain

kinds of vandalism and auto-revert, or recognize articles that need work, and flag them for editors

Finding Deeper InfoFinding Deeper Info

Wikipedia has articles about…• … all cities with info on their

populations, locations and skyscrapers, etc.… all German cars with engine size, accelerating data…

Can you find: Skyscrapers with 50+ floors and built after 2000 in Shanghai (or Chinese cities with 1,000,000+ people)?

Or German(Porsche) cars that accelerate from 0-100km/h in 5 seconds? 15

16

How Wikipedia Answers – List!How Wikipedia Answers – List!

http://en.wikipedia.org/wiki/List_of_fastest_cars_by_acceleration

17

Going DeeperGoing Deeper

http://en.wikipedia.org/wiki/List_of_German_cars

18

Deeper…Deeper…

19

And Deeper…And Deeper…

20

And Now…And Now…

23

Static List, Tables, …, Not Useable EnoughStatic List, Tables, …, Not Useable Enough

http://en.wikipedia.org/wiki/List_of_lists_about_Oregon

24

We need structured data

with clear and consistent

semantics

Semantics Come To RescueSemantics Come To Rescue

To find answers like:• All Porsche vehicles made in Germany

that accelerate from 1-100 km/h less than 4 seconds

• Sci-Fi movies made after year 2000 that cost less than $10M and gross more than $30M

• A map showing where all Mercedes-Benz vehicles are manufactured

• All skyscrapers in China (Japan, Thailand,…) of 50 (40/60/70) floors or more, and built in year 2000 (2001/2002) and after, sorted by built year, floors…, grouped by cities, regions…

• And many more

25

What is a Semantic WikiWhat is a Semantic Wiki

A wiki that has an underlying model of the knowledge described in its pages.

To allow users to make their knowledge explicit and formal Semantic Web Compatible

Semantic Wiki

26

Two PerspectivesTwo Perspectives

Wikis for Metadata

Metadata for Wikis

List of Semantic WikisList of Semantic Wikis

AceWikiArtificialMemoryWagn - Ruby on Rails-basedKiWi – Knowledge in a WikiKnoodl – Semantic Collaboration tool and application platformMetaweb - the software that powers FreebaseOntoWikiOpenRecordPhpWiki

Semantic MediaWiki - an extension to MediaWiki that turns it into a semantic wikiSwirrl - a spreadsheet-based semantic wiki applicationTaOPis - has a semantic wiki subsystem based on Frame logicTikiWiki CMS/Groupware integrates Semantic links as a core featurezAgile Wikidsmart - semantically enables Confluence

28

29

Basics of Semantic WikisBasics of Semantic Wikis

Still a wiki, with regular wiki features– Category/Tags, Namespaces, Title, Versioning, ...

Typed Content (built-ins + user created, e.g. categories)– Page/Card, Date, Number, URL/Email, String, …

Typed Links (e.g. properties)– “capital_of”, “contains”, “born_in”…

Querying Interface Support– E.g. “[[Category:Member]] [[Age::<30]]” (in SMW)

31

Semantic MediaWiki (SMW) Markup SyntaxSemantic MediaWiki (SMW) Markup Syntax

[[Property::Value | Display]]

Tsinghua is a university located in [[Has location::Beijing]], with

[[Has population::27000|about 27 thousands]] students.

In page "Property:Has location":

[[Has type::Page]]

In page "Property:Has population":

[[Has type::number]]

33

Define ClassesDefine Classes

Beijing is a city in [[Has country::China]], with population [[Has population::2,200,000]].

[[Category::Cities]]

Categories are used to define classes because they are better for class inheritance.

The Jin Mao Tower (金茂大厦 ) is an 88-story landmark supertall skyscraper in …

[[Categories: 1998 architecture | Skyscrapers in Shanghai | Hotels in Shanghai | Skyscrapers over 350 meters | Visitor attractions in Shanghai | Landmarks in Shanghai | Skidmore, Owings and Merrill buildings]]

Category:Skyscrapers in China Category: Skyscrapers by country

34

Database-style Query over Wiki DataDatabase-style Query over Wiki Data

{{#ask:[[Category:Skyscrapers]][[Located in::China]][[Floor count::>50]][[Year built::<2000]] …

}}

Example: Skyscrapers in China higher than 50 stories, built before

2000

ASK/SPARQL query target

Data via DBpedia

36

What is the Promise of Semantic Wikis?What is the Promise of Semantic Wikis?

Semantic Wikis promise Consensus over Data

Combine low-expressivity data authorship with the best features of traditional wikis

User-governed, user-maintained, user-defined

Easy to use as an extension of text authoring

The ultimate data aggregator

38

One Key Helpful Feature of Semantic WikisOne Key Helpful Feature of Semantic Wikis

Semantic Wikis are “Schema-Last”Databases require DBAs and schema design;

Semantic Wikis develop and maintain the schema in the wiki

39

Semantic MediaWiki CommunitySemantic MediaWiki Community

Open source (GPL) Well documented Active mailing list Commercial support available World-wide community Regular Conferences

– Next SMWCon 4/28-30, 2011 Arlington, VA

http://semantic-mediawiki.org/Very stable SMW core

Mature while still growing, slowly but steadily

40

SMW Extensions – Help Build Great ThingsSMW Extensions – Help Build Great Things

• Halo Extensions, Semantic Forms, Semantic Notification, …

Data I/O

• Semantic Toolbar, Semantic Drilldown, Enhanced Retrieval, Search…

Query and Browsing

• Semantic Result Printers, Tree View, Exhibit, Flash charts…

Visualization

• HaloACL, Deployment, Triplestore Connector, Simple Rules…• Semantic WikiTags and Subversion Integration extensions • Upcoming Linked Data Extension, with R2R and SILK from F.U.Berlin

Other useful extensions

41

Example: Ultrapedia – Semantic WikipediaExample: Ultrapedia – Semantic Wikipedia

Ultrapedia: An SMW demo built to explore general knowledge acquisition in a wiki

Wikipedia merged with the power of a database Help Readers and Writers Be More Productive

An Analytical Encyclopedia

Better Views of the Wiki DataBetter Views of the Wiki Data

http://wiking.vulcan.com/up/index.php/Porsche_996

Dynamic Views of the Acceleration DataDynamic Views of the Acceleration Data

Graph Views of the Acceleration DataGraph Views of the Acceleration Data

Dynamic Mapping and ChartingDynamic Mapping and Charting

48

Information Discovery via VisualizationInformation Discovery via Visualization

49

The InspirationThe Inspiration

We started with a

We could have an

wiki site

web application

50

Video: Semantic Wikis for A New ProblemVideo: Semantic Wikis for A New Problem

Social tag-based characterization

Keyword search over tag data

Inconsistent semantics

Easy to engineer

Increasing technical complexity → ← Increasing User Participation

Algorithm-based object characterization

Database-style search

Consistent semantics Extremely difficult to

engineer

Social database-style characterization

Database search + wiki text search

Semantic consistency via wiki mechanisms

Easy to engineer

Semantic Entertainment

Wiki

51

Semantic Seahawks Football WikiSemantic Seahawks Football Wiki

Semantic Entertainment: Query Result Highlight ReelSemantic Entertainment: Query Result Highlight Reel

Commercial Look/Feel

Play-by-play video search

Highlight reel generation

Search on crowd-defined patterns (“touchdowns with big hits”)

Tree-based navigation widget

Very favorable economics

Demo

62

Case Study and Demo: Project Management with SMW+Case Study and Demo: Project Management with SMW+

Automatically populate tables

Just the data you want, At the level you want Calendars and

timelines Workflows Personal menus Form-oriented inputs Notifications via

email/RSS MS Office integration SVN integration

Vulcan Project Management Wiki (Story)Vulcan Project Management Wiki (Story)

Template and style sheet

customizations

Related content

automatically included

64

Vulcan Project Management Wiki (Task)Vulcan Project Management Wiki (Task)

Color codes to indicate types

and status

SVN Integration automatically “Completed”

task and relate to repository

65

Vulcan Project Management Wiki (Visualizations)Vulcan Project Management Wiki (Visualizations)

Demo

66

Screenshot of a Sprint pageScreenshot of a Sprint page

http://wiking.vulcan.com/dev/index.php/Sprint_101020

Data automatically generated via template queries on page

67

Requirements for Wiki “Developers”Requirements for Wiki “Developers”

One need not– Write code like a hardcore programmer– Design, setup RDBMS or make frequent

schema changes– Possess knowledge of a senior system

admin Instead one need

– Configure the wiki with desired extensions– Design and evolve the data model

(schema)– Design Content

• Customize templates, forms, styles, skin, etc.

73

Conclusions: Semantic MediaWiki is a Powerful ToolConclusions: Semantic MediaWiki is a Powerful Tool

Semantic MediaWiki+ (http://smwforum.ontoprise.com) – Open-source, growing semantic wiki software system– Wiki-style text + semantic markups– Collaborative, user-governed subject models and data curation– Simple and extensible data models with easy import/export

SMW+ has many government and industry users– People built applications with it

Knowledge Management viacrowds can work– A way to leverage and exploit

web-collected data– A lightweight collaborative

knowledge management tool

A new platform for lightweight web application development

KB E

ffort (co

st, p

eople

,…)

KB size (number of assertions, complexity…)

Vulcan

Now

Future

AcknowledgementAcknowledgement

Paul Allen

Mark Greaves

Karlsruhe Institute of Technology

Tommy Lu

Ontoprise GmbH

William Smith

TeamMersion LLC

Jesse Wang

74

Thank you!

75

Backups starts here

(End of Slides)

76

Case Study: Battle-space Luminary System Case Study: Battle-space Luminary System

Discover when New Information represents a change in understanding of entities– Discovery of explicit entity links, implicit relationships

Large Volumes of Data in various formats– Unstructured news articles– Tactical Reports, Field Intelligence– Structured Database Information

Use Wiki Pages to represent current knowledge about an entity – “what we know” Domain Ontology to represent domain of information – “what we want to know” Issue Alerts when Significant Events occur

– New information according to category– Changing information on topics of interest– Need to send information to various devices – cell phones, email, etc.

77

System DesignSystem Design

Wiki Configuration– Semantic MediaWiki: Large developer community, active development, open

source. Wikipedia uses MediaWiki, so scalability and performance are important.

– Semantic Results Format: Provides various rich media displays of semantic information, including graphs, timelines, maps

– Semantic Forms: Provides convenient user interface for entering semantic data into wiki, avoiding cumbersome wikitext

– Semantic Notifications: Enables sending of notifications when results of semantic query change.

Domain Ontology– Created OWL Ontology for Terrorism

Semantic Parsing, Extraction, Reasoning– Java Process using various Open-Source Toolkits– Rapid plugin of new technologies– Multiple Data Sources supported

78

Sample Content PageSample Content Page

79

Wiki Content DesignWiki Content Design

Use Templates to Ensure Consistent Look-and-Feel– Templates Correspond to Ontology Classes– Fields within Templates correspond to Properties within Ontology– Rich Content Visualizations derived in consistent way

Hierarchical Categories match Class Hierarchy within Ontology– Ensures Validity for Properties– Category included on each Template page to ensure consistency

Forms Provide ability for users to enter data directly into wiki without knowing Wiki Text– Each form corresponds to a Template– Fields within forms correspond to the fields/properties within the Template– GUI can include auto-completion– Created Page immediately linked semantically to rest of Wiki

80

Sample VisualizationsSample Visualizations

Visualizations automatically created

w/o user edit(tables, timelines,

maps, social networks…)

UI enables notifications based

on results of query – message sent when

visualization changes

Wikipedia for Porsches (Acceleration Data Example)Wikipedia for Porsches (Acceleration Data Example)

Information Need: All Porsche models that accelerate 0-100kph in under 5, 6, and 7 seconds

More Porsche Acceleration Data in WikipediaMore Porsche Acceleration Data in Wikipedia

Main PageUltrapedia Main PageUltrapedia Main Page

Tree View Control Abstract/Summary quick preview

Semantics for Improved Wiki NavigationSemantics for Improved Wiki Navigation

The Porsche 996 Acceleration Table In UltrapediaThe Porsche 996 Acceleration Table In Ultrapedia

Same Table as a QuerySame Table as a Query

Which Porsches accelerate fast?Dynamically-Generated Tables for QueriesDynamically-Generated Tables for Queries

Information Need: All Porsche models that accelerate 0-100kph in under 5, 6, and 7 seconds

Graph Views of the Acceleration DataGraph Views of the Acceleration Data

External Data via a Live Ebay QueryExternal Data via a Live Ebay Query

Linking to External Ebay DataLinking to External Ebay Data

Mercedes-Benz E-class W212 Gallery SectionPhotos in Wiki Articles as DataPhotos in Wiki Articles as Data

Volkswagen Production Timeline ViewTimelines from DataTimelines from Data

Dynamic Mapping and ChartingDynamic Mapping and Charting

Editing Wiki Data In PlaceEditing Wiki Data In Place

Return

top related