sssw2007 social web baumann

72
Social Web! ... and now? Stephan Baumann Competence Center Computational Culture German Research Center for AI The Fifth Summer School on Ontological Engineering and the Semantic Web (SSSW'07)

Upload: stephan-baumann

Post on 06-Dec-2014

5.410 views

Category:

Technology


2 download

DESCRIPTION

invited talk about the options to improve social software by semantic web techniques and vice versa, strong focus on the social edge of technology

TRANSCRIPT

Page 1: SSSW2007 Social Web Baumann

Social Web! ... and now?

Stephan BaumannCompetence Center Computational Culture

German Research Center for AI

The Fifth Summer School on Ontological Engineering and the Semantic Web (SSSW'07)

Page 2: SSSW2007 Social Web Baumann

Prologue

Page 3: SSSW2007 Social Web Baumann
Page 4: SSSW2007 Social Web Baumann

• 2007

• overwhelming

• too many copycats

• Semantic Web Ph.Ds MUST help!

Page 5: SSSW2007 Social Web Baumann

Setting the scene (1)• Devices

• More computing power

• More storage capacity

• Less size

• Connectivity, always-on, ad-hoc networks

• Software

• Open Source, freeware, even frameworks

• 1-click-away, even advanced academic stuff

Page 6: SSSW2007 Social Web Baumann

Setting the scene (2)• People

• Digital Natives

• Free-time programmers

• Coding as hobby

• Coding as art

• Powerful communities (sourceforge, processing, generator-x, etc.)

• APIs of the powerhouses (Google, Ebay, etc.)

Page 7: SSSW2007 Social Web Baumann

• ... and as a result?!

• ripping

• sharing

• remixing

• mashing

• always

• everywhere

• transmedia: from evil ringtones to Web2.0 mash-ups

Setting the scene (3)

MUSICMOVIES

GAMESPHOTOS

BLOGS

Page 9: SSSW2007 Social Web Baumann

cultural impact

[S. Baumann: Artificial Listening Systems, Ph.D Thesis, University of Kaiserslautern, March 2005.]

Page 10: SSSW2007 Social Web Baumann

social impact

Page 11: SSSW2007 Social Web Baumann

so what?!

Page 12: SSSW2007 Social Web Baumann

Exciting area for scientific engagement :-)

Page 13: SSSW2007 Social Web Baumann

Great!Are theresimilarsongs ?!?!

What is he listening to ?

He shares my musical taste ... handsome guy

[S. Baumann and J. Halloran:An ecological approach to multimodal subjective music similarity perception. (CIM2004), Graz, Austria, April 2004.]

Page 14: SSSW2007 Social Web Baumann

bluetuna.opendfki.de

[S. Baumann, A. Bassoli, B.Jung, M. Wisniowski: BluetunA: Let your neighbor know what music you like(CHI2007), San Jose, US, April 2007.]

• Freeware supporting common mobile phones

• Sources at Sourceforge (join in!)

• Import of Last.fm profiles

• Last.fm „scrobbles“ 15 mio times daily

• CBS acquired Last.fm for $280 million

Page 15: SSSW2007 Social Web Baumann

Person, Context, Artifact, Relation,

Network

Page 16: SSSW2007 Social Web Baumann

I. People

Page 17: SSSW2007 Social Web Baumann

User models for music

<musicpreferences>

<genre>classical</genre>

<genre>blues</genre>

<genre>rock/pop</genre>

<composer>Mozart</composer>

<artist>Beatles</artist>

<sample>

<title>Yesterday</title>

<artist>Beatles</artist>

</sample>

</musicpreferences>

<habit>

<context>I’m happy

<tempo>very fast</tempo>

<genre>pop</genre>

</context>

<pfeature>romantic

<tempo>very slow</tempo>

<softness>very soft<softness>

<title>*love*</title>

</pfeature>

<context>bedtime

<pfeature>romantic</pfeature>

</context>

</habit>

<user><generalbackground>

<name>John White </name> <education>MS</education> <citizenship>US</citizenship> <birthdate>9/7/1974</birthdate> <sex>male</sex> <occupation>student</occupation>

</generalbackground></user>

<musicbackground>

<education>none</education>

<instrument>piano</instrument>

<instrument>vocal</instrument>

</musicbackground>

<generalpreferences>

<color>blue</color>

<animal>dog</animal>

</generalpreferences>

[Chai,Vercoe: ISMIR2000]

Page 18: SSSW2007 Social Web Baumann

FOAF• http://www.foaf-project.org

• FOAF documents describe the characteristics and relationships amongst friends of friends, and their friends, and the stories they tell.

• „So FOAF is quite pluralistic in its approach to representing relationships between people. FOAF is built on top of a general purpose machine language for representing relationships (ie. RDF), so is quite capable of representing any kinds of relationship we care to add. The problems are generally social rather than technical; deciding on appropriate ways of describing these interconnections is a subtle art.“

• „Perhaps the most important use of foaf:knows is, alongside the rdfs:seeAlso property, to connect FOAF files together. Taken alone, a FOAF file is somewhat dull. But linked in with 1000s of other FOAF files it becomes more interesting, with each FOAF file saying a little more about people, places, documents, things... people they mention, and so on.“

[Dan Brickley, Libby Miller: foaf-project.org, 2000-2005]

Page 19: SSSW2007 Social Web Baumann

FOAF-related applications

• FOAFme (http://foafme.opendfki.de)

• FOAFNaut

• Gnowsis

[Sauermann, Lee: foafcamp.asemantics.org 2004]

Page 20: SSSW2007 Social Web Baumann

Users & Context

[Kofod-Petersen&Cassens: MRC2005, FLAIRS2006]

Page 21: SSSW2007 Social Web Baumann

www.ubisworld.org

• GUMO - General User Model Ontology (OWL)

• UbisOntology (RDF-S)

88 CHAPTER 5. USER MODEL ONTOLOGY & UBISWORLD

acquisition and inference steps, again others are entered into the system by self-reports bythe users. In [Kobsa et al., 2001] an analysis about the input data for personalized systemsis presented which discusses the different kinds of data that user-adaptive systems may needto consider. The classifying terms user data and usage data are introduced. However, it isalso shown that there are potential overlaps between these two categories. The user data isfurther distinguished into demographic data, user knowledge, user skills and capabilities,user interests and preferences and user goals and plans.

The categorization in the general user model ontology GUMO approach, as shown in fig-ure 5.6, is based on Jameson’s and Kobsa’s work; however, it is different, in that the modelingof the dimensions can now be split into auxiliaries, predicates and ranges whichleads to a more powerful interplay between the SITUATIONALSTATEMENTS and the ontol-ogy, as argued in the sections above. Furthermore, no top level user model ontology for thesemantic web has been proposed so far, apart from [Heckmann, 2003d]. Figure 5.6 classifiesseveral groups of user dimensions that can be identified. The complete list of the modeledbasic user dimensions together with their UbisIds are presented in the appendix table B.2 onpage 223.

Figure 5.6: Groups of basic user dimensions as shown in the Ubis Ontology Browser

Since there is no consensus in the nature or meaning of affect, existing theories andmodels of personality, mood, emotion and facial expressions are discussed individually inthe following subsections without classifying them under the term “affect” in the ontology.

5.2.2.1 Contact Information and Demographics

Contact information is available to most commercial systems, since the user has to regis-ter their record data. Some of them are mostly compulsory to be entered while others areoptional. Even these basic bits of user information like given name, postal code orcountry as shown in figure 5.7(a) are good for user-adaptation. There exist commercial

5.2. GUMO - THE GENERAL USER MODEL ONTOLOGY 89

databases that map the postal code together with the street and sometimes even thehouse number to the assumed wealth and living standard of the inhabitants. Knowing thecity of living makes it possible to offer regional awareness. The country refers to theofficial spoken languages. Even the assumption, that old-fashioned given names couldrefer to elderly people is used in some commercial applications.

(a) (b)

Figure 5.7: Contact information and demographics as shown in the Ubis Ontology Browser

User demographics like gender and age as shown in figure 5.7(b) are very im-portant user model dimensions that are for example inferred from speaking behav-ior, see [Muller et al., 2001]. Second languages, family status, highesteducation level, employment and salary also belong to the demographics group.Today’s personalized websites often operate on the basis of these data and according to[Kobsa et al., 2001] the value can be high when combined with high-quality statistical data,such as about the purchase behavior of different customer groups.

5.2.2.2 Personality and Characteristics

Personality and Characteristics describe permanent or very slow changing pat-terns that are associated with an individual. They might be difficult to detect by a user adap-tive system but once they are known they can be permanently used. There exist severalmodels of personality in the psychological literature. Not all of them can be considered forthe basic user model ontology. However, since the ontology should not stick to one modelonly, several models have been analyzed and are discussed in the following. This section isbased on [Klesen, 2002], [Oberlander, 2004], [Picard, 1997] and [Andre et al., 2000].

A predominant model for personality is according to [Klesen, 2002] the Five-

5.2. GUMO - THE GENERAL USER MODEL ONTOLOGY 89

databases that map the postal code together with the street and sometimes even thehouse number to the assumed wealth and living standard of the inhabitants. Knowing thecity of living makes it possible to offer regional awareness. The country refers to theofficial spoken languages. Even the assumption, that old-fashioned given names couldrefer to elderly people is used in some commercial applications.

(a) (b)

Figure 5.7: Contact information and demographics as shown in the Ubis Ontology Browser

User demographics like gender and age as shown in figure 5.7(b) are very im-portant user model dimensions that are for example inferred from speaking behav-ior, see [Muller et al., 2001]. Second languages, family status, highesteducation level, employment and salary also belong to the demographics group.Today’s personalized websites often operate on the basis of these data and according to[Kobsa et al., 2001] the value can be high when combined with high-quality statistical data,such as about the purchase behavior of different customer groups.

5.2.2.2 Personality and Characteristics

Personality and Characteristics describe permanent or very slow changing pat-terns that are associated with an individual. They might be difficult to detect by a user adap-tive system but once they are known they can be permanently used. There exist severalmodels of personality in the psychological literature. Not all of them can be considered forthe basic user model ontology. However, since the ontology should not stick to one modelonly, several models have been analyzed and are discussed in the following. This section isbased on [Klesen, 2002], [Oberlander, 2004], [Picard, 1997] and [Andre et al., 2000].

A predominant model for personality is according to [Klesen, 2002] the Five-

92 CHAPTER 5. USER MODEL ONTOLOGY & UBISWORLD

(a) (b)

Figure 5.8: (a) Personality and (b) Characteristics in the general user model ontology GUMO

5.2.2.3 Mood, Emotion and Facial Expression

Mood and emotion differ from personality and characteristics in respect to the user modeldimensions in terms of duration. Emotions tend to be closely associated with a specificevent or object and have a short duration of a few minutes up to an hour, whereas moodis more diffuse and of longer duration between a few hours and a few days. The problemof representing and reasoning with affective states and personality traits is analyzed in theresearch about embodied animated agents, see e.g. [Andre et al., 2000], [Klesen, 2002] and[Prendinger and Ishizuka, 2004] as well as in the research of human performance simulation,see [Silverman et al., 2003]. Frameworks for the generation of emotions in life-like agentsare presented in [Lee, 1999], [Gebhard et al., 2004] and [Schmitt, 2005]. Emotional statesare for example happiness, anxiety, fear, love, hate and pride. The list ofcurrently modeled emotional states in the general user model ontology GUMO can be found

[Heckmann: MRC2005, Ph.D Thesis DFKI]

Page 22: SSSW2007 Social Web Baumann

II. People–Networks

Page 23: SSSW2007 Social Web Baumann

Friendster and Vizster

• 2003: Friendster, very popular social software

• descriptive personal profiles

• + explicit friendship links

• attracted millions of users -> massive network

• Hard to navigate user-interface

• Boyd & Heer decided to design a visualization system for online social networks [Heer,Boyd: Vizster, InfoVis2005]

• Danah Boyd: well-known ethnographer, UC Berkeley

• Jeffrey Heer: user-interface design expert, UC Berkeley

Page 24: SSSW2007 Social Web Baumann

Friendster and Vizster

• Data feed

• crawl of 1.5 million members

• storage: mySQL database

• Lucene for keyword search

• [Newman‘s2004] community identification algorithm

• Custom visualization of diverse network characteristics based on prefuse [http://prefuse.sourceforge.net]

• Several visual interaction modes based on ethnographic findings

Page 25: SSSW2007 Social Web Baumann

!"#$%& '"%& ()*+,#'-& +.& /%+/$%& 01%2& 3,#%421'%,& .+,& 1+5#)$&

%6/$+,)'#+4& )42& /$)-7& 80%1'#+4)9$%& /,)5'#5%1& :%,%& )$1+& /,%1%4';&

<+(%&01%,1&01%2&3,#%421'%,& '+&2#1',#90'%&2,0=1&:"#$%&+'"%,1&01%2&

'"%& 1%,>#5%& '+& 5+41',05'& .,)020$%4'& /,+.#$%1& .+,& /09$#5& ,+)1'#4=&

'",+0="& '%1'#(+4#)$1;& ?4+'"%,& /,+9$%(& '")'& %(%,=%2& :)1& '"%&

5+$$#2#4=& +.& 4%':+,@1& 4+,()$$-& @%/'& )/),'& A& 4+'& %>%,-+4%& :)1&

/,%/),%2&'+&%6/+1%&'"%#,&4%':+,@1&'+&9+'"&.,#%421&)42&5+$$%)=0%1;&&

!! "#$%&'#(&)#*+,-.$#/+,

B0,& =+)$& :#'"& C#D1'%,& :)1& '+& 90#$2& )& >#10)$#D)'#+4& 1-1'%(& '")'&

%42E01%,1& +.& 1+5#)$& 4%':+,@#4=& 1%,>#5%1& 5+0$2& 01%& '+& .)5#$#')'%&

2#15+>%,-&)42&#45,%)1%2&):),%4%11&+.&'"%#,&+4$#4%&5+((04#'-;&!%&

:)4'%2& '+& 10//+,'& '"%& %6/$+,)'+,-& )42& /$)-.0$& )1/%5'1& +.&

3,#%421'%,& :"#$%& )$1+& =#>#4=& 01%,1& %)1#%,& )55%11& '+& 1%),5"& )42&

=,+0/& /)''%,41;& !"#$%& 01%,1& ,%=0$),$-& %6/$+,%2& '"%& 4%':+,@& +4&

3,#%421'%,7&'"%&$#4%),&.+,()'&$#(#'%2&105"&%6/$+,)'#+41;&F"#1&$%2&01&

'+& 2%>%$+/& ,#5"%,& 4%':+,@& >#%:1& )42& %6/$+,)'+,-& '++$17& :"#$%&

()#4')#4#4=& )& $+5)$& +,#%4')'#+4;&!%& )$1+& $%),4%2& '")'& '"%& 01%& +.&

#()=%,-&:)1&#42#1/%41)9$%&.+,&#2%4'#.-#4=&/%+/$%&)42&%1')9$#1"#4=&

)& /,%1%4')'#+4& +.& 1%$.7& )42& 1+& (01'& /$)-& )& 5%4',)$& ,+$%& #4& '"%&

>#10)$#D)'#+4;&G4&)22#'#+4&'+&"%$/#4=&10//+,'&'"%&50,,%4'&/,)5'#5%17&

:%&:)4'%2& '+&()@%& 10,%& '")'&C#D1'%,& 2#2& 4+'& %$#(#4)'%& '"%& 2)')&

'")'&"%$/%2&01%,1&=%'&)&1%41%&+.&/%+/$%&'",+0="&'"%#,&/,+.#$%1;&B4%&

%6)(/$%&#1&'"%&01%&+.&,%E)//,+/,#)'%2&/,+.#$%&.#%$21&H!"#"7&#4>%,'#4=&

)=%1&'+&#2%4'#.-&'%%4)=%,1I&.+,&5+2%2&5+((04#5)'#+4&:#'"#4&)&109E

/+/0$)'#+4;& 3+,& '"#1& ,%)1+47& :%& ,%)$#D%2& '")'& :%& (01'& ()@%&

1%),5")9$%& /,+.#$%& 2)')& >%,-& /,%1%4'& )42& )55%11#9$%& #4& '"%&

>#10)$#D)'#+4;& F"%1%& =+)$1& /+1#'#+4& C#D1'%,& 2#..%,%4'$-& .,+(&

',)2#'#+4)$&1+5#)$&4%':+,@&>#10)$#D)'#+41&01%2&)1&)4)$-1#1&'++$1&9-&

1+5#)$&15#%45%&,%1%),5"%,1;&F"%&.+$$+:#4=&2%15,#/'#+4&#45$02%1&'"%&

#(/$#5)'#+41&'"#1&)//,+)5"&")1&")2&.+,&+0,&2%1#=4&2%5#1#+417&9+'"&

#4& '%,(1&+.& /,%1%4')'#+4& )42& '"%& $%>%$& +.& '%5"4#5)$& 1+/"#1'#5)'#+4&

%6/+1%2&9-&'"%&>#10)$#D)'#+4;&

C#D1'%,& /,%1%4'1& 1+5#)$& 4%':+,@1& 01#4=& )& .)(#$#),& 4+2%E$#4@&

,%/,%1%4')'#+47&:"%,%&4+2%1&,%/,%1%4'&(%(9%,1&+.&'"%&1-1'%(&)42&

$#4@1& ,%/,%1%4'& '"%& ),'#50$)'%2& J.,#%421"#/K& $#4@1& 9%':%%4& '"%(&

H3#=0,%1&L7MI;&G4&'"#1&>#%:7&4%':+,@&(%(9%,1&),%&/,%1%4'%2&01#4=&

9+'"& '"%#,& 1%$.E/,+>#2%2& 4)(%& )427& #.& )>)#$)9$%7& )& ,%/,%1%4')'#>%&

/"+'+=,)/"& +,& #()=%;& F"%& 4%':+,@1& ),%& /,%1%4'%2& )1& %=+5%4',#5&

4%':+,@1N& 4%':+,@1& 5+41#1'#4=& +.& )4& #42#>#20)$& )42& '"%#,&

#((%2#)'%& .,#%421;& O1%,1& 5)4& %6/)42& '"%& 2#1/$)-& 9-& 1%$%5'#4=&

4+2%1& '+&()@%& >#1#9$%& +'"%,1P& #((%2#)'%& .,#%421& )1&:%$$;&F+& '"%&

,#="'& +.& '"%& 4%':+,@& 2#1/$)-& #1& )& /)4%$& /,%1%4'#4=& )& /%,1+4P1&

/,+.#$%;&?1& 2#15011%2& $)'%,7& '"%&/,+.#$%& /)4%$& )$1+& /,+>#2%1& 2#,%5'&

()4#/0$)'#+4&1%),5"%1&+>%,&/,+.#$%&'%6';&

G4& /0,10#4=& '"#1& 2%1#=47& :%& 5"+1%& '+& >#+$)'%& <"4%#2%,()4P1&

()4',)& +.& J+>%,>#%:& .#,1'7& D++(& )42& .#$'%,7& '"%4& 2%')#$1E+4E

2%()42K&QLRS7&#41'%)2&+/'#4=&.+,&)&/"#$+1+/"-&+.&J1'),'&:#'"&:")'&

-+0& @4+:7& '"%4& =,+:;K& ?4& +>%,>#%:& +.& '"%& .0$$& 4%':+,@& #1&

#4)//,+/,#)'%&#4&'"#1&/%,1+4)$&5+4'%6'7&)1&'"%&1"%%,&15)$%&+9150,%1&

01%.0$& $)42(),@1;& O1%,1& +.& '"#1& 1-1'%(& ),%& .)(#$#),& :#'"& '"%#,&

.,#%4217&1+(%&.,#%421&+.&.,#%4217&)42&>),#+01&J5%$%9,#'#%1;K&T#>%4&)&

$)5@& +.& $% &'()'(& @4+:$%2=%& +.& '"%& 01%,P1& .)(#$#),#'-& :#'"& '"%#,&

%6'%42%2& 4%':+,@7& 1'),'#4=& .,+(& )4& %=+5%4',#5& /%,1/%5'#>%& 4+'&

+4$-& 5),,#%1& $%11& /%,5%/'0)$& )42& 5+(/0')'#+4)$& 90,2%47& 90'&

=0),)4'%%1& '"%& /,%1%45%& +.& ,%)2#$-& #2%4'#.#)9$%& $)42(),@1& .+,&

&

"#$%&'!()!*+&'',!-./0!/1!0.'!2#3-0'&!4#-%56#350#/,!-7-0'89!:.'!6'10!-#;'!<&'-',0-!5!,'0=/&>!;#-<657!=#0.!+/,0&/6-!1/&!+/88%,#07!5,567-#-!5,;!>'7=/&;!-'5&+.9!:.'!&#$.0!-#;'!+/,-#-0-!/1!5!<5,'6!;#-<657#,$!5!-'6'+0';!8'8?'&@-!<&/1#6'!#,1/&850#/,9!A/&;-!#,!0.'!<&/1#6'!<5,'6!0.50!/++%&!#,!8/&'!0.5,!/,'!<&/1#6'!=#66!.#$.6#$.0!/,!8/%-'B/4'&C!+6#+>#,$!0.'-'!=/&;-!=#66!#,#0#50'!-'5&+.'-!1/&!0./-'!0'&8-9!:.'!+.'+>?/D'-!#,!0.'!<&/1#6'!<5,'6!=#66!#,#0#50'!5,!!

EFB&57G!4#'=!/1!0.50!<5&0#+%65&!<&/1#6'!;#8',-#/,!H-''!"#$%&'-!IBJK9!

Page 26: SSSW2007 Social Web Baumann

• Source: Top100 Blogs in Germany

• Analysis of blogroll structure

• Means for trust, quality?

• Blogsono Student Project: Visualization and Sonification

Analysis and Visualization of Popular Networks

Page 27: SSSW2007 Social Web Baumann

Blogsono

[Laininger, DFKI/ FH-ZW 2006]

Page 28: SSSW2007 Social Web Baumann

• Information Architecture Lab, FH Kaiserslautern, Germany [Prof.Speck et al]

• Several Student Projects

• Vinex: Visualization

• Nepotist Analyzer: Economic Networks Analyzer

• spamCAN: Spam Analyzer

• PeerLo: Bittorrent Network Analyzer

• B.A.S.S: MySpace MusicAnalysis

Analysis and Visualization of Popular Networks

Page 29: SSSW2007 Social Web Baumann

!"#$%&'()*"+,&-.(/, 0&1"2*2"+/&!3+4+..&53+(4(67895&:;;<

-=>?&(%@%&A B(>(+))CDE+F("&@("(+)&E="&1G"*("=)F&*("&-=/3>?H/3+)()A5(/3)#4#F+(&=)*&*(H&$"(+()&I+HH()HE=F?)FH&(%@%

[Speck, Hendrik and Frédéric Philipp Thiele. Social Network Analysis. Business models, Explanations, Attributes and Features of Social Networks. CeBIT. WeblogCamp@CeBIT. March 18th, 2007, Hanover (Germany)]

!"#$%&'()*"+,&-.(/, 0&1"2*2"+/&!3+4+..&53+(4(67895&:;;<

-=>?&(%@%&A B(>(+))CDE+F("&@("(+)&E="&1G"*("=)F&*("&-=/3>?H/3+)()A5(/3)#4#F+(&=)*&*(H&$"(+()&I+HH()HE=F?)FH&(%@%

J+/()H(&9)$#">?D+#)%&K#=&?"(&$"((&D#&H3?"(&LD#&/#.MN&*+HD"+O=D(&?)*&D"?)H>+D&D3(&P#",Q&?)*&D#&"(>+R&LD#&?*?.D&D3(&P#",Q&=)*("&D3(&$#44#P+)F&/#)*+D+#)HS&TDD"+O=D+#)%&LK#=&>=HD&?DD"+O=D(&D3(&P#",&+)&D3(&>?))("&H.(/+$+(*&OM&D3(&?=D3#"&#"&4+/()H#"&O=D&)#D&+)&?)M&P?M&D3?D&H=FF(HDH&D3?D&D3(M&()*#"H(&M#=&#"&M#="&=H(&#$&D3(&P#",Q&-3?"(&T4+,(%&L9$&M#=&?4D("N&D"?)H$#">N&#"&O=+4*&=.#)&D3+H&P#",N&M#=&>?M&*+HD"+O=D(&D3(&"(H=4D+)F&P#",&#)4M&=)*("&D3(&

H?>(N&H+>+4?"&#"&?&/#>.?D+O4(&4+/()H(%Q

6#)/4=H+#)TDD"+O=D+#)A-3?"(T4+,(&U%;&V).#"D(*%&J+/()H(&9)$#">?D+#)

Page 30: SSSW2007 Social Web Baumann

!"#$%&'()*"+,&-.(/, 0&1"2*2"+/&!3+4+..&53+(4(67895&:;;<

-=>?&(%@%&A B(>(+))CDE+F("&@("(+)&E="&1G"*("=)F&*("&-=/3>?H/3+)()A5(/3)#4#F+(&=)*&*(H&$"(+()&I+HH()HE=F?)FH&(%@%

J(.#D+HDK)?4LH("%&7/#)#>+/&J(DM#",&K)?4LH("%

-#/+?4&J(DM#",&K)?4LH+H

-#="/(N&63?"4()(&8(?O("HP&Q?+)("&!$+HD("P&!"#$%&'()*"+,&-.(/,%&1?/33#/3H/3=4(&R?+H("H4?=D(")

[Speck, Hendrik and Frédéric Philipp Thiele. Social Network Analysis. Business models, Explanations, Attributes and Features of Social Networks. CeBIT. WeblogCamp@CeBIT. March 18th, 2007, Hanover (Germany)]

!"#$%&'()*"+,&-.(/, 0&1"2*2"+/&!3+4+..&53+(4(67895&:;;<

-=>?&(%@%&A B(>(+))CDE+F("&@("(+)&E="&1G"*("=)F&*("&-=/3>?H/3+)()A5(/3)#4#F+(&=)*&*(H&$"(+()&I+HH()HE=F?)FH&(%@%

J+/()H(&9)$#">?D+#)%&K#=&?"(&$"((&D#&H3?"(&LD#&/#.MN&*+HD"+O=D(&?)*&D"?)H>+D&D3(&P#",Q&?)*&D#&"(>+R&LD#&?*?.D&D3(&P#",Q&=)*("&D3(&$#44#P+)F&/#)*+D+#)HS&TDD"+O=D+#)%&LK#=&>=HD&?DD"+O=D(&D3(&P#",&+)&D3(&>?))("&H.(/+$+(*&OM&D3(&?=D3#"&#"&4+/()H#"&O=D&)#D&+)&?)M&P?M&D3?D&H=FF(HDH&D3?D&D3(M&()*#"H(&M#=&#"&M#="&=H(&#$&D3(&P#",Q&-3?"(&T4+,(%&L9$&M#=&?4D("N&D"?)H$#">N&#"&O=+4*&=.#)&D3+H&P#",N&M#=&>?M&*+HD"+O=D(&D3(&"(H=4D+)F&P#",&#)4M&=)*("&D3(&

H?>(N&H+>+4?"&#"&?&/#>.?D+O4(&4+/()H(%Q

6#)/4=H+#)TDD"+O=D+#)A-3?"(T4+,(&U%;&V).#"D(*%&J+/()H(&9)$#">?D+#)

Page 31: SSSW2007 Social Web Baumann

III. People-Artifacts-Networks

Page 32: SSSW2007 Social Web Baumann

Enronic

• Tuesday, June 7, 2005

• Social network analyzer for Enron's emails• When Enron was hit with lawsuits, the prosecution required the company to turn over

the incriminating emails it had in its database through a legal process called discovery. Usually a company undergoing this kind of discovery carefully combs through its email and publishes only those messages that are relevant to the case, eliminating its employees' private messages and confidential material otherwise not related to the case.

• However, Enron opted to save money by just dumping all of its email online, employee love-letters and all. This was bad for employee privacy, but good for social network researchers.

• Jeffrey Heer at UC Berkeley has produced "enronic," a system for visualizing the relationships between Enron employees based on who emailed whom and how often. It's a Java applet that you can download and mess around with -- you can even take a turn at sleuthing out hidden participants in the scam by trying your own social network analysis.

http://www.boingboing.net/2005/06/07/social_network_analy.html

Page 33: SSSW2007 Social Web Baumann

Enronic• Mining the social network from the data

• e.g by using Applied Natural Language Processing

• Email is straightforward

• sender, recipient

• direct edge in case of email

• + existing categorization of email body into a given taxomony (business, strategy, etc.)

• further aspects of email body could be used for clustering

• Interactive visualization of these properties

Page 34: SSSW2007 Social Web Baumann

http://jheer.org/enron/

Page 35: SSSW2007 Social Web Baumann

• 1st prize Semantic Web Challenge 2004

• [Peter Mika: Semantic Web Technology for the Extraction and Analysis of Social Networks, Journal of Web Semantics, July 2005]

• Data feeds:

• Webpages

• Emails (8185)

• Publications (5147)

• FOAF profiles

• Extraction of underlying social network + generation of an ontology + visualization

• Co-occurence analysis, inferences, identity reasoning, etc.

• Social Network Analysis (SNA)

Flink: The Who is Who of the Semantic Web

Page 36: SSSW2007 Social Web Baumann

Flink: Onlineare research topics, while the associations between the topics are based onthe number of researchers who have an interest in the given pair of topics(see Figure 3). An interesting feature of this ontology is that the associationscreated are specific to the community of researchers whose names are used inthe experiment. This means that unlike similar lightweight ontologies createdfrom a statistical analysis of generic web content, this ontology reflects thespecific conceptualizations of the community that was used in the extractionprocess (see the following Section). Also, the ontology naturally evolves as therelationships between research topics changes (e.g. as certain fields of researchmove closer to each other). For a further discussion on the relation betweensociability and semantics, we refer the reader to [17].

Fig. 3. The ontology of research topics.

The visitor of the website can also view some basic statistics of the social net-work. Degree, closeness and betweenness are common measures of importanceor influence in Social Network Analysis, while the degree distribution atteststo a general characteristic of the network itself (see Figure 5). Geographicvisualizations of the Semantic Web o!er another overview of the network byshowing the places where researchers are located and the connections betweenthem (see Figure 1).

3 System design

Similarly to the design of most Semantic Web applications, the architectureof Flink can be divided in three layers concerned with metadata acquisition,

6

Page 37: SSSW2007 Social Web Baumann

!"#$%&'()*"+,&-.(/, 0&1"2*2"+/&!3+4+..&53+(4(67895&:;;<

-=>?&(%@%&A B(>(+))CDE+F("&@("(+)&E="&1G"*("=)F&*("&-=/3>?H/3+)()A5(/3)#4#F+(&=)*&*(H&$"(+()&I+HH()HE=F?)FH&(%@%

-#/+?4&J(DK#",&L)?4MH+HH.?>6LJ%&-.?>&L)?4MH("%

-#="/(N&O?4$&I?F)("P&Q?D3?"+)?&B("3?"*DP&RS#))(&-/3>+DD4("P&!"#$%&'()*"+,&-.(/,%&1?/33#/3H/3=4(&Q?+H("H4?=D(")[Speck, Hendrik and Frédéric Philipp Thiele. Social Network Analysis. Business models, Explanations, Attributes and Features of Social Networks. CeBIT. WeblogCamp@CeBIT. March 18th, 2007, Hanover (Germany)]

Spam Analyzer

!"#$%&'()*"+,&-.(/, 0&1"2*2"+/&!3+4+..&53+(4(67895&:;;<

-=>?&(%@%&A B(>(+))CDE+F("&@("(+)&E="&1G"*("=)F&*("&-=/3>?H/3+)()A5(/3)#4#F+(&=)*&*(H&$"(+()&I+HH()HE=F?)FH&(%@%

J+/()H(&9)$#">?D+#)%&K#=&?"(&$"((&D#&H3?"(&LD#&/#.MN&*+HD"+O=D(&?)*&D"?)H>+D&D3(&P#",Q&?)*&D#&"(>+R&LD#&?*?.D&D3(&P#",Q&=)*("&D3(&$#44#P+)F&/#)*+D+#)HS&TDD"+O=D+#)%&LK#=&>=HD&?DD"+O=D(&D3(&P#",&+)&D3(&>?))("&H.(/+$+(*&OM&D3(&?=D3#"&#"&4+/()H#"&O=D&)#D&+)&?)M&P?M&D3?D&H=FF(HDH&D3?D&D3(M&()*#"H(&M#=&#"&M#="&=H(&#$&D3(&P#",Q&-3?"(&T4+,(%&L9$&M#=&?4D("N&D"?)H$#">N&#"&O=+4*&=.#)&D3+H&P#",N&M#=&>?M&*+HD"+O=D(&D3(&"(H=4D+)F&P#",&#)4M&=)*("&D3(&

H?>(N&H+>+4?"&#"&?&/#>.?D+O4(&4+/()H(%Q

6#)/4=H+#)TDD"+O=D+#)A-3?"(T4+,(&U%;&V).#"D(*%&J+/()H(&9)$#">?D+#)

Page 38: SSSW2007 Social Web Baumann

IV. Tools

Page 39: SSSW2007 Social Web Baumann

Social Network Analysis• Specialization of network analysis

• Well-known field (e.g. sociologists, consultants, etc.)

• The network

• Nodes (here people or groups)

• Relations (directed or undirected)

• Properties of relations (name, type, weight, etc.)

• SNA deals with properties of the network

• Performance of subsets

• Location of actors (implies possibilities and constraints for their actions)

Page 40: SSSW2007 Social Web Baumann

Social Network Analysis• Most popular

measures

• Degrees

• Betweeness

• Closeness

• Further properties

• Boundary Spanners

• Peripheral Players

• Network Centralization

Page 41: SSSW2007 Social Web Baumann

Social Network Analysis• Network analysis packages

• JUNG [http://jung.sourceforge.net/]• Pajek [http://vlado.fmf.uni-lj.si/pub/networks/pajek/]• UCINET [http://www.analytictech.com]

Page 42: SSSW2007 Social Web Baumann

• Unsupervised, self-organized processing of data inspired by cortical maps in the human brain

• Non-linear projection of high dimensional data to low dimensional grid (usually 2D)

• Preservation of input space topology: data points close in input space are close on the map

• In contrast to • MultiDimensionalScaling (MDS)• PrincipalComponentAnalysis (PCA)• the original data space distances can be

shown.• entangled clusters can be separated.• projection and clustering are provided

Self Organizing Map (SOM)

Page 43: SSSW2007 Social Web Baumann

SOM ... Folksonomies

Artists: 398Input: 398 * 3313SOM: 49 units (7*7)

Map unit Labels Artists

1 Rap-Metal, Funk-Metal

Limp Bizkit, Linkin Park

2 Reggae, Rap Bob Marley, Shaggy, Police, 311

3 Metal, Heavy Metallica, Nirvana, Smashing Pumpkins

4 Techno, Remix Alice Deejay, Prodigy, Moby, Chemical Brothers

5 Spanish, Latin Ricky Martin, Enrique Iglesias

6 Rock, Roll, Blues Dire Straits, Queen, Eric Clapton, Bob Dylan

Page 44: SSSW2007 Social Web Baumann

Learning from music-related Networks

• Large-scale experiment: AMAZON reviews as feeds

• 33879 artist reviews

• 33879 x 242 (dim. reduction PCA)

• SOM: 3000 units (60 x 50) representing genre/style concepts

• Interactive music station for Netaudio: archive.org MP3 + XMLfeeds

• (optional audio analysis)

• 1125 artist reviews

• 1125 x 1139 features

• SOM: 180 units (15 x 12) representing genre/style concepts

Page 45: SSSW2007 Social Web Baumann

ANLP + Visualization• „Decoding the (textual) artifacts“

• JUDGE

• Just another Java-based Text Retrieval & Machine Learning Framework

• automatic feature extraction• feature preprocessing and attribute selection methods• document classification with several classification methods• document clustering with several clustering methods• full support of almost all WEKA Algorithms• easy usage as a webservice with a special interface• http://www3.dfki.uni-kl.de/judge

• DYNAQ: interactive, dynamic queries

• Visualization: processing.org

Page 46: SSSW2007 Social Web Baumann

Semantic Media: Tagging

SmartMedia ontology

Instance browser

Media instance

Media player

List of annotations

Page 47: SSSW2007 Social Web Baumann

Bottom-up meets top-down

[S. Vembu, M. Kiesel, M. Sintek, S. Baumann:Towards bridging the semantic gap in multimedia annotation and retrieval.1st Int.Workshop on Semantic Web Annotations for Multimedia, SWAMM 2006 at WWW 2006.]

Page 48: SSSW2007 Social Web Baumann

V. Networks-Interaction(„Games, Chats, Tags ...“)

Page 49: SSSW2007 Social Web Baumann

• S. Ahern, D. Eckles, N. Good, S. King, M. Naaman, R. Nair, Yahoo! Research Berkeley "Over-Exposed? Privacy Patterns and Considerations in Online and Mobile Photo Sharing"

• flickr upload study: private vs. public photos

• 81 users, 37k photos

• privacy determination by location and content (tags)

• demand for privacy policies!

• no privacy concerns by users yet

[Obradovic, Darko: State of the Art in SNA & CHI / preliminary work for Ph.D, June2007, DFKI Kaiserslautern, Germany]

Related work at CHI2007

Page 50: SSSW2007 Social Web Baumann

• C. Lampe, Michigan State University, E. Johnston, University of Michigan "Follow the Reader: Filtering Comments on Slashdot"

• slashdot comment rating

• critical mass vs. information overload

• analysed 2.4M page hits by 90k users

• interviewed 8k users

• ratings considered useful!

• filter customization is done rarely

• importance of defaults!

[Obradovic, Darko: State of the Art in SNA & CHI / preliminary work for Ph.D, June2007, DFKI Kaiserslautern, Germany]

Related work at CHI2007

Page 51: SSSW2007 Social Web Baumann

• N. Ducheneaut, E. Nickell, R. Moore, PARC, N. Yee, Stanford University "The Life and Death of Online Gaming Communities: A Look at Guilds in World of Warcraft"

• max. size for creative & cooperative social groups: 45 (Dunbar number: 150, current WoW guilds: <35)

• social factors for group growth & death

• monitored 5k guilds on 5 WoW servers

• "small is beautiful"

• importance of social group management

[Obradovic, Darko: State of the Art in SNA & CHI / preliminary work for Ph.D, June2007, DFKI Kaiserslautern, Germany]

Related work at CHI2007

Page 52: SSSW2007 Social Web Baumann

• M. Ames, Stanford University, M. Namaan, Yahoo! Research Berkeley "Why We Tag: Motivations for Annotation in Mobile and Online Media"

• evaluation with flickr/zonetag (direct upload from cameraphone) with 172 users

• taxonomy of tagging motivations

• tagging for social purposes is more motivating

• tagging at creation time is favoured, won't be done later

• tag suggestions help a lot

[Obradovic, Darko: State of the Art in SNA & CHI / preliminary work for Ph.D, June2007, DFKI Kaiserslautern, Germany]

Related work at CHI2007

Page 53: SSSW2007 Social Web Baumann

• A. W. Rivadeneira, University of Maryland, D. M. Gruen, M. J. Muller, D. R. Millen, IBM Research "Getting Our Head in the Clouds: Toward Evaluation Studies of Tagclouds"

• cognitive research

• spatial algorithms (size)

• clustering algorithms (similar, occur together)

• tested: font size, quadrant and proximity-to-largest-word

• experiment with 13 subjects

• tagcloud designers should focus on layout

• simple lists by frequency are ok!

[Obradovic, Darko: State of the Art in SNA & CHI / preliminary work for Ph.D, June2007, DFKI Kaiserslautern, Germany]

Related work at CHI2007

Page 54: SSSW2007 Social Web Baumann

[A. W. Rivadeneira, University of Maryland, D. M. Gruen, M. J. Muller, D. R. Millen, IBM Research "Getting Our Head in the Clouds: Toward Evaluation Studies of Tagclouds"]

Page 55: SSSW2007 Social Web Baumann

• J. D. Weisz, S. Kiesler, H. Zhang, Y. Ren, R. E. Kraut, Carnegie Mellon University, J. A. Konstan, University of Minnesota "Watching Together: Integrating Text Chat with Video"

• interactive, more social television experience

• experiment: cartoon watching with simultaneous chat (image attached)

• settings: real-time, chat breaks, chat after cartoon

• is fun, has social benefits, but distracts

• use intermissions for appropriate content

• future work: evaluate different content

[Obradovic, Darko: State of the Art in SNA & CHI / preliminary work for Ph.D, June2007, DFKI Kaiserslautern, Germany]

Related work at CHI2007

Page 56: SSSW2007 Social Web Baumann

[J. D. Weisz, S. Kiesler, H. Zhang, Y. Ren, R. E. Kraut, Carnegie Mellon University, J. A. Konstan, University of Minnesota "Watching Together: Integrating Text Chat with Video]

Page 57: SSSW2007 Social Web Baumann

• K. O'Hara, A. S. Mitchell, A. Vorbau, Hewlett Packard Labs "Consuming Video on Mobile Devices"

• the "mobile factor" changes video experience, especially in the social context

• behaviour study with 28 participants

• mobile video is more than passive consumption due to boredom

• social importance of owning content and getting credit for recommending content

• solitary activity has social underpinnings (watch video at a time it won't annoy your family)

[Obradovic, Darko: State of the Art in SNA & CHI / preliminary work for Ph.D, June2007, DFKI Kaiserslautern, Germany]

Related work at CHI2007

Page 58: SSSW2007 Social Web Baumann

The End

Page 59: SSSW2007 Social Web Baumann

Hot topics (2006 talk!)

• SPACE: Locative media

• TIME: Long-term (lifelong?) changes

• Development of „taste“ (with respect to objects)

• Liu,Maes&Davenport [Unraveling the Taste Fabric of Social Networks, IJSWIS2005]

• Dependencies on the development of social network preferences?

Page 60: SSSW2007 Social Web Baumann
Page 61: SSSW2007 Social Web Baumann
Page 62: SSSW2007 Social Web Baumann

• DIGITAL IDENTITY

• Sherry Turkle [The Second Self, 2005] [Life on the Screen, 1995]

• Danah Boyd [Identity Production in a Networked Culture, AAAS2006]

• Liberty Alliance [projectliberty.org]

• ID 2.0 [Dick Hardt: OSCON 2005 Keynote]

Hot topics (2006 talk!)

Page 63: SSSW2007 Social Web Baumann

• Privacy vs. Impression Management (=Exhibitionism?)

• OpenID as a supporting open standard!?

• Research issues

• Aggregation of „FACETED IDENTITY“

• Measures for TRUST

• Political implications

• e.g „Vorratsdatenspeicherung“ in Germany

• Projects with CCTV and RFID technology

Identity matters!

Page 64: SSSW2007 Social Web Baumann
Page 65: SSSW2007 Social Web Baumann
Page 66: SSSW2007 Social Web Baumann

Hot topics (2006 talk!)• ?!

• Bridging the gap between Web2.0 and SemanticWeb?

• GRDDL (Gleaning Resource Descriptions from Dialects of Languages)

• 2007: From a commercial / startup perspective

• Semantic social networks are „hot“, some rely on microformats

• yowhassup.com

Page 67: SSSW2007 Social Web Baumann
Page 68: SSSW2007 Social Web Baumann

Problems

• 100 onliners =

• 1 creator +

• 10 interactions +

• 89 viewers

• Slots of attention decrease

• Cultural differences

Page 69: SSSW2007 Social Web Baumann

Future trends?• „Precision over Recall!“

• StumbleUpon.com

• „Semi-closed and closed“ networks

• Quality, trust matters!

Page 70: SSSW2007 Social Web Baumann

Future trends?

• „Generation Mobile“ (e.g. Twitter)

• „Generation Game“

• I still wait for efficient reasoning :)

• large-scale RDF stores

• full-fledged „semantified“ media

Page 71: SSSW2007 Social Web Baumann

Mind the gap(s)!• Try to work on an application you are missing

personally! (intrinsic motivation)

• I will start to work on Music Information Retrieval again!

• Work on real-world data!

• Show it to your friends!

• Talk about it!

Page 72: SSSW2007 Social Web Baumann

thanks for listening to my few bits ...send emails(!) to:

[email protected] our blog

www.computationalculture.org!"#$%&'()*"+,&-.(/, 0&1"2*2"+/&!3+4+..&53+(4(67895&:;;<

-=>?&(%@%&A B(>(+))CDE+F("&@("(+)&E="&1G"*("=)F&*("&-=/3>?H/3+)()A5(/3)#4#F+(&=)*&*(H&$"(+()&I+HH()HE=F?)FH&(%@%

J+/()H(&9)$#">?D+#)%&K#=&?"(&$"((&D#&H3?"(&LD#&/#.MN&*+HD"+O=D(&?)*&D"?)H>+D&D3(&P#",Q&?)*&D#&"(>+R&LD#&?*?.D&D3(&P#",Q&=)*("&D3(&$#44#P+)F&/#)*+D+#)HS&TDD"+O=D+#)%&LK#=&>=HD&?DD"+O=D(&D3(&P#",&+)&D3(&>?))("&H.(/+$+(*&OM&D3(&?=D3#"&#"&4+/()H#"&O=D&)#D&+)&?)M&P?M&D3?D&H=FF(HDH&D3?D&D3(M&()*#"H(&M#=&#"&M#="&=H(&#$&D3(&P#",Q&-3?"(&T4+,(%&L9$&M#=&?4D("N&D"?)H$#">N&#"&O=+4*&=.#)&D3+H&P#",N&M#=&>?M&*+HD"+O=D(&D3(&"(H=4D+)F&P#",&#)4M&=)*("&D3(&

H?>(N&H+>+4?"&#"&?&/#>.?D+O4(&4+/()H(%Q

6#)/4=H+#)TDD"+O=D+#)A-3?"(T4+,(&U%;&V).#"D(*%&J+/()H(&9)$#">?D+#)