surfacing real-world event content on twitter
DESCRIPTION
Talk given at Google NYC on October 15th, 2010.TRANSCRIPT
SURFACING REAL-WORLD
EVENT CONTENT ON TWITTER
Hila Becker, Luis Gravano Mor Naaman Columbia University Rutgers University
Event Content in Social Media
Event Content in Social Media
Smaller events, without traditional
news coverage
Popular, widely known events
Event Content in Social Media
Discovery
Detect events using features of social media content (e.g., term statistics)
Mining content from known event sources (e.g., user-contributed event databases)
Organization
Associating social media content with events
Identifying similar content within and across sites
Presentation
Selecting what content to display to a user
Providing interfaces that summarize and aggregate the content along different dimensions
Event Content in Social Media
Discovery
Detect events using features of social media content (e.g., term statistics)
Mining content from known event sources (e.g., user-contributed event databases)
Organization
Associating social media content with events
Identifying similar content within and across sites
Presentation
Selecting what content to display to a user
Providing interfaces that summarize and aggregate the content along different dimensions
Event Content in Social Media
Discovery
Detect events using features of social media content (e.g., term statistics)
Mining content from known event sources (e.g., user-contributed event databases)
Organization
Associating social media content with events
Identifying similar content within and across sites
Presentation
Selecting what content to display to a user
Providing interfaces that summarize and aggregate the content along different dimensions
Identifying Events in Social Media
Timeliness
Real-time
Retrospective
(Prospective)
Content discovery
Known properties
Event databases (e.g., Upcoming, Eventful)
Keyword triggers (e.g, “earthquake”)
Shared calendars
Unknown properties
Identifying Events in Social Media
Timeliness
Real-time
Retrospective
(Prospective)
Content discovery
Known properties
Event databases (e.g., Upcoming, Eventful)
Keyword triggers (e.g, “earthquake”)
Shared calendars
Unknown properties
Identifying Events in Social Media
Timeliness
Cont
ent
Dis
cove
ry
Real-time Retrospective
Kno
wn
Unk
now
n
Identifying Events in Social Media
Timeliness
Cont
ent
Dis
cove
ry
Real-time Retrospective
Kno
wn
Unk
now
n
Twitter new event detection [Petrović et al. NAACL’10]
Identifying Events in Social Media
Timeliness
Cont
ent
Dis
cove
ry
Real-time Retrospective
Kno
wn
Unk
now
n
Twitter new event detection [Petrović et al. NAACL’10]
Event detection on Flickr [Chen and Roy CIKM’09]
Identifying Events in Social Media
Timeliness
Cont
ent
Dis
cove
ry
Real-time Retrospective
Kno
wn
Unk
now
n
Earthquake prediction
using Twitter [Sakaki et al.
WWW’10]
Twitter new event detection [Petrović et al. NAACL’10]
Event detection on Flickr [Chen and Roy CIKM’09]
Identifying Events in Social Media
Timeliness
Cont
ent
Dis
cove
ry
Real-time Retrospective
Kno
wn
Unk
now
n
Earthquake prediction
using Twitter [Sakaki et al.
WWW’10]
Twitter new event detection [Petrović et al. NAACL’10]
Event detection on Flickr [Chen and Roy CIKM’09]
Organization of YouTube
concert videos [Kennedy and
Naaman WWW’09]
Identifying Events in Social Media
Timeliness
Cont
ent
Dis
cove
ry
Real-time Retrospective
Kno
wn
Unk
now
n
Identifying Events in Social Media
Timeliness
Cont
ent
Dis
cove
ry
Real-time Retrospective
Kno
wn
Unk
now
n
Surfacing events on
Identifying Events in Social Media
Timeliness
Cont
ent
Dis
cove
ry
Real-time Retrospective
Kno
wn
Unk
now
n
Learning similarity metrics
for event identification on
Flickr [Becker et al. WSDM’10]
Surfacing events on
Identifying Events in Social Media
Timeliness
Cont
ent
Dis
cove
ry
Real-time Retrospective
Kno
wn
Unk
now
n
Learning similarity metrics
for event identification on
Flickr [Becker et al. WSDM’10]
Surfacing events on
Identifying Twitter content
for planned events
Identifying Events in Social Media
Timeliness
Cont
ent
Dis
cove
ry
Real-time Retrospective
Kno
wn
Unk
now
n
Learning similarity metrics
for event identification on
Flickr [Becker et al. WSDM’10]
Surfacing events on
Identifying Twitter content
for planned events
Connecting events across
sites (e.g., YouTube,
Picasa)
Twitter Content
Streams of textual
messages
Brief content (140
characters)
Communicated to network
of followers
Twitter Trending Topics
Twitter trending topics, September 24, 2010 7:00am
Twitter Trending Topics
Twitter trending topics, September 24, 2010 7:00am
Recurring
Twitter-centric
Confusing
Real-World
Events?
Twitter Trending Topics
Twitter trending topics, September 24, 2010 7:00am
Recurring
Twitter-centric
Confusing
Real-World
Events?
Twitter Trending Topics
Twitter trending topics, September 24, 2010 7:00am
Recurring
Twitter-centric
Confusing
Real-World
Events?
Twitter Trending Topics
Twitter trending topics, September 24, 2010 7:00am
Recurring
Twitter-centric
Confusing
Real-World
Events?
Twitter Trending Topics
Twitter trending topics, September 24, 2010 7:00am
Recurring
Twitter-centric
Confusing
Real-World
Events?
Identifying Events on Twitter
Challenges:
Wide variety of topics, not all related to events (e.g.,
morning greetings, “thank you” messages)
Low quality text: abbreviations, unconventional language,
riddled with typos, grammatically incorrect
Opportunities:
Content generated in real-time as events happen
Time and location information
Identifying Events on Twitter
Challenges:
Wide variety of topics, not all related to events (e.g.,
morning greetings, “thank you” messages)
Low quality text: abbreviations, unconventional language,
riddled with typos, grammatically incorrect
Opportunities:
Content generated in real-time as events happen
Time and location information
Events on Twitter
Types of events on Twitter
Exogenous: Real-world occurrences (e.g., Superbowl, “Lost” finale)
Endogenous: Specific to the Twitter-verse (e.g., #thingsyoushouldntsay meme, RT statement by Lady Gaga)
Event:
One or more terms and a time period
Volume of messages posted for the terms in the time period exceeds some expected level of activity
Events on Twitter
Types of events on Twitter
Exogenous: Real-world occurrences (e.g., Superbowl, “Lost” finale)
Endogenous: Specific to the Twitter-verse (e.g., #thingsyoushouldntsay meme, RT statement by Lady Gaga)
Event:
One or more terms and a time period
Volume of messages posted for the terms in the time period exceeds some expected level of activity
Real-Time Unsupervised Event
Identification on Twitter
Organization
Content representation: text, time, location
Group similar content via clustering
Discovery
Extract discriminating features of clusters
Build an event classifier
Presentation
Select content for each event
Evaluate the quality, relevance, and usefulness
Real-Time Unsupervised Event
Identification on Twitter
Organization
Content representation: text, time, location
Group similar content via clustering
Discovery
Extract discriminating features of clusters
Build an event classifier
Presentation
Select content for each event
Evaluate the quality, relevance, and usefulness
Real-Time Unsupervised Event
Identification on Twitter
Organization
Content representation: text, time, location
Group similar content via clustering
Discovery
Extract discriminating features of clusters
Build an event classifier
Presentation
Select content for each event
Evaluate the quality, relevance, and usefulness
Real-Time Unsupervised Event
Identification on Twitter
Organization
Content representation: text, time, location
Group similar content via clustering
Discovery
Extract discriminating features of clusters
Build an event classifier
Presentation
Select content for each event
Evaluate the quality, relevance, and usefulness
Surfacing Event Content on Twitter
Tweets
Surfacing Event Content on Twitter
Tweets
Surfacing Event Content on Twitter
Tweet Clusters
Tweets
Surfacing Event Content on Twitter
Tweet Clusters
Tweets
Surfacing Event Content on Twitter
Tweet Clusters
Tweets Event Clusters
Surfacing Event Content on Twitter
Tweet Clusters
Tweets Event Clusters
Surfacing Event Content on Twitter
Tweet Clusters
Tweets Event Clusters
Surfacing Event Content on Twitter
Tweet Clusters
Tweets Event Clusters Selected Tweets
Organizing Tweets in Real-Time
Order tweets by post time
Use TF-IDF vector representation of textual content
Stop word elimination
Stemming
Enhanced weight for hashtags (#tag)
IDF computed over past data
Separate tweets by location
Focus on tweets from NYC
Different locations can be processed in parallel
Organizing Tweets in Real-Time
Order tweets by post time
Use TF-IDF vector representation of textual content
Stop word elimination
Stemming
Enhanced weight for hashtags (#tag)
IDF computed over past data
Separate tweets by location
Focus on tweets from NYC
Different locations can be processed in parallel
Organizing Tweets in Real-Time
Order tweets by post time
Use TF-IDF vector representation of textual content
Stop word elimination
Stemming
Enhanced weight for hashtags (#tag)
IDF computed over past data
Separate tweets by location
Focus on tweets from NYC
Different locations can be processed in parallel
Clustering Algorithm
Many alternatives possible! [Berkhin 2002]
Single-pass incremental clustering algorithm
Scalable, online solution
Used effectively for
Event identification in textual news [Allan et al. 1998]
News event detection on Twitter [Sankaranarayanan et al. 2009]
Does not require a priori knowledge of number of
clusters
Known fragmentation issue, often solved with a
periodic second pass
Clustering Algorithm
Many alternatives possible! [Berkhin 2002]
Single-pass incremental clustering algorithm
Scalable, online solution
Used effectively for
Event identification in textual news [Allan et al. 1998]
News event detection on Twitter [Sankaranarayanan et al. 2009]
Does not require a priori knowledge of number of
clusters
Known fragmentation issue, often solved with a
periodic second pass
Overview of Cluster-based Approach
Group similar tweets via online clustering
Compute statistics of cluster content
Top terms (e.g., [earthquake, haiti])
Number of documents per hour
…
Use cluster-level features to identify event clusters
Single feature with threshold (e.g., increase in volume
over time-window)
Trained classification model
Overview of Cluster-based Approach
Group similar tweets via online clustering
Compute statistics of cluster content
Top terms (e.g., [earthquake, haiti])
Number of documents per hour
…
Use cluster-level features to identify event clusters
Single feature with threshold (e.g., increase in volume
over time-window)
Trained classification model
Overview of Cluster-based Approach
Group similar tweets via online clustering
Compute statistics of cluster content
Top terms (e.g., [earthquake, haiti])
Number of documents per hour
…
Use cluster-level features to identify event clusters
Single feature with threshold (e.g., increase in volume
over time-window)
Trained classification model
Real-Time Unsupervised Event
Identification on Twitter
Organization
Content representation: text, time, location
Group similar content via clustering
Discovery
Extract discriminating features of clusters
Build an event classifier
Presentation
Select content for each event
Evaluate the quality, relevance, and usefulness
Social Interaction Features
Retweets
RT @username
Often characterize Twitter-specific events
Replies
Tweet starts with @username
Possible indication of non-event content
Mentions
@username anywhere in the tweet
Reference to twitter users that might be part of an event
Social Interaction Features
Retweets
RT @username
Often characterize Twitter-specific events
Replies
Tweet starts with @username
Possible indication of non-event content
Mentions
@username anywhere in the tweet
Reference to twitter users that might be part of an event
Social Interaction Features
Retweets
RT @username
Often characterize Twitter-specific events
Replies
Tweet starts with @username
Possible indication of non-event content
Mentions
@username anywhere in the tweet
Reference to twitter users that might be part of an event
Topic Coherence
Intuition: clusters with strong inter-document similarity
may contain event information
Class
Today
Early
Work
Sleep
Start
I’m gonna do my best to go
sleep during all my classes
today =)
Starting work early today.
Looking fwd to cooking class
tonight!
Today starts the rest of my
life…
Katie
Couric
President
Obama
Interview
CBS
Katie Couric Interview With
President Obama
http://bit.ly/bRsGPo
The Katie Couric-President
Obama interview has now
begun on CBS
Katie Couric interviews
President Obama during CBS'
Super Bowl pregame coverage
Trending Behavior
Trending
characteristics of
top terms in
cluster:
Exponential fit
Deviation from
expected
volume
Volume over time for the term “valentine”
time
docu
ment
s
time (hours)
Twitter-Centric Event Features
Tagging behavior
Multi-word tags (e.g., #myhomelesssignwouldsay)
Percentage of tagged tweets
Top term is a tag
…
Retweeting
Percentage of messages with RT @
Percentage of messages from top RTed tweet
…
Twitter-Centric Event Features
Tagging behavior
Multi-word tags (e.g., #myhomelesssignwouldsay)
Percentage of tagged tweets
Top term is a tag
…
Retweeting
Percentage of messages with RT @
Percentage of messages from top RTed tweet
…
Event Classifier
Use features to build a classifier
Human-annotated training data
SVM model (selected during training phase)
Alternative classification modes:
RW-Event: real-world event vs. rest
TC-Event: event (real-world or Twitter-centric) vs. non-
event
Real-Time Unsupervised Event
Identification on Twitter
Organization
Content representation: text, time, location
Group similar content via clustering
Discovery
Extract discriminating features of clusters
Build an event classifier
Presentation
Select content for each event
Evaluate the quality, relevance, and usefulness
Real-Time Unsupervised Event
Identification on Twitter
Organization
Content representation: text, time, location
Group similar content via clustering
Discovery
Extract discriminating features of clusters
Build an event classifier
Presentation
Select content for each event
Evaluate the quality, relevance, and usefulness
Real-Time Unsupervised Event
Identification on Twitter
Organization
Content representation: text, time, location
Group similar content via clustering
Discovery
Extract discriminating features of clusters
Build an event classifier
Presentation
Select content for each event
Evaluate the quality, relevance, and usefulness
Event Content Selection
Tiger
Woods
Apology
Event Content Selection
Tiger
Woods
Apology
Tiger Woods to make a
public apology Friday and
talk about his future in golf.
Tiger Woods Returns To
Golf - Public Apology
http://bit.ly/9Ui5jx
Tiger woods y'all,tiger
woods y'all,ah tiger woods
y'all
Tiger Woods Hugs:
http://tinyurl.com/yhf4
uzw
Wedge wars upstage
Watson v Woods: BBC
Sport (blog)
Event Content Selection
Tiger
Woods
Apology
Tiger Woods to make a
public apology Friday and
talk about his future in golf.
Tiger Woods Returns To
Golf - Public Apology
http://bit.ly/9Ui5jx
Tiger woods y'all,tiger
woods y'all,ah tiger woods
y'all
Tiger Woods Hugs:
http://tinyurl.com/yhf4
uzw
Wedge wars upstage
Watson v Woods: BBC
Sport (blog)
Event Content Selection
Challenges:
Clusters contain noise
Relevant tweets might have poor quality text
Relevant, high quality tweets might not be interesting
For each tweet and a given event evaluate
Quality
Relevance
Usefulness
Event Content Selection
Challenges:
Clusters contain noise
Relevant tweets might have poor quality text
Relevant, high quality tweets might not be interesting
For each tweet and a given event evaluate
Quality
Relevance
Usefulness
Centrality Based Tweet Selection
Centroid
Cosine similarity of each tweet to cluster centroid
Degree
Tweets are nodes
Tweets are connected if their similarity is above a threshold
Compute degree centrality of each node
LexRank [Erkan and Radev 2004]
Same graph structure as Degree method
Central tweets are similar to other central tweets
Centrality Based Tweet Selection
Centroid
Cosine similarity of each tweet to cluster centroid
Degree
Tweets are nodes
Tweets are connected if their similarity is above a threshold
Compute degree centrality of each node
LexRank [Erkan and Radev 2004]
Same graph structure as Degree method
Central tweets are similar to other central tweets
Centrality Based Tweet Selection
Centroid
Cosine similarity of each tweet to cluster centroid
Degree
Tweets are nodes
Tweets are connected if their similarity is above a threshold
Compute degree centrality of each node
LexRank [Erkan and Radev 2004]
Same graph structure as Degree method
Central tweets are similar to other central tweets
Experimental Setup: Data
>2,600,000 tweets, collected via Twitter API
Location: New York City area
Indicated on user profile
Time: February 2010
First week used to calibrate statistics
Second week used for training/validation
Third and fourth weeks used for testing
Experimental Setup: Data
>2,600,000 tweets, collected via Twitter API
Location: New York City area
Indicated on user profile
Time: February 2010
First week used to calibrate statistics
Second week used for training/validation
Third and fourth weeks used for testing
Experimental Setup: Data
>2,600,000 tweets, collected via Twitter API
Location: New York City area
Indicated on user profile
Time: February 2010
First week used to calibrate statistics
Second week used for training/validation
Third and fourth weeks used for testing
Experimental Setup: Training
Data:
504 clusters
Fastest growing clusters/hour in second week of February
2010
Labels:
Real-world event (e.g., [superbowl,colts,saints,sb44])
Twitter-specific event (e.g., [uknowubrokewhen,money,job])
Non-event (e.g., [happy,love,lol])
Ambiguous cluster (e.g., [south,park,west,sxsw,cartman])
Experimental Setup: Training
Data:
504 clusters
Fastest growing clusters/hour in second week of February
2010
Labels:
Real-world event (e.g., [superbowl,colts,saints,sb44])
Twitter-specific event (e.g., [uknowubrokewhen,money,job])
Non-event (e.g., [happy,love,lol])
Ambiguous cluster (e.g., [south,park,west,sxsw,cartman])
Experimental Setup: Testing
Baselines:
Naïve Bayes text classification (NB-Text)
Fastest-growing clusters per hour
Classifiers:
RW-Event
TC-Event
400 clusters
5 hours
Top 20 clusters per hour according to RW-Event, TC-Event, Fastest-growing, random
Experimental Setup: Testing
Baselines:
Naïve Bayes text classification (NB-Text)
Fastest-growing clusters per hour
Classifiers:
RW-Event
TC-Event
400 clusters
5 hours
Top 20 clusters per hour according to RW-Event, TC-Event, Fastest-growing, random
Experimental Setup: Testing
Baselines:
Naïve Bayes text classification (NB-Text)
Fastest-growing clusters per hour
Classifiers:
RW-Event
TC-Event
400 clusters
5 hours
Top 20 clusters per hour according to RW-Event, TC-Event, Fastest-growing, random
Experimental Methodology: Event
Classification
Classification accuracy
10-fold cross validation
Separate test set of randomly chosen tweets
Event surfacing
Top events per hour for each technique
Evaluation:
Precision@K
NDCG@K
Experimental Methodology: Event
Classification
Classification accuracy
10-fold cross validation
Separate test set of randomly chosen tweets
Event surfacing
Top events per hour for each technique
Evaluation:
Precision@K
NDCG@K
Identified Events
Description Keywords
Senator Evan Bayh's Retirement bayh, evan, senate, congress, retire
Westminster Dog Show westminster, dog, show, club, kennel
Obama’s Meeting with the Dalai Lama lama, dalai, meet, obama, china
NYC Toy Fair toyfairny, starwars, hasbro, lego, toy
Marc Jacobs Fashion Show jacobs, marc, nyfw, show, fashion
A sample of events identified by our classifiers on the test set
Classification Performance (F-measure)
RW-Event classifier is more effective at
discriminating between real-world events and rest
of Twitter data
Classifier Validation Test
NB-Text 0.785 0.702
RW-Event 0.849 0.837
TC-Event 0.875 0.789
Precision@K Evaluation
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
5 10 15 20
Pre
cisi
on
Number of Clusters (K)
RW-Event
TC-Event
Fastest
Random
NDCG@K Evaluation
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
5 10 15 20
ND
CG
Number of Clusters (K)
RW-Event
TC-Event
Fastest
Random
Experimental Methodology:
Content Selection
50 event clusters
Randomly selected from test set
5 top tweets per event for each: Centroid, Degree,
LexRank
Labeled on a 1-4 scale
Quality: excellent (4) poor (1)
Relevance: clearly relevant (4) not relevant (1)
Usefulness: clearly useful (4) not useful (1)
Selected Tweets: Example
Method Tweet
Centroid
Video: Tiger regretful; unsure about return to golf - Main Line ...:
(AP) Tiger Woods publicly apologized Friday...
http://bit.ly/dAO41N
Degree
Watson: Woods needs to show humility upon return (AP): Tom
Watson says Tiger Woods needs to "show some humility to...
http://bit.ly/cHVH7x
LexRank RT @EricStangel: Tiger Woods statement: And now for Elin's
repsonse....
A sample of tweets selected by different centrality methods
Content Selection Results
Average scores over all events
High quality and relevance (>3) for both Degree
and Centroid
Centroid only method with high usefulness
Method Quality Relevance Usefulness
LexRank 3.444 2.984 2.608
Degree 3.536 3.156 2.802
Centroid 3.636 3.694 3.474
Preferred Method per Event
Centroid is the preferred method across all metrics
For usefulness, Centroid tweets preferred more than 2:1
compared to Degree, 4:1 compared to LexRank
Method Quality Relevance Usefulness
LexRank 22.66% 16.33% 12%
Degree 31.66% 25.33% 28%
Centroid 45.66% 58.33% 60%
Conclusions
Techniques for discovering, organizing, and presenting social media from real-world events
Event classifiers
Important to capture features of Twitter-specific events in order to reveal the real-world events
Effectively surfaced real-world events in an unsupervised setting
Content selection
Similarity to centroid technique better at selecting event content
There is relevant and useful event content on Twitter!
Identifying Events in Social Media
Timeliness
Cont
ent
Dis
cove
ry
Real-time Retrospective
Kno
wn
Unk
now
n
Learning similarity metrics
for event identification on
Flickr [Becker et al. WSDM’10]
Surfacing events on
Identifying Twitter content
for planned events
Connecting events across
sites (e.g., YouTube,
Picasa)
Learning Similarity Metrics for Event
Identification in Social Media (WSDM ’10)
Ctitle
Ctags
Ctime
Combine
similarities
Learning Similarity Metrics for Event
Identification in Social Media (WSDM ’10)
Wtitle
Wtags
Wtime
f(C,W)
Ctitle
Ctags
Ctime
Learned in a
training step
Combine
similarities
Learning Similarity Metrics for Event
Identification in Social Media (WSDM ’10)
Wtitle
Wtags
Wtime
f(C,W)
Ctitle
Ctags
Ctime
Final
clustering
solution
Learned in a
training step
Identifying Tweets for Known Events
Identifying Tweets for Known Events
Identifying Events in Social Media
Timeliness
Cont
ent
Dis
cove
ry
Real-time Retrospective
Kno
wn
Unk
now
n
Learning similarity metrics
for event identification on
Flickr [Becker et al. WSDM’10]
Surfacing events on
Identifying Twitter content
for planned events
Connecting events across
sites (e.g., YouTube,
Picasa)
Thank you!
Pablo Barrio
David Elson
Dan Iter
Yves Petinot
Sara Rosenthal
Gonçalo Simões
Matt Solomon
Kapil Thadani