2009fs-01

Upload: shravan-kumar

Post on 06-Apr-2018

214 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/3/2019 2009FS-01

    1/32

    Ranking and Suggesting Popular Items

    IDB & IDS Lab. Seminar

    Fall 2009

    Minsuk [email protected]

    September 18th, 2009

    M. Vojnovic, J. Cruise, D. Gunawardena, & P. Marbach

    IEEE Trans. on Knowledge & Data Eng. (TKDE)Vol. 21, No. 8, August 2009

    Center for E-Business TechnologySeoul National UniversitySeoul, Korea Intelligent Database Systems Lab.

  • 8/3/2019 2009FS-01

    2/32

    Copyright 2009 by CEBT

    Abstract

    Ranking and Suggesting Popular Items

    Items are suggested to users to aid tasks such as browsing of the content.

    Users tend to select suggested items more frequently (popularity bias).

    Propose algorithms for suggesting popular items,

    which enables to learn users true preference over items.

    2

    Suggestion set

  • 8/3/2019 2009FS-01

    3/32

    Contents

    Introduction

    Problem Formulation

    Algorithms

    Additional Material

    Analytical & Numerical Results

    Related Work

    Conclusion

    3

  • 8/3/2019 2009FS-01

    4/32

    Copyright 2009 by CEBT

    Introduction

    Item Suggestion

    Items are suggested to users to aid tasks

    such as browsing or tagging of the content.

    Items could be search query keywords, documents, tags, etc.

    4

  • 8/3/2019 2009FS-01

    5/32

    Copyright 2009 by CEBT

    Introduction

    Motivation

    Learning of item popularity is complicated

    Users tend to select suggested items more frequently.

    least effort, bandwagon effect

    Such popularity bias makes problem.

    5

    Just Click from Suggestion Set?

    Think..., and type new one?

  • 8/3/2019 2009FS-01

    6/32

    Copyright 2009 by CEBT

    Introduction

    Motivating Example

    6

  • 8/3/2019 2009FS-01

    7/32

    Copyright 2009 by CEBT

    Introduction

    Motivating Example

    7

  • 8/3/2019 2009FS-01

    8/32

    Copyright 2009 by CEBT

    Introduction

    Goal

    Propose algorithms and analyze performance for suggesting popular items

    A way that enables to learn users true preference over items

    8

  • 8/3/2019 2009FS-01

    9/32

    Contents

    Introduction

    Problem Formulation

    Algorithms

    Additional Material

    Analytical & Numerical Results

    Related Work

    Conclusion

    9

  • 8/3/2019 2009FS-01

    10/32

    Copyright 2009 by CEBT

    Problem Formulation

    Ranking and Suggesting Items

    Problem

    Users select items from a given set C

    ris Users true preference over the set of items C

    rcalled true popularity rank scores

    ri is the portion of users that would select item i if suggestions were not made

    Assumption

    ri is strictly positive

    Items are enumerated

    r is normalized such that it is a probability distribution.

    10

    ri 0.45 0.21 0.13 0.10 0.06 0.05

  • 8/3/2019 2009FS-01

    11/32

    Copyright 2009 by CEBT

    Problem Formulation

    Algorithms

    Ranking rule

    Specifies how to update the ranking scores of items

    Suggestion rule

    Specifies what subset of items to suggest to a user

    Assumption

    size of suggestion set is fixed

    Do not account for suggestion order

    interested in small suggestion sets

    allow for randomized order

    11

  • 8/3/2019 2009FS-01

    12/32

    Copyright 2009 by CEBT

    Problem Formulation

    Design objective

    Learn the true popularity ranking of items

    For items, i and j, rirj implies i (t)j(t), for sufficiently large time t

    r is the true popularity rank scores

    is the ranking scores generated by ranking rule algorithm

    Suggest true popular items and identify quickly the true popular items

    Evaluation metric

    Precision of a set of items

    12

  • 8/3/2019 2009FS-01

    13/32

    Copyright 2009 by CEBT

    entire item set

    Problem Formulation

    Users Choice Model

    How users choose items when he is presented a set S of suggested items

    User selects an item from the entire set of items by sampling,

    using the true item popularity distribution r.

    Otherwise, user does the same but confines his choice to items in the suggest set.

    13

    Suggestion set

  • 8/3/2019 2009FS-01

    14/32

    Contents

    Introduction

    Problem Formulation Algorithms

    A Nave Algorithm

    Ranking rules

    Suggestion rules

    Additional Material Analytical & Numerical Results

    Related Work

    Conclusion

    14

  • 8/3/2019 2009FS-01

    15/32

    Copyright 2009 by CEBT

    A B C D E F

    Algorithms

    A Nave Algorithm

    TOP (Top popular)

    Simple algorithm (baseline)

    Rank score of an item equals number of selections of this item.

    15

    Vi 3 2 1 2 1 1

  • 8/3/2019 2009FS-01

    16/32

    Copyright 2009 by CEBT

    Top Popular

    Algorithms

    Proposed algorithms

    16

    nave

    not biased

    ranking suggesting

    Rank rule 2

    Rank rule 1

    PROP

    M2S

    FM2S

    top-N rank score(baseline)

  • 8/3/2019 2009FS-01

    17/32

    Copyright 2009 by CEBT

    A B C D E F

    pi

    count

    Algorithms

    Ranking rules

    Rank rule 1

    Simple ranking rule

    Rank score for item i increases whenever a user selects this item.

    17

    i 0.33 0.17 0.17 0.08 0.08 0.08

    count 3 2 2 1 1 1

  • 8/3/2019 2009FS-01

    18/32

    Copyright 2009 by CEBT

    Algorithms

    Ranking rules

    Rank rule 2

    Rank rule 1 may fail to discover true popularity order.

    Here, rank score updated only for an item that was not suggested.

    Slow rate of convergence

    18

    A B C D E F

    pi

    count

    i 0.33 0.17 0.11 0.03 0.04 0.04

    count 12 13 1 1 1 2

  • 8/3/2019 2009FS-01

    19/32

    Copyright 2009 by CEBT

    Algorithms

    Suggestion rules

    PROP (Frequency Proportional)

    randomized algorithm

    Suggestion set is sampled with probability proportional to current rank score.

    More robust to imitation than TOP

    19

    A B C D E F

    Suggestion set

  • 8/3/2019 2009FS-01

    20/32

    Copyright 2009 by CEBT

    Algorithms

    Suggestion rules

    M2S (Move-to-set)

    Suggest the last used item

    Suggestion set updated only when a user selects an item that is not in S

    Random iterative update rule of suggestion set

    computationally very simple

    20

    Suggestion set

  • 8/3/2019 2009FS-01

    21/32

    Copyright 2009 by CEBT

    Algorithms

    Suggestion rules

    FM2S (Frequency move-to-set)

    can go to suggestion set only if sufficiently popular, w.r.t. true popularity

    compared to M2S (previous page)

    Not update counter for an item that were in suggested set

    different from TOP (first simple alg.)

    21

  • 8/3/2019 2009FS-01

    22/32

    Copyright 2009 by CEBT

    Suggestion set

    A B C D E F G H

    Algorithms

    Suggestion rules

    FM2S (Frequency move-to-set)

    can go to suggestion set only if sufficiently popular, w.r.t. true popularity

    Not update counter for an item that were in suggested set

    22

    +3 +2A B C D E F G H

    Wi 7 7 6 5 5 4 2 1

  • 8/3/2019 2009FS-01

    23/32

    Contents

    Introduction

    Problem Formulation

    Algorithms

    Additional Material

    Analytical & Numerical Results

    Related Work

    Conclusion

    23

  • 8/3/2019 2009FS-01

    24/32

    Copyright 2009 by CEBT

    Additional Material

    Experiment done by me

    TOP vs. FM2S

    TOP fail to catch true distribution

    100 times of item selection sampled from true preference distribution

    when imitation probability (probability for selecting from suggestion set) is 0.5

    24

    true 25 20 15 11 10 7 6 6

    count 23 86 30 13 12 9 24 3

    FM2S 12 12 13 11 9 9 12 3

  • 8/3/2019 2009FS-01

    25/32

    Contents

    Introduction

    Related Work

    Problem Formulation

    Algorithms

    Additional Material

    Analytical & Numerical Results Analytical Results

    Numerical Results

    Conclusion

    25

  • 8/3/2019 2009FS-01

    26/32

    Copyright 2009 by CEBT

    Analytical Results

    Goal

    present analytical results on the ranking and suggesting algorithms

    Results

    some Theorems with proof

    Top Popular

    There exists a threshold on the imitation probability belowwhich the algorithm guarantees to learn true popularity rank.

    The threshold is a function of suggestion set size and true popularity rank scores.

    Frequency Proportional

    Move-to-Set

    Frequency Move-to-Set

    Show that this algorithm tends to suggest only a subset of

    sufficiently true popular items

    26

  • 8/3/2019 2009FS-01

    27/32

    Copyright 2009 by CEBT

    Numerical Results

    Goal

    Evaluate performance of algorithms

    using samples of real-world distributions for item popularity scores

    Data set

    Tagging histories for bookmarks from del.icio.us

    Study

    Learning the true popularity ranking

    compute threshold imitation probability for each bookmark

    Undesired lockdown may happen at small level of imitation (about 0.1)

    Precision of suggestions evaluate mean precision of suggestions made by algorithms PROP, M2S, FM2S

    Mean precision under FM2S is the best

    27

  • 8/3/2019 2009FS-01

    28/32

    Contents

    Introduction

    Problem Formulation

    Algorithms

    Additional Material

    Analytical & Numerical Results

    Related Work

    Conclusion

    28

  • 8/3/2019 2009FS-01

    29/32

    Copyright 2009 by CEBT

    Related Work

    Related Area

    Recommendation systems

    learn which items preferred based on users selection

    We consider a system with feedback

    Voting systems

    an instance of approval voting

    Feedback may bias voting results

    bandwagon and underdog effects induced by pre-election polls

    Use click-through data for Web search

    search engine result sets lock down to a set of popular URLs

    Our problem is different because user can select items that is not presented. Social Tagging

    The effect of the tag suggestions

    Tags applied by users are affected by tag suggestions

    29

  • 8/3/2019 2009FS-01

    30/32

    Contents

    Introduction

    Related Work

    Problem Formulation

    Algorithms

    Analytical & Numerical Results

    Conclusion

    30

  • 8/3/2019 2009FS-01

    31/32

    Copyright 2009 by CEBT

    Suggestion set

    entire item set

    Conclusion

    Ranking and Suggesting Popular Items

    propose randomized algorithms for ranking and suggesting popular items

    designed to account for popularity bias.

    M2S and FM2S learn true popularity ranking that are lightweight

    self-tuning in that they do not require any special configuration parameters

    FM2S confines to displaying only sufficiently popular items

    31

    nave

    notbiased

    ranking suggesting

    Rank rule2

    Rank rule1

    FM2S

    top-N rank score(baseline)

    Top Popular

  • 8/3/2019 2009FS-01

    32/32

    32

    Thank you~