part 1: introduction importance of geolocation finding compromised accounts (prevent security...

25
Find Me If You Can: Improving Geographical Prediction with Social and Spatial Proximity

Upload: antony-neal

Post on 29-Dec-2015

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

Find Me If You Can: Improving Geographical Prediction

with Social and Spatial Proximity

Page 2: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

Part 1: Introduction

Page 3: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

Importance of geolocation

Finding compromised accounts (prevent security breaches).

Personalization of information based on location.

Page 4: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

MotivationIP address typically provide accuracy at

the city level.results are inconsistent.Geo-IP databases require constant

maintenance.Other geolocation strategies are non

transparent.

Page 5: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

Part 3: use of observations in a predictive model

Part 2: relationships

Part 1: Introduction

Page 6: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

Paper outlinestudy the relationship between geography

and friendship.they use the Facebook social network in

order to study the relationship.Some users provide their addresses.we get 30.6 million edges between

individuals with known location.

Page 7: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

Factors that affect relationshipsSocial Norms.Distance.Communication technologies.

Page 8: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

• Males are significantly more likely to share their address information than females.

• users that share their addresses tend to have many more friends.

Supplying addresses on Facebook

Page 9: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

No bias problem.

Page 10: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

low density: power-law with exponent -1.37.

high density: power-law with exponent -3.07.

96% of people live in areas before the transition point on exponent -1.37 .

Page 11: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

we see that the curves increase linearly only for a small distance.

we increase the radius and expect to find an increase in the population.

on the other hand, we move further away from urban centers to rural areas.

Page 12: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

• we can get a good fit to a curve of the form . The exponent very close to c = −1.

Page 13: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

for short distances the probability is higher in lower density areas

at about 50 miles the three curves converge. at long distances, people in high density areas

being more likely to be friends.

Page 14: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

ranku(v) := |{w : d(u,w) < d(u, v)}|.we do see a nice smooth curve, again with an

exponent of close to −1.

Page 15: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

All the curves with exponent about −1.higher at low ranks for people in less dense areas, and

higher at high ranks for people in more dense areas (cosmopolitan effect).

Page 16: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

Part 3: use of observations in a predictive model

Part 2: relationships

Part 1: Introduction

Page 17: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

)

. = 0.0019 (taken from slide 12).

Page 18: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location
Page 19: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

attempt to recover addresses of 75 % of individuals.iteratively using the newly guessed locations as

input as well as the locations provided by users.

Page 20: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

Prediction performance as a function of friend count.

Page 21: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

A good trade-off is 5+blend.

Page 22: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

BenefitsInfo about relationships with greater

accuracy and in greater depth.The new algorithm.

Page 23: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

Part 3: use of observations in a predictive model

Part 2: relationships

Part 1: Introduction

Page 24: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

Future workFuture work can improve even more the

accuracy.Using social gathering.

Page 25: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location

attaching time stamps to data.More weight to new friendships than old

ones.