Affordable Access

deepdyve-link
Publisher Website

#Santiago is not #Chile, or is it? A Model to Normalize Social Media Impact

Authors
  • Graells-Garrido, Eduardo
  • Poblete, Barbara
Type
Preprint
Publication Date
Sep 06, 2013
Submission Date
Sep 06, 2013
Identifiers
DOI: 10.1145/2535597.2535611
Source
arXiv
License
Unknown
External links

Abstract

Online social networks are known to be demographically biased. Currently there are questions about what degree of representativity of the physical population they have, and how population biases impact user-generated content. In this paper we focus on centralism, a problem affecting Chile. Assuming that local differences exist in a country, in terms of vocabulary, we built a methodology based on the vector space model to find distinctive content from different locations, and use it to create classifiers to predict whether the content of a micro-post is related to a particular location, having in mind a geographically diverse selection of micro-posts. We evaluate them in a case study where we analyze the virtual population of Chile that participated in the Twitter social network during an event of national relevance: the municipal (local governments) elections held in 2012. We observe that the participating virtual population is spatially representative of the physical population, implying that there is centralism in Twitter. Our classifiers out-perform a non geographically-diverse baseline at the regional level, and have the same accuracy at a provincial level. However, our approach makes assumptions that need to be tested in multi-thematic and more general datasets. We leave this for future work.

Report this publication

Statistics

Seen <100 times