Ebrahimi, M.; ShafieiBavani, E.; Wong, R.; Chen, F.: Twitter user geolocation by filtering of highly mentioned users (2018)
0.02
0.02441524 = product of:
0.09766096 = sum of:
0.09766096 = weight(_text_:social in 4286) [ClassicSimilarity], result of:
0.09766096 = score(doc=4286,freq=8.0), product of:
0.1847249 = queryWeight, product of:
3.9875789 = idf(docFreq=2228, maxDocs=44218)
0.046325076 = queryNorm
0.52868325 = fieldWeight in 4286, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
3.9875789 = idf(docFreq=2228, maxDocs=44218)
0.046875 = fieldNorm(doc=4286)
0.25 = coord(1/4)
- Abstract
- Geolocated social media data provide a powerful source of information about places and regional human behavior. Because only a small amount of social media data have been geolocation-annotated, inference techniques play a substantial role to increase the volume of annotated data. Conventional research in this area has been based on the text content of posts from a given user or the social network of the user, with some recent crossovers between the text- and network-based approaches. This paper proposes a novel approach to categorize highly-mentioned users (celebrities) into Local and Global types, and consequently use Local celebrities as location indicators. A label propagation algorithm is then used over the refined social network for geolocation inference. Finally, we propose a hybrid approach by merging a text-based method as a back-off strategy into our network-based approach. Empirical experiments over three standard Twitter benchmark data sets demonstrate that our approach outperforms state-of-the-art user geolocation methods.