Ebrahimi, M.; ShafieiBavani, E.; Wong, R.; Chen, F.: Twitter user geolocation by filtering of highly mentioned users (2018)
0.02
0.019208387 = product of:
0.05122236 = sum of:
0.025667597 = weight(_text_:use in 4286) [ClassicSimilarity], result of:
0.025667597 = score(doc=4286,freq=2.0), product of:
0.12644777 = queryWeight, product of:
3.0620887 = idf(docFreq=5623, maxDocs=44218)
0.041294612 = queryNorm
0.20298971 = fieldWeight in 4286, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.0620887 = idf(docFreq=5623, maxDocs=44218)
0.046875 = fieldNorm(doc=4286)
0.018933605 = weight(_text_:of in 4286) [ClassicSimilarity], result of:
0.018933605 = score(doc=4286,freq=16.0), product of:
0.06457475 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.041294612 = queryNorm
0.2932045 = fieldWeight in 4286, product of:
4.0 = tf(freq=16.0), with freq of:
16.0 = termFreq=16.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.046875 = fieldNorm(doc=4286)
0.006621159 = product of:
0.013242318 = sum of:
0.013242318 = weight(_text_:on in 4286) [ClassicSimilarity], result of:
0.013242318 = score(doc=4286,freq=2.0), product of:
0.090823986 = queryWeight, product of:
2.199415 = idf(docFreq=13325, maxDocs=44218)
0.041294612 = queryNorm
0.14580199 = fieldWeight in 4286, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.199415 = idf(docFreq=13325, maxDocs=44218)
0.046875 = fieldNorm(doc=4286)
0.5 = coord(1/2)
0.375 = coord(3/8)
- Abstract
- Geolocated social media data provide a powerful source of information about places and regional human behavior. Because only a small amount of social media data have been geolocation-annotated, inference techniques play a substantial role to increase the volume of annotated data. Conventional research in this area has been based on the text content of posts from a given user or the social network of the user, with some recent crossovers between the text- and network-based approaches. This paper proposes a novel approach to categorize highly-mentioned users (celebrities) into Local and Global types, and consequently use Local celebrities as location indicators. A label propagation algorithm is then used over the refined social network for geolocation inference. Finally, we propose a hybrid approach by merging a text-based method as a back-off strategy into our network-based approach. Empirical experiments over three standard Twitter benchmark data sets demonstrate that our approach outperforms state-of-the-art user geolocation methods.
- Source
- Journal of the Association for Information Science and Technology. 69(2018) no.7, S.879-889