Document (#30153)

Author
Watters, C.
Amoudi, A.
Title
Geosearcher : location-based ranking of search engine results
Source
Journal of the American Society for Information Science and technology. 54(2003) no.2, S.140-151
Year
2003
Abstract
Waters and Amoudi describe GeoSearcher, a prototype ranking program that arranges search engine results along a geo-spatial dimension without the provision of geo-spatial meta-tags or the use of geo-spatial feature extraction. GeoSearcher uses URL analysis, IptoLL, Whois, and the Getty Thesaurus of Geographic Names to determine site location. It accepts the first 200 sites returned by a search engine, identifies the coordinates, calculates their distance from a reference point and ranks in ascending order by this value. For any retrieved site the system checks if it has already been located in the current session, then sends the domain name to Whois to generate a return of a two letter country code and an area code. With no success the name is stripped one level and resent. If this fails the top level domain is tested for being a country code. Any remaining unmatched names go to IptoLL. Distance is calculated using the center point of the geographic area and a provided reference location. A test run on a set of 100 URLs from a search was successful in locating 90 sites. Eighty three pages could be manually found and 68 had sufficient information to verify location determination. Of these 65 ( 95%) had been assigned reasonably correct geographic locations. A random set of URLs used instead of a search result, yielded 80% success.
Theme
Suchmaschinen
Retrievalalgorithmen
Object
GeoSearcher

Similar documents (author)

  1. Watters, C.: Extending the multimedia class hierarchy for hypermedia applications (1996) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:watters in 605) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 605, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=605)
    
  2. Watters, C.: Information retrieval and the virtual document (1999) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:watters in 4319) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 4319, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=4319)
    
  3. Watters, C.; Shepherd, M.A.: Shifting the information paradigm from data-centered to user-centered (1994) 4.49
    4.4944186 = sum of:
      4.4944186 = weight(author_txt:watters in 7290) [ClassicSimilarity], result of:
        4.4944186 = fieldWeight in 7290, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.5 = fieldNorm(doc=7290)
    
  4. Carrick, C.; Watters, C.: Automatic association of news items (1997) 4.49
    4.4944186 = sum of:
      4.4944186 = weight(author_txt:watters in 1549) [ClassicSimilarity], result of:
        4.4944186 = fieldWeight in 1549, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.5 = fieldNorm(doc=1549)
    
  5. Watters, C.; Wang, H.: Rating new documents for similarity (2000) 4.49
    4.4944186 = sum of:
      4.4944186 = weight(author_txt:watters in 4856) [ClassicSimilarity], result of:
        4.4944186 = fieldWeight in 4856, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.5 = fieldNorm(doc=4856)
    

Similar documents (content)

  1. Hill, L.L.; Frew, J.; Zheng, Q.: Geographic names : the implementation of a gazetteer in a georeferenced digital library (1999) 0.19
    0.19419435 = sum of:
      0.19419435 = product of:
        0.8091431 = sum of:
          0.018014124 = weight(abstract_txt:reference in 1240) [ClassicSimilarity], result of:
            0.018014124 = score(doc=1240,freq=1.0), product of:
              0.086113885 = queryWeight, product of:
                4.46271 = idf(docFreq=1385, maxDocs=44218)
                0.019296322 = queryNorm
              0.20918953 = fieldWeight in 1240, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.46271 = idf(docFreq=1385, maxDocs=44218)
                0.046875 = fieldNorm(doc=1240)
          0.08599131 = weight(abstract_txt:name in 1240) [ClassicSimilarity], result of:
            0.08599131 = score(doc=1240,freq=5.0), product of:
              0.14277236 = queryWeight, product of:
                1.2876134 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.019296322 = queryNorm
              0.6022966 = fieldWeight in 1240, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.046875 = fieldNorm(doc=1240)
          0.1334279 = weight(abstract_txt:names in 1240) [ClassicSimilarity], result of:
            0.1334279 = score(doc=1240,freq=11.0), product of:
              0.1471289 = queryWeight, product of:
                1.3071108 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.019296322 = queryNorm
              0.90687764 = fieldWeight in 1240, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.046875 = fieldNorm(doc=1240)
          0.33256897 = weight(abstract_txt:geographic in 1240) [ClassicSimilarity], result of:
            0.33256897 = score(doc=1240,freq=15.0), product of:
              0.27920225 = queryWeight, product of:
                2.2053041 = boost
                6.5610886 = idf(docFreq=169, maxDocs=44218)
                0.019296322 = queryNorm
              1.19114 = fieldWeight in 1240, product of:
                3.8729835 = tf(freq=15.0), with freq of:
                  15.0 = termFreq=15.0
                6.5610886 = idf(docFreq=169, maxDocs=44218)
                0.046875 = fieldNorm(doc=1240)
          0.096190274 = weight(abstract_txt:spatial in 1240) [ClassicSimilarity], result of:
            0.096190274 = score(doc=1240,freq=1.0), product of:
              0.3011496 = queryWeight, product of:
                2.2903411 = boost
                6.8140855 = idf(docFreq=131, maxDocs=44218)
                0.019296322 = queryNorm
              0.31941026 = fieldWeight in 1240, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8140855 = idf(docFreq=131, maxDocs=44218)
                0.046875 = fieldNorm(doc=1240)
          0.14295055 = weight(abstract_txt:location in 1240) [ClassicSimilarity], result of:
            0.14295055 = score(doc=1240,freq=2.0), product of:
              0.34260076 = queryWeight, product of:
                2.8208017 = boost
                6.29421 = idf(docFreq=221, maxDocs=44218)
                0.019296322 = queryNorm
              0.4172511 = fieldWeight in 1240, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.29421 = idf(docFreq=221, maxDocs=44218)
                0.046875 = fieldNorm(doc=1240)
        0.24 = coord(6/25)
    
  2. Hill, L.L.: Geographic indexing for bibliographic databases (1989) 0.17
    0.16774154 = sum of:
      0.16774154 = product of:
        0.8387077 = sum of:
          0.036028247 = weight(abstract_txt:reference in 2716) [ClassicSimilarity], result of:
            0.036028247 = score(doc=2716,freq=1.0), product of:
              0.086113885 = queryWeight, product of:
                4.46271 = idf(docFreq=1385, maxDocs=44218)
                0.019296322 = queryNorm
              0.41837907 = fieldWeight in 2716, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.46271 = idf(docFreq=1385, maxDocs=44218)
                0.09375 = fieldNorm(doc=2716)
          0.110677525 = weight(abstract_txt:country in 2716) [ClassicSimilarity], result of:
            0.110677525 = score(doc=2716,freq=1.0), product of:
              0.18197738 = queryWeight, product of:
                1.453691 = boost
                6.487401 = idf(docFreq=182, maxDocs=44218)
                0.019296322 = queryNorm
              0.6081939 = fieldWeight in 2716, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.487401 = idf(docFreq=182, maxDocs=44218)
                0.09375 = fieldNorm(doc=2716)
          0.2974587 = weight(abstract_txt:geographic in 2716) [ClassicSimilarity], result of:
            0.2974587 = score(doc=2716,freq=3.0), product of:
              0.27920225 = queryWeight, product of:
                2.2053041 = boost
                6.5610886 = idf(docFreq=169, maxDocs=44218)
                0.019296322 = queryNorm
              1.065388 = fieldWeight in 2716, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5610886 = idf(docFreq=169, maxDocs=44218)
                0.09375 = fieldNorm(doc=2716)
          0.19238055 = weight(abstract_txt:spatial in 2716) [ClassicSimilarity], result of:
            0.19238055 = score(doc=2716,freq=1.0), product of:
              0.3011496 = queryWeight, product of:
                2.2903411 = boost
                6.8140855 = idf(docFreq=131, maxDocs=44218)
                0.019296322 = queryNorm
              0.6388205 = fieldWeight in 2716, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8140855 = idf(docFreq=131, maxDocs=44218)
                0.09375 = fieldNorm(doc=2716)
          0.20216261 = weight(abstract_txt:location in 2716) [ClassicSimilarity], result of:
            0.20216261 = score(doc=2716,freq=1.0), product of:
              0.34260076 = queryWeight, product of:
                2.8208017 = boost
                6.29421 = idf(docFreq=221, maxDocs=44218)
                0.019296322 = queryNorm
              0.59008217 = fieldWeight in 2716, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.29421 = idf(docFreq=221, maxDocs=44218)
                0.09375 = fieldNorm(doc=2716)
        0.2 = coord(5/25)
    
  3. Koehler, W.C.: Internet search note : specialized retrieval and Web search engines (1997) 0.13
    0.13363151 = sum of:
      0.13363151 = product of:
        0.556798 = sum of:
          0.074541844 = weight(abstract_txt:level in 769) [ClassicSimilarity], result of:
            0.074541844 = score(doc=769,freq=3.0), product of:
              0.087479495 = queryWeight, product of:
                1.0078979 = boost
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.019296322 = queryNorm
              0.8521065 = fieldWeight in 769, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.109375 = fieldNorm(doc=769)
          0.050224617 = weight(abstract_txt:domain in 769) [ClassicSimilarity], result of:
            0.050224617 = score(doc=769,freq=1.0), product of:
              0.096967086 = queryWeight, product of:
                1.0611471 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.019296322 = queryNorm
              0.5179553 = fieldWeight in 769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.109375 = fieldNorm(doc=769)
          0.060092658 = weight(abstract_txt:area in 769) [ClassicSimilarity], result of:
            0.060092658 = score(doc=769,freq=1.0), product of:
              0.10928508 = queryWeight, product of:
                1.1265328 = boost
                5.027389 = idf(docFreq=787, maxDocs=44218)
                0.019296322 = queryNorm
              0.54987067 = fieldWeight in 769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.027389 = idf(docFreq=787, maxDocs=44218)
                0.109375 = fieldNorm(doc=769)
          0.08973179 = weight(abstract_txt:name in 769) [ClassicSimilarity], result of:
            0.08973179 = score(doc=769,freq=1.0), product of:
              0.14277236 = queryWeight, product of:
                1.2876134 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.019296322 = queryNorm
              0.6284955 = fieldWeight in 769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.109375 = fieldNorm(doc=769)
          0.08184624 = weight(abstract_txt:search in 769) [ClassicSimilarity], result of:
            0.08184624 = score(doc=769,freq=2.0), product of:
              0.1446491 = queryWeight, product of:
                2.0492327 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.019296322 = queryNorm
              0.5658261 = fieldWeight in 769, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.109375 = fieldNorm(doc=769)
          0.20036086 = weight(abstract_txt:geographic in 769) [ClassicSimilarity], result of:
            0.20036086 = score(doc=769,freq=1.0), product of:
              0.27920225 = queryWeight, product of:
                2.2053041 = boost
                6.5610886 = idf(docFreq=169, maxDocs=44218)
                0.019296322 = queryNorm
              0.71761906 = fieldWeight in 769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5610886 = idf(docFreq=169, maxDocs=44218)
                0.109375 = fieldNorm(doc=769)
        0.24 = coord(6/25)
    
  4. Wisniewski, J.: Authority work, Internet resources, and a cataloguer's home page (1998) 0.13
    0.1317772 = sum of:
      0.1317772 = product of:
        0.65888596 = sum of:
          0.120345145 = weight(abstract_txt:sites in 2534) [ClassicSimilarity], result of:
            0.120345145 = score(doc=2534,freq=2.0), product of:
              0.12607463 = queryWeight, product of:
                1.2099774 = boost
                5.399778 = idf(docFreq=542, maxDocs=44218)
                0.019296322 = queryNorm
              0.95455486 = fieldWeight in 2534, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.399778 = idf(docFreq=542, maxDocs=44218)
                0.125 = fieldNorm(doc=2534)
          0.09972625 = weight(abstract_txt:site in 2534) [ClassicSimilarity], result of:
            0.09972625 = score(doc=2534,freq=1.0), product of:
              0.14013876 = queryWeight, product of:
                1.2756823 = boost
                5.6930003 = idf(docFreq=404, maxDocs=44218)
                0.019296322 = queryNorm
              0.71162504 = fieldWeight in 2534, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6930003 = idf(docFreq=404, maxDocs=44218)
                0.125 = fieldNorm(doc=2534)
          0.10255062 = weight(abstract_txt:name in 2534) [ClassicSimilarity], result of:
            0.10255062 = score(doc=2534,freq=1.0), product of:
              0.14277236 = queryWeight, product of:
                1.2876134 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.019296322 = queryNorm
              0.7182806 = fieldWeight in 2534, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.125 = fieldNorm(doc=2534)
          0.10728007 = weight(abstract_txt:names in 2534) [ClassicSimilarity], result of:
            0.10728007 = score(doc=2534,freq=1.0), product of:
              0.1471289 = queryWeight, product of:
                1.3071108 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.019296322 = queryNorm
              0.72915703 = fieldWeight in 2534, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.125 = fieldNorm(doc=2534)
          0.22898383 = weight(abstract_txt:geographic in 2534) [ClassicSimilarity], result of:
            0.22898383 = score(doc=2534,freq=1.0), product of:
              0.27920225 = queryWeight, product of:
                2.2053041 = boost
                6.5610886 = idf(docFreq=169, maxDocs=44218)
                0.019296322 = queryNorm
              0.8201361 = fieldWeight in 2534, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5610886 = idf(docFreq=169, maxDocs=44218)
                0.125 = fieldNorm(doc=2534)
        0.2 = coord(5/25)
    
  5. Schaefer, M.T.: Project Aristotle & Cyberstacks : automating the virtual Internet library (1998) 0.12
    0.11772267 = sum of:
      0.11772267 = product of:
        0.58861333 = sum of:
          0.08726047 = weight(abstract_txt:site in 337) [ClassicSimilarity], result of:
            0.08726047 = score(doc=337,freq=1.0), product of:
              0.14013876 = queryWeight, product of:
                1.2756823 = boost
                5.6930003 = idf(docFreq=404, maxDocs=44218)
                0.019296322 = queryNorm
              0.6226719 = fieldWeight in 337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6930003 = idf(docFreq=404, maxDocs=44218)
                0.109375 = fieldNorm(doc=337)
          0.08783295 = weight(abstract_txt:success in 337) [ClassicSimilarity], result of:
            0.08783295 = score(doc=337,freq=1.0), product of:
              0.14075102 = queryWeight, product of:
                1.278466 = boost
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.019296322 = queryNorm
              0.62403065 = fieldWeight in 337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.109375 = fieldNorm(doc=337)
          0.1197895 = weight(abstract_txt:engine in 337) [ClassicSimilarity], result of:
            0.1197895 = score(doc=337,freq=1.0), product of:
              0.19814791 = queryWeight, product of:
                1.8578206 = boost
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.019296322 = queryNorm
              0.6045459 = fieldWeight in 337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5272765 = idf(docFreq=477, maxDocs=44218)
                0.109375 = fieldNorm(doc=337)
          0.05787403 = weight(abstract_txt:search in 337) [ClassicSimilarity], result of:
            0.05787403 = score(doc=337,freq=1.0), product of:
              0.1446491 = queryWeight, product of:
                2.0492327 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.019296322 = queryNorm
              0.4000995 = fieldWeight in 337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.109375 = fieldNorm(doc=337)
          0.23585638 = weight(abstract_txt:location in 337) [ClassicSimilarity], result of:
            0.23585638 = score(doc=337,freq=1.0), product of:
              0.34260076 = queryWeight, product of:
                2.8208017 = boost
                6.29421 = idf(docFreq=221, maxDocs=44218)
                0.019296322 = queryNorm
              0.68842924 = fieldWeight in 337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.29421 = idf(docFreq=221, maxDocs=44218)
                0.109375 = fieldNorm(doc=337)
        0.2 = coord(5/25)