Document (#30154)

Author
Watters, C.
Amoudi, A.
Title
Geosearcher : location-based ranking of search engine results
Source
Journal of the American Society for Information Science and technology. 54(2003) no.2, S.140-151
Year
2003
Abstract
Waters and Amoudi describe GeoSearcher, a prototype ranking program that arranges search engine results along a geo-spatial dimension without the provision of geo-spatial meta-tags or the use of geo-spatial feature extraction. GeoSearcher uses URL analysis, IptoLL, Whois, and the Getty Thesaurus of Geographic Names to determine site location. It accepts the first 200 sites returned by a search engine, identifies the coordinates, calculates their distance from a reference point and ranks in ascending order by this value. For any retrieved site the system checks if it has already been located in the current session, then sends the domain name to Whois to generate a return of a two letter country code and an area code. With no success the name is stripped one level and resent. If this fails the top level domain is tested for being a country code. Any remaining unmatched names go to IptoLL. Distance is calculated using the center point of the geographic area and a provided reference location. A test run on a set of 100 URLs from a search was successful in locating 90 sites. Eighty three pages could be manually found and 68 had sufficient information to verify location determination. Of these 65 ( 95%) had been assigned reasonably correct geographic locations. A random set of URLs used instead of a search result, yielded 80% success.
Theme
Suchmaschinen
Retrievalalgorithmen
Object
GeoSearcher

Similar documents (author)

  1. Watters, C.: Extending the multimedia class hierarchy for hypermedia applications (1996) 5.59
    5.5903964 = sum of:
      5.5903964 = weight(author_txt:watters in 606) [ClassicSimilarity], result of:
        5.5903964 = fieldWeight in 606, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.944634 = idf(docFreq=14, maxDocs=42306)
          0.625 = fieldNorm(doc=606)
    
  2. Watters, C.: Information retrieval and the virtual document (1999) 5.59
    5.5903964 = sum of:
      5.5903964 = weight(author_txt:watters in 5320) [ClassicSimilarity], result of:
        5.5903964 = fieldWeight in 5320, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.944634 = idf(docFreq=14, maxDocs=42306)
          0.625 = fieldNorm(doc=5320)
    
  3. Watters, C.; Shepherd, M.A.: Shifting the information paradigm from data-centered to user-centered (1994) 4.47
    4.472317 = sum of:
      4.472317 = weight(author_txt:watters in 7290) [ClassicSimilarity], result of:
        4.472317 = fieldWeight in 7290, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.944634 = idf(docFreq=14, maxDocs=42306)
          0.5 = fieldNorm(doc=7290)
    
  4. Carrick, C.; Watters, C.: Automatic association of news items (1997) 4.47
    4.472317 = sum of:
      4.472317 = weight(author_txt:watters in 2550) [ClassicSimilarity], result of:
        4.472317 = fieldWeight in 2550, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.944634 = idf(docFreq=14, maxDocs=42306)
          0.5 = fieldNorm(doc=2550)
    
  5. Watters, C.; Wang, H.: Rating new documents for similarity (2000) 4.47
    4.472317 = sum of:
      4.472317 = weight(author_txt:watters in 5857) [ClassicSimilarity], result of:
        4.472317 = fieldWeight in 5857, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.944634 = idf(docFreq=14, maxDocs=42306)
          0.5 = fieldNorm(doc=5857)
    

Similar documents (content)

  1. Hill, L.L.; Frew, J.; Zheng, Q.: Geographic names : the implementation of a gazetteer in a georeferenced digital library (1999) 0.19
    0.19475706 = sum of:
      0.19475706 = product of:
        0.81148773 = sum of:
          0.017793866 = weight(abstract_txt:reference in 3241) [ClassicSimilarity], result of:
            0.017793866 = score(doc=3241,freq=1.0), product of:
              0.08536053 = queryWeight, product of:
                4.447049 = idf(docFreq=1346, maxDocs=42306)
                0.01919487 = queryNorm
              0.20845543 = fieldWeight in 3241, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.447049 = idf(docFreq=1346, maxDocs=42306)
                0.046875 = fieldNorm(doc=3241)
          0.0865052 = weight(abstract_txt:name in 3241) [ClassicSimilarity], result of:
            0.0865052 = score(doc=3241,freq=5.0), product of:
              0.14325672 = queryWeight, product of:
                1.2954748 = boost
                5.76104 = idf(docFreq=361, maxDocs=42306)
                0.01919487 = queryNorm
              0.6038474 = fieldWeight in 3241, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.76104 = idf(docFreq=361, maxDocs=42306)
                0.046875 = fieldNorm(doc=3241)
          0.13417532 = weight(abstract_txt:names in 3241) [ClassicSimilarity], result of:
            0.13417532 = score(doc=3241,freq=11.0), product of:
              0.1475914 = queryWeight, product of:
                1.314928 = boost
                5.8475494 = idf(docFreq=331, maxDocs=42306)
                0.01919487 = queryNorm
              0.9090998 = fieldWeight in 3241, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                5.8475494 = idf(docFreq=331, maxDocs=42306)
                0.046875 = fieldNorm(doc=3241)
          0.33073163 = weight(abstract_txt:geographic in 3241) [ClassicSimilarity], result of:
            0.33073163 = score(doc=3241,freq=15.0), product of:
              0.27801016 = queryWeight, product of:
                2.2102807 = boost
                6.552818 = idf(docFreq=163, maxDocs=42306)
                0.01919487 = queryNorm
              1.1896386 = fieldWeight in 3241, product of:
                3.8729835 = tf(freq=15.0), with freq of:
                  15.0 = termFreq=15.0
                6.552818 = idf(docFreq=163, maxDocs=42306)
                0.046875 = fieldNorm(doc=3241)
          0.09749071 = weight(abstract_txt:spatial in 3241) [ClassicSimilarity], result of:
            0.09749071 = score(doc=3241,freq=1.0), product of:
              0.30367997 = queryWeight, product of:
                2.31007 = boost
                6.8486633 = idf(docFreq=121, maxDocs=42306)
                0.01919487 = queryNorm
              0.3210311 = fieldWeight in 3241, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8486633 = idf(docFreq=121, maxDocs=42306)
                0.046875 = fieldNorm(doc=3241)
          0.144791 = weight(abstract_txt:location in 3241) [ClassicSimilarity], result of:
            0.144791 = score(doc=3241,freq=2.0), product of:
              0.3453329 = queryWeight, product of:
                2.8444967 = boost
                6.324808 = idf(docFreq=205, maxDocs=42306)
                0.01919487 = queryNorm
              0.41927952 = fieldWeight in 3241, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.324808 = idf(docFreq=205, maxDocs=42306)
                0.046875 = fieldNorm(doc=3241)
        0.24 = coord(6/25)
    
  2. Hill, L.L.: Geographic indexing for bibliographic databases (1989) 0.17
    0.16869019 = sum of:
      0.16869019 = product of:
        0.84345096 = sum of:
          0.03558773 = weight(abstract_txt:reference in 3717) [ClassicSimilarity], result of:
            0.03558773 = score(doc=3717,freq=1.0), product of:
              0.08536053 = queryWeight, product of:
                4.447049 = idf(docFreq=1346, maxDocs=42306)
                0.01919487 = queryNorm
              0.41691086 = fieldWeight in 3717, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.447049 = idf(docFreq=1346, maxDocs=42306)
                0.09375 = fieldNorm(doc=3717)
          0.11230106 = weight(abstract_txt:country in 3717) [ClassicSimilarity], result of:
            0.11230106 = score(doc=3717,freq=1.0), product of:
              0.18364514 = queryWeight, product of:
                1.4667672 = boost
                6.5227857 = idf(docFreq=168, maxDocs=42306)
                0.01919487 = queryNorm
              0.6115112 = fieldWeight in 3717, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5227857 = idf(docFreq=168, maxDocs=42306)
                0.09375 = fieldNorm(doc=3717)
          0.29581532 = weight(abstract_txt:geographic in 3717) [ClassicSimilarity], result of:
            0.29581532 = score(doc=3717,freq=3.0), product of:
              0.27801016 = queryWeight, product of:
                2.2102807 = boost
                6.552818 = idf(docFreq=163, maxDocs=42306)
                0.01919487 = queryNorm
              1.064045 = fieldWeight in 3717, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.552818 = idf(docFreq=163, maxDocs=42306)
                0.09375 = fieldNorm(doc=3717)
          0.19498143 = weight(abstract_txt:spatial in 3717) [ClassicSimilarity], result of:
            0.19498143 = score(doc=3717,freq=1.0), product of:
              0.30367997 = queryWeight, product of:
                2.31007 = boost
                6.8486633 = idf(docFreq=121, maxDocs=42306)
                0.01919487 = queryNorm
              0.6420622 = fieldWeight in 3717, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8486633 = idf(docFreq=121, maxDocs=42306)
                0.09375 = fieldNorm(doc=3717)
          0.2047654 = weight(abstract_txt:location in 3717) [ClassicSimilarity], result of:
            0.2047654 = score(doc=3717,freq=1.0), product of:
              0.3453329 = queryWeight, product of:
                2.8444967 = boost
                6.324808 = idf(docFreq=205, maxDocs=42306)
                0.01919487 = queryNorm
              0.59295076 = fieldWeight in 3717, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.324808 = idf(docFreq=205, maxDocs=42306)
                0.09375 = fieldNorm(doc=3717)
        0.2 = coord(5/25)
    
  3. Koehler, W.C.: Internet search note : specialized retrieval and Web search engines (1997) 0.13
    0.13417779 = sum of:
      0.13417779 = product of:
        0.55907416 = sum of:
          0.076452985 = weight(abstract_txt:level in 1770) [ClassicSimilarity], result of:
            0.076452985 = score(doc=1770,freq=3.0), product of:
              0.08891635 = queryWeight, product of:
                1.0206157 = boost
                4.538728 = idf(docFreq=1228, maxDocs=42306)
                0.01919487 = queryNorm
              0.8598305 = fieldWeight in 1770, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.538728 = idf(docFreq=1228, maxDocs=42306)
                0.109375 = fieldNorm(doc=1770)
          0.05171281 = weight(abstract_txt:domain in 1770) [ClassicSimilarity], result of:
            0.05171281 = score(doc=1770,freq=1.0), product of:
              0.09881536 = queryWeight, product of:
                1.0759292 = boost
                4.78471 = idf(docFreq=960, maxDocs=42306)
                0.01919487 = queryNorm
              0.52332765 = fieldWeight in 1770, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.78471 = idf(docFreq=960, maxDocs=42306)
                0.109375 = fieldNorm(doc=1770)
          0.059841726 = weight(abstract_txt:area in 1770) [ClassicSimilarity], result of:
            0.059841726 = score(doc=1770,freq=1.0), product of:
              0.108916864 = queryWeight, product of:
                1.1295853 = boost
                5.023321 = idf(docFreq=756, maxDocs=42306)
                0.01919487 = queryNorm
              0.5494257 = fieldWeight in 1770, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.023321 = idf(docFreq=756, maxDocs=42306)
                0.109375 = fieldNorm(doc=1770)
          0.09026804 = weight(abstract_txt:name in 1770) [ClassicSimilarity], result of:
            0.09026804 = score(doc=1770,freq=1.0), product of:
              0.14325672 = queryWeight, product of:
                1.2954748 = boost
                5.76104 = idf(docFreq=361, maxDocs=42306)
                0.01919487 = queryNorm
              0.6301138 = fieldWeight in 1770, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.76104 = idf(docFreq=361, maxDocs=42306)
                0.109375 = fieldNorm(doc=1770)
          0.0815447 = weight(abstract_txt:search in 1770) [ClassicSimilarity], result of:
            0.0815447 = score(doc=1770,freq=2.0), product of:
              0.14420916 = queryWeight, product of:
                2.0551233 = boost
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.01919487 = queryNorm
              0.5654613 = fieldWeight in 1770, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.109375 = fieldNorm(doc=1770)
          0.1992539 = weight(abstract_txt:geographic in 1770) [ClassicSimilarity], result of:
            0.1992539 = score(doc=1770,freq=1.0), product of:
              0.27801016 = queryWeight, product of:
                2.2102807 = boost
                6.552818 = idf(docFreq=163, maxDocs=42306)
                0.01919487 = queryNorm
              0.71671444 = fieldWeight in 1770, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.552818 = idf(docFreq=163, maxDocs=42306)
                0.109375 = fieldNorm(doc=1770)
        0.24 = coord(6/25)
    
  4. Wisniewski, J.: Authority work, Internet resources, and a cataloguer's home page (1998) 0.13
    0.13179098 = sum of:
      0.13179098 = product of:
        0.65895486 = sum of:
          0.120978974 = weight(abstract_txt:sites in 3535) [ClassicSimilarity], result of:
            0.120978974 = score(doc=3535,freq=2.0), product of:
              0.12644286 = queryWeight, product of:
                1.2170786 = boost
                5.4124084 = idf(docFreq=512, maxDocs=42306)
                0.01919487 = queryNorm
              0.95678765 = fieldWeight in 3535, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4124084 = idf(docFreq=512, maxDocs=42306)
                0.125 = fieldNorm(doc=3535)
          0.09921262 = weight(abstract_txt:site in 3535) [ClassicSimilarity], result of:
            0.09921262 = score(doc=3535,freq=1.0), product of:
              0.13957544 = queryWeight, product of:
                1.2787215 = boost
                5.6865373 = idf(docFreq=389, maxDocs=42306)
                0.01919487 = queryNorm
              0.71081716 = fieldWeight in 3535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6865373 = idf(docFreq=389, maxDocs=42306)
                0.125 = fieldNorm(doc=3535)
          0.103163466 = weight(abstract_txt:name in 3535) [ClassicSimilarity], result of:
            0.103163466 = score(doc=3535,freq=1.0), product of:
              0.14325672 = queryWeight, product of:
                1.2954748 = boost
                5.76104 = idf(docFreq=361, maxDocs=42306)
                0.01919487 = queryNorm
              0.72013 = fieldWeight in 3535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.76104 = idf(docFreq=361, maxDocs=42306)
                0.125 = fieldNorm(doc=3535)
          0.107881 = weight(abstract_txt:names in 3535) [ClassicSimilarity], result of:
            0.107881 = score(doc=3535,freq=1.0), product of:
              0.1475914 = queryWeight, product of:
                1.314928 = boost
                5.8475494 = idf(docFreq=331, maxDocs=42306)
                0.01919487 = queryNorm
              0.7309437 = fieldWeight in 3535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8475494 = idf(docFreq=331, maxDocs=42306)
                0.125 = fieldNorm(doc=3535)
          0.22771874 = weight(abstract_txt:geographic in 3535) [ClassicSimilarity], result of:
            0.22771874 = score(doc=3535,freq=1.0), product of:
              0.27801016 = queryWeight, product of:
                2.2102807 = boost
                6.552818 = idf(docFreq=163, maxDocs=42306)
                0.01919487 = queryNorm
              0.8191022 = fieldWeight in 3535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.552818 = idf(docFreq=163, maxDocs=42306)
                0.125 = fieldNorm(doc=3535)
        0.2 = coord(5/25)
    
  5. Schaefer, M.T.: Project Aristotle & Cyberstacks : automating the virtual Internet library (1998) 0.12
    0.118090324 = sum of:
      0.118090324 = product of:
        0.5904516 = sum of:
          0.086811036 = weight(abstract_txt:site in 1338) [ClassicSimilarity], result of:
            0.086811036 = score(doc=1338,freq=1.0), product of:
              0.13957544 = queryWeight, product of:
                1.2787215 = boost
                5.6865373 = idf(docFreq=389, maxDocs=42306)
                0.01919487 = queryNorm
              0.621965 = fieldWeight in 1338, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6865373 = idf(docFreq=389, maxDocs=42306)
                0.109375 = fieldNorm(doc=1338)
          0.08788471 = weight(abstract_txt:success in 1338) [ClassicSimilarity], result of:
            0.08788471 = score(doc=1338,freq=1.0), product of:
              0.14072391 = queryWeight, product of:
                1.2839715 = boost
                5.7098846 = idf(docFreq=380, maxDocs=42306)
                0.01919487 = queryNorm
              0.62451863 = fieldWeight in 1338, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7098846 = idf(docFreq=380, maxDocs=42306)
                0.109375 = fieldNorm(doc=1338)
          0.11920208 = weight(abstract_txt:engine in 1338) [ClassicSimilarity], result of:
            0.11920208 = score(doc=1338,freq=1.0), product of:
              0.19738403 = queryWeight, product of:
                1.8624005 = boost
                5.5214577 = idf(docFreq=459, maxDocs=42306)
                0.01919487 = queryNorm
              0.60390943 = fieldWeight in 1338, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5214577 = idf(docFreq=459, maxDocs=42306)
                0.109375 = fieldNorm(doc=1338)
          0.05766081 = weight(abstract_txt:search in 1338) [ClassicSimilarity], result of:
            0.05766081 = score(doc=1338,freq=1.0), product of:
              0.14420916 = queryWeight, product of:
                2.0551233 = boost
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.01919487 = queryNorm
              0.39984152 = fieldWeight in 1338, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.109375 = fieldNorm(doc=1338)
          0.23889297 = weight(abstract_txt:location in 1338) [ClassicSimilarity], result of:
            0.23889297 = score(doc=1338,freq=1.0), product of:
              0.3453329 = queryWeight, product of:
                2.8444967 = boost
                6.324808 = idf(docFreq=205, maxDocs=42306)
                0.01919487 = queryNorm
              0.6917759 = fieldWeight in 1338, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.324808 = idf(docFreq=205, maxDocs=42306)
                0.109375 = fieldNorm(doc=1338)
        0.2 = coord(5/25)