Document (#27065)

Author
Reitsma, R.F.
Thabane, L.
MacLeod, J.M.B.
Title
Spatialization of Web Sites Using a Weighted Frequency Model of Navigation Data
Source
Journal of the American Society for Information Science and technology. 55(2004) no.1, S.13-22
Year
2004
Abstract
A common problem in the spatialization of information systems is the determination of geometry; i.e., dimensionality and metric. Such geometry is either chosen a priori or is inferred a posteriori from secondary data. Recent work emphasizes the use of geometric information latent in a system's navigational record. Resolving this information from its noisy background, however, requires an unambiguous criterion of selection. In this paper we use a previously published, statistical method for resolving a Web-based information system's geometry from navigational data. However, because of the method's (theoretical) sensitivity to data selection, a weighted frequency correction based an empirical probability distributions is applied. The effect of this correction an the Web-space geometry is investigated. Results indicate that the inferred geometry is robust; i.e., it does not significantly change under this probabilistic correction.

Similar documents (author)

  1. MacLeod, I.A.: Text retrieval and the relational model (1991) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:macleod in 1111) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 1111, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=1111)
    
  2. Macleod, I.A.: Extending the command language interface to handle marked-up documents (1990) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:macleod in 4896) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 4896, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=4896)
    
  3. MacLeod, I.A.: Storage and retrieval of structured documents (1990) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:macleod in 3530) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 3530, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=3530)
    
  4. MacLeod, D.: ¬The Internet guide for the leagl researcher : a how-to guide to locating and retrieving free and fee-based information on the Internet (1995) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:macleod in 4240) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 4240, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=4240)
    
  5. MacLeod, D.: ¬The Internet, LEXIS, and WESTLAW : a comparison of resources for the legal researcher (1996) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:macleod in 4720) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 4720, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=4720)
    

Similar documents (content)

  1. Cornelius, I.: Quantum leaps in information retrieval (2009) 0.10
    0.10441045 = sum of:
      0.10441045 = product of:
        0.6525653 = sum of:
          0.014076117 = weight(abstract_txt:from in 2963) [ClassicSimilarity], result of:
            0.014076117 = score(doc=2963,freq=2.0), product of:
              0.03841289 = queryWeight, product of:
                1.1335347 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.012260907 = queryNorm
              0.36644253 = fieldWeight in 2963, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.09375 = fieldNorm(doc=2963)
          0.008831394 = weight(abstract_txt:this in 2963) [ClassicSimilarity], result of:
            0.008831394 = score(doc=2963,freq=1.0), product of:
              0.039038893 = queryWeight, product of:
                1.3195152 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.012260907 = queryNorm
              0.2262204 = fieldWeight in 2963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.09375 = fieldNorm(doc=2963)
          0.015447704 = weight(abstract_txt:information in 2963) [ClassicSimilarity], result of:
            0.015447704 = score(doc=2963,freq=3.0), product of:
              0.039295867 = queryWeight, product of:
                1.323851 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.012260907 = queryNorm
              0.3931127 = fieldWeight in 2963, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.09375 = fieldNorm(doc=2963)
          0.61421007 = weight(abstract_txt:geometry in 2963) [ClassicSimilarity], result of:
            0.61421007 = score(doc=2963,freq=1.0), product of:
              0.7112014 = queryWeight, product of:
                6.2967577 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.012260907 = queryNorm
              0.8636232 = fieldWeight in 2963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.09375 = fieldNorm(doc=2963)
        0.16 = coord(4/25)
    
  2. Ledesma, L.D.: ¬A computational approach to George Boole's discovery of mathematical logic (1997) 0.09
    0.088048436 = sum of:
      0.088048436 = product of:
        0.733737 = sum of:
          0.008831394 = weight(abstract_txt:this in 463) [ClassicSimilarity], result of:
            0.008831394 = score(doc=463,freq=1.0), product of:
              0.039038893 = queryWeight, product of:
                1.3195152 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.012260907 = queryNorm
              0.2262204 = fieldWeight in 463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.09375 = fieldNorm(doc=463)
          0.11069553 = weight(abstract_txt:system's in 463) [ClassicSimilarity], result of:
            0.11069553 = score(doc=463,freq=1.0), product of:
              0.16719428 = queryWeight, product of:
                1.930907 = boost
                7.062158 = idf(docFreq=102, maxDocs=44218)
                0.012260907 = queryNorm
              0.6620773 = fieldWeight in 463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.062158 = idf(docFreq=102, maxDocs=44218)
                0.09375 = fieldNorm(doc=463)
          0.61421007 = weight(abstract_txt:geometry in 463) [ClassicSimilarity], result of:
            0.61421007 = score(doc=463,freq=1.0), product of:
              0.7112014 = queryWeight, product of:
                6.2967577 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.012260907 = queryNorm
              0.8636232 = fieldWeight in 463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.09375 = fieldNorm(doc=463)
        0.12 = coord(3/25)
    
  3. Tononi, G.: Integrated information theory of consciousness : an updated account (2012) 0.08
    0.07855188 = sum of:
      0.07855188 = product of:
        0.39275938 = sum of:
          0.008619826 = weight(abstract_txt:from in 534) [ClassicSimilarity], result of:
            0.008619826 = score(doc=534,freq=3.0), product of:
              0.03841289 = queryWeight, product of:
                1.1335347 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.012260907 = queryNorm
              0.2243993 = fieldWeight in 534, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.046875 = fieldNorm(doc=534)
          0.004415697 = weight(abstract_txt:this in 534) [ClassicSimilarity], result of:
            0.004415697 = score(doc=534,freq=1.0), product of:
              0.039038893 = queryWeight, product of:
                1.3195152 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.012260907 = queryNorm
              0.1131102 = fieldWeight in 534, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.046875 = fieldNorm(doc=534)
          0.01727106 = weight(abstract_txt:information in 534) [ClassicSimilarity], result of:
            0.01727106 = score(doc=534,freq=15.0), product of:
              0.039295867 = queryWeight, product of:
                1.323851 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.012260907 = queryNorm
              0.4395134 = fieldWeight in 534, product of:
                3.8729835 = tf(freq=15.0), with freq of:
                  15.0 = termFreq=15.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.046875 = fieldNorm(doc=534)
          0.055347767 = weight(abstract_txt:system's in 534) [ClassicSimilarity], result of:
            0.055347767 = score(doc=534,freq=1.0), product of:
              0.16719428 = queryWeight, product of:
                1.930907 = boost
                7.062158 = idf(docFreq=102, maxDocs=44218)
                0.012260907 = queryNorm
              0.33103865 = fieldWeight in 534, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.062158 = idf(docFreq=102, maxDocs=44218)
                0.046875 = fieldNorm(doc=534)
          0.30710503 = weight(abstract_txt:geometry in 534) [ClassicSimilarity], result of:
            0.30710503 = score(doc=534,freq=1.0), product of:
              0.7112014 = queryWeight, product of:
                6.2967577 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.012260907 = queryNorm
              0.4318116 = fieldWeight in 534, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.046875 = fieldNorm(doc=534)
        0.2 = coord(5/25)
    
  4. Bakar, Z.A.; Sembok, T.M.T.; Yusoff, M.: ¬An evaluation of retrieval effectiveness using spelling-correction and string-similarity matching methods on Malay texts (2000) 0.06
    0.063719 = sum of:
      0.063719 = product of:
        0.39824373 = sum of:
          0.011730098 = weight(abstract_txt:from in 4804) [ClassicSimilarity], result of:
            0.011730098 = score(doc=4804,freq=2.0), product of:
              0.03841289 = queryWeight, product of:
                1.1335347 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.012260907 = queryNorm
              0.30536878 = fieldWeight in 4804, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.078125 = fieldNorm(doc=4804)
          0.0073594945 = weight(abstract_txt:this in 4804) [ClassicSimilarity], result of:
            0.0073594945 = score(doc=4804,freq=1.0), product of:
              0.039038893 = queryWeight, product of:
                1.3195152 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.012260907 = queryNorm
              0.18851699 = fieldWeight in 4804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=4804)
          0.088326946 = weight(abstract_txt:weighted in 4804) [ClassicSimilarity], result of:
            0.088326946 = score(doc=4804,freq=1.0), product of:
              0.16242428 = queryWeight, product of:
                1.9031637 = boost
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.012260907 = queryNorm
              0.5438038 = fieldWeight in 4804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.078125 = fieldNorm(doc=4804)
          0.29082718 = weight(abstract_txt:correction in 4804) [ClassicSimilarity], result of:
            0.29082718 = score(doc=4804,freq=2.0), product of:
              0.32661232 = queryWeight, product of:
                3.3053129 = boost
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.012260907 = queryNorm
              0.89043546 = fieldWeight in 4804, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.078125 = fieldNorm(doc=4804)
        0.16 = coord(4/25)
    
  5. Ottaviani, J.S.: ¬The fractal nature of relevance : a hypothesis (1994) 0.06
    0.06329947 = sum of:
      0.06329947 = product of:
        0.5274956 = sum of:
          0.0082944315 = weight(abstract_txt:from in 7154) [ClassicSimilarity], result of:
            0.0082944315 = score(doc=7154,freq=1.0), product of:
              0.03841289 = queryWeight, product of:
                1.1335347 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.012260907 = queryNorm
              0.21592833 = fieldWeight in 7154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.078125 = fieldNorm(doc=7154)
          0.0073594945 = weight(abstract_txt:this in 7154) [ClassicSimilarity], result of:
            0.0073594945 = score(doc=7154,freq=1.0), product of:
              0.039038893 = queryWeight, product of:
                1.3195152 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.012260907 = queryNorm
              0.18851699 = fieldWeight in 7154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=7154)
          0.5118417 = weight(abstract_txt:geometry in 7154) [ClassicSimilarity], result of:
            0.5118417 = score(doc=7154,freq=1.0), product of:
              0.7112014 = queryWeight, product of:
                6.2967577 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.012260907 = queryNorm
              0.71968603 = fieldWeight in 7154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.078125 = fieldNorm(doc=7154)
        0.12 = coord(3/25)