Document (#21500)

Author
Faloutsos, C.
Title
Signature files
Source
Information retrieval: data structures and algorithms. Ed.: W.B. Frakes u. R. Baeza-Yates
Imprint
Englewood Cliffs, NJ : Prentice Hall
Year
1992
Pages
S.44-65
Abstract
Presents a survey and discussion on signature-based text retrieval methods. It describes the main idea behind the signature approach and its advantages over other text retrieval methods, it provides a classification of the signature methods that have appeared in the literature, it describes the main representatives of each class, together with the relative advantages and drawbacks, and it gives a list of applications as well as commercial or university prototypes that use the signature approach
Theme
Retrievalalgorithmen

Similar documents (content)

  1. Lam, W.; Wong, K.-F.; Wong, C.-Y.: Chinese document indexing based on new partitioned signature file : model and evaluation (2001) 0.39
    0.3895283 = sum of:
      0.3895283 = product of:
        1.3911725 = sum of:
          0.00404313 = weight(abstract_txt:that in 303) [ClassicSimilarity], result of:
            0.00404313 = score(doc=303,freq=1.0), product of:
              0.02730144 = queryWeight, product of:
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.011522147 = queryNorm
              0.1480922 = fieldWeight in 303, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=303)
          0.0402306 = weight(abstract_txt:files in 303) [ClassicSimilarity], result of:
            0.0402306 = score(doc=303,freq=2.0), product of:
              0.07956549 = queryWeight, product of:
                1.2071315 = boost
                5.720536 = idf(docFreq=393, maxDocs=44218)
                0.011522147 = queryNorm
              0.50562876 = fieldWeight in 303, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.720536 = idf(docFreq=393, maxDocs=44218)
                0.0625 = fieldNorm(doc=303)
          0.022092178 = weight(abstract_txt:retrieval in 303) [ClassicSimilarity], result of:
            0.022092178 = score(doc=303,freq=3.0), product of:
              0.05872536 = queryWeight, product of:
                1.4666283 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.011522147 = queryNorm
              0.37619486 = fieldWeight in 303, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=303)
          0.015967276 = weight(abstract_txt:approach in 303) [ClassicSimilarity], result of:
            0.015967276 = score(doc=303,freq=1.0), product of:
              0.068212025 = queryWeight, product of:
                1.5806572 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.011522147 = queryNorm
              0.234083 = fieldWeight in 303, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=303)
          0.02009795 = weight(abstract_txt:text in 303) [ClassicSimilarity], result of:
            0.02009795 = score(doc=303,freq=1.0), product of:
              0.079519734 = queryWeight, product of:
                1.7066509 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.011522147 = queryNorm
              0.25274166 = fieldWeight in 303, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=303)
          0.045971822 = weight(abstract_txt:methods in 303) [ClassicSimilarity], result of:
            0.045971822 = score(doc=303,freq=2.0), product of:
              0.1254263 = queryWeight, product of:
                2.625108 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.011522147 = queryNorm
              0.36652455 = fieldWeight in 303, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=303)
          1.2427696 = weight(abstract_txt:signature in 303) [ClassicSimilarity], result of:
            1.2427696 = score(doc=303,freq=7.0), product of:
              0.8822293 = queryWeight, product of:
                8.988102 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.011522147 = queryNorm
              1.4086696 = fieldWeight in 303, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0625 = fieldNorm(doc=303)
        0.28 = coord(7/25)
    
  2. Kelledy, F.; Smeaton, A.F.: Signature files and beyond (1996) 0.36
    0.3569525 = sum of:
      0.3569525 = product of:
        1.4873021 = sum of:
          0.0071473117 = weight(abstract_txt:that in 6973) [ClassicSimilarity], result of:
            0.0071473117 = score(doc=6973,freq=2.0), product of:
              0.02730144 = queryWeight, product of:
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.011522147 = queryNorm
              0.26179248 = fieldWeight in 6973, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=6973)
          0.07111832 = weight(abstract_txt:files in 6973) [ClassicSimilarity], result of:
            0.07111832 = score(doc=6973,freq=4.0), product of:
              0.07956549 = queryWeight, product of:
                1.2071315 = boost
                5.720536 = idf(docFreq=393, maxDocs=44218)
                0.011522147 = queryNorm
              0.89383376 = fieldWeight in 6973, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.720536 = idf(docFreq=393, maxDocs=44218)
                0.078125 = fieldNorm(doc=6973)
          0.019959092 = weight(abstract_txt:approach in 6973) [ClassicSimilarity], result of:
            0.019959092 = score(doc=6973,freq=1.0), product of:
              0.068212025 = queryWeight, product of:
                1.5806572 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.011522147 = queryNorm
              0.29260373 = fieldWeight in 6973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.078125 = fieldNorm(doc=6973)
          0.035528492 = weight(abstract_txt:text in 6973) [ClassicSimilarity], result of:
            0.035528492 = score(doc=6973,freq=2.0), product of:
              0.079519734 = queryWeight, product of:
                1.7066509 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.011522147 = queryNorm
              0.44678837 = fieldWeight in 6973, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=6973)
          0.04063373 = weight(abstract_txt:methods in 6973) [ClassicSimilarity], result of:
            0.04063373 = score(doc=6973,freq=1.0), product of:
              0.1254263 = queryWeight, product of:
                2.625108 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.011522147 = queryNorm
              0.32396498 = fieldWeight in 6973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.078125 = fieldNorm(doc=6973)
          1.3129151 = weight(abstract_txt:signature in 6973) [ClassicSimilarity], result of:
            1.3129151 = score(doc=6973,freq=5.0), product of:
              0.8822293 = queryWeight, product of:
                8.988102 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.011522147 = queryNorm
              1.488179 = fieldWeight in 6973, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.078125 = fieldNorm(doc=6973)
        0.24 = coord(6/25)
    
  3. Lee, D.L.; Ren, L.: Document ranking on weight-partitioned signature files (1996) 0.34
    0.3352864 = sum of:
      0.3352864 = product of:
        1.676432 = sum of:
          0.0060646953 = weight(abstract_txt:that in 2417) [ClassicSimilarity], result of:
            0.0060646953 = score(doc=2417,freq=1.0), product of:
              0.02730144 = queryWeight, product of:
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.011522147 = queryNorm
              0.22213829 = fieldWeight in 2417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=2417)
          0.033065777 = weight(abstract_txt:together in 2417) [ClassicSimilarity], result of:
            0.033065777 = score(doc=2417,freq=1.0), product of:
              0.067125686 = queryWeight, product of:
                1.1087575 = boost
                5.254347 = idf(docFreq=627, maxDocs=44218)
                0.011522147 = queryNorm
              0.49259502 = fieldWeight in 2417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.254347 = idf(docFreq=627, maxDocs=44218)
                0.09375 = fieldNorm(doc=2417)
          0.042670995 = weight(abstract_txt:files in 2417) [ClassicSimilarity], result of:
            0.042670995 = score(doc=2417,freq=1.0), product of:
              0.07956549 = queryWeight, product of:
                1.2071315 = boost
                5.720536 = idf(docFreq=393, maxDocs=44218)
                0.011522147 = queryNorm
              0.5363003 = fieldWeight in 2417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.720536 = idf(docFreq=393, maxDocs=44218)
                0.09375 = fieldNorm(doc=2417)
          0.019132389 = weight(abstract_txt:retrieval in 2417) [ClassicSimilarity], result of:
            0.019132389 = score(doc=2417,freq=1.0), product of:
              0.05872536 = queryWeight, product of:
                1.4666283 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.011522147 = queryNorm
              0.3257943 = fieldWeight in 2417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=2417)
          1.5754981 = weight(abstract_txt:signature in 2417) [ClassicSimilarity], result of:
            1.5754981 = score(doc=2417,freq=5.0), product of:
              0.8822293 = queryWeight, product of:
                8.988102 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.011522147 = queryNorm
              1.7858148 = fieldWeight in 2417, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.09375 = fieldNorm(doc=2417)
        0.2 = coord(5/25)
    
  4. Robertson, A.M.; Willett, P.: Applications of n-grams in textual information systems (1998) 0.24
    0.23503086 = sum of:
      0.23503086 = product of:
        0.97929525 = sum of:
          0.0070754783 = weight(abstract_txt:that in 4715) [ClassicSimilarity], result of:
            0.0070754783 = score(doc=4715,freq=1.0), product of:
              0.02730144 = queryWeight, product of:
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.011522147 = queryNorm
              0.25916135 = fieldWeight in 4715, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.109375 = fieldNorm(doc=4715)
          0.028361045 = weight(abstract_txt:applications in 4715) [ClassicSimilarity], result of:
            0.028361045 = score(doc=4715,freq=1.0), product of:
              0.05467891 = queryWeight, product of:
                1.000696 = boost
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.011522147 = queryNorm
              0.51868343 = fieldWeight in 4715, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.109375 = fieldNorm(doc=4715)
          0.049782827 = weight(abstract_txt:files in 4715) [ClassicSimilarity], result of:
            0.049782827 = score(doc=4715,freq=1.0), product of:
              0.07956549 = queryWeight, product of:
                1.2071315 = boost
                5.720536 = idf(docFreq=393, maxDocs=44218)
                0.011522147 = queryNorm
              0.62568367 = fieldWeight in 4715, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.720536 = idf(docFreq=393, maxDocs=44218)
                0.109375 = fieldNorm(doc=4715)
          0.02232112 = weight(abstract_txt:retrieval in 4715) [ClassicSimilarity], result of:
            0.02232112 = score(doc=4715,freq=1.0), product of:
              0.05872536 = queryWeight, product of:
                1.4666283 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.011522147 = queryNorm
              0.38009337 = fieldWeight in 4715, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.109375 = fieldNorm(doc=4715)
          0.04973989 = weight(abstract_txt:text in 4715) [ClassicSimilarity], result of:
            0.04973989 = score(doc=4715,freq=2.0), product of:
              0.079519734 = queryWeight, product of:
                1.7066509 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.011522147 = queryNorm
              0.6255037 = fieldWeight in 4715, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.109375 = fieldNorm(doc=4715)
          0.82201487 = weight(abstract_txt:signature in 4715) [ClassicSimilarity], result of:
            0.82201487 = score(doc=4715,freq=1.0), product of:
              0.8822293 = queryWeight, product of:
                8.988102 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.011522147 = queryNorm
              0.9317474 = fieldWeight in 4715, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.109375 = fieldNorm(doc=4715)
        0.24 = coord(6/25)
    
  5. Carterette, B.; Can, F.: Comparing inverted files and signature files for searching a large lexicon (2005) 0.18
    0.1844521 = sum of:
      0.1844521 = product of:
        1.5371009 = sum of:
          0.061590273 = weight(abstract_txt:files in 1029) [ClassicSimilarity], result of:
            0.061590273 = score(doc=1029,freq=3.0), product of:
              0.07956549 = queryWeight, product of:
                1.2071315 = boost
                5.720536 = idf(docFreq=393, maxDocs=44218)
                0.011522147 = queryNorm
              0.7740828 = fieldWeight in 1029, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.720536 = idf(docFreq=393, maxDocs=44218)
                0.078125 = fieldNorm(doc=1029)
          0.037284292 = weight(abstract_txt:main in 1029) [ClassicSimilarity], result of:
            0.037284292 = score(doc=1029,freq=1.0), product of:
              0.10346282 = queryWeight, product of:
                1.9467015 = boost
                4.612661 = idf(docFreq=1192, maxDocs=44218)
                0.011522147 = queryNorm
              0.36036414 = fieldWeight in 1029, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.612661 = idf(docFreq=1192, maxDocs=44218)
                0.078125 = fieldNorm(doc=1029)
          1.4382263 = weight(abstract_txt:signature in 1029) [ClassicSimilarity], result of:
            1.4382263 = score(doc=1029,freq=6.0), product of:
              0.8822293 = queryWeight, product of:
                8.988102 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.011522147 = queryNorm
              1.6302183 = fieldWeight in 1029, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.078125 = fieldNorm(doc=1029)
        0.12 = coord(3/25)