Document (#21501)

Author
Gonnet, G.H.
Snider, T.
Baeza-Yates, R.A.
Title
New indices for text : PAT trees and PAT arrays
Source
Information retrieval: data structures and algorithms. Ed.: W.B. Frakes u. R. Baeza-Yates
Imprint
Englewood Cliffs, NJ : Prentice Hall
Year
1992
Pages
S.66-82
Abstract
We survey new indices for text, with emphasis on PAT arrays (also called suffic arrays). A PAT array is an index based on a new model of text that does not use the concept of word and does not need to know the structure of text
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Baeza-Yates, R.A.: Introduction to data structures and algorithms related to information retrieval (1992) 6.32
    6.316001 = sum of:
      6.316001 = sum of:
        3.0247912 = weight(author_txt:yates in 3082) [ClassicSimilarity], result of:
          3.0247912 = score(doc=3082,freq=1.0), product of:
            0.68694395 = queryWeight, product of:
              8.806516 = idf(docFreq=17, maxDocs=44218)
              0.078004055 = queryNorm
            4.403258 = fieldWeight in 3082, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.806516 = idf(docFreq=17, maxDocs=44218)
              0.5 = fieldNorm(doc=3082)
        3.2912095 = weight(author_txt:baeza in 3082) [ClassicSimilarity], result of:
          3.2912095 = score(doc=3082,freq=1.0), product of:
            0.7267104 = queryWeight, product of:
              1.0285373 = boost
              9.05783 = idf(docFreq=13, maxDocs=44218)
              0.078004055 = queryNorm
            4.528915 = fieldWeight in 3082, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.05783 = idf(docFreq=13, maxDocs=44218)
              0.5 = fieldNorm(doc=3082)
    
  2. Baeza-Yates, R.A.: String searching algorithms (1992) 6.32
    6.316001 = sum of:
      6.316001 = sum of:
        3.0247912 = weight(author_txt:yates in 3505) [ClassicSimilarity], result of:
          3.0247912 = score(doc=3505,freq=1.0), product of:
            0.68694395 = queryWeight, product of:
              8.806516 = idf(docFreq=17, maxDocs=44218)
              0.078004055 = queryNorm
            4.403258 = fieldWeight in 3505, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.806516 = idf(docFreq=17, maxDocs=44218)
              0.5 = fieldNorm(doc=3505)
        3.2912095 = weight(author_txt:baeza in 3505) [ClassicSimilarity], result of:
          3.2912095 = score(doc=3505,freq=1.0), product of:
            0.7267104 = queryWeight, product of:
              1.0285373 = boost
              9.05783 = idf(docFreq=13, maxDocs=44218)
              0.078004055 = queryNorm
            4.528915 = fieldWeight in 3505, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.05783 = idf(docFreq=13, maxDocs=44218)
              0.5 = fieldNorm(doc=3505)
    
  3. Baeza-Yates, R.; Navarro, G.: Block addressing indices for approximate text retrieval (2000) 5.53
    5.5265007 = sum of:
      5.5265007 = sum of:
        2.6466925 = weight(author_txt:yates in 4295) [ClassicSimilarity], result of:
          2.6466925 = score(doc=4295,freq=1.0), product of:
            0.68694395 = queryWeight, product of:
              8.806516 = idf(docFreq=17, maxDocs=44218)
              0.078004055 = queryNorm
            3.8528507 = fieldWeight in 4295, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.806516 = idf(docFreq=17, maxDocs=44218)
              0.4375 = fieldNorm(doc=4295)
        2.8798082 = weight(author_txt:baeza in 4295) [ClassicSimilarity], result of:
          2.8798082 = score(doc=4295,freq=1.0), product of:
            0.7267104 = queryWeight, product of:
              1.0285373 = boost
              9.05783 = idf(docFreq=13, maxDocs=44218)
              0.078004055 = queryNorm
            3.9628005 = fieldWeight in 4295, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.05783 = idf(docFreq=13, maxDocs=44218)
              0.4375 = fieldNorm(doc=4295)
    
  4. Baeza-Yates, R.; Navarro, G.: XQL and proximal nodes (2002) 5.53
    5.5265007 = sum of:
      5.5265007 = sum of:
        2.6466925 = weight(author_txt:yates in 454) [ClassicSimilarity], result of:
          2.6466925 = score(doc=454,freq=1.0), product of:
            0.68694395 = queryWeight, product of:
              8.806516 = idf(docFreq=17, maxDocs=44218)
              0.078004055 = queryNorm
            3.8528507 = fieldWeight in 454, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.806516 = idf(docFreq=17, maxDocs=44218)
              0.4375 = fieldNorm(doc=454)
        2.8798082 = weight(author_txt:baeza in 454) [ClassicSimilarity], result of:
          2.8798082 = score(doc=454,freq=1.0), product of:
            0.7267104 = queryWeight, product of:
              1.0285373 = boost
              9.05783 = idf(docFreq=13, maxDocs=44218)
              0.078004055 = queryNorm
            3.9628005 = fieldWeight in 454, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.05783 = idf(docFreq=13, maxDocs=44218)
              0.4375 = fieldNorm(doc=454)
    
  5. Castillo, C.; Baeza-Yates, R.: Web retrieval and mining (2009) 5.53
    5.5265007 = sum of:
      5.5265007 = sum of:
        2.6466925 = weight(author_txt:yates in 3904) [ClassicSimilarity], result of:
          2.6466925 = score(doc=3904,freq=1.0), product of:
            0.68694395 = queryWeight, product of:
              8.806516 = idf(docFreq=17, maxDocs=44218)
              0.078004055 = queryNorm
            3.8528507 = fieldWeight in 3904, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.806516 = idf(docFreq=17, maxDocs=44218)
              0.4375 = fieldNorm(doc=3904)
        2.8798082 = weight(author_txt:baeza in 3904) [ClassicSimilarity], result of:
          2.8798082 = score(doc=3904,freq=1.0), product of:
            0.7267104 = queryWeight, product of:
              1.0285373 = boost
              9.05783 = idf(docFreq=13, maxDocs=44218)
              0.078004055 = queryNorm
            3.9628005 = fieldWeight in 3904, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.05783 = idf(docFreq=13, maxDocs=44218)
              0.4375 = fieldNorm(doc=3904)
    

Similar documents (content)

  1. Will, L.: ¬The ISO 25964 data model for the structure of an information retrieval thesaurus (2012) 0.23
    0.2274935 = sum of:
      0.2274935 = product of:
        0.7583116 = sum of:
          0.004164469 = weight(abstract_txt:that in 862) [ClassicSimilarity], result of:
            0.004164469 = score(doc=862,freq=1.0), product of:
              0.018747192 = queryWeight, product of:
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.00791196 = queryNorm
              0.22213829 = fieldWeight in 862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=862)
          0.012285889 = weight(abstract_txt:also in 862) [ClassicSimilarity], result of:
            0.012285889 = score(doc=862,freq=1.0), product of:
              0.038562708 = queryWeight, product of:
                1.4342196 = boost
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.00791196 = queryNorm
              0.31859508 = fieldWeight in 862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.09375 = fieldNorm(doc=862)
          0.019828577 = weight(abstract_txt:model in 862) [ClassicSimilarity], result of:
            0.019828577 = score(doc=862,freq=1.0), product of:
              0.053058807 = queryWeight, product of:
                1.6823279 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.00791196 = queryNorm
              0.37370944 = fieldWeight in 862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.09375 = fieldNorm(doc=862)
          0.025909835 = weight(abstract_txt:structure in 862) [ClassicSimilarity], result of:
            0.025909835 = score(doc=862,freq=1.0), product of:
              0.06341708 = queryWeight, product of:
                1.8392256 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.00791196 = queryNorm
              0.40856242 = fieldWeight in 862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.09375 = fieldNorm(doc=862)
          0.028629908 = weight(abstract_txt:concept in 862) [ClassicSimilarity], result of:
            0.028629908 = score(doc=862,freq=1.0), product of:
              0.06778128 = queryWeight, product of:
                1.9014581 = boost
                4.505458 = idf(docFreq=1327, maxDocs=44218)
                0.00791196 = queryNorm
              0.42238668 = fieldWeight in 862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.505458 = idf(docFreq=1327, maxDocs=44218)
                0.09375 = fieldNorm(doc=862)
          0.667493 = weight(abstract_txt:arrays in 862) [ClassicSimilarity], result of:
            0.667493 = score(doc=862,freq=1.0), product of:
              0.7978134 = queryWeight, product of:
                11.299083 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.00791196 = queryNorm
              0.836653 = fieldWeight in 862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.09375 = fieldNorm(doc=862)
        0.3 = coord(6/20)
    
  2. Harman, D.; Fox, E.; Baeza-Yates, R.; Lee, W.: Inverted files (1992) 0.19
    0.1910511 = sum of:
      0.1910511 = product of:
        0.95525545 = sum of:
          0.007852598 = weight(abstract_txt:that in 3497) [ClassicSimilarity], result of:
            0.007852598 = score(doc=3497,freq=2.0), product of:
              0.018747192 = queryWeight, product of:
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.00791196 = queryNorm
              0.41886798 = fieldWeight in 3497, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.125 = fieldNorm(doc=3497)
          0.006519631 = weight(abstract_txt:with in 3497) [ClassicSimilarity], result of:
            0.006519631 = score(doc=3497,freq=1.0), product of:
              0.020865044 = queryWeight, product of:
                1.0549735 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.00791196 = queryNorm
              0.31246668 = fieldWeight in 3497, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.125 = fieldNorm(doc=3497)
          0.050892584 = weight(abstract_txt:survey in 3497) [ClassicSimilarity], result of:
            0.050892584 = score(doc=3497,freq=1.0), product of:
              0.08210576 = queryWeight, product of:
                2.0927565 = boost
                4.9587345 = idf(docFreq=843, maxDocs=44218)
                0.00791196 = queryNorm
              0.6198418 = fieldWeight in 3497, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9587345 = idf(docFreq=843, maxDocs=44218)
                0.125 = fieldNorm(doc=3497)
          0.8899906 = weight(abstract_txt:arrays in 3497) [ClassicSimilarity], result of:
            0.8899906 = score(doc=3497,freq=1.0), product of:
              0.7978134 = queryWeight, product of:
                11.299083 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.00791196 = queryNorm
              1.1155373 = fieldWeight in 3497, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.125 = fieldNorm(doc=3497)
        0.2 = coord(4/20)
    
  3. Hartman, J.H.; Proebsting, T.A.; Sundaram, R.: Index-based hyperlinks (1997) 0.18
    0.17735195 = sum of:
      0.17735195 = product of:
        0.7094078 = sum of:
          0.004858548 = weight(abstract_txt:that in 2723) [ClassicSimilarity], result of:
            0.004858548 = score(doc=2723,freq=1.0), product of:
              0.018747192 = queryWeight, product of:
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.00791196 = queryNorm
              0.25916135 = fieldWeight in 2723, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.109375 = fieldNorm(doc=2723)
          0.011832469 = weight(abstract_txt:based in 2723) [ClassicSimilarity], result of:
            0.011832469 = score(doc=2723,freq=1.0), product of:
              0.033935077 = queryWeight, product of:
                1.3454151 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.00791196 = queryNorm
              0.3486796 = fieldWeight in 2723, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.109375 = fieldNorm(doc=2723)
          0.014333538 = weight(abstract_txt:also in 2723) [ClassicSimilarity], result of:
            0.014333538 = score(doc=2723,freq=1.0), product of:
              0.038562708 = queryWeight, product of:
                1.4342196 = boost
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.00791196 = queryNorm
              0.37169427 = fieldWeight in 2723, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.109375 = fieldNorm(doc=2723)
          0.039114945 = weight(abstract_txt:index in 2723) [ClassicSimilarity], result of:
            0.039114945 = score(doc=2723,freq=1.0), product of:
              0.075305566 = queryWeight, product of:
                2.0042202 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.00791196 = queryNorm
              0.5194164 = fieldWeight in 2723, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.109375 = fieldNorm(doc=2723)
          0.63926834 = weight(abstract_txt:indices in 2723) [ClassicSimilarity], result of:
            0.63926834 = score(doc=2723,freq=5.0), product of:
              0.35733378 = queryWeight, product of:
                6.174246 = boost
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.00791196 = queryNorm
              1.788995 = fieldWeight in 2723, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.109375 = fieldNorm(doc=2723)
        0.25 = coord(5/20)
    
  4. Ibáñez, A.; Armañanzas, R.; Bielza, C.; Larrañaga, P.: Genetic algorithms and Gaussian Bayesian networks to uncover the predictive core set of bibliometric indices (2016) 0.15
    0.15054466 = sum of:
      0.15054466 = product of:
        0.5018155 = sum of:
          0.006208024 = weight(abstract_txt:that in 3041) [ClassicSimilarity], result of:
            0.006208024 = score(doc=3041,freq=5.0), product of:
              0.018747192 = queryWeight, product of:
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.00791196 = queryNorm
              0.3311442 = fieldWeight in 3041, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=3041)
          0.0032598155 = weight(abstract_txt:with in 3041) [ClassicSimilarity], result of:
            0.0032598155 = score(doc=3041,freq=1.0), product of:
              0.020865044 = queryWeight, product of:
                1.0549735 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.00791196 = queryNorm
              0.15623334 = fieldWeight in 3041, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=3041)
          0.008190593 = weight(abstract_txt:also in 3041) [ClassicSimilarity], result of:
            0.008190593 = score(doc=3041,freq=1.0), product of:
              0.038562708 = queryWeight, product of:
                1.4342196 = boost
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.00791196 = queryNorm
              0.21239673 = fieldWeight in 3041, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.0625 = fieldNorm(doc=3041)
          0.013219051 = weight(abstract_txt:model in 3041) [ClassicSimilarity], result of:
            0.013219051 = score(doc=3041,freq=1.0), product of:
              0.053058807 = queryWeight, product of:
                1.6823279 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.00791196 = queryNorm
              0.24913962 = fieldWeight in 3041, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=3041)
          0.038713757 = weight(abstract_txt:index in 3041) [ClassicSimilarity], result of:
            0.038713757 = score(doc=3041,freq=3.0), product of:
              0.075305566 = queryWeight, product of:
                2.0042202 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.00791196 = queryNorm
              0.5140889 = fieldWeight in 3041, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.0625 = fieldNorm(doc=3041)
          0.43222427 = weight(abstract_txt:indices in 3041) [ClassicSimilarity], result of:
            0.43222427 = score(doc=3041,freq=7.0), product of:
              0.35733378 = queryWeight, product of:
                6.174246 = boost
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.00791196 = queryNorm
              1.2095814 = fieldWeight in 3041, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.0625 = fieldNorm(doc=3041)
        0.3 = coord(6/20)
    
  5. Rousseau, R.; Jin, B.: ¬The age-dependent h-type AR**2-index : basic properties and a case study (2008) 0.15
    0.14678976 = sum of:
      0.14678976 = product of:
        0.48929918 = sum of:
          0.004907874 = weight(abstract_txt:that in 2638) [ClassicSimilarity], result of:
            0.004907874 = score(doc=2638,freq=2.0), product of:
              0.018747192 = queryWeight, product of:
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.00791196 = queryNorm
              0.26179248 = fieldWeight in 2638, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=2638)
          0.004074769 = weight(abstract_txt:with in 2638) [ClassicSimilarity], result of:
            0.004074769 = score(doc=2638,freq=1.0), product of:
              0.020865044 = queryWeight, product of:
                1.0549735 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.00791196 = queryNorm
              0.19529167 = fieldWeight in 2638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.078125 = fieldNorm(doc=2638)
          0.06247406 = weight(abstract_txt:index in 2638) [ClassicSimilarity], result of:
            0.06247406 = score(doc=2638,freq=5.0), product of:
              0.075305566 = queryWeight, product of:
                2.0042202 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.00791196 = queryNorm
              0.8296075 = fieldWeight in 2638, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.078125 = fieldNorm(doc=2638)
          0.054974865 = weight(abstract_txt:called in 2638) [ClassicSimilarity], result of:
            0.054974865 = score(doc=2638,freq=2.0), product of:
              0.093853414 = queryWeight, product of:
                2.2374685 = boost
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.00791196 = queryNorm
              0.5857524 = fieldWeight in 2638, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.078125 = fieldNorm(doc=2638)
          0.0740756 = weight(abstract_txt:does in 2638) [ClassicSimilarity], result of:
            0.0740756 = score(doc=2638,freq=1.0), product of:
              0.18175125 = queryWeight, product of:
                4.403374 = boost
                5.2168427 = idf(docFreq=651, maxDocs=44218)
                0.00791196 = queryNorm
              0.40756583 = fieldWeight in 2638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2168427 = idf(docFreq=651, maxDocs=44218)
                0.078125 = fieldNorm(doc=2638)
          0.288792 = weight(abstract_txt:indices in 2638) [ClassicSimilarity], result of:
            0.288792 = score(doc=2638,freq=2.0), product of:
              0.35733378 = queryWeight, product of:
                6.174246 = boost
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.00791196 = queryNorm
              0.8081856 = fieldWeight in 2638, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.078125 = fieldNorm(doc=2638)
        0.3 = coord(6/20)