Document (#21502)

Author
Gonnet, G.H.
Snider, T.
Baeza-Yates, R.A.
Title
New indices for text : PAT trees and PAT arrays
Source
Information retrieval: data structures and algorithms. Ed.: W.B. Frakes u. R. Baeza-Yates
Imprint
Englewood Cliffs, NJ : Prentice Hall
Year
1992
Pages
S.66-82
Abstract
We survey new indices for text, with emphasis on PAT arrays (also called suffic arrays). A PAT array is an index based on a new model of text that does not use the concept of word and does not need to know the structure of text
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Baeza-Yates, R.A.: Introduction to data structures and algorithms related to information retrieval (1992) 6.29
    6.2919617 = sum of:
      6.2919617 = sum of:
        3.0127723 = weight(author_txt:yates in 4083) [ClassicSimilarity], result of:
          3.0127723 = score(doc=4083,freq=1.0), product of:
            0.6868659 = queryWeight, product of:
              8.772519 = idf(docFreq=17, maxDocs=42740)
              0.07829746 = queryNorm
            4.3862596 = fieldWeight in 4083, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.772519 = idf(docFreq=17, maxDocs=42740)
              0.5 = fieldNorm(doc=4083)
        3.2791896 = weight(author_txt:baeza in 4083) [ClassicSimilarity], result of:
          3.2791896 = score(doc=4083,freq=1.0), product of:
            0.72678417 = queryWeight, product of:
              1.0286479 = boost
              9.023833 = idf(docFreq=13, maxDocs=42740)
              0.07829746 = queryNorm
            4.5119166 = fieldWeight in 4083, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.023833 = idf(docFreq=13, maxDocs=42740)
              0.5 = fieldNorm(doc=4083)
    
  2. Baeza-Yates, R.A.: String searching algorithms (1992) 6.29
    6.2919617 = sum of:
      6.2919617 = sum of:
        3.0127723 = weight(author_txt:yates in 4506) [ClassicSimilarity], result of:
          3.0127723 = score(doc=4506,freq=1.0), product of:
            0.6868659 = queryWeight, product of:
              8.772519 = idf(docFreq=17, maxDocs=42740)
              0.07829746 = queryNorm
            4.3862596 = fieldWeight in 4506, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.772519 = idf(docFreq=17, maxDocs=42740)
              0.5 = fieldNorm(doc=4506)
        3.2791896 = weight(author_txt:baeza in 4506) [ClassicSimilarity], result of:
          3.2791896 = score(doc=4506,freq=1.0), product of:
            0.72678417 = queryWeight, product of:
              1.0286479 = boost
              9.023833 = idf(docFreq=13, maxDocs=42740)
              0.07829746 = queryNorm
            4.5119166 = fieldWeight in 4506, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.023833 = idf(docFreq=13, maxDocs=42740)
              0.5 = fieldNorm(doc=4506)
    
  3. Baeza-Yates, R.; Navarro, G.: Block addressing indices for approximate text retrieval (2000) 5.51
    5.5054665 = sum of:
      5.5054665 = sum of:
        2.6361756 = weight(author_txt:yates in 5296) [ClassicSimilarity], result of:
          2.6361756 = score(doc=5296,freq=1.0), product of:
            0.6868659 = queryWeight, product of:
              8.772519 = idf(docFreq=17, maxDocs=42740)
              0.07829746 = queryNorm
            3.8379772 = fieldWeight in 5296, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.772519 = idf(docFreq=17, maxDocs=42740)
              0.4375 = fieldNorm(doc=5296)
        2.8692908 = weight(author_txt:baeza in 5296) [ClassicSimilarity], result of:
          2.8692908 = score(doc=5296,freq=1.0), product of:
            0.72678417 = queryWeight, product of:
              1.0286479 = boost
              9.023833 = idf(docFreq=13, maxDocs=42740)
              0.07829746 = queryNorm
            3.947927 = fieldWeight in 5296, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.023833 = idf(docFreq=13, maxDocs=42740)
              0.4375 = fieldNorm(doc=5296)
    
  4. Baeza-Yates, R.; Navarro, G.: XQL and proximal nodes (2002) 5.51
    5.5054665 = sum of:
      5.5054665 = sum of:
        2.6361756 = weight(author_txt:yates in 1455) [ClassicSimilarity], result of:
          2.6361756 = score(doc=1455,freq=1.0), product of:
            0.6868659 = queryWeight, product of:
              8.772519 = idf(docFreq=17, maxDocs=42740)
              0.07829746 = queryNorm
            3.8379772 = fieldWeight in 1455, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.772519 = idf(docFreq=17, maxDocs=42740)
              0.4375 = fieldNorm(doc=1455)
        2.8692908 = weight(author_txt:baeza in 1455) [ClassicSimilarity], result of:
          2.8692908 = score(doc=1455,freq=1.0), product of:
            0.72678417 = queryWeight, product of:
              1.0286479 = boost
              9.023833 = idf(docFreq=13, maxDocs=42740)
              0.07829746 = queryNorm
            3.947927 = fieldWeight in 1455, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.023833 = idf(docFreq=13, maxDocs=42740)
              0.4375 = fieldNorm(doc=1455)
    
  5. Castillo, C.; Baeza-Yates, R.: Web retrieval and mining (2009) 5.51
    5.5054665 = sum of:
      5.5054665 = sum of:
        2.6361756 = weight(author_txt:yates in 905) [ClassicSimilarity], result of:
          2.6361756 = score(doc=905,freq=1.0), product of:
            0.6868659 = queryWeight, product of:
              8.772519 = idf(docFreq=17, maxDocs=42740)
              0.07829746 = queryNorm
            3.8379772 = fieldWeight in 905, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.772519 = idf(docFreq=17, maxDocs=42740)
              0.4375 = fieldNorm(doc=905)
        2.8692908 = weight(author_txt:baeza in 905) [ClassicSimilarity], result of:
          2.8692908 = score(doc=905,freq=1.0), product of:
            0.72678417 = queryWeight, product of:
              1.0286479 = boost
              9.023833 = idf(docFreq=13, maxDocs=42740)
              0.07829746 = queryNorm
            3.947927 = fieldWeight in 905, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.023833 = idf(docFreq=13, maxDocs=42740)
              0.4375 = fieldNorm(doc=905)
    

Similar documents (content)

  1. Will, L.: ¬The ISO 25964 data model for the structure of an information retrieval thesaurus (2012) 0.23
    0.22648433 = sum of:
      0.22648433 = product of:
        0.7549477 = sum of:
          0.004311988 = weight(abstract_txt:that in 2863) [ClassicSimilarity], result of:
            0.004311988 = score(doc=2863,freq=1.0), product of:
              0.019207139 = queryWeight, product of:
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.008020826 = queryNorm
              0.22449924 = fieldWeight in 2863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.09375 = fieldNorm(doc=2863)
          0.012688324 = weight(abstract_txt:also in 2863) [ClassicSimilarity], result of:
            0.012688324 = score(doc=2863,freq=1.0), product of:
              0.03944093 = queryWeight, product of:
                1.432987 = boost
                3.4315145 = idf(docFreq=3756, maxDocs=42740)
                0.008020826 = queryNorm
              0.32170448 = fieldWeight in 2863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4315145 = idf(docFreq=3756, maxDocs=42740)
                0.09375 = fieldNorm(doc=2863)
          0.020434586 = weight(abstract_txt:model in 2863) [ClassicSimilarity], result of:
            0.020434586 = score(doc=2863,freq=1.0), product of:
              0.0541903 = queryWeight, product of:
                1.6796912 = boost
                4.022287 = idf(docFreq=2080, maxDocs=42740)
                0.008020826 = queryNorm
              0.37708938 = fieldWeight in 2863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.022287 = idf(docFreq=2080, maxDocs=42740)
                0.09375 = fieldNorm(doc=2863)
          0.026252175 = weight(abstract_txt:structure in 2863) [ClassicSimilarity], result of:
            0.026252175 = score(doc=2863,freq=1.0), product of:
              0.06404047 = queryWeight, product of:
                1.8259796 = boost
                4.3725977 = idf(docFreq=1465, maxDocs=42740)
                0.008020826 = queryNorm
              0.40993103 = fieldWeight in 2863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3725977 = idf(docFreq=1465, maxDocs=42740)
                0.09375 = fieldNorm(doc=2863)
          0.029322205 = weight(abstract_txt:concept in 2863) [ClassicSimilarity], result of:
            0.029322205 = score(doc=2863,freq=1.0), product of:
              0.068940654 = queryWeight, product of:
                1.8945512 = boost
                4.5368032 = idf(docFreq=1243, maxDocs=42740)
                0.008020826 = queryNorm
              0.4253253 = fieldWeight in 2863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5368032 = idf(docFreq=1243, maxDocs=42740)
                0.09375 = fieldNorm(doc=2863)
          0.6619384 = weight(abstract_txt:arrays in 2863) [ClassicSimilarity], result of:
            0.6619384 = score(doc=2863,freq=1.0), product of:
              0.7941998 = queryWeight, product of:
                11.137666 = boost
                8.890302 = idf(docFreq=15, maxDocs=42740)
                0.008020826 = queryNorm
              0.8334658 = fieldWeight in 2863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.890302 = idf(docFreq=15, maxDocs=42740)
                0.09375 = fieldNorm(doc=2863)
        0.3 = coord(6/20)
    
  2. Harman, D.; Fox, E.; Baeza-Yates, R.; Lee, W.: Inverted files (1992) 0.19
    0.18993686 = sum of:
      0.18993686 = product of:
        0.94968426 = sum of:
          0.008130763 = weight(abstract_txt:that in 4498) [ClassicSimilarity], result of:
            0.008130763 = score(doc=4498,freq=2.0), product of:
              0.019207139 = queryWeight, product of:
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.008020826 = queryNorm
              0.42331982 = fieldWeight in 4498, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.125 = fieldNorm(doc=4498)
          0.0066812416 = weight(abstract_txt:with in 4498) [ClassicSimilarity], result of:
            0.0066812416 = score(doc=4498,freq=1.0), product of:
              0.021230323 = queryWeight, product of:
                1.0513492 = boost
                2.5176222 = idf(docFreq=9369, maxDocs=42740)
                0.008020826 = queryNorm
              0.31470278 = fieldWeight in 4498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5176222 = idf(docFreq=9369, maxDocs=42740)
                0.125 = fieldNorm(doc=4498)
          0.05228772 = weight(abstract_txt:survey in 4498) [ClassicSimilarity], result of:
            0.05228772 = score(doc=4498,freq=1.0), product of:
              0.08368577 = queryWeight, product of:
                2.087346 = boost
                4.9984813 = idf(docFreq=783, maxDocs=42740)
                0.008020826 = queryNorm
              0.62481016 = fieldWeight in 4498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9984813 = idf(docFreq=783, maxDocs=42740)
                0.125 = fieldNorm(doc=4498)
          0.8825845 = weight(abstract_txt:arrays in 4498) [ClassicSimilarity], result of:
            0.8825845 = score(doc=4498,freq=1.0), product of:
              0.7941998 = queryWeight, product of:
                11.137666 = boost
                8.890302 = idf(docFreq=15, maxDocs=42740)
                0.008020826 = queryNorm
              1.1112877 = fieldWeight in 4498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.890302 = idf(docFreq=15, maxDocs=42740)
                0.125 = fieldNorm(doc=4498)
        0.2 = coord(4/20)
    
  3. Hartman, J.H.; Proebsting, T.A.; Sundaram, R.: Index-based hyperlinks (1997) 0.18
    0.17829958 = sum of:
      0.17829958 = product of:
        0.7131983 = sum of:
          0.0050306525 = weight(abstract_txt:that in 3724) [ClassicSimilarity], result of:
            0.0050306525 = score(doc=3724,freq=1.0), product of:
              0.019207139 = queryWeight, product of:
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.008020826 = queryNorm
              0.26191577 = fieldWeight in 3724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.109375 = fieldNorm(doc=3724)
          0.01210436 = weight(abstract_txt:based in 3724) [ClassicSimilarity], result of:
            0.01210436 = score(doc=3724,freq=1.0), product of:
              0.034488503 = queryWeight, product of:
                1.3400033 = boost
                3.2088501 = idf(docFreq=4693, maxDocs=42740)
                0.008020826 = queryNorm
              0.35096797 = fieldWeight in 3724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2088501 = idf(docFreq=4693, maxDocs=42740)
                0.109375 = fieldNorm(doc=3724)
          0.0148030445 = weight(abstract_txt:also in 3724) [ClassicSimilarity], result of:
            0.0148030445 = score(doc=3724,freq=1.0), product of:
              0.03944093 = queryWeight, product of:
                1.432987 = boost
                3.4315145 = idf(docFreq=3756, maxDocs=42740)
                0.008020826 = queryNorm
              0.3753219 = fieldWeight in 3724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4315145 = idf(docFreq=3756, maxDocs=42740)
                0.109375 = fieldNorm(doc=3724)
          0.038899586 = weight(abstract_txt:index in 3724) [ClassicSimilarity], result of:
            0.038899586 = score(doc=3724,freq=1.0), product of:
              0.07510631 = queryWeight, product of:
                1.9774562 = boost
                4.7353325 = idf(docFreq=1019, maxDocs=42740)
                0.008020826 = queryNorm
              0.517927 = fieldWeight in 3724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7353325 = idf(docFreq=1019, maxDocs=42740)
                0.109375 = fieldNorm(doc=3724)
          0.6423607 = weight(abstract_txt:indices in 3724) [ClassicSimilarity], result of:
            0.6423607 = score(doc=3724,freq=5.0), product of:
              0.35885507 = queryWeight, product of:
                6.1128426 = boost
                7.319085 = idf(docFreq=76, maxDocs=42740)
                0.008020826 = queryNorm
              1.7900282 = fieldWeight in 3724, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.319085 = idf(docFreq=76, maxDocs=42740)
                0.109375 = fieldNorm(doc=3724)
        0.25 = coord(5/20)
    
  4. Ibáñez, A.; Armañanzas, R.; Bielza, C.; Larrañaga, P.: Genetic algorithms and Gaussian Bayesian networks to uncover the predictive core set of bibliometric indices (2016) 0.15
    0.15139987 = sum of:
      0.15139987 = product of:
        0.5046662 = sum of:
          0.0064279325 = weight(abstract_txt:that in 5042) [ClassicSimilarity], result of:
            0.0064279325 = score(doc=5042,freq=5.0), product of:
              0.019207139 = queryWeight, product of:
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.008020826 = queryNorm
              0.33466372 = fieldWeight in 5042, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.0625 = fieldNorm(doc=5042)
          0.0033406208 = weight(abstract_txt:with in 5042) [ClassicSimilarity], result of:
            0.0033406208 = score(doc=5042,freq=1.0), product of:
              0.021230323 = queryWeight, product of:
                1.0513492 = boost
                2.5176222 = idf(docFreq=9369, maxDocs=42740)
                0.008020826 = queryNorm
              0.15735139 = fieldWeight in 5042, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5176222 = idf(docFreq=9369, maxDocs=42740)
                0.0625 = fieldNorm(doc=5042)
          0.008458883 = weight(abstract_txt:also in 5042) [ClassicSimilarity], result of:
            0.008458883 = score(doc=5042,freq=1.0), product of:
              0.03944093 = queryWeight, product of:
                1.432987 = boost
                3.4315145 = idf(docFreq=3756, maxDocs=42740)
                0.008020826 = queryNorm
              0.21446966 = fieldWeight in 5042, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4315145 = idf(docFreq=3756, maxDocs=42740)
                0.0625 = fieldNorm(doc=5042)
          0.013623059 = weight(abstract_txt:model in 5042) [ClassicSimilarity], result of:
            0.013623059 = score(doc=5042,freq=1.0), product of:
              0.0541903 = queryWeight, product of:
                1.6796912 = boost
                4.022287 = idf(docFreq=2080, maxDocs=42740)
                0.008020826 = queryNorm
              0.25139293 = fieldWeight in 5042, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.022287 = idf(docFreq=2080, maxDocs=42740)
                0.0625 = fieldNorm(doc=5042)
          0.038500603 = weight(abstract_txt:index in 5042) [ClassicSimilarity], result of:
            0.038500603 = score(doc=5042,freq=3.0), product of:
              0.07510631 = queryWeight, product of:
                1.9774562 = boost
                4.7353325 = idf(docFreq=1019, maxDocs=42740)
                0.008020826 = queryNorm
              0.5126148 = fieldWeight in 5042, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7353325 = idf(docFreq=1019, maxDocs=42740)
                0.0625 = fieldNorm(doc=5042)
          0.4343151 = weight(abstract_txt:indices in 5042) [ClassicSimilarity], result of:
            0.4343151 = score(doc=5042,freq=7.0), product of:
              0.35885507 = queryWeight, product of:
                6.1128426 = boost
                7.319085 = idf(docFreq=76, maxDocs=42740)
                0.008020826 = queryNorm
              1.21028 = fieldWeight in 5042, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.319085 = idf(docFreq=76, maxDocs=42740)
                0.0625 = fieldNorm(doc=5042)
        0.3 = coord(6/20)
    
  5. Rousseau, R.; Jin, B.: ¬The age-dependent h-type AR**2-index : basic properties and a case study (2008) 0.15
    0.14778958 = sum of:
      0.14778958 = product of:
        0.4926319 = sum of:
          0.0050817267 = weight(abstract_txt:that in 4639) [ClassicSimilarity], result of:
            0.0050817267 = score(doc=4639,freq=2.0), product of:
              0.019207139 = queryWeight, product of:
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.008020826 = queryNorm
              0.2645749 = fieldWeight in 4639, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.078125 = fieldNorm(doc=4639)
          0.004175776 = weight(abstract_txt:with in 4639) [ClassicSimilarity], result of:
            0.004175776 = score(doc=4639,freq=1.0), product of:
              0.021230323 = queryWeight, product of:
                1.0513492 = boost
                2.5176222 = idf(docFreq=9369, maxDocs=42740)
                0.008020826 = queryNorm
              0.19668923 = fieldWeight in 4639, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5176222 = idf(docFreq=9369, maxDocs=42740)
                0.078125 = fieldNorm(doc=4639)
          0.062130082 = weight(abstract_txt:index in 4639) [ClassicSimilarity], result of:
            0.062130082 = score(doc=4639,freq=5.0), product of:
              0.07510631 = queryWeight, product of:
                1.9774562 = boost
                4.7353325 = idf(docFreq=1019, maxDocs=42740)
                0.008020826 = queryNorm
              0.82722855 = fieldWeight in 4639, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7353325 = idf(docFreq=1019, maxDocs=42740)
                0.078125 = fieldNorm(doc=4639)
          0.05591133 = weight(abstract_txt:called in 4639) [ClassicSimilarity], result of:
            0.05591133 = score(doc=4639,freq=2.0), product of:
              0.09501416 = queryWeight, product of:
                2.2241437 = boost
                5.3260646 = idf(docFreq=564, maxDocs=42740)
                0.008020826 = queryNorm
              0.5884526 = fieldWeight in 4639, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3260646 = idf(docFreq=564, maxDocs=42740)
                0.078125 = fieldNorm(doc=4639)
          0.07514402 = weight(abstract_txt:does in 4639) [ClassicSimilarity], result of:
            0.07514402 = score(doc=4639,freq=1.0), product of:
              0.18368404 = queryWeight, product of:
                4.3734016 = boost
                5.236402 = idf(docFreq=617, maxDocs=42740)
                0.008020826 = queryNorm
              0.40909392 = fieldWeight in 4639, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.236402 = idf(docFreq=617, maxDocs=42740)
                0.078125 = fieldNorm(doc=4639)
          0.29018897 = weight(abstract_txt:indices in 4639) [ClassicSimilarity], result of:
            0.29018897 = score(doc=4639,freq=2.0), product of:
              0.35885507 = queryWeight, product of:
                6.1128426 = boost
                7.319085 = idf(docFreq=76, maxDocs=42740)
                0.008020826 = queryNorm
              0.8086523 = fieldWeight in 4639, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.319085 = idf(docFreq=76, maxDocs=42740)
                0.078125 = fieldNorm(doc=4639)
        0.3 = coord(6/20)