Document (#21502)

Author
Gonnet, G.H.
Snider, T.
Baeza-Yates, R.A.
Title
New indices for text : PAT trees and PAT arrays
Source
Information retrieval: data structures and algorithms. Ed.: W.B. Frakes u. R. Baeza-Yates
Imprint
Englewood Cliffs, NJ : Prentice Hall
Year
1992
Pages
S.66-82
Abstract
We survey new indices for text, with emphasis on PAT arrays (also called suffic arrays). A PAT array is an index based on a new model of text that does not use the concept of word and does not need to know the structure of text
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Baeza-Yates, R.A.: Introduction to data structures and algorithms related to information retrieval (1992) 6.28
    6.2847443 = sum of:
      6.2847443 = sum of:
        3.0091636 = weight(author_txt:yates in 4083) [ClassicSimilarity], result of:
          3.0091636 = score(doc=4083,freq=1.0), product of:
            0.6868423 = queryWeight, product of:
              8.762313 = idf(docFreq=17, maxDocs=42306)
              0.078385964 = queryNorm
            4.3811564 = fieldWeight in 4083, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.762313 = idf(docFreq=17, maxDocs=42306)
              0.5 = fieldNorm(doc=4083)
        3.2755806 = weight(author_txt:baeza in 4083) [ClassicSimilarity], result of:
          3.2755806 = score(doc=4083,freq=1.0), product of:
            0.72680634 = queryWeight, product of:
              1.0286813 = boost
              9.013627 = idf(docFreq=13, maxDocs=42306)
              0.078385964 = queryNorm
            4.5068135 = fieldWeight in 4083, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.013627 = idf(docFreq=13, maxDocs=42306)
              0.5 = fieldNorm(doc=4083)
    
  2. Baeza-Yates, R.A.: String searching algorithms (1992) 6.28
    6.2847443 = sum of:
      6.2847443 = sum of:
        3.0091636 = weight(author_txt:yates in 4506) [ClassicSimilarity], result of:
          3.0091636 = score(doc=4506,freq=1.0), product of:
            0.6868423 = queryWeight, product of:
              8.762313 = idf(docFreq=17, maxDocs=42306)
              0.078385964 = queryNorm
            4.3811564 = fieldWeight in 4506, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.762313 = idf(docFreq=17, maxDocs=42306)
              0.5 = fieldNorm(doc=4506)
        3.2755806 = weight(author_txt:baeza in 4506) [ClassicSimilarity], result of:
          3.2755806 = score(doc=4506,freq=1.0), product of:
            0.72680634 = queryWeight, product of:
              1.0286813 = boost
              9.013627 = idf(docFreq=13, maxDocs=42306)
              0.078385964 = queryNorm
            4.5068135 = fieldWeight in 4506, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.013627 = idf(docFreq=13, maxDocs=42306)
              0.5 = fieldNorm(doc=4506)
    
  3. Baeza-Yates, R.; Navarro, G.: Block addressing indices for approximate text retrieval (2000) 5.50
    5.499151 = sum of:
      5.499151 = sum of:
        2.6330183 = weight(author_txt:yates in 5296) [ClassicSimilarity], result of:
          2.6330183 = score(doc=5296,freq=1.0), product of:
            0.6868423 = queryWeight, product of:
              8.762313 = idf(docFreq=17, maxDocs=42306)
              0.078385964 = queryNorm
            3.8335118 = fieldWeight in 5296, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.762313 = idf(docFreq=17, maxDocs=42306)
              0.4375 = fieldNorm(doc=5296)
        2.8661332 = weight(author_txt:baeza in 5296) [ClassicSimilarity], result of:
          2.8661332 = score(doc=5296,freq=1.0), product of:
            0.72680634 = queryWeight, product of:
              1.0286813 = boost
              9.013627 = idf(docFreq=13, maxDocs=42306)
              0.078385964 = queryNorm
            3.943462 = fieldWeight in 5296, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.013627 = idf(docFreq=13, maxDocs=42306)
              0.4375 = fieldNorm(doc=5296)
    
  4. Baeza-Yates, R.; Navarro, G.: XQL and proximal nodes (2002) 5.50
    5.499151 = sum of:
      5.499151 = sum of:
        2.6330183 = weight(author_txt:yates in 1455) [ClassicSimilarity], result of:
          2.6330183 = score(doc=1455,freq=1.0), product of:
            0.6868423 = queryWeight, product of:
              8.762313 = idf(docFreq=17, maxDocs=42306)
              0.078385964 = queryNorm
            3.8335118 = fieldWeight in 1455, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.762313 = idf(docFreq=17, maxDocs=42306)
              0.4375 = fieldNorm(doc=1455)
        2.8661332 = weight(author_txt:baeza in 1455) [ClassicSimilarity], result of:
          2.8661332 = score(doc=1455,freq=1.0), product of:
            0.72680634 = queryWeight, product of:
              1.0286813 = boost
              9.013627 = idf(docFreq=13, maxDocs=42306)
              0.078385964 = queryNorm
            3.943462 = fieldWeight in 1455, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.013627 = idf(docFreq=13, maxDocs=42306)
              0.4375 = fieldNorm(doc=1455)
    
  5. Castillo, C.; Baeza-Yates, R.: Web retrieval and mining (2009) 5.50
    5.499151 = sum of:
      5.499151 = sum of:
        2.6330183 = weight(author_txt:yates in 905) [ClassicSimilarity], result of:
          2.6330183 = score(doc=905,freq=1.0), product of:
            0.6868423 = queryWeight, product of:
              8.762313 = idf(docFreq=17, maxDocs=42306)
              0.078385964 = queryNorm
            3.8335118 = fieldWeight in 905, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.762313 = idf(docFreq=17, maxDocs=42306)
              0.4375 = fieldNorm(doc=905)
        2.8661332 = weight(author_txt:baeza in 905) [ClassicSimilarity], result of:
          2.8661332 = score(doc=905,freq=1.0), product of:
            0.72680634 = queryWeight, product of:
              1.0286813 = boost
              9.013627 = idf(docFreq=13, maxDocs=42306)
              0.078385964 = queryNorm
            3.943462 = fieldWeight in 905, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.013627 = idf(docFreq=13, maxDocs=42306)
              0.4375 = fieldNorm(doc=905)
    

Similar documents (content)

  1. Will, L.: ¬The ISO 25964 data model for the structure of an information retrieval thesaurus (2012) 0.23
    0.22610456 = sum of:
      0.22610456 = product of:
        0.75368184 = sum of:
          0.0043701525 = weight(abstract_txt:that in 2863) [ClassicSimilarity], result of:
            0.0043701525 = score(doc=2863,freq=1.0), product of:
              0.019383686 = queryWeight, product of:
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.008060229 = queryNorm
              0.2254552 = fieldWeight in 2863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.09375 = fieldNorm(doc=2863)
          0.01278626 = weight(abstract_txt:also in 2863) [ClassicSimilarity], result of:
            0.01278626 = score(doc=2863,freq=1.0), product of:
              0.039652232 = queryWeight, product of:
                1.4302621 = boost
                3.4395735 = idf(docFreq=3688, maxDocs=42306)
                0.008060229 = queryNorm
              0.32246003 = fieldWeight in 2863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4395735 = idf(docFreq=3688, maxDocs=42306)
                0.09375 = fieldNorm(doc=2863)
          0.020626169 = weight(abstract_txt:model in 2863) [ClassicSimilarity], result of:
            0.020626169 = score(doc=2863,freq=1.0), product of:
              0.054540318 = queryWeight, product of:
                1.6774155 = boost
                4.0339417 = idf(docFreq=2035, maxDocs=42306)
                0.008060229 = queryNorm
              0.37818205 = fieldWeight in 2863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0339417 = idf(docFreq=2035, maxDocs=42306)
                0.09375 = fieldNorm(doc=2863)
          0.026270691 = weight(abstract_txt:structure in 2863) [ClassicSimilarity], result of:
            0.026270691 = score(doc=2863,freq=1.0), product of:
              0.06408449 = queryWeight, product of:
                1.8182697 = boost
                4.372676 = idf(docFreq=1450, maxDocs=42306)
                0.008060229 = queryNorm
              0.40993837 = fieldWeight in 2863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.372676 = idf(docFreq=1450, maxDocs=42306)
                0.09375 = fieldNorm(doc=2863)
          0.029537601 = weight(abstract_txt:concept in 2863) [ClassicSimilarity], result of:
            0.029537601 = score(doc=2863,freq=1.0), product of:
              0.0692929 = queryWeight, product of:
                1.8907156 = boost
                4.546898 = idf(docFreq=1218, maxDocs=42306)
                0.008060229 = queryNorm
              0.42627168 = fieldWeight in 2863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.546898 = idf(docFreq=1218, maxDocs=42306)
                0.09375 = fieldNorm(doc=2863)
          0.660091 = weight(abstract_txt:arrays in 2863) [ClassicSimilarity], result of:
            0.660091 = score(doc=2863,freq=1.0), product of:
              0.7928936 = queryWeight, product of:
                11.077707 = boost
                8.8800955 = idf(docFreq=15, maxDocs=42306)
                0.008060229 = queryNorm
              0.8325089 = fieldWeight in 2863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.8800955 = idf(docFreq=15, maxDocs=42306)
                0.09375 = fieldNorm(doc=2863)
        0.3 = coord(6/20)
    
  2. Harman, D.; Fox, E.; Baeza-Yates, R.; Lee, W.: Inverted files (1992) 0.19
    0.18951285 = sum of:
      0.18951285 = product of:
        0.94756424 = sum of:
          0.008240439 = weight(abstract_txt:that in 4498) [ClassicSimilarity], result of:
            0.008240439 = score(doc=4498,freq=2.0), product of:
              0.019383686 = queryWeight, product of:
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.008060229 = queryNorm
              0.4251224 = fieldWeight in 4498, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.125 = fieldNorm(doc=4498)
          0.0067573357 = weight(abstract_txt:with in 4498) [ClassicSimilarity], result of:
            0.0067573357 = score(doc=4498,freq=1.0), product of:
              0.021395862 = queryWeight, product of:
                1.0506226 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.008060229 = queryNorm
              0.31582442 = fieldWeight in 4498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.125 = fieldNorm(doc=4498)
          0.052445117 = weight(abstract_txt:survey in 4498) [ClassicSimilarity], result of:
            0.052445117 = score(doc=4498,freq=1.0), product of:
              0.08387184 = queryWeight, product of:
                2.0801272 = boost
                5.002405 = idf(docFreq=772, maxDocs=42306)
                0.008060229 = queryNorm
              0.62530065 = fieldWeight in 4498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.002405 = idf(docFreq=772, maxDocs=42306)
                0.125 = fieldNorm(doc=4498)
          0.88012135 = weight(abstract_txt:arrays in 4498) [ClassicSimilarity], result of:
            0.88012135 = score(doc=4498,freq=1.0), product of:
              0.7928936 = queryWeight, product of:
                11.077707 = boost
                8.8800955 = idf(docFreq=15, maxDocs=42306)
                0.008060229 = queryNorm
              1.1100119 = fieldWeight in 4498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.8800955 = idf(docFreq=15, maxDocs=42306)
                0.125 = fieldNorm(doc=4498)
        0.2 = coord(4/20)
    
  3. Hartman, J.H.; Proebsting, T.A.; Sundaram, R.: Index-based hyperlinks (1997) 0.18
    0.17953737 = sum of:
      0.17953737 = product of:
        0.7181495 = sum of:
          0.0050985115 = weight(abstract_txt:that in 3724) [ClassicSimilarity], result of:
            0.0050985115 = score(doc=3724,freq=1.0), product of:
              0.019383686 = queryWeight, product of:
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.008060229 = queryNorm
              0.26303107 = fieldWeight in 3724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.109375 = fieldNorm(doc=3724)
          0.012184112 = weight(abstract_txt:based in 3724) [ClassicSimilarity], result of:
            0.012184112 = score(doc=3724,freq=1.0), product of:
              0.03464735 = queryWeight, product of:
                1.3369551 = boost
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.008060229 = queryNorm
              0.35166073 = fieldWeight in 3724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.109375 = fieldNorm(doc=3724)
          0.014917303 = weight(abstract_txt:also in 3724) [ClassicSimilarity], result of:
            0.014917303 = score(doc=3724,freq=1.0), product of:
              0.039652232 = queryWeight, product of:
                1.4302621 = boost
                3.4395735 = idf(docFreq=3688, maxDocs=42306)
                0.008060229 = queryNorm
              0.37620336 = fieldWeight in 3724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4395735 = idf(docFreq=3688, maxDocs=42306)
                0.109375 = fieldNorm(doc=3724)
          0.03891621 = weight(abstract_txt:index in 3724) [ClassicSimilarity], result of:
            0.03891621 = score(doc=3724,freq=1.0), product of:
              0.07514402 = queryWeight, product of:
                1.9689244 = boost
                4.7349787 = idf(docFreq=1009, maxDocs=42306)
                0.008060229 = queryNorm
              0.5178883 = fieldWeight in 3724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7349787 = idf(docFreq=1009, maxDocs=42306)
                0.109375 = fieldNorm(doc=3724)
          0.64703333 = weight(abstract_txt:indices in 3724) [ClassicSimilarity], result of:
            0.64703333 = score(doc=3724,freq=5.0), product of:
              0.36067152 = queryWeight, product of:
                6.100322 = boost
                7.335196 = idf(docFreq=74, maxDocs=42306)
                0.008060229 = queryNorm
              1.7939684 = fieldWeight in 3724, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.335196 = idf(docFreq=74, maxDocs=42306)
                0.109375 = fieldNorm(doc=3724)
        0.25 = coord(5/20)
    
  4. Ibáñez, A.; Armañanzas, R.; Bielza, C.; Larrañaga, P.: Genetic algorithms and Gaussian Bayesian networks to uncover the predictive core set of bibliometric indices (2016) 0.15
    0.15244791 = sum of:
      0.15244791 = product of:
        0.5081597 = sum of:
          0.006514639 = weight(abstract_txt:that in 42) [ClassicSimilarity], result of:
            0.006514639 = score(doc=42,freq=5.0), product of:
              0.019383686 = queryWeight, product of:
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.008060229 = queryNorm
              0.33608878 = fieldWeight in 42, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0625 = fieldNorm(doc=42)
          0.0033786679 = weight(abstract_txt:with in 42) [ClassicSimilarity], result of:
            0.0033786679 = score(doc=42,freq=1.0), product of:
              0.021395862 = queryWeight, product of:
                1.0506226 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.008060229 = queryNorm
              0.15791221 = fieldWeight in 42, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.0625 = fieldNorm(doc=42)
          0.008524173 = weight(abstract_txt:also in 42) [ClassicSimilarity], result of:
            0.008524173 = score(doc=42,freq=1.0), product of:
              0.039652232 = queryWeight, product of:
                1.4302621 = boost
                3.4395735 = idf(docFreq=3688, maxDocs=42306)
                0.008060229 = queryNorm
              0.21497335 = fieldWeight in 42, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4395735 = idf(docFreq=3688, maxDocs=42306)
                0.0625 = fieldNorm(doc=42)
          0.0137507785 = weight(abstract_txt:model in 42) [ClassicSimilarity], result of:
            0.0137507785 = score(doc=42,freq=1.0), product of:
              0.054540318 = queryWeight, product of:
                1.6774155 = boost
                4.0339417 = idf(docFreq=2035, maxDocs=42306)
                0.008060229 = queryNorm
              0.25212136 = fieldWeight in 42, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0339417 = idf(docFreq=2035, maxDocs=42306)
                0.0625 = fieldNorm(doc=42)
          0.038517058 = weight(abstract_txt:index in 42) [ClassicSimilarity], result of:
            0.038517058 = score(doc=42,freq=3.0), product of:
              0.07514402 = queryWeight, product of:
                1.9689244 = boost
                4.7349787 = idf(docFreq=1009, maxDocs=42306)
                0.008060229 = queryNorm
              0.51257646 = fieldWeight in 42, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7349787 = idf(docFreq=1009, maxDocs=42306)
                0.0625 = fieldNorm(doc=42)
          0.43747437 = weight(abstract_txt:indices in 42) [ClassicSimilarity], result of:
            0.43747437 = score(doc=42,freq=7.0), product of:
              0.36067152 = queryWeight, product of:
                6.100322 = boost
                7.335196 = idf(docFreq=74, maxDocs=42306)
                0.008060229 = queryNorm
              1.212944 = fieldWeight in 42, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.335196 = idf(docFreq=74, maxDocs=42306)
                0.0625 = fieldNorm(doc=42)
        0.3 = coord(6/20)
    
  5. Rousseau, R.; Jin, B.: ¬The age-dependent h-type AR**2-index : basic properties and a case study (2008) 0.15
    0.14850289 = sum of:
      0.14850289 = product of:
        0.4950096 = sum of:
          0.0051502744 = weight(abstract_txt:that in 458) [ClassicSimilarity], result of:
            0.0051502744 = score(doc=458,freq=2.0), product of:
              0.019383686 = queryWeight, product of:
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.008060229 = queryNorm
              0.2657015 = fieldWeight in 458, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.078125 = fieldNorm(doc=458)
          0.0042233346 = weight(abstract_txt:with in 458) [ClassicSimilarity], result of:
            0.0042233346 = score(doc=458,freq=1.0), product of:
              0.021395862 = queryWeight, product of:
                1.0506226 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.008060229 = queryNorm
              0.19739026 = fieldWeight in 458, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.078125 = fieldNorm(doc=458)
          0.062156636 = weight(abstract_txt:index in 458) [ClassicSimilarity], result of:
            0.062156636 = score(doc=458,freq=5.0), product of:
              0.07514402 = queryWeight, product of:
                1.9689244 = boost
                4.7349787 = idf(docFreq=1009, maxDocs=42306)
                0.008060229 = queryNorm
              0.82716674 = fieldWeight in 458, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7349787 = idf(docFreq=1009, maxDocs=42306)
                0.078125 = fieldNorm(doc=458)
          0.05579404 = weight(abstract_txt:called in 458) [ClassicSimilarity], result of:
            0.05579404 = score(doc=458,freq=2.0), product of:
              0.094901845 = queryWeight, product of:
                2.2126827 = boost
                5.3211823 = idf(docFreq=561, maxDocs=42306)
                0.008060229 = queryNorm
              0.58791316 = fieldWeight in 458, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3211823 = idf(docFreq=561, maxDocs=42306)
                0.078125 = fieldNorm(doc=458)
          0.075385444 = weight(abstract_txt:does in 458) [ClassicSimilarity], result of:
            0.075385444 = score(doc=458,freq=1.0), product of:
              0.18411723 = queryWeight, product of:
                4.35857 = boost
                5.2408657 = idf(docFreq=608, maxDocs=42306)
                0.008060229 = queryNorm
              0.40944263 = fieldWeight in 458, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2408657 = idf(docFreq=608, maxDocs=42306)
                0.078125 = fieldNorm(doc=458)
          0.29229987 = weight(abstract_txt:indices in 458) [ClassicSimilarity], result of:
            0.29229987 = score(doc=458,freq=2.0), product of:
              0.36067152 = queryWeight, product of:
                6.100322 = boost
                7.335196 = idf(docFreq=74, maxDocs=42306)
                0.008060229 = queryNorm
              0.8104323 = fieldWeight in 458, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.335196 = idf(docFreq=74, maxDocs=42306)
                0.078125 = fieldNorm(doc=458)
        0.3 = coord(6/20)