Document (#21512)

Author
Wartik, S.
Fox, E.
Heath, L.
Chen, Q.-F.
Title
Hashing algorithms
Source
Information retrieval: data structures and algorithms. Ed.: W.B. Frakes u. R. Baeza-Yates
Imprint
Englewood Cliffs, NJ : Prentice Hall
Year
1992
Pages
S.293-362
Abstract
Discusses hashing, an information storage and retrieval technique useful for implementing many of the other structures in this book. The concepts underlying hashing are presented, along with 2 implementation strategies. The chapter also contains an extensive discussion of perfect hashing, an important optimization in information retrieval, and an O(n) algorithm to find minimal perfect hash functions for a set of keys
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Heath, F.: Libraries, information technology, and the future (1995) 2.87
    2.8708858 = sum of:
      2.8708858 = product of:
        5.7417717 = sum of:
          5.7417717 = weight(author_txt:heath in 3733) [ClassicSimilarity], result of:
            5.7417717 = score(doc=3733,freq=1.0), product of:
              0.93099535 = queryWeight, product of:
                1.5970145 = boost
                9.867756 = idf(docFreq=5, maxDocs=42596)
                0.059077248 = queryNorm
              6.1673474 = fieldWeight in 3733, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.867756 = idf(docFreq=5, maxDocs=42596)
                0.625 = fieldNorm(doc=3733)
        0.5 = coord(1/2)
    
  2. Bizer, C.; Heath, T.: Linked Data : evolving the web into a global data space (2011) 2.30
    2.2967086 = sum of:
      2.2967086 = product of:
        4.593417 = sum of:
          4.593417 = weight(author_txt:heath in 726) [ClassicSimilarity], result of:
            4.593417 = score(doc=726,freq=1.0), product of:
              0.93099535 = queryWeight, product of:
                1.5970145 = boost
                9.867756 = idf(docFreq=5, maxDocs=42596)
                0.059077248 = queryNorm
              4.933878 = fieldWeight in 726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.867756 = idf(docFreq=5, maxDocs=42596)
                0.5 = fieldNorm(doc=726)
        0.5 = coord(1/2)
    
  3. Vikor, D.L.; Gaumond, G.; Heath, F.M.: Building electronic cooperation in the 1990s : the Maryland, Georgia, and Texas experiences (1997) 1.72
    1.7225316 = sum of:
      1.7225316 = product of:
        3.445063 = sum of:
          3.445063 = weight(author_txt:heath in 2681) [ClassicSimilarity], result of:
            3.445063 = score(doc=2681,freq=1.0), product of:
              0.93099535 = queryWeight, product of:
                1.5970145 = boost
                9.867756 = idf(docFreq=5, maxDocs=42596)
                0.059077248 = queryNorm
              3.7004085 = fieldWeight in 2681, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.867756 = idf(docFreq=5, maxDocs=42596)
                0.375 = fieldNorm(doc=2681)
        0.5 = coord(1/2)
    
  4. Bizer, C.; Cyganiak, R.; Heath, T.: How to publish Linked Data on the Web (2007) 1.72
    1.7225316 = sum of:
      1.7225316 = product of:
        3.445063 = sum of:
          3.445063 = weight(author_txt:heath in 45) [ClassicSimilarity], result of:
            3.445063 = score(doc=45,freq=1.0), product of:
              0.93099535 = queryWeight, product of:
                1.5970145 = boost
                9.867756 = idf(docFreq=5, maxDocs=42596)
                0.059077248 = queryNorm
              3.7004085 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.867756 = idf(docFreq=5, maxDocs=42596)
                0.375 = fieldNorm(doc=45)
        0.5 = coord(1/2)
    
  5. Chen, Y.N.; Chen, S.J.: ¬A metadata practice of the OFLA FRBR model : a case study for the National Palace Museum in Taipai (2004) 0.80
    0.79743326 = sum of:
      0.79743326 = product of:
        1.5948665 = sum of:
          1.5948665 = weight(author_txt:chen in 4385) [ClassicSimilarity], result of:
            1.5948665 = score(doc=4385,freq=2.0), product of:
              0.36503103 = queryWeight, product of:
                6.178877 = idf(docFreq=239, maxDocs=42596)
                0.059077248 = queryNorm
              4.369126 = fieldWeight in 4385, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.178877 = idf(docFreq=239, maxDocs=42596)
                0.5 = fieldNorm(doc=4385)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Wartik, S.: Boolean operators (1992) 0.18
    0.18146059 = sum of:
      0.18146059 = product of:
        1.1341287 = sum of:
          0.008299039 = weight(abstract_txt:information in 4510) [ClassicSimilarity], result of:
            0.008299039 = score(doc=4510,freq=1.0), product of:
              0.027323479 = queryWeight, product of:
                1.0048585 = boost
                2.429863 = idf(docFreq=10194, maxDocs=42596)
                0.0111904945 = queryNorm
              0.30373287 = fieldWeight in 4510, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.429863 = idf(docFreq=10194, maxDocs=42596)
                0.125 = fieldNorm(doc=4510)
          0.041448735 = weight(abstract_txt:implementation in 4510) [ClassicSimilarity], result of:
            0.041448735 = score(doc=4510,freq=1.0), product of:
              0.06336483 = queryWeight, product of:
                1.0820469 = boost
                5.233027 = idf(docFreq=617, maxDocs=42596)
                0.0111904945 = queryNorm
              0.6541284 = fieldWeight in 4510, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.233027 = idf(docFreq=617, maxDocs=42596)
                0.125 = fieldNorm(doc=4510)
          0.0240196 = weight(abstract_txt:retrieval in 4510) [ClassicSimilarity], result of:
            0.0240196 = score(doc=4510,freq=1.0), product of:
              0.055491693 = queryWeight, product of:
                1.432026 = boost
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.0111904945 = queryNorm
              0.4328504 = fieldWeight in 4510, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.125 = fieldNorm(doc=4510)
          1.0603613 = weight(abstract_txt:hashing in 4510) [ClassicSimilarity], result of:
            1.0603613 = score(doc=4510,freq=1.0), product of:
              0.8732998 = queryWeight, product of:
                8.034033 = boost
                9.713606 = idf(docFreq=6, maxDocs=42596)
                0.0111904945 = queryNorm
              1.2142007 = fieldWeight in 4510, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.713606 = idf(docFreq=6, maxDocs=42596)
                0.125 = fieldNorm(doc=4510)
        0.16 = coord(4/25)
    
  2. Nelson, M.J.: ¬A prefix trie index for inverted files (1997) 0.17
    0.16957943 = sum of:
      0.16957943 = product of:
        1.0598714 = sum of:
          0.005186899 = weight(abstract_txt:information in 1496) [ClassicSimilarity], result of:
            0.005186899 = score(doc=1496,freq=1.0), product of:
              0.027323479 = queryWeight, product of:
                1.0048585 = boost
                2.429863 = idf(docFreq=10194, maxDocs=42596)
                0.0111904945 = queryNorm
              0.18983305 = fieldWeight in 1496, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.429863 = idf(docFreq=10194, maxDocs=42596)
                0.078125 = fieldNorm(doc=1496)
          0.021230526 = weight(abstract_txt:retrieval in 1496) [ClassicSimilarity], result of:
            0.021230526 = score(doc=1496,freq=2.0), product of:
              0.055491693 = queryWeight, product of:
                1.432026 = boost
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.0111904945 = queryNorm
              0.38258928 = fieldWeight in 1496, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.078125 = fieldNorm(doc=1496)
          0.09621831 = weight(abstract_txt:keys in 1496) [ClassicSimilarity], result of:
            0.09621831 = score(doc=1496,freq=1.0), product of:
              0.15197049 = queryWeight, product of:
                1.6757203 = boost
                8.104168 = idf(docFreq=34, maxDocs=42596)
                0.0111904945 = queryNorm
              0.6331381 = fieldWeight in 1496, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.104168 = idf(docFreq=34, maxDocs=42596)
                0.078125 = fieldNorm(doc=1496)
          0.9372357 = weight(abstract_txt:hashing in 1496) [ClassicSimilarity], result of:
            0.9372357 = score(doc=1496,freq=2.0), product of:
              0.8732998 = queryWeight, product of:
                8.034033 = boost
                9.713606 = idf(docFreq=6, maxDocs=42596)
                0.0111904945 = queryNorm
              1.0732119 = fieldWeight in 1496, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.713606 = idf(docFreq=6, maxDocs=42596)
                0.078125 = fieldNorm(doc=1496)
        0.16 = coord(4/25)
    
  3. Hoad, T.C.; Zobel, J.: Methods for identifying versioned and plagiarized documents (2003) 0.16
    0.16430151 = sum of:
      0.16430151 = product of:
        0.8215076 = sum of:
          0.0041495194 = weight(abstract_txt:information in 160) [ClassicSimilarity], result of:
            0.0041495194 = score(doc=160,freq=1.0), product of:
              0.027323479 = queryWeight, product of:
                1.0048585 = boost
                2.429863 = idf(docFreq=10194, maxDocs=42596)
                0.0111904945 = queryNorm
              0.15186644 = fieldWeight in 160, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.429863 = idf(docFreq=10194, maxDocs=42596)
                0.0625 = fieldNorm(doc=160)
          0.019813718 = weight(abstract_txt:strategies in 160) [ClassicSimilarity], result of:
            0.019813718 = score(doc=160,freq=1.0), product of:
              0.06149476 = queryWeight, product of:
                1.0659602 = boost
                5.1552277 = idf(docFreq=667, maxDocs=42596)
                0.0111904945 = queryNorm
              0.32220173 = fieldWeight in 160, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1552277 = idf(docFreq=667, maxDocs=42596)
                0.0625 = fieldNorm(doc=160)
          0.035745952 = weight(abstract_txt:technique in 160) [ClassicSimilarity], result of:
            0.035745952 = score(doc=160,freq=2.0), product of:
              0.072332814 = queryWeight, product of:
                1.1560845 = boost
                5.59109 = idf(docFreq=431, maxDocs=42596)
                0.0111904945 = queryNorm
              0.4941872 = fieldWeight in 160, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.59109 = idf(docFreq=431, maxDocs=42596)
                0.0625 = fieldNorm(doc=160)
          0.0120098 = weight(abstract_txt:retrieval in 160) [ClassicSimilarity], result of:
            0.0120098 = score(doc=160,freq=1.0), product of:
              0.055491693 = queryWeight, product of:
                1.432026 = boost
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.0111904945 = queryNorm
              0.2164252 = fieldWeight in 160, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.0625 = fieldNorm(doc=160)
          0.7497886 = weight(abstract_txt:hashing in 160) [ClassicSimilarity], result of:
            0.7497886 = score(doc=160,freq=2.0), product of:
              0.8732998 = queryWeight, product of:
                8.034033 = boost
                9.713606 = idf(docFreq=6, maxDocs=42596)
                0.0111904945 = queryNorm
              0.85856956 = fieldWeight in 160, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.713606 = idf(docFreq=6, maxDocs=42596)
                0.0625 = fieldNorm(doc=160)
        0.2 = coord(5/25)
    
  4. Ford, D.A.; Christodoukalis, S.: File organizations for optical disks (1992) 0.15
    0.14578466 = sum of:
      0.14578466 = product of:
        0.91115415 = sum of:
          0.0294137 = weight(abstract_txt:structures in 4502) [ClassicSimilarity], result of:
            0.0294137 = score(doc=4502,freq=1.0), product of:
              0.06107072 = queryWeight, product of:
                1.0622786 = boost
                5.137423 = idf(docFreq=679, maxDocs=42596)
                0.0111904945 = queryNorm
              0.48163342 = fieldWeight in 4502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.137423 = idf(docFreq=679, maxDocs=42596)
                0.09375 = fieldNorm(doc=4502)
          0.06099291 = weight(abstract_txt:storage in 4502) [ClassicSimilarity], result of:
            0.06099291 = score(doc=4502,freq=2.0), product of:
              0.07882117 = queryWeight, product of:
                1.2068224 = boost
                5.8364697 = idf(docFreq=337, maxDocs=42596)
                0.0111904945 = queryNorm
              0.77381384 = fieldWeight in 4502, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8364697 = idf(docFreq=337, maxDocs=42596)
                0.09375 = fieldNorm(doc=4502)
          0.025476635 = weight(abstract_txt:retrieval in 4502) [ClassicSimilarity], result of:
            0.025476635 = score(doc=4502,freq=2.0), product of:
              0.055491693 = queryWeight, product of:
                1.432026 = boost
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.0111904945 = queryNorm
              0.45910716 = fieldWeight in 4502, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.09375 = fieldNorm(doc=4502)
          0.7952709 = weight(abstract_txt:hashing in 4502) [ClassicSimilarity], result of:
            0.7952709 = score(doc=4502,freq=1.0), product of:
              0.8732998 = queryWeight, product of:
                8.034033 = boost
                9.713606 = idf(docFreq=6, maxDocs=42596)
                0.0111904945 = queryNorm
              0.91065055 = fieldWeight in 4502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.713606 = idf(docFreq=6, maxDocs=42596)
                0.09375 = fieldNorm(doc=4502)
        0.16 = coord(4/25)
    
  5. Lam, W.; Wong, K.-F.; Wong, C.-Y.: Chinese document indexing based on new partitioned signature file : model and evaluation (2001) 0.13
    0.12855873 = sum of:
      0.12855873 = product of:
        0.803492 = sum of:
          0.0041495194 = weight(abstract_txt:information in 1304) [ClassicSimilarity], result of:
            0.0041495194 = score(doc=1304,freq=1.0), product of:
              0.027323479 = queryWeight, product of:
                1.0048585 = boost
                2.429863 = idf(docFreq=10194, maxDocs=42596)
                0.0111904945 = queryNorm
              0.15186644 = fieldWeight in 1304, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.429863 = idf(docFreq=10194, maxDocs=42596)
                0.0625 = fieldNorm(doc=1304)
          0.028752334 = weight(abstract_txt:storage in 1304) [ClassicSimilarity], result of:
            0.028752334 = score(doc=1304,freq=1.0), product of:
              0.07882117 = queryWeight, product of:
                1.2068224 = boost
                5.8364697 = idf(docFreq=337, maxDocs=42596)
                0.0111904945 = queryNorm
              0.36477935 = fieldWeight in 1304, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8364697 = idf(docFreq=337, maxDocs=42596)
                0.0625 = fieldNorm(doc=1304)
          0.020801583 = weight(abstract_txt:retrieval in 1304) [ClassicSimilarity], result of:
            0.020801583 = score(doc=1304,freq=3.0), product of:
              0.055491693 = queryWeight, product of:
                1.432026 = boost
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.0111904945 = queryNorm
              0.37485942 = fieldWeight in 1304, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.0625 = fieldNorm(doc=1304)
          0.7497886 = weight(abstract_txt:hashing in 1304) [ClassicSimilarity], result of:
            0.7497886 = score(doc=1304,freq=2.0), product of:
              0.8732998 = queryWeight, product of:
                8.034033 = boost
                9.713606 = idf(docFreq=6, maxDocs=42596)
                0.0111904945 = queryNorm
              0.85856956 = fieldWeight in 1304, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.713606 = idf(docFreq=6, maxDocs=42596)
                0.0625 = fieldNorm(doc=1304)
        0.16 = coord(4/25)