Document (#21512)

Author
Wartik, S.
Fox, E.
Heath, L.
Chen, Q.-F.
Title
Hashing algorithms
Source
Information retrieval: data structures and algorithms. Ed.: W.B. Frakes u. R. Baeza-Yates
Imprint
Englewood Cliffs, NJ : Prentice Hall
Year
1992
Pages
S.293-362
Abstract
Discusses hashing, an information storage and retrieval technique useful for implementing many of the other structures in this book. The concepts underlying hashing are presented, along with 2 implementation strategies. The chapter also contains an extensive discussion of perfect hashing, an important optimization in information retrieval, and an O(n) algorithm to find minimal perfect hash functions for a set of keys
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Heath, F.: Libraries, information technology, and the future (1995) 2.88
    2.8786802 = sum of:
      2.8786802 = product of:
        5.7573605 = sum of:
          5.7573605 = weight(author_txt:heath in 4733) [ClassicSimilarity], result of:
            5.7573605 = score(doc=4733,freq=1.0), product of:
              0.93207496 = queryWeight, product of:
                1.6040282 = boost
                9.883085 = idf(docFreq=5, maxDocs=43254)
                0.0587958 = queryNorm
              6.1769285 = fieldWeight in 4733, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.883085 = idf(docFreq=5, maxDocs=43254)
                0.625 = fieldNorm(doc=4733)
        0.5 = coord(1/2)
    
  2. Bizer, C.; Heath, T.: Linked Data : evolving the web into a global data space (2011) 2.30
    2.3029442 = sum of:
      2.3029442 = product of:
        4.6058884 = sum of:
          4.6058884 = weight(author_txt:heath in 1190) [ClassicSimilarity], result of:
            4.6058884 = score(doc=1190,freq=1.0), product of:
              0.93207496 = queryWeight, product of:
                1.6040282 = boost
                9.883085 = idf(docFreq=5, maxDocs=43254)
                0.0587958 = queryNorm
              4.9415426 = fieldWeight in 1190, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.883085 = idf(docFreq=5, maxDocs=43254)
                0.5 = fieldNorm(doc=1190)
        0.5 = coord(1/2)
    
  3. Vikor, D.L.; Gaumond, G.; Heath, F.M.: Building electronic cooperation in the 1990s : the Maryland, Georgia, and Texas experiences (1997) 1.73
    1.727208 = sum of:
      1.727208 = product of:
        3.454416 = sum of:
          3.454416 = weight(author_txt:heath in 3681) [ClassicSimilarity], result of:
            3.454416 = score(doc=3681,freq=1.0), product of:
              0.93207496 = queryWeight, product of:
                1.6040282 = boost
                9.883085 = idf(docFreq=5, maxDocs=43254)
                0.0587958 = queryNorm
              3.706157 = fieldWeight in 3681, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.883085 = idf(docFreq=5, maxDocs=43254)
                0.375 = fieldNorm(doc=3681)
        0.5 = coord(1/2)
    
  4. Bizer, C.; Cyganiak, R.; Heath, T.: How to publish Linked Data on the Web (2007) 1.73
    1.727208 = sum of:
      1.727208 = product of:
        3.454416 = sum of:
          3.454416 = weight(author_txt:heath in 256) [ClassicSimilarity], result of:
            3.454416 = score(doc=256,freq=1.0), product of:
              0.93207496 = queryWeight, product of:
                1.6040282 = boost
                9.883085 = idf(docFreq=5, maxDocs=43254)
                0.0587958 = queryNorm
              3.706157 = fieldWeight in 256, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.883085 = idf(docFreq=5, maxDocs=43254)
                0.375 = fieldNorm(doc=256)
        0.5 = coord(1/2)
    
  5. Chen, Y.N.; Chen, S.J.: ¬A metadata practice of the OFLA FRBR model : a case study for the National Palace Museum in Taipai (2004) 0.79
    0.7891551 = sum of:
      0.7891551 = product of:
        1.5783103 = sum of:
          1.5783103 = weight(author_txt:chen in 5385) [ClassicSimilarity], result of:
            1.5783103 = score(doc=5385,freq=2.0), product of:
              0.36226538 = queryWeight, product of:
                6.161416 = idf(docFreq=247, maxDocs=43254)
                0.0587958 = queryNorm
              4.356779 = fieldWeight in 5385, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.161416 = idf(docFreq=247, maxDocs=43254)
                0.5 = fieldNorm(doc=5385)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Wartik, S.: Boolean operators (1992) 0.18
    0.18175013 = sum of:
      0.18175013 = product of:
        1.1359384 = sum of:
          0.008246192 = weight(abstract_txt:information in 5510) [ClassicSimilarity], result of:
            0.008246192 = score(doc=5510,freq=1.0), product of:
              0.027182411 = queryWeight, product of:
                1.0042644 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.011152814 = queryNorm
              0.303365 = fieldWeight in 5510, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.125 = fieldNorm(doc=5510)
          0.041130126 = weight(abstract_txt:implementation in 5510) [ClassicSimilarity], result of:
            0.041130126 = score(doc=5510,freq=1.0), product of:
              0.06298189 = queryWeight, product of:
                1.0809283 = boost
                5.224375 = idf(docFreq=632, maxDocs=43254)
                0.011152814 = queryNorm
              0.65304685 = fieldWeight in 5510, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.224375 = idf(docFreq=632, maxDocs=43254)
                0.125 = fieldNorm(doc=5510)
          0.024101157 = weight(abstract_txt:retrieval in 5510) [ClassicSimilarity], result of:
            0.024101157 = score(doc=5510,freq=1.0), product of:
              0.05556623 = queryWeight, product of:
                1.4358515 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.011152814 = queryNorm
              0.4337375 = fieldWeight in 5510, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.125 = fieldNorm(doc=5510)
          1.0624609 = weight(abstract_txt:hashing in 5510) [ClassicSimilarity], result of:
            1.0624609 = score(doc=5510,freq=1.0), product of:
              0.8736503 = queryWeight, product of:
                8.051705 = boost
                9.728935 = idf(docFreq=6, maxDocs=43254)
                0.011152814 = queryNorm
              1.2161169 = fieldWeight in 5510, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.728935 = idf(docFreq=6, maxDocs=43254)
                0.125 = fieldNorm(doc=5510)
        0.16 = coord(4/25)
    
  2. Nelson, M.J.: ¬A prefix trie index for inverted files (1997) 0.17
    0.1699276 = sum of:
      0.1699276 = product of:
        1.0620475 = sum of:
          0.00515387 = weight(abstract_txt:information in 2496) [ClassicSimilarity], result of:
            0.00515387 = score(doc=2496,freq=1.0), product of:
              0.027182411 = queryWeight, product of:
                1.0042644 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.011152814 = queryNorm
              0.18960312 = fieldWeight in 2496, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.078125 = fieldNorm(doc=2496)
          0.021302612 = weight(abstract_txt:retrieval in 2496) [ClassicSimilarity], result of:
            0.021302612 = score(doc=2496,freq=2.0), product of:
              0.05556623 = queryWeight, product of:
                1.4358515 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.011152814 = queryNorm
              0.38337338 = fieldWeight in 2496, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.078125 = fieldNorm(doc=2496)
          0.09649938 = weight(abstract_txt:keys in 2496) [ClassicSimilarity], result of:
            0.09649938 = score(doc=2496,freq=1.0), product of:
              0.15212667 = queryWeight, product of:
                1.6799321 = boost
                8.119497 = idf(docFreq=34, maxDocs=43254)
                0.011152814 = queryNorm
              0.63433576 = fieldWeight in 2496, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.119497 = idf(docFreq=34, maxDocs=43254)
                0.078125 = fieldNorm(doc=2496)
          0.9390916 = weight(abstract_txt:hashing in 2496) [ClassicSimilarity], result of:
            0.9390916 = score(doc=2496,freq=2.0), product of:
              0.8736503 = queryWeight, product of:
                8.051705 = boost
                9.728935 = idf(docFreq=6, maxDocs=43254)
                0.011152814 = queryNorm
              1.0749056 = fieldWeight in 2496, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.728935 = idf(docFreq=6, maxDocs=43254)
                0.078125 = fieldNorm(doc=2496)
        0.16 = coord(4/25)
    
  3. Hoad, T.C.; Zobel, J.: Methods for identifying versioned and plagiarized documents (2003) 0.16
    0.16451806 = sum of:
      0.16451806 = product of:
        0.8225903 = sum of:
          0.004123096 = weight(abstract_txt:information in 160) [ClassicSimilarity], result of:
            0.004123096 = score(doc=160,freq=1.0), product of:
              0.027182411 = queryWeight, product of:
                1.0042644 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.011152814 = queryNorm
              0.1516825 = fieldWeight in 160, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.0625 = fieldNorm(doc=160)
          0.01959675 = weight(abstract_txt:strategies in 160) [ClassicSimilarity], result of:
            0.01959675 = score(doc=160,freq=1.0), product of:
              0.060989026 = queryWeight, product of:
                1.0636896 = boost
                5.141056 = idf(docFreq=687, maxDocs=43254)
                0.011152814 = queryNorm
              0.321316 = fieldWeight in 160, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.141056 = idf(docFreq=687, maxDocs=43254)
                0.0625 = fieldNorm(doc=160)
          0.035546612 = weight(abstract_txt:technique in 160) [ClassicSimilarity], result of:
            0.035546612 = score(doc=160,freq=2.0), product of:
              0.07199757 = queryWeight, product of:
                1.1557076 = boost
                5.5858 = idf(docFreq=440, maxDocs=43254)
                0.011152814 = queryNorm
              0.49371964 = fieldWeight in 160, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5858 = idf(docFreq=440, maxDocs=43254)
                0.0625 = fieldNorm(doc=160)
          0.012050578 = weight(abstract_txt:retrieval in 160) [ClassicSimilarity], result of:
            0.012050578 = score(doc=160,freq=1.0), product of:
              0.05556623 = queryWeight, product of:
                1.4358515 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.011152814 = queryNorm
              0.21686874 = fieldWeight in 160, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.0625 = fieldNorm(doc=160)
          0.7512733 = weight(abstract_txt:hashing in 160) [ClassicSimilarity], result of:
            0.7512733 = score(doc=160,freq=2.0), product of:
              0.8736503 = queryWeight, product of:
                8.051705 = boost
                9.728935 = idf(docFreq=6, maxDocs=43254)
                0.011152814 = queryNorm
              0.8599245 = fieldWeight in 160, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.728935 = idf(docFreq=6, maxDocs=43254)
                0.0625 = fieldNorm(doc=160)
        0.2 = coord(5/25)
    
  4. Ford, D.A.; Christodoukalis, S.: File organizations for optical disks (1992) 0.15
    0.146056 = sum of:
      0.146056 = product of:
        0.91284996 = sum of:
          0.029320562 = weight(abstract_txt:structures in 5502) [ClassicSimilarity], result of:
            0.029320562 = score(doc=5502,freq=1.0), product of:
              0.060885847 = queryWeight, product of:
                1.0627894 = boost
                5.1367054 = idf(docFreq=690, maxDocs=43254)
                0.011152814 = queryNorm
              0.48156613 = fieldWeight in 5502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1367054 = idf(docFreq=690, maxDocs=43254)
                0.09375 = fieldNorm(doc=5502)
          0.06112057 = weight(abstract_txt:storage in 5502) [ClassicSimilarity], result of:
            0.06112057 = score(doc=5502,freq=2.0), product of:
              0.078858726 = queryWeight, product of:
                1.2095225 = boost
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.011152814 = queryNorm
              0.7750641 = fieldWeight in 5502, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.09375 = fieldNorm(doc=5502)
          0.025563138 = weight(abstract_txt:retrieval in 5502) [ClassicSimilarity], result of:
            0.025563138 = score(doc=5502,freq=2.0), product of:
              0.05556623 = queryWeight, product of:
                1.4358515 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.011152814 = queryNorm
              0.46004808 = fieldWeight in 5502, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.09375 = fieldNorm(doc=5502)
          0.7968457 = weight(abstract_txt:hashing in 5502) [ClassicSimilarity], result of:
            0.7968457 = score(doc=5502,freq=1.0), product of:
              0.8736503 = queryWeight, product of:
                8.051705 = boost
                9.728935 = idf(docFreq=6, maxDocs=43254)
                0.011152814 = queryNorm
              0.9120877 = fieldWeight in 5502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.728935 = idf(docFreq=6, maxDocs=43254)
                0.09375 = fieldNorm(doc=5502)
        0.16 = coord(4/25)
    
  5. Lam, W.; Wong, K.-F.; Wong, C.-Y.: Chinese document indexing based on new partitioned signature file : model and evaluation (2001) 0.13
    0.12881297 = sum of:
      0.12881297 = product of:
        0.80508107 = sum of:
          0.004123096 = weight(abstract_txt:information in 2304) [ClassicSimilarity], result of:
            0.004123096 = score(doc=2304,freq=1.0), product of:
              0.027182411 = queryWeight, product of:
                1.0042644 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.011152814 = queryNorm
              0.1516825 = fieldWeight in 2304, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.0625 = fieldNorm(doc=2304)
          0.028812513 = weight(abstract_txt:storage in 2304) [ClassicSimilarity], result of:
            0.028812513 = score(doc=2304,freq=1.0), product of:
              0.078858726 = queryWeight, product of:
                1.2095225 = boost
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.011152814 = queryNorm
              0.36536872 = fieldWeight in 2304, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.0625 = fieldNorm(doc=2304)
          0.020872213 = weight(abstract_txt:retrieval in 2304) [ClassicSimilarity], result of:
            0.020872213 = score(doc=2304,freq=3.0), product of:
              0.05556623 = queryWeight, product of:
                1.4358515 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.011152814 = queryNorm
              0.37562767 = fieldWeight in 2304, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.0625 = fieldNorm(doc=2304)
          0.7512733 = weight(abstract_txt:hashing in 2304) [ClassicSimilarity], result of:
            0.7512733 = score(doc=2304,freq=2.0), product of:
              0.8736503 = queryWeight, product of:
                8.051705 = boost
                9.728935 = idf(docFreq=6, maxDocs=43254)
                0.011152814 = queryNorm
              0.8599245 = fieldWeight in 2304, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.728935 = idf(docFreq=6, maxDocs=43254)
                0.0625 = fieldNorm(doc=2304)
        0.16 = coord(4/25)