Document (#19103)

Author
Cunningham, S.J.
Title
Approximating document descriptors : what to do when a catalog isn't available
Source
Electronic library and visual information research: Proceedings of the 4th ELVIRA Conference (ELVIRA 4), Electronic Library and Visual Information Research, De Montfort University, Milton Keynes, May 1997. Ed. by C. Davies u. A. Ramsden
Imprint
London : Aslib
Year
1997
Pages
S.125-131
Abstract
The New Zealand Computer Science Technical Reports collection provides a central index to over 32.000 working papers distributed in archives around the world. The collection is not formally catalogued and cataloguing information is available only for a minority of the documents. However it is possible to access and index the full text of the documents, not simply the title and abstract, as is common in bibliographic databases. Techniques for using this expanded keyword access to the full text so as to create 'approximate' document descriptions are being investigated to allow the user to carry out searches similar to (although not as precise as) those supported by formally catalogued systems

Similar documents (author)

  1. Cunningham, E.R.: Classification for medical literature (1946) 5.47
    5.47028 = sum of:
      5.47028 = weight(author_txt:cunningham in 3561) [ClassicSimilarity], result of:
        5.47028 = fieldWeight in 3561, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.752448 = idf(docFreq=18, maxDocs=44218)
          0.625 = fieldNorm(doc=3561)
    
  2. Cunningham, M.: Document imaging : present and future (1994) 5.47
    5.47028 = sum of:
      5.47028 = weight(author_txt:cunningham in 8396) [ClassicSimilarity], result of:
        5.47028 = fieldWeight in 8396, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.752448 = idf(docFreq=18, maxDocs=44218)
          0.625 = fieldNorm(doc=8396)
    
  3. Cunningham, J.: Getting the most from Alta Vista (1996) 5.47
    5.47028 = sum of:
      5.47028 = weight(author_txt:cunningham in 7699) [ClassicSimilarity], result of:
        5.47028 = fieldWeight in 7699, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.752448 = idf(docFreq=18, maxDocs=44218)
          0.625 = fieldNorm(doc=7699)
    
  4. Cunningham, A.: ¬A new direction for the National Bibliography (1997) 5.47
    5.47028 = sum of:
      5.47028 = weight(author_txt:cunningham in 1617) [ClassicSimilarity], result of:
        5.47028 = fieldWeight in 1617, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.752448 = idf(docFreq=18, maxDocs=44218)
          0.625 = fieldNorm(doc=1617)
    
  5. Cunningham, S.: Hybrid WWW and CD-ROM systems (1998) 5.47
    5.47028 = sum of:
      5.47028 = weight(author_txt:cunningham in 5220) [ClassicSimilarity], result of:
        5.47028 = fieldWeight in 5220, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.752448 = idf(docFreq=18, maxDocs=44218)
          0.625 = fieldNorm(doc=5220)
    

Similar documents (content)

  1. Jianchao, X.; Ming, H.; Milin, S.: On indexing descriptors for document archive (1998) 0.16
    0.15801673 = sum of:
      0.15801673 = product of:
        0.65840304 = sum of:
          0.098235175 = weight(abstract_txt:keyword in 3567) [ClassicSimilarity], result of:
            0.098235175 = score(doc=3567,freq=1.0), product of:
              0.13016874 = queryWeight, product of:
                1.038503 = boost
                6.037405 = idf(docFreq=286, maxDocs=44218)
                0.020761017 = queryNorm
              0.7546756 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.037405 = idf(docFreq=286, maxDocs=44218)
                0.125 = fieldNorm(doc=3567)
          0.22839502 = weight(abstract_txt:descriptors in 3567) [ClassicSimilarity], result of:
            0.22839502 = score(doc=3567,freq=3.0), product of:
              0.15839666 = queryWeight, product of:
                1.1455853 = boost
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.020761017 = queryNorm
              1.4419181 = fieldWeight in 3567, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.125 = fieldNorm(doc=3567)
          0.059038673 = weight(abstract_txt:text in 3567) [ClassicSimilarity], result of:
            0.059038673 = score(doc=3567,freq=1.0), product of:
              0.11679648 = queryWeight, product of:
                1.3911831 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020761017 = queryNorm
              0.5054833 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.125 = fieldNorm(doc=3567)
          0.070617095 = weight(abstract_txt:document in 3567) [ClassicSimilarity], result of:
            0.070617095 = score(doc=3567,freq=1.0), product of:
              0.13160688 = queryWeight, product of:
                1.476756 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.020761017 = queryNorm
              0.53657603 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.125 = fieldNorm(doc=3567)
          0.09561784 = weight(abstract_txt:index in 3567) [ClassicSimilarity], result of:
            0.09561784 = score(doc=3567,freq=1.0), product of:
              0.16107617 = queryWeight, product of:
                1.633748 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.020761017 = queryNorm
              0.59361875 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.125 = fieldNorm(doc=3567)
          0.10649924 = weight(abstract_txt:full in 3567) [ClassicSimilarity], result of:
            0.10649924 = score(doc=3567,freq=1.0), product of:
              0.17307581 = queryWeight, product of:
                1.6935093 = boost
                4.922663 = idf(docFreq=874, maxDocs=44218)
                0.020761017 = queryNorm
              0.6153329 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.922663 = idf(docFreq=874, maxDocs=44218)
                0.125 = fieldNorm(doc=3567)
        0.24 = coord(6/25)
    
  2. Preston, L.A.; Ebbs, C.M.; Luther, J.: 'Full text' access evaluation : are we getting the real thing? (1998) 0.13
    0.13119264 = sum of:
      0.13119264 = product of:
        0.546636 = sum of:
          0.075254865 = weight(abstract_txt:access in 2695) [ClassicSimilarity], result of:
            0.075254865 = score(doc=2695,freq=3.0), product of:
              0.095203884 = queryWeight, product of:
                1.2560205 = boost
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.020761017 = queryNorm
              0.79046005 = fieldWeight in 2695, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.125 = fieldNorm(doc=2695)
          0.08349329 = weight(abstract_txt:text in 2695) [ClassicSimilarity], result of:
            0.08349329 = score(doc=2695,freq=2.0), product of:
              0.11679648 = queryWeight, product of:
                1.3911831 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020761017 = queryNorm
              0.7148614 = fieldWeight in 2695, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.125 = fieldNorm(doc=2695)
          0.070617095 = weight(abstract_txt:document in 2695) [ClassicSimilarity], result of:
            0.070617095 = score(doc=2695,freq=1.0), product of:
              0.13160688 = queryWeight, product of:
                1.476756 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.020761017 = queryNorm
              0.53657603 = fieldWeight in 2695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.125 = fieldNorm(doc=2695)
          0.07104027 = weight(abstract_txt:available in 2695) [ClassicSimilarity], result of:
            0.07104027 = score(doc=2695,freq=1.0), product of:
              0.13213213 = queryWeight, product of:
                1.4796999 = boost
                4.3011656 = idf(docFreq=1628, maxDocs=44218)
                0.020761017 = queryNorm
              0.5376457 = fieldWeight in 2695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3011656 = idf(docFreq=1628, maxDocs=44218)
                0.125 = fieldNorm(doc=2695)
          0.09561784 = weight(abstract_txt:index in 2695) [ClassicSimilarity], result of:
            0.09561784 = score(doc=2695,freq=1.0), product of:
              0.16107617 = queryWeight, product of:
                1.633748 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.020761017 = queryNorm
              0.59361875 = fieldWeight in 2695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.125 = fieldNorm(doc=2695)
          0.15061267 = weight(abstract_txt:full in 2695) [ClassicSimilarity], result of:
            0.15061267 = score(doc=2695,freq=2.0), product of:
              0.17307581 = queryWeight, product of:
                1.6935093 = boost
                4.922663 = idf(docFreq=874, maxDocs=44218)
                0.020761017 = queryNorm
              0.87021214 = fieldWeight in 2695, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.922663 = idf(docFreq=874, maxDocs=44218)
                0.125 = fieldNorm(doc=2695)
        0.24 = coord(6/25)
    
  3. Veenema, F.: To index or not to index (1996) 0.11
    0.10777133 = sum of:
      0.10777133 = product of:
        0.4490472 = sum of:
          0.05165884 = weight(abstract_txt:text in 7247) [ClassicSimilarity], result of:
            0.05165884 = score(doc=7247,freq=1.0), product of:
              0.11679648 = queryWeight, product of:
                1.3911831 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020761017 = queryNorm
              0.4422979 = fieldWeight in 7247, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.109375 = fieldNorm(doc=7247)
          0.05468367 = weight(abstract_txt:documents in 7247) [ClassicSimilarity], result of:
            0.05468367 = score(doc=7247,freq=1.0), product of:
              0.12131237 = queryWeight, product of:
                1.4178228 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.020761017 = queryNorm
              0.45076746 = fieldWeight in 7247, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.109375 = fieldNorm(doc=7247)
          0.0873842 = weight(abstract_txt:document in 7247) [ClassicSimilarity], result of:
            0.0873842 = score(doc=7247,freq=2.0), product of:
              0.13160688 = queryWeight, product of:
                1.476756 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.020761017 = queryNorm
              0.663979 = fieldWeight in 7247, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.109375 = fieldNorm(doc=7247)
          0.078468055 = weight(abstract_txt:collection in 7247) [ClassicSimilarity], result of:
            0.078468055 = score(doc=7247,freq=1.0), product of:
              0.15433411 = queryWeight, product of:
                1.5991912 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.020761017 = queryNorm
              0.50842977 = fieldWeight in 7247, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.109375 = fieldNorm(doc=7247)
          0.0836656 = weight(abstract_txt:index in 7247) [ClassicSimilarity], result of:
            0.0836656 = score(doc=7247,freq=1.0), product of:
              0.16107617 = queryWeight, product of:
                1.633748 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.020761017 = queryNorm
              0.5194164 = fieldWeight in 7247, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.109375 = fieldNorm(doc=7247)
          0.09318683 = weight(abstract_txt:full in 7247) [ClassicSimilarity], result of:
            0.09318683 = score(doc=7247,freq=1.0), product of:
              0.17307581 = queryWeight, product of:
                1.6935093 = boost
                4.922663 = idf(docFreq=874, maxDocs=44218)
                0.020761017 = queryNorm
              0.53841627 = fieldWeight in 7247, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.922663 = idf(docFreq=874, maxDocs=44218)
                0.109375 = fieldNorm(doc=7247)
        0.24 = coord(6/25)
    
  4. Lu, K.; Mao, J.; Li, G.: Toward effective automated weighted subject indexing : a comparison of different approaches in different environments (2018) 0.10
    0.104703814 = sum of:
      0.104703814 = product of:
        0.3739422 = sum of:
          0.07536678 = weight(abstract_txt:abstract in 4292) [ClassicSimilarity], result of:
            0.07536678 = score(doc=4292,freq=2.0), product of:
              0.13744386 = queryWeight, product of:
                1.0671294 = boost
                6.203826 = idf(docFreq=242, maxDocs=44218)
                0.020761017 = queryNorm
              0.5483459 = fieldWeight in 4292, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.203826 = idf(docFreq=242, maxDocs=44218)
                0.0625 = fieldNorm(doc=4292)
          0.09324187 = weight(abstract_txt:descriptors in 4292) [ClassicSimilarity], result of:
            0.09324187 = score(doc=4292,freq=2.0), product of:
              0.15839666 = queryWeight, product of:
                1.1455853 = boost
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.020761017 = queryNorm
              0.5886606 = fieldWeight in 4292, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.0625 = fieldNorm(doc=4292)
          0.02172421 = weight(abstract_txt:access in 4292) [ClassicSimilarity], result of:
            0.02172421 = score(doc=4292,freq=1.0), product of:
              0.095203884 = queryWeight, product of:
                1.2560205 = boost
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.020761017 = queryNorm
              0.22818616 = fieldWeight in 4292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.0625 = fieldNorm(doc=4292)
          0.041746646 = weight(abstract_txt:text in 4292) [ClassicSimilarity], result of:
            0.041746646 = score(doc=4292,freq=2.0), product of:
              0.11679648 = queryWeight, product of:
                1.3911831 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020761017 = queryNorm
              0.3574307 = fieldWeight in 4292, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=4292)
          0.031247811 = weight(abstract_txt:documents in 4292) [ClassicSimilarity], result of:
            0.031247811 = score(doc=4292,freq=1.0), product of:
              0.12131237 = queryWeight, product of:
                1.4178228 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.020761017 = queryNorm
              0.2575814 = fieldWeight in 4292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=4292)
          0.035308547 = weight(abstract_txt:document in 4292) [ClassicSimilarity], result of:
            0.035308547 = score(doc=4292,freq=1.0), product of:
              0.13160688 = queryWeight, product of:
                1.476756 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.020761017 = queryNorm
              0.26828802 = fieldWeight in 4292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=4292)
          0.07530633 = weight(abstract_txt:full in 4292) [ClassicSimilarity], result of:
            0.07530633 = score(doc=4292,freq=2.0), product of:
              0.17307581 = queryWeight, product of:
                1.6935093 = boost
                4.922663 = idf(docFreq=874, maxDocs=44218)
                0.020761017 = queryNorm
              0.43510607 = fieldWeight in 4292, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.922663 = idf(docFreq=874, maxDocs=44218)
                0.0625 = fieldNorm(doc=4292)
        0.28 = coord(7/25)
    
  5. Mischo, W.H.: Expanded subject access to reference collection materials (1979) 0.10
    0.0963309 = sum of:
      0.0963309 = product of:
        0.48165452 = sum of:
          0.115380935 = weight(abstract_txt:descriptors in 837) [ClassicSimilarity], result of:
            0.115380935 = score(doc=837,freq=1.0), product of:
              0.15839666 = queryWeight, product of:
                1.1455853 = boost
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.020761017 = queryNorm
              0.72843033 = fieldWeight in 837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.109375 = fieldNorm(doc=837)
          0.11571984 = weight(abstract_txt:expanded in 837) [ClassicSimilarity], result of:
            0.11571984 = score(doc=837,freq=1.0), product of:
              0.15870668 = queryWeight, product of:
                1.1467059 = boost
                6.666449 = idf(docFreq=152, maxDocs=44218)
                0.020761017 = queryNorm
              0.72914284 = fieldWeight in 837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.666449 = idf(docFreq=152, maxDocs=44218)
                0.109375 = fieldNorm(doc=837)
          0.053764675 = weight(abstract_txt:access in 837) [ClassicSimilarity], result of:
            0.053764675 = score(doc=837,freq=2.0), product of:
              0.095203884 = queryWeight, product of:
                1.2560205 = boost
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.020761017 = queryNorm
              0.56473196 = fieldWeight in 837, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.109375 = fieldNorm(doc=837)
          0.078468055 = weight(abstract_txt:collection in 837) [ClassicSimilarity], result of:
            0.078468055 = score(doc=837,freq=1.0), product of:
              0.15433411 = queryWeight, product of:
                1.5991912 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.020761017 = queryNorm
              0.50842977 = fieldWeight in 837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.109375 = fieldNorm(doc=837)
          0.11832103 = weight(abstract_txt:index in 837) [ClassicSimilarity], result of:
            0.11832103 = score(doc=837,freq=2.0), product of:
              0.16107617 = queryWeight, product of:
                1.633748 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.020761017 = queryNorm
              0.7345657 = fieldWeight in 837, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.109375 = fieldNorm(doc=837)
        0.2 = coord(5/25)