Document (#1605)

Author
Byrne, J.R.
Title
Relative effectiveness of titles, abstracts, and subject headings for machine retrieval from the COMPENDEX services
Source
Journal of the American Society for Information Science. 26(1975), S.223-229
Year
1975
Abstract
We have investigated the relative merits of searching on titles, subject headings, abstracts, free-language terms, and combinations of these elements. The COMPENDEX data base was used for this study since it combined all of these data elements of interest. In general, the results obtained from the experiments indicate that, as expected, titles alone are not satisfactory for efficient retrieval. The combination of titles and abstracts came the closest to 100% retrieval, with searching of abstracts alone doing almost as well. Indexer input, although necessary for 100% retrieval in almost all cases, was found to be relatively unimportant
Theme
Retrievalstudien
Object
COMPENDEX

Similar documents (content)

  1. Orton, D.: Database review : engineering (1995) 0.14
    0.14378376 = sum of:
      0.14378376 = product of:
        1.1981981 = sum of:
          0.06730968 = weight(abstract_txt:searching in 3863) [ClassicSimilarity], result of:
            0.06730968 = score(doc=3863,freq=1.0), product of:
              0.10053895 = queryWeight, product of:
                1.43541 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.016346892 = queryNorm
              0.6694886 = fieldWeight in 3863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.15625 = fieldNorm(doc=3863)
          0.20935313 = weight(abstract_txt:almost in 3863) [ClassicSimilarity], result of:
            0.20935313 = score(doc=3863,freq=1.0), product of:
              0.21422441 = queryWeight, product of:
                2.095286 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.016346892 = queryNorm
              0.9772608 = fieldWeight in 3863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.15625 = fieldNorm(doc=3863)
          0.9215352 = weight(abstract_txt:compendex in 3863) [ClassicSimilarity], result of:
            0.9215352 = score(doc=3863,freq=2.0), product of:
              0.45668203 = queryWeight, product of:
                3.0592556 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.016346892 = queryNorm
              2.0178924 = fieldWeight in 3863, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.15625 = fieldNorm(doc=3863)
        0.12 = coord(3/25)
    
  2. Hook, P.A.; Gantchev, A.: Using combined metadata sources to visualize a small library (OBL's English Language Books) (2017) 0.12
    0.12194258 = sum of:
      0.12194258 = product of:
        0.4355092 = sum of:
          0.06307107 = weight(abstract_txt:combined in 3870) [ClassicSimilarity], result of:
            0.06307107 = score(doc=3870,freq=3.0), product of:
              0.097591594 = queryWeight, product of:
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.016346892 = queryNorm
              0.6462757 = fieldWeight in 3870, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.0625 = fieldNorm(doc=3870)
          0.011091132 = weight(abstract_txt:these in 3870) [ClassicSimilarity], result of:
            0.011091132 = score(doc=3870,freq=1.0), product of:
              0.05566214 = queryWeight, product of:
                1.068043 = boost
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.016346892 = queryNorm
              0.19925809 = fieldWeight in 3870, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.0625 = fieldNorm(doc=3870)
          0.028422808 = weight(abstract_txt:data in 3870) [ClassicSimilarity], result of:
            0.028422808 = score(doc=3870,freq=5.0), product of:
              0.06095799 = queryWeight, product of:
                1.1176971 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016346892 = queryNorm
              0.46626878 = fieldWeight in 3870, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=3870)
          0.04564461 = weight(abstract_txt:subject in 3870) [ClassicSimilarity], result of:
            0.04564461 = score(doc=3870,freq=5.0), product of:
              0.08359475 = queryWeight, product of:
                1.3088753 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.016346892 = queryNorm
              0.5460225 = fieldWeight in 3870, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.0625 = fieldNorm(doc=3870)
          0.08104743 = weight(abstract_txt:headings in 3870) [ClassicSimilarity], result of:
            0.08104743 = score(doc=3870,freq=3.0), product of:
              0.14533216 = queryWeight, product of:
                1.7257968 = boost
                5.1515374 = idf(docFreq=695, maxDocs=44218)
                0.016346892 = queryNorm
              0.5576703 = fieldWeight in 3870, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1515374 = idf(docFreq=695, maxDocs=44218)
                0.0625 = fieldNorm(doc=3870)
          0.077914305 = weight(abstract_txt:relative in 3870) [ClassicSimilarity], result of:
            0.077914305 = score(doc=3870,freq=1.0), product of:
              0.2041679 = queryWeight, product of:
                2.0455143 = boost
                6.1059003 = idf(docFreq=267, maxDocs=44218)
                0.016346892 = queryNorm
              0.38161877 = fieldWeight in 3870, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1059003 = idf(docFreq=267, maxDocs=44218)
                0.0625 = fieldNorm(doc=3870)
          0.12831782 = weight(abstract_txt:titles in 3870) [ClassicSimilarity], result of:
            0.12831782 = score(doc=3870,freq=1.0), product of:
              0.35873795 = queryWeight, product of:
                3.8345327 = boost
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.016346892 = queryNorm
              0.35769236 = fieldWeight in 3870, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.0625 = fieldNorm(doc=3870)
        0.28 = coord(7/25)
    
  3. Ekmekcioglu, F.C.; Robertson, A.M.; Willett, P.: Effectiveness of query expansion in ranked-output document retrieval systems (1992) 0.12
    0.12011831 = sum of:
      0.12011831 = product of:
        0.60059154 = sum of:
          0.01940948 = weight(abstract_txt:these in 5689) [ClassicSimilarity], result of:
            0.01940948 = score(doc=5689,freq=1.0), product of:
              0.05566214 = queryWeight, product of:
                1.068043 = boost
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.016346892 = queryNorm
              0.34870166 = fieldWeight in 5689, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.109375 = fieldNorm(doc=5689)
          0.031458285 = weight(abstract_txt:data in 5689) [ClassicSimilarity], result of:
            0.031458285 = score(doc=5689,freq=2.0), product of:
              0.06095799 = queryWeight, product of:
                1.1176971 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016346892 = queryNorm
              0.516065 = fieldWeight in 5689, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.109375 = fieldNorm(doc=5689)
          0.07109969 = weight(abstract_txt:retrieval in 5689) [ClassicSimilarity], result of:
            0.07109969 = score(doc=5689,freq=2.0), product of:
              0.13227034 = queryWeight, product of:
                2.3283863 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.016346892 = queryNorm
              0.53753316 = fieldWeight in 5689, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.109375 = fieldNorm(doc=5689)
          0.22455621 = weight(abstract_txt:titles in 5689) [ClassicSimilarity], result of:
            0.22455621 = score(doc=5689,freq=1.0), product of:
              0.35873795 = queryWeight, product of:
                3.8345327 = boost
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.016346892 = queryNorm
              0.62596166 = fieldWeight in 5689, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.109375 = fieldNorm(doc=5689)
          0.25406787 = weight(abstract_txt:abstracts in 5689) [ClassicSimilarity], result of:
            0.25406787 = score(doc=5689,freq=1.0), product of:
              0.38951764 = queryWeight, product of:
                3.9956493 = boost
                5.963546 = idf(docFreq=308, maxDocs=44218)
                0.016346892 = queryNorm
              0.6522628 = fieldWeight in 5689, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.963546 = idf(docFreq=308, maxDocs=44218)
                0.109375 = fieldNorm(doc=5689)
        0.2 = coord(5/25)
    
  4. Roberts, D.; Souter, C.: ¬The automation of controlled vocabulary subject indexing of medical journal articles (2000) 0.11
    0.1148941 = sum of:
      0.1148941 = product of:
        0.47872543 = sum of:
          0.04896627 = weight(abstract_txt:input in 711) [ClassicSimilarity], result of:
            0.04896627 = score(doc=711,freq=1.0), product of:
              0.102460705 = queryWeight, product of:
                1.0246427 = boost
                6.1171575 = idf(docFreq=264, maxDocs=44218)
                0.016346892 = queryNorm
              0.47790292 = fieldWeight in 711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1171575 = idf(docFreq=264, maxDocs=44218)
                0.078125 = fieldNorm(doc=711)
          0.015888833 = weight(abstract_txt:data in 711) [ClassicSimilarity], result of:
            0.015888833 = score(doc=711,freq=1.0), product of:
              0.06095799 = queryWeight, product of:
                1.1176971 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016346892 = queryNorm
              0.26065218 = fieldWeight in 711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=711)
          0.03608523 = weight(abstract_txt:subject in 711) [ClassicSimilarity], result of:
            0.03608523 = score(doc=711,freq=2.0), product of:
              0.08359475 = queryWeight, product of:
                1.3088753 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.016346892 = queryNorm
              0.43166864 = fieldWeight in 711, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.078125 = fieldNorm(doc=711)
          0.035910767 = weight(abstract_txt:retrieval in 711) [ClassicSimilarity], result of:
            0.035910767 = score(doc=711,freq=1.0), product of:
              0.13227034 = queryWeight, product of:
                2.3283863 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.016346892 = queryNorm
              0.27149525 = fieldWeight in 711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=711)
          0.16039728 = weight(abstract_txt:titles in 711) [ClassicSimilarity], result of:
            0.16039728 = score(doc=711,freq=1.0), product of:
              0.35873795 = queryWeight, product of:
                3.8345327 = boost
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.016346892 = queryNorm
              0.44711545 = fieldWeight in 711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.078125 = fieldNorm(doc=711)
          0.18147705 = weight(abstract_txt:abstracts in 711) [ClassicSimilarity], result of:
            0.18147705 = score(doc=711,freq=1.0), product of:
              0.38951764 = queryWeight, product of:
                3.9956493 = boost
                5.963546 = idf(docFreq=308, maxDocs=44218)
                0.016346892 = queryNorm
              0.46590203 = fieldWeight in 711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.963546 = idf(docFreq=308, maxDocs=44218)
                0.078125 = fieldNorm(doc=711)
        0.24 = coord(6/25)
    
  5. Voorbij, H.: ¬Een goede titel behoeft geen trefwoord, of toch wel? : een vergelijkend oderzoek titelwoorden - trefwoorden (1997) 0.11
    0.11145977 = sum of:
      0.11145977 = product of:
        0.55729884 = sum of:
          0.06123867 = weight(abstract_txt:subject in 1446) [ClassicSimilarity], result of:
            0.06123867 = score(doc=1446,freq=4.0), product of:
              0.08359475 = queryWeight, product of:
                1.3088753 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.016346892 = queryNorm
              0.732566 = fieldWeight in 1446, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.09375 = fieldNorm(doc=1446)
          0.04038581 = weight(abstract_txt:searching in 1446) [ClassicSimilarity], result of:
            0.04038581 = score(doc=1446,freq=1.0), product of:
              0.10053895 = queryWeight, product of:
                1.43541 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.016346892 = queryNorm
              0.40169317 = fieldWeight in 1446, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.09375 = fieldNorm(doc=1446)
          0.14037827 = weight(abstract_txt:headings in 1446) [ClassicSimilarity], result of:
            0.14037827 = score(doc=1446,freq=4.0), product of:
              0.14533216 = queryWeight, product of:
                1.7257968 = boost
                5.1515374 = idf(docFreq=695, maxDocs=44218)
                0.016346892 = queryNorm
              0.9659133 = fieldWeight in 1446, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1515374 = idf(docFreq=695, maxDocs=44218)
                0.09375 = fieldNorm(doc=1446)
          0.04309292 = weight(abstract_txt:retrieval in 1446) [ClassicSimilarity], result of:
            0.04309292 = score(doc=1446,freq=1.0), product of:
              0.13227034 = queryWeight, product of:
                2.3283863 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.016346892 = queryNorm
              0.3257943 = fieldWeight in 1446, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=1446)
          0.27220318 = weight(abstract_txt:titles in 1446) [ClassicSimilarity], result of:
            0.27220318 = score(doc=1446,freq=2.0), product of:
              0.35873795 = queryWeight, product of:
                3.8345327 = boost
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.016346892 = queryNorm
              0.75878 = fieldWeight in 1446, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.09375 = fieldNorm(doc=1446)
        0.2 = coord(5/25)