Document (#1605)

Author
Byrne, J.R.
Title
Relative effectiveness of titles, abstracts, and subject headings for machine retrieval from the COMPENDEX services
Source
Journal of the American Society for Information Science. 26(1975), S.223-229
Year
1975
Abstract
We have investigated the relative merits of searching on titles, subject headings, abstracts, free-language terms, and combinations of these elements. The COMPENDEX data base was used for this study since it combined all of these data elements of interest. In general, the results obtained from the experiments indicate that, as expected, titles alone are not satisfactory for efficient retrieval. The combination of titles and abstracts came the closest to 100% retrieval, with searching of abstracts alone doing almost as well. Indexer input, although necessary for 100% retrieval in almost all cases, was found to be relatively unimportant
Theme
Retrievalstudien
Object
COMPENDEX

Similar documents (content)

  1. Orton, D.: Database review : engineering (1995) 0.14
    0.14263569 = sum of:
      0.14263569 = product of:
        1.1886308 = sum of:
          0.06659756 = weight(abstract_txt:searching in 3932) [ClassicSimilarity], result of:
            0.06659756 = score(doc=3932,freq=1.0), product of:
              0.09989244 = queryWeight, product of:
                1.4222316 = boost
                4.2668333 = idf(docFreq=1612, maxDocs=42306)
                0.016461015 = queryNorm
              0.66669273 = fieldWeight in 3932, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2668333 = idf(docFreq=1612, maxDocs=42306)
                0.15625 = fieldNorm(doc=3932)
          0.21207127 = weight(abstract_txt:almost in 3932) [ClassicSimilarity], result of:
            0.21207127 = score(doc=3932,freq=1.0), product of:
              0.21621291 = queryWeight, product of:
                2.0924006 = boost
                6.2774057 = idf(docFreq=215, maxDocs=42306)
                0.016461015 = queryNorm
              0.9808446 = fieldWeight in 3932, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2774057 = idf(docFreq=215, maxDocs=42306)
                0.15625 = fieldNorm(doc=3932)
          0.909962 = weight(abstract_txt:compendex in 3932) [ClassicSimilarity], result of:
            0.909962 = score(doc=3932,freq=2.0), product of:
              0.45314017 = queryWeight, product of:
                3.0291467 = boost
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.016461015 = queryNorm
              2.0081248 = fieldWeight in 3932, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.15625 = fieldNorm(doc=3932)
        0.12 = coord(3/25)
    
  2. Hook, P.A.; Gantchev, A.: Using combined metadata sources to visualize a small library (OBL's English Language Books) (2017) 0.12
    0.12297937 = sum of:
      0.12297937 = product of:
        0.43921205 = sum of:
          0.064154625 = weight(abstract_txt:combined in 789) [ClassicSimilarity], result of:
            0.064154625 = score(doc=789,freq=3.0), product of:
              0.0987693 = queryWeight, product of:
                6.000195 = idf(docFreq=284, maxDocs=42306)
                0.016461015 = queryNorm
              0.6495401 = fieldWeight in 789, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.000195 = idf(docFreq=284, maxDocs=42306)
                0.0625 = fieldNorm(doc=789)
          0.011523313 = weight(abstract_txt:these in 789) [ClassicSimilarity], result of:
            0.011523313 = score(doc=789,freq=1.0), product of:
              0.057135403 = queryWeight, product of:
                1.0756146 = boost
                3.2269485 = idf(docFreq=4562, maxDocs=42306)
                0.016461015 = queryNorm
              0.20168428 = fieldWeight in 789, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2269485 = idf(docFreq=4562, maxDocs=42306)
                0.0625 = fieldNorm(doc=789)
          0.029680116 = weight(abstract_txt:data in 789) [ClassicSimilarity], result of:
            0.029680116 = score(doc=789,freq=5.0), product of:
              0.06278282 = queryWeight, product of:
                1.1275204 = boost
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.016461015 = queryNorm
              0.47274268 = fieldWeight in 789, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.0625 = fieldNorm(doc=789)
          0.045816448 = weight(abstract_txt:subject in 789) [ClassicSimilarity], result of:
            0.045816448 = score(doc=789,freq=5.0), product of:
              0.083858036 = queryWeight, product of:
                1.3030958 = boost
                3.9094145 = idf(docFreq=2305, maxDocs=42306)
                0.016461015 = queryNorm
              0.5463573 = fieldWeight in 789, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9094145 = idf(docFreq=2305, maxDocs=42306)
                0.0625 = fieldNorm(doc=789)
          0.0811963 = weight(abstract_txt:headings in 789) [ClassicSimilarity], result of:
            0.0811963 = score(doc=789,freq=3.0), product of:
              0.14560317 = queryWeight, product of:
                1.7170756 = boost
                5.1513944 = idf(docFreq=665, maxDocs=42306)
                0.016461015 = queryNorm
              0.5576548 = fieldWeight in 789, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1513944 = idf(docFreq=665, maxDocs=42306)
                0.0625 = fieldNorm(doc=789)
          0.078275844 = weight(abstract_txt:relative in 789) [ClassicSimilarity], result of:
            0.078275844 = score(doc=789,freq=1.0), product of:
              0.20493002 = queryWeight, product of:
                2.037074 = boost
                6.1114206 = idf(docFreq=254, maxDocs=42306)
                0.016461015 = queryNorm
              0.3819638 = fieldWeight in 789, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1114206 = idf(docFreq=254, maxDocs=42306)
                0.0625 = fieldNorm(doc=789)
          0.1285654 = weight(abstract_txt:titles in 789) [ClassicSimilarity], result of:
            0.1285654 = score(doc=789,freq=1.0), product of:
              0.359429 = queryWeight, product of:
                3.8152726 = boost
                5.723095 = idf(docFreq=375, maxDocs=42306)
                0.016461015 = queryNorm
              0.35769343 = fieldWeight in 789, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.723095 = idf(docFreq=375, maxDocs=42306)
                0.0625 = fieldNorm(doc=789)
        0.28 = coord(7/25)
    
  3. Ekmekcioglu, F.C.; Robertson, A.M.; Willett, P.: Effectiveness of query expansion in ranked-output document retrieval systems (1992) 0.12
    0.12040239 = sum of:
      0.12040239 = product of:
        0.6020119 = sum of:
          0.020165797 = weight(abstract_txt:these in 690) [ClassicSimilarity], result of:
            0.020165797 = score(doc=690,freq=1.0), product of:
              0.057135403 = queryWeight, product of:
                1.0756146 = boost
                3.2269485 = idf(docFreq=4562, maxDocs=42306)
                0.016461015 = queryNorm
              0.3529475 = fieldWeight in 690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2269485 = idf(docFreq=4562, maxDocs=42306)
                0.109375 = fieldNorm(doc=690)
          0.032849867 = weight(abstract_txt:data in 690) [ClassicSimilarity], result of:
            0.032849867 = score(doc=690,freq=2.0), product of:
              0.06278282 = queryWeight, product of:
                1.1275204 = boost
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.016461015 = queryNorm
              0.5232302 = fieldWeight in 690, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.109375 = fieldNorm(doc=690)
          0.07046833 = weight(abstract_txt:retrieval in 690) [ClassicSimilarity], result of:
            0.07046833 = score(doc=690,freq=2.0), product of:
              0.13157025 = queryWeight, product of:
                2.3083298 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.016461015 = queryNorm
              0.5355947 = fieldWeight in 690, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.109375 = fieldNorm(doc=690)
          0.22498944 = weight(abstract_txt:titles in 690) [ClassicSimilarity], result of:
            0.22498944 = score(doc=690,freq=1.0), product of:
              0.359429 = queryWeight, product of:
                3.8152726 = boost
                5.723095 = idf(docFreq=375, maxDocs=42306)
                0.016461015 = queryNorm
              0.6259635 = fieldWeight in 690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.723095 = idf(docFreq=375, maxDocs=42306)
                0.109375 = fieldNorm(doc=690)
          0.25353846 = weight(abstract_txt:abstracts in 690) [ClassicSimilarity], result of:
            0.25353846 = score(doc=690,freq=1.0), product of:
              0.38922516 = queryWeight, product of:
                3.9702647 = boost
                5.9555907 = idf(docFreq=297, maxDocs=42306)
                0.016461015 = queryNorm
              0.65139276 = fieldWeight in 690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9555907 = idf(docFreq=297, maxDocs=42306)
                0.109375 = fieldNorm(doc=690)
        0.2 = coord(5/25)
    
  4. Roberts, D.; Souter, C.: ¬The automation of controlled vocabulary subject indexing of medical journal articles (2000) 0.12
    0.115106344 = sum of:
      0.115106344 = product of:
        0.4796098 = sum of:
          0.049399514 = weight(abstract_txt:input in 1837) [ClassicSimilarity], result of:
            0.049399514 = score(doc=1837,freq=1.0), product of:
              0.10313012 = queryWeight, product of:
                1.0218374 = boost
                6.131223 = idf(docFreq=249, maxDocs=42306)
                0.016461015 = queryNorm
              0.47900182 = fieldWeight in 1837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.131223 = idf(docFreq=249, maxDocs=42306)
                0.078125 = fieldNorm(doc=1837)
          0.016591689 = weight(abstract_txt:data in 1837) [ClassicSimilarity], result of:
            0.016591689 = score(doc=1837,freq=1.0), product of:
              0.06278282 = queryWeight, product of:
                1.1275204 = boost
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.016461015 = queryNorm
              0.26427117 = fieldWeight in 1837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.078125 = fieldNorm(doc=1837)
          0.036221083 = weight(abstract_txt:subject in 1837) [ClassicSimilarity], result of:
            0.036221083 = score(doc=1837,freq=2.0), product of:
              0.083858036 = queryWeight, product of:
                1.3030958 = boost
                3.9094145 = idf(docFreq=2305, maxDocs=42306)
                0.016461015 = queryNorm
              0.43193337 = fieldWeight in 1837, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9094145 = idf(docFreq=2305, maxDocs=42306)
                0.078125 = fieldNorm(doc=1837)
          0.03559188 = weight(abstract_txt:retrieval in 1837) [ClassicSimilarity], result of:
            0.03559188 = score(doc=1837,freq=1.0), product of:
              0.13157025 = queryWeight, product of:
                2.3083298 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.016461015 = queryNorm
              0.2705162 = fieldWeight in 1837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.078125 = fieldNorm(doc=1837)
          0.16070674 = weight(abstract_txt:titles in 1837) [ClassicSimilarity], result of:
            0.16070674 = score(doc=1837,freq=1.0), product of:
              0.359429 = queryWeight, product of:
                3.8152726 = boost
                5.723095 = idf(docFreq=375, maxDocs=42306)
                0.016461015 = queryNorm
              0.4471168 = fieldWeight in 1837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.723095 = idf(docFreq=375, maxDocs=42306)
                0.078125 = fieldNorm(doc=1837)
          0.1810989 = weight(abstract_txt:abstracts in 1837) [ClassicSimilarity], result of:
            0.1810989 = score(doc=1837,freq=1.0), product of:
              0.38922516 = queryWeight, product of:
                3.9702647 = boost
                5.9555907 = idf(docFreq=297, maxDocs=42306)
                0.016461015 = queryNorm
              0.46528053 = fieldWeight in 1837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9555907 = idf(docFreq=297, maxDocs=42306)
                0.078125 = fieldNorm(doc=1837)
        0.24 = coord(6/25)
    
  5. Voorbij, H.: ¬Een goede titel behoeft geen trefwoord, of toch wel? : een vergelijkend oderzoek titelwoorden - trefwoorden (1997) 0.11
    0.1115005 = sum of:
      0.1115005 = product of:
        0.5575025 = sum of:
          0.06146922 = weight(abstract_txt:subject in 2447) [ClassicSimilarity], result of:
            0.06146922 = score(doc=2447,freq=4.0), product of:
              0.083858036 = queryWeight, product of:
                1.3030958 = boost
                3.9094145 = idf(docFreq=2305, maxDocs=42306)
                0.016461015 = queryNorm
              0.73301524 = fieldWeight in 2447, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9094145 = idf(docFreq=2305, maxDocs=42306)
                0.09375 = fieldNorm(doc=2447)
          0.039958537 = weight(abstract_txt:searching in 2447) [ClassicSimilarity], result of:
            0.039958537 = score(doc=2447,freq=1.0), product of:
              0.09989244 = queryWeight, product of:
                1.4222316 = boost
                4.2668333 = idf(docFreq=1612, maxDocs=42306)
                0.016461015 = queryNorm
              0.40001562 = fieldWeight in 2447, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2668333 = idf(docFreq=1612, maxDocs=42306)
                0.09375 = fieldNorm(doc=2447)
          0.14063613 = weight(abstract_txt:headings in 2447) [ClassicSimilarity], result of:
            0.14063613 = score(doc=2447,freq=4.0), product of:
              0.14560317 = queryWeight, product of:
                1.7170756 = boost
                5.1513944 = idf(docFreq=665, maxDocs=42306)
                0.016461015 = queryNorm
              0.9658865 = fieldWeight in 2447, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1513944 = idf(docFreq=665, maxDocs=42306)
                0.09375 = fieldNorm(doc=2447)
          0.042710256 = weight(abstract_txt:retrieval in 2447) [ClassicSimilarity], result of:
            0.042710256 = score(doc=2447,freq=1.0), product of:
              0.13157025 = queryWeight, product of:
                2.3083298 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.016461015 = queryNorm
              0.3246194 = fieldWeight in 2447, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.09375 = fieldNorm(doc=2447)
          0.27272838 = weight(abstract_txt:titles in 2447) [ClassicSimilarity], result of:
            0.27272838 = score(doc=2447,freq=2.0), product of:
              0.359429 = queryWeight, product of:
                3.8152726 = boost
                5.723095 = idf(docFreq=375, maxDocs=42306)
                0.016461015 = queryNorm
              0.7587824 = fieldWeight in 2447, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.723095 = idf(docFreq=375, maxDocs=42306)
                0.09375 = fieldNorm(doc=2447)
        0.2 = coord(5/25)