Document (#2291)

MacCain, K.W.
Descriptor and citation retrieval in the medical behavioral sciences literature : retrieval overlaps and novelty distribution
Journal of the American Society for Information Science. 40(1989), S.110-114
Search results for nine topics in the medical behavioral sciences are reanalyzed to compare the overall perfor-mance of descriptor and citation search strategies in identifying relevant and novel documents. Overlap per- centages between an aggregate "descriptor-based" database (MEDLINE, EXERPTA MEDICA, PSYCINFO) and an aggregate "citation-based" database (SCISEARCH, SOCIAL SCISEARCH) ranged from 1% to 26%, with a median overlap of 8% relevant retrievals found using both search strategies. For seven topics in which both descriptor and citation strategies produced reasonably substantial retrievals, two patterns of search performance and novelty distribution were observed: (1) where descriptor and citation retrieval showed little overlap, novelty retrieval percentages differed by 17-23% between the two strategies; (2) topics with a relatively high percentage retrieval overlap shoed little difference (1-4%) in descriptor and citation novelty retrieval percentages. These results reflect the varying partial congruence of two literature networks and represent two different types of subject relevance
Citation indexing

Similar documents (content)

  1. MacCain, K.W.; White, H.D.; Griffith, B.C.: Comparing retrieval performance in online data bases (1987) 0.39
    0.3894196 = sum of:
      0.3894196 = product of:
        1.0817211 = sum of:
          0.02764321 = weight(abstract_txt:relevant in 1167) [ClassicSimilarity], result of:
            0.02764321 = score(doc=1167,freq=2.0), product of:
              0.06746708 = queryWeight, product of:
                1.0935472 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.013309225 = queryNorm
              0.40972885 = fieldWeight in 1167, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.0625 = fieldNorm(doc=1167)
          0.028311647 = weight(abstract_txt:sciences in 1167) [ClassicSimilarity], result of:
            0.028311647 = score(doc=1167,freq=1.0), product of:
              0.08636803 = queryWeight, product of:
                1.2372804 = boost
                5.244838 = idf(docFreq=633, maxDocs=44218)
                0.013309225 = queryNorm
              0.3278024 = fieldWeight in 1167, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.244838 = idf(docFreq=633, maxDocs=44218)
                0.0625 = fieldNorm(doc=1167)
          0.038501076 = weight(abstract_txt:medical in 1167) [ClassicSimilarity], result of:
            0.038501076 = score(doc=1167,freq=1.0), product of:
              0.10601277 = queryWeight, product of:
                1.3707893 = boost
                5.8107834 = idf(docFreq=359, maxDocs=44218)
                0.013309225 = queryNorm
              0.36317396 = fieldWeight in 1167, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8107834 = idf(docFreq=359, maxDocs=44218)
                0.0625 = fieldNorm(doc=1167)
          0.07419252 = weight(abstract_txt:behavioral in 1167) [ClassicSimilarity], result of:
            0.07419252 = score(doc=1167,freq=1.0), product of:
              0.16416591 = queryWeight, product of:
                1.7058196 = boost
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.013309225 = queryNorm
              0.4519362 = fieldWeight in 1167, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.0625 = fieldNorm(doc=1167)
          0.067080885 = weight(abstract_txt:topics in 1167) [ClassicSimilarity], result of:
            0.067080885 = score(doc=1167,freq=3.0), product of:
              0.12183314 = queryWeight, product of:
                1.7997822 = boost
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.013309225 = queryNorm
              0.55059636 = fieldWeight in 1167, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.0625 = fieldNorm(doc=1167)
          0.12801129 = weight(abstract_txt:retrievals in 1167) [ClassicSimilarity], result of:
            0.12801129 = score(doc=1167,freq=2.0), product of:
              0.18744123 = queryWeight, product of:
                1.8227377 = boost
                7.7265954 = idf(docFreq=52, maxDocs=44218)
                0.013309225 = queryNorm
              0.68294096 = fieldWeight in 1167, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7265954 = idf(docFreq=52, maxDocs=44218)
                0.0625 = fieldNorm(doc=1167)
          0.18606833 = weight(abstract_txt:scisearch in 1167) [ClassicSimilarity], result of:
            0.18606833 = score(doc=1167,freq=2.0), product of:
              0.24051817 = queryWeight, product of:
                2.064741 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.013309225 = queryNorm
              0.7736144 = fieldWeight in 1167, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.0625 = fieldNorm(doc=1167)
          0.254169 = weight(abstract_txt:novelty in 1167) [ClassicSimilarity], result of:
            0.254169 = score(doc=1167,freq=2.0), product of:
              0.37307084 = queryWeight, product of:
                3.6366565 = boost
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.013309225 = queryNorm
              0.68128884 = fieldWeight in 1167, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.0625 = fieldNorm(doc=1167)
          0.27774304 = weight(abstract_txt:descriptor in 1167) [ClassicSimilarity], result of:
            0.27774304 = score(doc=1167,freq=1.0), product of:
              0.570837 = queryWeight, product of:
                5.509451 = boost
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.013309225 = queryNorm
              0.48655403 = fieldWeight in 1167, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.0625 = fieldNorm(doc=1167)
        0.36 = coord(9/25)
  2. Roberts, D.; Souter, C.: ¬The automation of controlled vocabulary subject indexing of medical journal articles (2000) 0.13
    0.12809156 = sum of:
      0.12809156 = product of:
        0.53371483 = sum of:
          0.0271955 = weight(abstract_txt:database in 711) [ClassicSimilarity], result of:
            0.0271955 = score(doc=711,freq=2.0), product of:
              0.05751189 = queryWeight, product of:
                1.009649 = boost
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.013309225 = queryNorm
              0.47286746 = fieldWeight in 711, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.078125 = fieldNorm(doc=711)
          0.021086777 = weight(abstract_txt:literature in 711) [ClassicSimilarity], result of:
            0.021086777 = score(doc=711,freq=1.0), product of:
              0.06115656 = queryWeight, product of:
                1.0411496 = boost
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.013309225 = queryNorm
              0.3447999 = fieldWeight in 711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.078125 = fieldNorm(doc=711)
          0.08335728 = weight(abstract_txt:medical in 711) [ClassicSimilarity], result of:
            0.08335728 = score(doc=711,freq=3.0), product of:
              0.10601277 = queryWeight, product of:
                1.3707893 = boost
                5.8107834 = idf(docFreq=359, maxDocs=44218)
                0.013309225 = queryNorm
              0.7862947 = fieldWeight in 711, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8107834 = idf(docFreq=359, maxDocs=44218)
                0.078125 = fieldNorm(doc=711)
          0.024013674 = weight(abstract_txt:search in 711) [ClassicSimilarity], result of:
            0.024013674 = score(doc=711,freq=1.0), product of:
              0.08402696 = queryWeight, product of:
                1.7259012 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.013309225 = queryNorm
              0.28578535 = fieldWeight in 711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.078125 = fieldNorm(doc=711)
          0.030882806 = weight(abstract_txt:retrieval in 711) [ClassicSimilarity], result of:
            0.030882806 = score(doc=711,freq=1.0), product of:
              0.113750815 = queryWeight, product of:
                2.4594018 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.013309225 = queryNorm
              0.27149525 = fieldWeight in 711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=711)
          0.34717882 = weight(abstract_txt:descriptor in 711) [ClassicSimilarity], result of:
            0.34717882 = score(doc=711,freq=1.0), product of:
              0.570837 = queryWeight, product of:
                5.509451 = boost
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.013309225 = queryNorm
              0.60819256 = fieldWeight in 711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.078125 = fieldNorm(doc=711)
        0.24 = coord(6/25)
  3. Tsay, M.-y.: Literature growth, journal characteristics, and suthor productivity in subject indexing, 1977 to 2000 (2004) 0.12
    0.12302927 = sum of:
      0.12302927 = product of:
        0.43939024 = sum of:
          0.0153840985 = weight(abstract_txt:database in 2070) [ClassicSimilarity], result of:
            0.0153840985 = score(doc=2070,freq=1.0), product of:
              0.05751189 = queryWeight, product of:
                1.009649 = boost
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.013309225 = queryNorm
              0.26749423 = fieldWeight in 2070, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.0625 = fieldNorm(doc=2070)
          0.029218692 = weight(abstract_txt:literature in 2070) [ClassicSimilarity], result of:
            0.029218692 = score(doc=2070,freq=3.0), product of:
              0.06115656 = queryWeight, product of:
                1.0411496 = boost
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.013309225 = queryNorm
              0.47776875 = fieldWeight in 2070, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.0625 = fieldNorm(doc=2070)
          0.03439808 = weight(abstract_txt:distribution in 2070) [ClassicSimilarity], result of:
            0.03439808 = score(doc=2070,freq=1.0), product of:
              0.09834049 = queryWeight, product of:
                1.3202549 = boost
                5.596568 = idf(docFreq=445, maxDocs=44218)
                0.013309225 = queryNorm
              0.3497855 = fieldWeight in 2070, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.596568 = idf(docFreq=445, maxDocs=44218)
                0.0625 = fieldNorm(doc=2070)
          0.01921094 = weight(abstract_txt:search in 2070) [ClassicSimilarity], result of:
            0.01921094 = score(doc=2070,freq=1.0), product of:
              0.08402696 = queryWeight, product of:
                1.7259012 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.013309225 = queryNorm
              0.22862828 = fieldWeight in 2070, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=2070)
          0.038729165 = weight(abstract_txt:topics in 2070) [ClassicSimilarity], result of:
            0.038729165 = score(doc=2070,freq=1.0), product of:
              0.12183314 = queryWeight, product of:
                1.7997822 = boost
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.013309225 = queryNorm
              0.31788695 = fieldWeight in 2070, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.0625 = fieldNorm(doc=2070)
          0.024706246 = weight(abstract_txt:retrieval in 2070) [ClassicSimilarity], result of:
            0.024706246 = score(doc=2070,freq=1.0), product of:
              0.113750815 = queryWeight, product of:
                2.4594018 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.013309225 = queryNorm
              0.21719621 = fieldWeight in 2070, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=2070)
          0.27774304 = weight(abstract_txt:descriptor in 2070) [ClassicSimilarity], result of:
            0.27774304 = score(doc=2070,freq=1.0), product of:
              0.570837 = queryWeight, product of:
                5.509451 = boost
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.013309225 = queryNorm
              0.48655403 = fieldWeight in 2070, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.0625 = fieldNorm(doc=2070)
        0.28 = coord(7/25)
  4. Krause, J.: Current research information as part of digital libraries and the heterogeneity problem : integrated searches in the context of databases with different content analyses (2002) 0.12
    0.12296605 = sum of:
      0.12296605 = product of:
        0.43916446 = sum of:
          0.0163173 = weight(abstract_txt:database in 3593) [ClassicSimilarity], result of:
            0.0163173 = score(doc=3593,freq=2.0), product of:
              0.05751189 = queryWeight, product of:
                1.009649 = boost
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.013309225 = queryNorm
              0.28372046 = fieldWeight in 3593, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.046875 = fieldNorm(doc=3593)
          0.017892722 = weight(abstract_txt:literature in 3593) [ClassicSimilarity], result of:
            0.017892722 = score(doc=3593,freq=2.0), product of:
              0.06115656 = queryWeight, product of:
                1.0411496 = boost
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.013309225 = queryNorm
              0.2925724 = fieldWeight in 3593, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.046875 = fieldNorm(doc=3593)
          0.020732407 = weight(abstract_txt:relevant in 3593) [ClassicSimilarity], result of:
            0.020732407 = score(doc=3593,freq=2.0), product of:
              0.06746708 = queryWeight, product of:
                1.0935472 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.013309225 = queryNorm
              0.30729663 = fieldWeight in 3593, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.046875 = fieldNorm(doc=3593)
          0.021233736 = weight(abstract_txt:sciences in 3593) [ClassicSimilarity], result of:
            0.021233736 = score(doc=3593,freq=1.0), product of:
              0.08636803 = queryWeight, product of:
                1.2372804 = boost
                5.244838 = idf(docFreq=633, maxDocs=44218)
                0.013309225 = queryNorm
              0.24585178 = fieldWeight in 3593, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.244838 = idf(docFreq=633, maxDocs=44218)
                0.046875 = fieldNorm(doc=3593)
          0.02881641 = weight(abstract_txt:search in 3593) [ClassicSimilarity], result of:
            0.02881641 = score(doc=3593,freq=4.0), product of:
              0.08402696 = queryWeight, product of:
                1.7259012 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.013309225 = queryNorm
              0.34294242 = fieldWeight in 3593, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.046875 = fieldNorm(doc=3593)
          0.03958091 = weight(abstract_txt:strategies in 3593) [ClassicSimilarity], result of:
            0.03958091 = score(doc=3593,freq=1.0), product of:
              0.16481723 = queryWeight, product of:
                2.417174 = boost
                5.123207 = idf(docFreq=715, maxDocs=44218)
                0.013309225 = queryNorm
              0.24015033 = fieldWeight in 3593, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.123207 = idf(docFreq=715, maxDocs=44218)
                0.046875 = fieldNorm(doc=3593)
          0.29459098 = weight(abstract_txt:descriptor in 3593) [ClassicSimilarity], result of:
            0.29459098 = score(doc=3593,freq=2.0), product of:
              0.570837 = queryWeight, product of:
                5.509451 = boost
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.013309225 = queryNorm
              0.51606846 = fieldWeight in 3593, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.046875 = fieldNorm(doc=3593)
        0.28 = coord(7/25)
  5. Hallet, K.S.: Separate but equal? : A system comparison study of MEDLINE's controlled vocabulary MeSH (1998) 0.11
    0.11321211 = sum of:
      0.11321211 = product of:
        0.56606054 = sum of:
          0.038501076 = weight(abstract_txt:medical in 3553) [ClassicSimilarity], result of:
            0.038501076 = score(doc=3553,freq=1.0), product of:
              0.10601277 = queryWeight, product of:
                1.3707893 = boost
                5.8107834 = idf(docFreq=359, maxDocs=44218)
                0.013309225 = queryNorm
              0.36317396 = fieldWeight in 3553, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8107834 = idf(docFreq=359, maxDocs=44218)
                0.0625 = fieldNorm(doc=3553)
          0.047057003 = weight(abstract_txt:search in 3553) [ClassicSimilarity], result of:
            0.047057003 = score(doc=3553,freq=6.0), product of:
              0.08402696 = queryWeight, product of:
                1.7259012 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.013309225 = queryNorm
              0.56002265 = fieldWeight in 3553, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=3553)
          0.05277455 = weight(abstract_txt:strategies in 3553) [ClassicSimilarity], result of:
            0.05277455 = score(doc=3553,freq=1.0), product of:
              0.16481723 = queryWeight, product of:
                2.417174 = boost
                5.123207 = idf(docFreq=715, maxDocs=44218)
                0.013309225 = queryNorm
              0.32020044 = fieldWeight in 3553, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.123207 = idf(docFreq=715, maxDocs=44218)
                0.0625 = fieldNorm(doc=3553)
          0.034939907 = weight(abstract_txt:retrieval in 3553) [ClassicSimilarity], result of:
            0.034939907 = score(doc=3553,freq=2.0), product of:
              0.113750815 = queryWeight, product of:
                2.4594018 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.013309225 = queryNorm
              0.3071618 = fieldWeight in 3553, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=3553)
          0.39278796 = weight(abstract_txt:descriptor in 3553) [ClassicSimilarity], result of:
            0.39278796 = score(doc=3553,freq=2.0), product of:
              0.570837 = queryWeight, product of:
                5.509451 = boost
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.013309225 = queryNorm
              0.6880913 = fieldWeight in 3553, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.0625 = fieldNorm(doc=3553)
        0.2 = coord(5/25)