Document (#6504)

Author
Paijmans, H.
Title
Comparing the document representation of two IR-systems : CLARIT and TOPIC
Source
Journal of the American Society for Information Science. 44(1993) no.7, S.383-392
Year
1993
Abstract
Discusses the TOPIC and CLARIT information retrieval systems in terms of assigned versus derived and precoordinate versus postcoordinate indexing. Compares the document representation of the two systems. Reports on a test done on a small sample of Wall Street Journal articles. The positive results found for CLARIT in earlier test on medical documents were not observed in this general database
Theme
Automatisches Indexieren
Object
CLARIT
TOPIC

Similar documents (content)

  1. O'Donnell, R.; Smeaton, A.F.: ¬A linguistic approach to information retrieval (1996) 0.19
    0.19076023 = sum of:
      0.19076023 = product of:
        0.6812865 = sum of:
          0.043721337 = weight(abstract_txt:reports in 2575) [ClassicSimilarity], result of:
            0.043721337 = score(doc=2575,freq=2.0), product of:
              0.08630054 = queryWeight, product of:
                1.024151 = boost
                4.5853753 = idf(docFreq=1225, maxDocs=44218)
                0.018377 = queryNorm
              0.5066172 = fieldWeight in 2575, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5853753 = idf(docFreq=1225, maxDocs=44218)
                0.078125 = fieldNorm(doc=2575)
          0.043766167 = weight(abstract_txt:journal in 2575) [ClassicSimilarity], result of:
            0.043766167 = score(doc=2575,freq=1.0), product of:
              0.10880618 = queryWeight, product of:
                1.1499634 = boost
                5.1486683 = idf(docFreq=697, maxDocs=44218)
                0.018377 = queryNorm
              0.4022397 = fieldWeight in 2575, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1486683 = idf(docFreq=697, maxDocs=44218)
                0.078125 = fieldNorm(doc=2575)
          0.18090533 = weight(abstract_txt:street in 2575) [ClassicSimilarity], result of:
            0.18090533 = score(doc=2575,freq=1.0), product of:
              0.28023914 = queryWeight, product of:
                1.8455322 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.018377 = queryNorm
              0.6455391 = fieldWeight in 2575, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.078125 = fieldNorm(doc=2575)
          0.18306749 = weight(abstract_txt:wall in 2575) [ClassicSimilarity], result of:
            0.18306749 = score(doc=2575,freq=1.0), product of:
              0.28246763 = queryWeight, product of:
                1.8528557 = boost
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.018377 = queryNorm
              0.64810073 = fieldWeight in 2575, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.078125 = fieldNorm(doc=2575)
          0.10841961 = weight(abstract_txt:representation in 2575) [ClassicSimilarity], result of:
            0.10841961 = score(doc=2575,freq=2.0), product of:
              0.1992048 = queryWeight, product of:
                2.2005038 = boost
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.018377 = queryNorm
              0.54426205 = fieldWeight in 2575, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.078125 = fieldNorm(doc=2575)
          0.08319851 = weight(abstract_txt:topic in 2575) [ClassicSimilarity], result of:
            0.08319851 = score(doc=2575,freq=1.0), product of:
              0.21036893 = queryWeight, product of:
                2.2613254 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.018377 = queryNorm
              0.3954886 = fieldWeight in 2575, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.078125 = fieldNorm(doc=2575)
          0.038208116 = weight(abstract_txt:systems in 2575) [ClassicSimilarity], result of:
            0.038208116 = score(doc=2575,freq=1.0), product of:
              0.14334154 = queryWeight, product of:
                2.2861457 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.018377 = queryNorm
              0.26655298 = fieldWeight in 2575, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.078125 = fieldNorm(doc=2575)
        0.28 = coord(7/25)
    
  2. Foskett, A.C.: ¬The subject approach to information (1996) 0.19
    0.18672268 = sum of:
      0.18672268 = product of:
        1.1670167 = sum of:
          0.08517932 = weight(abstract_txt:derived in 749) [ClassicSimilarity], result of:
            0.08517932 = score(doc=749,freq=1.0), product of:
              0.13552892 = queryWeight, product of:
                1.2834331 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.018377 = queryNorm
              0.6284955 = fieldWeight in 749, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.109375 = fieldNorm(doc=749)
          0.37219918 = weight(abstract_txt:postcoordinate in 749) [ClassicSimilarity], result of:
            0.37219918 = score(doc=749,freq=1.0), product of:
              0.362237 = queryWeight, product of:
                2.0982327 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.018377 = queryNorm
              1.0275018 = fieldWeight in 749, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.109375 = fieldNorm(doc=749)
          0.6169884 = weight(abstract_txt:precoordinate in 749) [ClassicSimilarity], result of:
            0.6169884 = score(doc=749,freq=2.0), product of:
              0.4027021 = queryWeight, product of:
                2.2123263 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.018377 = queryNorm
              1.5321212 = fieldWeight in 749, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.109375 = fieldNorm(doc=749)
          0.09264976 = weight(abstract_txt:systems in 749) [ClassicSimilarity], result of:
            0.09264976 = score(doc=749,freq=3.0), product of:
              0.14334154 = queryWeight, product of:
                2.2861457 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.018377 = queryNorm
              0.64635664 = fieldWeight in 749, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.109375 = fieldNorm(doc=749)
        0.16 = coord(4/25)
    
  3. Ruocco, A.S.; Frieder, O.: Clustering and classification of large document bases in a parallel environment (1997) 0.17
    0.17189276 = sum of:
      0.17189276 = product of:
        0.71621984 = sum of:
          0.042082965 = weight(abstract_txt:articles in 1661) [ClassicSimilarity], result of:
            0.042082965 = score(doc=1661,freq=1.0), product of:
              0.09386664 = queryWeight, product of:
                1.0681024 = boost
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.018377 = queryNorm
              0.44832718 = fieldWeight in 1661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.09375 = fieldNorm(doc=1661)
          0.0525194 = weight(abstract_txt:journal in 1661) [ClassicSimilarity], result of:
            0.0525194 = score(doc=1661,freq=1.0), product of:
              0.10880618 = queryWeight, product of:
                1.1499634 = boost
                5.1486683 = idf(docFreq=697, maxDocs=44218)
                0.018377 = queryNorm
              0.48268765 = fieldWeight in 1661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1486683 = idf(docFreq=697, maxDocs=44218)
                0.09375 = fieldNorm(doc=1661)
          0.21708637 = weight(abstract_txt:street in 1661) [ClassicSimilarity], result of:
            0.21708637 = score(doc=1661,freq=1.0), product of:
              0.28023914 = queryWeight, product of:
                1.8455322 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.018377 = queryNorm
              0.7746469 = fieldWeight in 1661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.09375 = fieldNorm(doc=1661)
          0.219681 = weight(abstract_txt:wall in 1661) [ClassicSimilarity], result of:
            0.219681 = score(doc=1661,freq=1.0), product of:
              0.28246763 = queryWeight, product of:
                1.8528557 = boost
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.018377 = queryNorm
              0.7777209 = fieldWeight in 1661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.09375 = fieldNorm(doc=1661)
          0.10543611 = weight(abstract_txt:document in 1661) [ClassicSimilarity], result of:
            0.10543611 = score(doc=1661,freq=3.0), product of:
              0.15126422 = queryWeight, product of:
                1.9175221 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.018377 = queryNorm
              0.6970327 = fieldWeight in 1661, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.09375 = fieldNorm(doc=1661)
          0.07941408 = weight(abstract_txt:systems in 1661) [ClassicSimilarity], result of:
            0.07941408 = score(doc=1661,freq=3.0), product of:
              0.14334154 = queryWeight, product of:
                2.2861457 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.018377 = queryNorm
              0.55402 = fieldWeight in 1661, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.09375 = fieldNorm(doc=1661)
        0.24 = coord(6/25)
    
  4. Bjorner, S.: Let your fingers do the searching : call for the fax (1993) 0.14
    0.1416986 = sum of:
      0.1416986 = product of:
        0.8856163 = sum of:
          0.070138276 = weight(abstract_txt:articles in 6519) [ClassicSimilarity], result of:
            0.070138276 = score(doc=6519,freq=1.0), product of:
              0.09386664 = queryWeight, product of:
                1.0681024 = boost
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.018377 = queryNorm
              0.74721193 = fieldWeight in 6519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.15625 = fieldNorm(doc=6519)
          0.087532334 = weight(abstract_txt:journal in 6519) [ClassicSimilarity], result of:
            0.087532334 = score(doc=6519,freq=1.0), product of:
              0.10880618 = queryWeight, product of:
                1.1499634 = boost
                5.1486683 = idf(docFreq=697, maxDocs=44218)
                0.018377 = queryNorm
              0.8044794 = fieldWeight in 6519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1486683 = idf(docFreq=697, maxDocs=44218)
                0.15625 = fieldNorm(doc=6519)
          0.36181065 = weight(abstract_txt:street in 6519) [ClassicSimilarity], result of:
            0.36181065 = score(doc=6519,freq=1.0), product of:
              0.28023914 = queryWeight, product of:
                1.8455322 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.018377 = queryNorm
              1.2910782 = fieldWeight in 6519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.15625 = fieldNorm(doc=6519)
          0.36613497 = weight(abstract_txt:wall in 6519) [ClassicSimilarity], result of:
            0.36613497 = score(doc=6519,freq=1.0), product of:
              0.28246763 = queryWeight, product of:
                1.8528557 = boost
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.018377 = queryNorm
              1.2962015 = fieldWeight in 6519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.15625 = fieldNorm(doc=6519)
        0.16 = coord(4/25)
    
  5. Névéol, A.; Deserno, T.M.; Darmoni, S.J.; Güld, M.O.; Aronson, A.R.: Natural language processing versus content-based image analysis for medical document retrieval (2009) 0.14
    0.13542709 = sum of:
      0.13542709 = product of:
        0.48366815 = sum of:
          0.07118078 = weight(abstract_txt:medical in 2702) [ClassicSimilarity], result of:
            0.07118078 = score(doc=2702,freq=2.0), product of:
              0.13859038 = queryWeight, product of:
                1.2978479 = boost
                5.8107834 = idf(docFreq=359, maxDocs=44218)
                0.018377 = queryNorm
              0.51360554 = fieldWeight in 2702, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8107834 = idf(docFreq=359, maxDocs=44218)
                0.0625 = fieldNorm(doc=2702)
          0.05694846 = weight(abstract_txt:comparing in 2702) [ClassicSimilarity], result of:
            0.05694846 = score(doc=2702,freq=1.0), product of:
              0.15048362 = queryWeight, product of:
                1.3523897 = boost
                6.0549803 = idf(docFreq=281, maxDocs=44218)
                0.018377 = queryNorm
              0.37843627 = fieldWeight in 2702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0549803 = idf(docFreq=281, maxDocs=44218)
                0.0625 = fieldNorm(doc=2702)
          0.059382893 = weight(abstract_txt:assigned in 2702) [ClassicSimilarity], result of:
            0.059382893 = score(doc=2702,freq=1.0), product of:
              0.15474221 = queryWeight, product of:
                1.3713921 = boost
                6.140059 = idf(docFreq=258, maxDocs=44218)
                0.018377 = queryNorm
              0.3837537 = fieldWeight in 2702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.140059 = idf(docFreq=258, maxDocs=44218)
                0.0625 = fieldNorm(doc=2702)
          0.05739215 = weight(abstract_txt:document in 2702) [ClassicSimilarity], result of:
            0.05739215 = score(doc=2702,freq=2.0), product of:
              0.15126422 = queryWeight, product of:
                1.9175221 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.018377 = queryNorm
              0.37941656 = fieldWeight in 2702, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=2702)
          0.06594358 = weight(abstract_txt:test in 2702) [ClassicSimilarity], result of:
            0.06594358 = score(doc=2702,freq=1.0), product of:
              0.20907056 = queryWeight, product of:
                2.254336 = boost
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.018377 = queryNorm
              0.315413 = fieldWeight in 2702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.0625 = fieldNorm(doc=2702)
          0.030566493 = weight(abstract_txt:systems in 2702) [ClassicSimilarity], result of:
            0.030566493 = score(doc=2702,freq=1.0), product of:
              0.14334154 = queryWeight, product of:
                2.2861457 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.018377 = queryNorm
              0.2132424 = fieldWeight in 2702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.0625 = fieldNorm(doc=2702)
          0.14225382 = weight(abstract_txt:versus in 2702) [ClassicSimilarity], result of:
            0.14225382 = score(doc=2702,freq=1.0), product of:
              0.34904963 = queryWeight, product of:
                2.9128346 = boost
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.018377 = queryNorm
              0.4075461 = fieldWeight in 2702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.0625 = fieldNorm(doc=2702)
        0.28 = coord(7/25)