Document (#6504)

Author
Paijmans, H.
Title
Comparing the document representation of two IR-systems : CLARIT and TOPIC
Source
Journal of the American Society for Information Science. 44(1993) no.7, S.383-392
Year
1993
Abstract
Discusses the TOPIC and CLARIT information retrieval systems in terms of assigned versus derived and precoordinate versus postcoordinate indexing. Compares the document representation of the two systems. Reports on a test done on a small sample of Wall Street Journal articles. The positive results found for CLARIT in earlier test on medical documents were not observed in this general database
Theme
Automatisches Indexieren
Object
CLARIT
TOPIC

Similar documents (content)

  1. O'Donnell, R.; Smeaton, A.F.: ¬A linguistic approach to information retrieval (1996) 0.19
    0.19133797 = sum of:
      0.19133797 = product of:
        0.6833499 = sum of:
          0.043235805 = weight(abstract_txt:reports in 3576) [ClassicSimilarity], result of:
            0.043235805 = score(doc=3576,freq=2.0), product of:
              0.0856582 = queryWeight, product of:
                1.0115709 = boost
                4.5684576 = idf(docFreq=1192, maxDocs=42306)
                0.018535445 = queryNorm
              0.504748 = fieldWeight in 3576, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5684576 = idf(docFreq=1192, maxDocs=42306)
                0.078125 = fieldNorm(doc=3576)
          0.044775844 = weight(abstract_txt:journal in 3576) [ClassicSimilarity], result of:
            0.044775844 = score(doc=3576,freq=1.0), product of:
              0.110470355 = queryWeight, product of:
                1.1487744 = boost
                5.188096 = idf(docFreq=641, maxDocs=42306)
                0.018535445 = queryNorm
              0.40532 = fieldWeight in 3576, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.188096 = idf(docFreq=641, maxDocs=42306)
                0.078125 = fieldNorm(doc=3576)
          0.1801426 = weight(abstract_txt:wall in 3576) [ClassicSimilarity], result of:
            0.1801426 = score(doc=3576,freq=1.0), product of:
              0.2794436 = queryWeight, product of:
                1.8270859 = boost
                8.251487 = idf(docFreq=29, maxDocs=42306)
                0.018535445 = queryNorm
              0.6446474 = fieldWeight in 3576, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.251487 = idf(docFreq=29, maxDocs=42306)
                0.078125 = fieldNorm(doc=3576)
          0.1801426 = weight(abstract_txt:street in 3576) [ClassicSimilarity], result of:
            0.1801426 = score(doc=3576,freq=1.0), product of:
              0.2794436 = queryWeight, product of:
                1.8270859 = boost
                8.251487 = idf(docFreq=29, maxDocs=42306)
                0.018535445 = queryNorm
              0.6446474 = fieldWeight in 3576, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.251487 = idf(docFreq=29, maxDocs=42306)
                0.078125 = fieldNorm(doc=3576)
          0.11137427 = weight(abstract_txt:representation in 3576) [ClassicSimilarity], result of:
            0.11137427 = score(doc=3576,freq=2.0), product of:
              0.20280242 = queryWeight, product of:
                2.2012198 = boost
                4.970576 = idf(docFreq=797, maxDocs=42306)
                0.018535445 = queryNorm
              0.5491762 = fieldWeight in 3576, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.970576 = idf(docFreq=797, maxDocs=42306)
                0.078125 = fieldNorm(doc=3576)
          0.08529047 = weight(abstract_txt:topic in 3576) [ClassicSimilarity], result of:
            0.08529047 = score(doc=3576,freq=1.0), product of:
              0.2138751 = queryWeight, product of:
                2.2605128 = boost
                5.104465 = idf(docFreq=697, maxDocs=42306)
                0.018535445 = queryNorm
              0.39878634 = fieldWeight in 3576, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.104465 = idf(docFreq=697, maxDocs=42306)
                0.078125 = fieldNorm(doc=3576)
          0.038388338 = weight(abstract_txt:systems in 3576) [ClassicSimilarity], result of:
            0.038388338 = score(doc=3576,freq=1.0), product of:
              0.14378817 = queryWeight, product of:
                2.270043 = boost
                3.4173236 = idf(docFreq=3771, maxDocs=42306)
                0.018535445 = queryNorm
              0.2669784 = fieldWeight in 3576, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4173236 = idf(docFreq=3771, maxDocs=42306)
                0.078125 = fieldNorm(doc=3576)
        0.28 = coord(7/25)
    
  2. Foskett, A.C.: ¬The subject approach to information (1996) 0.18
    0.18473244 = sum of:
      0.18473244 = product of:
        1.1545777 = sum of:
          0.08583225 = weight(abstract_txt:derived in 1750) [ClassicSimilarity], result of:
            0.08583225 = score(doc=1750,freq=1.0), product of:
              0.13621707 = queryWeight, product of:
                1.2756386 = boost
                5.76104 = idf(docFreq=361, maxDocs=42306)
                0.018535445 = queryNorm
              0.6301138 = fieldWeight in 1750, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.76104 = idf(docFreq=361, maxDocs=42306)
                0.109375 = fieldNorm(doc=1750)
          0.36694124 = weight(abstract_txt:postcoordinate in 1750) [ClassicSimilarity], result of:
            0.36694124 = score(doc=1750,freq=1.0), product of:
              0.35880807 = queryWeight, product of:
                2.070346 = boost
                9.3501 = idf(docFreq=9, maxDocs=42306)
                0.018535445 = queryNorm
              1.0226672 = fieldWeight in 1750, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.3501 = idf(docFreq=9, maxDocs=42306)
                0.109375 = fieldNorm(doc=1750)
          0.60871744 = weight(abstract_txt:precoordinate in 1750) [ClassicSimilarity], result of:
            0.60871744 = score(doc=1750,freq=2.0), product of:
              0.39908466 = queryWeight, product of:
                2.1834557 = boost
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.018535445 = queryNorm
              1.525284 = fieldWeight in 1750, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.109375 = fieldNorm(doc=1750)
          0.09308677 = weight(abstract_txt:systems in 1750) [ClassicSimilarity], result of:
            0.09308677 = score(doc=1750,freq=3.0), product of:
              0.14378817 = queryWeight, product of:
                2.270043 = boost
                3.4173236 = idf(docFreq=3771, maxDocs=42306)
                0.018535445 = queryNorm
              0.6473882 = fieldWeight in 1750, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4173236 = idf(docFreq=3771, maxDocs=42306)
                0.109375 = fieldNorm(doc=1750)
        0.16 = coord(4/25)
    
  3. Ruocco, A.S.; Frieder, O.: Clustering and classification of large document bases in a parallel environment (1997) 0.17
    0.17122059 = sum of:
      0.17122059 = product of:
        0.71341914 = sum of:
          0.04310619 = weight(abstract_txt:articles in 2662) [ClassicSimilarity], result of:
            0.04310619 = score(doc=2662,freq=1.0), product of:
              0.0953796 = queryWeight, product of:
                1.0674305 = boost
                4.8207307 = idf(docFreq=926, maxDocs=42306)
                0.018535445 = queryNorm
              0.45194352 = fieldWeight in 2662, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8207307 = idf(docFreq=926, maxDocs=42306)
                0.09375 = fieldNorm(doc=2662)
          0.053731013 = weight(abstract_txt:journal in 2662) [ClassicSimilarity], result of:
            0.053731013 = score(doc=2662,freq=1.0), product of:
              0.110470355 = queryWeight, product of:
                1.1487744 = boost
                5.188096 = idf(docFreq=641, maxDocs=42306)
                0.018535445 = queryNorm
              0.486384 = fieldWeight in 2662, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.188096 = idf(docFreq=641, maxDocs=42306)
                0.09375 = fieldNorm(doc=2662)
          0.2161711 = weight(abstract_txt:wall in 2662) [ClassicSimilarity], result of:
            0.2161711 = score(doc=2662,freq=1.0), product of:
              0.2794436 = queryWeight, product of:
                1.8270859 = boost
                8.251487 = idf(docFreq=29, maxDocs=42306)
                0.018535445 = queryNorm
              0.77357686 = fieldWeight in 2662, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.251487 = idf(docFreq=29, maxDocs=42306)
                0.09375 = fieldNorm(doc=2662)
          0.2161711 = weight(abstract_txt:street in 2662) [ClassicSimilarity], result of:
            0.2161711 = score(doc=2662,freq=1.0), product of:
              0.2794436 = queryWeight, product of:
                1.8270859 = boost
                8.251487 = idf(docFreq=29, maxDocs=42306)
                0.018535445 = queryNorm
              0.77357686 = fieldWeight in 2662, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.251487 = idf(docFreq=29, maxDocs=42306)
                0.09375 = fieldNorm(doc=2662)
          0.104451045 = weight(abstract_txt:document in 2662) [ClassicSimilarity], result of:
            0.104451045 = score(doc=2662,freq=3.0), product of:
              0.15031669 = queryWeight, product of:
                1.8950927 = boost
                4.2793097 = idf(docFreq=1592, maxDocs=42306)
                0.018535445 = queryNorm
              0.6948733 = fieldWeight in 2662, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2793097 = idf(docFreq=1592, maxDocs=42306)
                0.09375 = fieldNorm(doc=2662)
          0.07978866 = weight(abstract_txt:systems in 2662) [ClassicSimilarity], result of:
            0.07978866 = score(doc=2662,freq=3.0), product of:
              0.14378817 = queryWeight, product of:
                2.270043 = boost
                3.4173236 = idf(docFreq=3771, maxDocs=42306)
                0.018535445 = queryNorm
              0.5549042 = fieldWeight in 2662, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4173236 = idf(docFreq=3771, maxDocs=42306)
                0.09375 = fieldNorm(doc=2662)
        0.24 = coord(6/25)
    
  4. Bjorner, S.: Let your fingers do the searching : call for the fax (1993) 0.14
    0.14111452 = sum of:
      0.14111452 = product of:
        0.88196576 = sum of:
          0.07184365 = weight(abstract_txt:articles in 6519) [ClassicSimilarity], result of:
            0.07184365 = score(doc=6519,freq=1.0), product of:
              0.0953796 = queryWeight, product of:
                1.0674305 = boost
                4.8207307 = idf(docFreq=926, maxDocs=42306)
                0.018535445 = queryNorm
              0.75323915 = fieldWeight in 6519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8207307 = idf(docFreq=926, maxDocs=42306)
                0.15625 = fieldNorm(doc=6519)
          0.08955169 = weight(abstract_txt:journal in 6519) [ClassicSimilarity], result of:
            0.08955169 = score(doc=6519,freq=1.0), product of:
              0.110470355 = queryWeight, product of:
                1.1487744 = boost
                5.188096 = idf(docFreq=641, maxDocs=42306)
                0.018535445 = queryNorm
              0.81064 = fieldWeight in 6519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.188096 = idf(docFreq=641, maxDocs=42306)
                0.15625 = fieldNorm(doc=6519)
          0.3602852 = weight(abstract_txt:wall in 6519) [ClassicSimilarity], result of:
            0.3602852 = score(doc=6519,freq=1.0), product of:
              0.2794436 = queryWeight, product of:
                1.8270859 = boost
                8.251487 = idf(docFreq=29, maxDocs=42306)
                0.018535445 = queryNorm
              1.2892948 = fieldWeight in 6519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.251487 = idf(docFreq=29, maxDocs=42306)
                0.15625 = fieldNorm(doc=6519)
          0.3602852 = weight(abstract_txt:street in 6519) [ClassicSimilarity], result of:
            0.3602852 = score(doc=6519,freq=1.0), product of:
              0.2794436 = queryWeight, product of:
                1.8270859 = boost
                8.251487 = idf(docFreq=29, maxDocs=42306)
                0.018535445 = queryNorm
              1.2892948 = fieldWeight in 6519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.251487 = idf(docFreq=29, maxDocs=42306)
                0.15625 = fieldNorm(doc=6519)
        0.16 = coord(4/25)
    
  5. Névéol, A.; Deserno, T.M.; Darmoni, S.J.; Güld, M.O.; Aronson, A.R.: Natural language processing versus content-based image analysis for medical document retrieval (2009) 0.14
    0.1359881 = sum of:
      0.1359881 = product of:
        0.4856718 = sum of:
          0.070797354 = weight(abstract_txt:medical in 522) [ClassicSimilarity], result of:
            0.070797354 = score(doc=522,freq=2.0), product of:
              0.13808863 = queryWeight, product of:
                1.284372 = boost
                5.800482 = idf(docFreq=347, maxDocs=42306)
                0.018535445 = queryNorm
              0.512695 = fieldWeight in 522, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.800482 = idf(docFreq=347, maxDocs=42306)
                0.0625 = fieldNorm(doc=522)
          0.058327008 = weight(abstract_txt:comparing in 522) [ClassicSimilarity], result of:
            0.058327008 = score(doc=522,freq=1.0), product of:
              0.15289843 = queryWeight, product of:
                1.3514917 = boost
                6.103608 = idf(docFreq=256, maxDocs=42306)
                0.018535445 = queryNorm
              0.3814755 = fieldWeight in 522, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.103608 = idf(docFreq=256, maxDocs=42306)
                0.0625 = fieldNorm(doc=522)
          0.05982782 = weight(abstract_txt:assigned in 522) [ClassicSimilarity], result of:
            0.05982782 = score(doc=522,freq=1.0), product of:
              0.15551013 = queryWeight, product of:
                1.3629854 = boost
                6.155516 = idf(docFreq=243, maxDocs=42306)
                0.018535445 = queryNorm
              0.38471976 = fieldWeight in 522, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.155516 = idf(docFreq=243, maxDocs=42306)
                0.0625 = fieldNorm(doc=522)
          0.05685595 = weight(abstract_txt:document in 522) [ClassicSimilarity], result of:
            0.05685595 = score(doc=522,freq=2.0), product of:
              0.15031669 = queryWeight, product of:
                1.8950927 = boost
                4.2793097 = idf(docFreq=1592, maxDocs=42306)
                0.018535445 = queryNorm
              0.37824112 = fieldWeight in 522, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2793097 = idf(docFreq=1592, maxDocs=42306)
                0.0625 = fieldNorm(doc=522)
          0.06677625 = weight(abstract_txt:test in 522) [ClassicSimilarity], result of:
            0.06677625 = score(doc=522,freq=1.0), product of:
              0.21082136 = queryWeight, product of:
                2.2443168 = boost
                5.067893 = idf(docFreq=723, maxDocs=42306)
                0.018535445 = queryNorm
              0.3167433 = fieldWeight in 522, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.067893 = idf(docFreq=723, maxDocs=42306)
                0.0625 = fieldNorm(doc=522)
          0.03071067 = weight(abstract_txt:systems in 522) [ClassicSimilarity], result of:
            0.03071067 = score(doc=522,freq=1.0), product of:
              0.14378817 = queryWeight, product of:
                2.270043 = boost
                3.4173236 = idf(docFreq=3771, maxDocs=42306)
                0.018535445 = queryNorm
              0.21358272 = fieldWeight in 522, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4173236 = idf(docFreq=3771, maxDocs=42306)
                0.0625 = fieldNorm(doc=522)
          0.14237675 = weight(abstract_txt:versus in 522) [ClassicSimilarity], result of:
            0.14237675 = score(doc=522,freq=1.0), product of:
              0.34924158 = queryWeight, product of:
                2.888616 = boost
                6.5227857 = idf(docFreq=168, maxDocs=42306)
                0.018535445 = queryNorm
              0.4076741 = fieldWeight in 522, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5227857 = idf(docFreq=168, maxDocs=42306)
                0.0625 = fieldNorm(doc=522)
        0.28 = coord(7/25)