Document (#2295)

Author
Paice, C.D.
Title
¬A thesaural model of information retrieval
Source
Information processing and management. 27(1991) no.5, S.433-447
Year
1991
Abstract
In an information retrieval system both queries and document representatives can be viewed as representations of topics. The central activity of such a system is thus the comparison of one topic representation with another. The set of terms contained in a typical topic representation may be adjusted or extended by reference to a domain thesaurus. This paper proposes that topic representations should actually consist of excerpts from a domain thesaurus, generated by a spreading activation technique. An algorithm for generating excerpts is outlines and exemplified, and the problem of assessing the resemblance between two excerpts is discussed. The paper questions whether existing thesauri would be adequate for this purpose, and offers some ideas on how suitable thesauri might be constructed

Similar documents (author)

  1. Paice, C.D.: Expert systems for information retrieval? (1986) 5.94
    5.937289 = sum of:
      5.937289 = weight(author_txt:paice in 1101) [ClassicSimilarity], result of:
        5.937289 = fieldWeight in 1101, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.625 = fieldNorm(doc=1101)
    
  2. Paice, C.D.: Automatic abstracting (1994) 5.94
    5.937289 = sum of:
      5.937289 = weight(author_txt:paice in 917) [ClassicSimilarity], result of:
        5.937289 = fieldWeight in 917, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.625 = fieldNorm(doc=917)
    
  3. Paice, C.D.: Automatic abstracting (1994) 5.94
    5.937289 = sum of:
      5.937289 = weight(author_txt:paice in 1255) [ClassicSimilarity], result of:
        5.937289 = fieldWeight in 1255, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.625 = fieldNorm(doc=1255)
    
  4. Paice, C.D.: Method for evaluation of stemming algorithms based on error counting (1996) 5.94
    5.937289 = sum of:
      5.937289 = weight(author_txt:paice in 5799) [ClassicSimilarity], result of:
        5.937289 = fieldWeight in 5799, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.625 = fieldNorm(doc=5799)
    
  5. Paice, C.D.: Soft evaluation of Boolean search queries in information retrieval systems (1984) 5.94
    5.937289 = sum of:
      5.937289 = weight(author_txt:paice in 789) [ClassicSimilarity], result of:
        5.937289 = fieldWeight in 789, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.625 = fieldNorm(doc=789)
    

Similar documents (content)

  1. Diaz, I.: Semi-automatic construction of thesaurus applying domain analysis techniques (1998) 0.14
    0.14183114 = sum of:
      0.14183114 = product of:
        0.5909631 = sum of:
          0.022985311 = weight(abstract_txt:system in 3744) [ClassicSimilarity], result of:
            0.022985311 = score(doc=3744,freq=1.0), product of:
              0.062316783 = queryWeight, product of:
                1.0597798 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.017436612 = queryNorm
              0.36884624 = fieldWeight in 3744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.109375 = fieldNorm(doc=3744)
          0.12729788 = weight(abstract_txt:domain in 3744) [ClassicSimilarity], result of:
            0.12729788 = score(doc=3744,freq=4.0), product of:
              0.122885 = queryWeight, product of:
                1.4882042 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.017436612 = queryNorm
              1.0359106 = fieldWeight in 3744, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.109375 = fieldNorm(doc=3744)
          0.071643636 = weight(abstract_txt:representation in 3744) [ClassicSimilarity], result of:
            0.071643636 = score(doc=3744,freq=1.0), product of:
              0.13297087 = queryWeight, product of:
                1.5480727 = boost
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.017436612 = queryNorm
              0.53879195 = fieldWeight in 3744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.109375 = fieldNorm(doc=3744)
          0.14311814 = weight(abstract_txt:thesaurus in 3744) [ClassicSimilarity], result of:
            0.14311814 = score(doc=3744,freq=3.0), product of:
              0.1462382 = queryWeight, product of:
                1.6234672 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.017436612 = queryNorm
              0.97866464 = fieldWeight in 3744, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.109375 = fieldNorm(doc=3744)
          0.09603917 = weight(abstract_txt:thesauri in 3744) [ClassicSimilarity], result of:
            0.09603917 = score(doc=3744,freq=1.0), product of:
              0.16166042 = queryWeight, product of:
                1.7069271 = boost
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.017436612 = queryNorm
              0.5940797 = fieldWeight in 3744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.109375 = fieldNorm(doc=3744)
          0.12987903 = weight(abstract_txt:representations in 3744) [ClassicSimilarity], result of:
            0.12987903 = score(doc=3744,freq=1.0), product of:
              0.1976958 = queryWeight, product of:
                1.887608 = boost
                6.006528 = idf(docFreq=295, maxDocs=44218)
                0.017436612 = queryNorm
              0.656964 = fieldWeight in 3744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.006528 = idf(docFreq=295, maxDocs=44218)
                0.109375 = fieldNorm(doc=3744)
        0.24 = coord(6/25)
    
  2. Salton, G.; Buckley, C.: Approaches to global text analysis (1990) 0.13
    0.12678091 = sum of:
      0.12678091 = product of:
        1.0565076 = sum of:
          0.022985311 = weight(abstract_txt:system in 4901) [ClassicSimilarity], result of:
            0.022985311 = score(doc=4901,freq=1.0), product of:
              0.062316783 = queryWeight, product of:
                1.0597798 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.017436612 = queryNorm
              0.36884624 = fieldWeight in 4901, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.109375 = fieldNorm(doc=4901)
          0.12987903 = weight(abstract_txt:representations in 4901) [ClassicSimilarity], result of:
            0.12987903 = score(doc=4901,freq=1.0), product of:
              0.1976958 = queryWeight, product of:
                1.887608 = boost
                6.006528 = idf(docFreq=295, maxDocs=44218)
                0.017436612 = queryNorm
              0.656964 = fieldWeight in 4901, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.006528 = idf(docFreq=295, maxDocs=44218)
                0.109375 = fieldNorm(doc=4901)
          0.9036433 = weight(abstract_txt:excerpts in 4901) [ClassicSimilarity], result of:
            0.9036433 = score(doc=4901,freq=2.0), product of:
              0.65462095 = queryWeight, product of:
                4.2068176 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.017436612 = queryNorm
              1.380407 = fieldWeight in 4901, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.109375 = fieldNorm(doc=4901)
        0.12 = coord(3/25)
    
  3. Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: FACET: thesaurus retrieval with semantic term expansion (2002) 0.12
    0.12373473 = sum of:
      0.12373473 = product of:
        0.44190973 = sum of:
          0.011492656 = weight(abstract_txt:system in 175) [ClassicSimilarity], result of:
            0.011492656 = score(doc=175,freq=1.0), product of:
              0.062316783 = queryWeight, product of:
                1.0597798 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.017436612 = queryNorm
              0.18442312 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.012576366 = weight(abstract_txt:retrieval in 175) [ClassicSimilarity], result of:
            0.012576366 = score(doc=175,freq=1.0), product of:
              0.06617514 = queryWeight, product of:
                1.0920953 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.017436612 = queryNorm
              0.19004668 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.10046091 = weight(abstract_txt:activation in 175) [ClassicSimilarity], result of:
            0.10046091 = score(doc=175,freq=1.0), product of:
              0.20988408 = queryWeight, product of:
                1.3752697 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.017436612 = queryNorm
              0.4786495 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.104339756 = weight(abstract_txt:spreading in 175) [ClassicSimilarity], result of:
            0.104339756 = score(doc=175,freq=1.0), product of:
              0.21525238 = queryWeight, product of:
                1.3927466 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.017436612 = queryNorm
              0.48473218 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.035821818 = weight(abstract_txt:representation in 175) [ClassicSimilarity], result of:
            0.035821818 = score(doc=175,freq=1.0), product of:
              0.13297087 = queryWeight, product of:
                1.5480727 = boost
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.017436612 = queryNorm
              0.26939598 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.10930828 = weight(abstract_txt:thesaurus in 175) [ClassicSimilarity], result of:
            0.10930828 = score(doc=175,freq=7.0), product of:
              0.1462382 = queryWeight, product of:
                1.6234672 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.017436612 = queryNorm
              0.7474674 = fieldWeight in 175, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.06790995 = weight(abstract_txt:thesauri in 175) [ClassicSimilarity], result of:
            0.06790995 = score(doc=175,freq=2.0), product of:
              0.16166042 = queryWeight, product of:
                1.7069271 = boost
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.017436612 = queryNorm
              0.42007777 = fieldWeight in 175, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
        0.28 = coord(7/25)
    
  4. Mazzocchi, F.; Tiberi, M.: Knowledge organization in the philosophical domain : dealing with polysemy in thesaurus building (2009) 0.11
    0.10849513 = sum of:
      0.10849513 = product of:
        0.3874826 = sum of:
          0.016418079 = weight(abstract_txt:system in 3267) [ClassicSimilarity], result of:
            0.016418079 = score(doc=3267,freq=1.0), product of:
              0.062316783 = queryWeight, product of:
                1.0597798 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.017436612 = queryNorm
              0.2634616 = fieldWeight in 3267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.078125 = fieldNorm(doc=3267)
          0.017846098 = weight(abstract_txt:paper in 3267) [ClassicSimilarity], result of:
            0.017846098 = score(doc=3267,freq=1.0), product of:
              0.06587981 = queryWeight, product of:
                1.0896556 = boost
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.017436612 = queryNorm
              0.27088875 = fieldWeight in 3267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.078125 = fieldNorm(doc=3267)
          0.017966237 = weight(abstract_txt:retrieval in 3267) [ClassicSimilarity], result of:
            0.017966237 = score(doc=3267,freq=1.0), product of:
              0.06617514 = queryWeight, product of:
                1.0920953 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.017436612 = queryNorm
              0.27149525 = fieldWeight in 3267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=3267)
          0.12863259 = weight(abstract_txt:thesaural in 3267) [ClassicSimilarity], result of:
            0.12863259 = score(doc=3267,freq=1.0), product of:
              0.19511057 = queryWeight, product of:
                1.3259847 = boost
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.017436612 = queryNorm
              0.6592805 = fieldWeight in 3267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.078125 = fieldNorm(doc=3267)
          0.064295135 = weight(abstract_txt:domain in 3267) [ClassicSimilarity], result of:
            0.064295135 = score(doc=3267,freq=2.0), product of:
              0.122885 = queryWeight, product of:
                1.4882042 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.017436612 = queryNorm
              0.52321386 = fieldWeight in 3267, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.078125 = fieldNorm(doc=3267)
          0.05902093 = weight(abstract_txt:thesaurus in 3267) [ClassicSimilarity], result of:
            0.05902093 = score(doc=3267,freq=1.0), product of:
              0.1462382 = queryWeight, product of:
                1.6234672 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.017436612 = queryNorm
              0.4035945 = fieldWeight in 3267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.078125 = fieldNorm(doc=3267)
          0.08330355 = weight(abstract_txt:topic in 3267) [ClassicSimilarity], result of:
            0.08330355 = score(doc=3267,freq=1.0), product of:
              0.21063451 = queryWeight, product of:
                2.3862915 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.017436612 = queryNorm
              0.3954886 = fieldWeight in 3267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.078125 = fieldNorm(doc=3267)
        0.28 = coord(7/25)
    
  5. Tudhope, D.; Binding, C.; Blocks, D.; Cuncliffe, D.: Representation and retrieval in faceted systems (2003) 0.11
    0.10510823 = sum of:
      0.10510823 = product of:
        0.37538654 = sum of:
          0.018574936 = weight(abstract_txt:system in 2703) [ClassicSimilarity], result of:
            0.018574936 = score(doc=2703,freq=2.0), product of:
              0.062316783 = queryWeight, product of:
                1.0597798 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.017436612 = queryNorm
              0.2980728 = fieldWeight in 2703, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0625 = fieldNorm(doc=2703)
          0.020190556 = weight(abstract_txt:paper in 2703) [ClassicSimilarity], result of:
            0.020190556 = score(doc=2703,freq=2.0), product of:
              0.06587981 = queryWeight, product of:
                1.0896556 = boost
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.017436612 = queryNorm
              0.30647564 = fieldWeight in 2703, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.0625 = fieldNorm(doc=2703)
          0.01437299 = weight(abstract_txt:retrieval in 2703) [ClassicSimilarity], result of:
            0.01437299 = score(doc=2703,freq=1.0), product of:
              0.06617514 = queryWeight, product of:
                1.0920953 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.017436612 = queryNorm
              0.21719621 = fieldWeight in 2703, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=2703)
          0.057896797 = weight(abstract_txt:representation in 2703) [ClassicSimilarity], result of:
            0.057896797 = score(doc=2703,freq=2.0), product of:
              0.13297087 = queryWeight, product of:
                1.5480727 = boost
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.017436612 = queryNorm
              0.43540964 = fieldWeight in 2703, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.0625 = fieldNorm(doc=2703)
          0.0817818 = weight(abstract_txt:thesaurus in 2703) [ClassicSimilarity], result of:
            0.0817818 = score(doc=2703,freq=3.0), product of:
              0.1462382 = queryWeight, product of:
                1.6234672 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.017436612 = queryNorm
              0.55923694 = fieldWeight in 2703, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.0625 = fieldNorm(doc=2703)
          0.07761137 = weight(abstract_txt:thesauri in 2703) [ClassicSimilarity], result of:
            0.07761137 = score(doc=2703,freq=2.0), product of:
              0.16166042 = queryWeight, product of:
                1.7069271 = boost
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.017436612 = queryNorm
              0.4800889 = fieldWeight in 2703, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.0625 = fieldNorm(doc=2703)
          0.10495811 = weight(abstract_txt:representations in 2703) [ClassicSimilarity], result of:
            0.10495811 = score(doc=2703,freq=2.0), product of:
              0.1976958 = queryWeight, product of:
                1.887608 = boost
                6.006528 = idf(docFreq=295, maxDocs=44218)
                0.017436612 = queryNorm
              0.5309071 = fieldWeight in 2703, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.006528 = idf(docFreq=295, maxDocs=44218)
                0.0625 = fieldNorm(doc=2703)
        0.28 = coord(7/25)