Document (#3910)

Author
Kim, Y.W.
Kim, J.H.
Title
¬A model of knowledge based information retrieval with hierarchical concept graph
Source
Journal of documentation. 46(1990) no.2, S.113-136
Year
1990
Abstract
This paper discusses a knowledge based information retrieval model with hierarchical thesaurus. The model computes the conceptual distance between a query and an object and both are indexed with weighted terms from a hierarchical thesaurus. The hierarchical thesaurus is represented by a hierarchical-concept graph (HCG) in which nodes represent concepts and directed edges represent generalised relationships. Rada et al. have developed a similar model. However, their model considered only a binary indexing schemes and revealed some counter-intuitive results. Our proposed model extends theirs by allowing the index term and the edge of the HCG to be weighted. A new concept mapping method is devised to overcome Rada's counter-intuitive results. In addition, a scheme for allowing Boolean operators in user queries is provided with a formula for computing conceptual destance from negated index terms. Experimental results have shown that our model simulates human performance more closely than Rada's model

Similar documents (content)

  1. Tang, X.; Chen, L.; Cui, J.; Wei, B.: Knowledge representation learning with entity descriptions, hierarchical types, and textual relations (2019) 0.20
    0.19508594 = sum of:
      0.19508594 = product of:
        0.8128581 = sum of:
          0.020760942 = weight(abstract_txt:with in 5101) [ClassicSimilarity], result of:
            0.020760942 = score(doc=5101,freq=3.0), product of:
              0.06137658 = queryWeight, product of:
                1.378358 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.017813405 = queryNorm
              0.3382551 = fieldWeight in 5101, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.078125 = fieldNorm(doc=5101)
          0.024305852 = weight(abstract_txt:results in 5101) [ClassicSimilarity], result of:
            0.024305852 = score(doc=5101,freq=1.0), product of:
              0.08933865 = queryWeight, product of:
                1.4401609 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.017813405 = queryNorm
              0.27206424 = fieldWeight in 5101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.078125 = fieldNorm(doc=5101)
          0.10895596 = weight(abstract_txt:graph in 5101) [ClassicSimilarity], result of:
            0.10895596 = score(doc=5101,freq=1.0), product of:
              0.21217899 = queryWeight, product of:
                1.8121614 = boost
                6.572923 = idf(docFreq=167, maxDocs=44218)
                0.017813405 = queryNorm
              0.51350963 = fieldWeight in 5101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.572923 = idf(docFreq=167, maxDocs=44218)
                0.078125 = fieldNorm(doc=5101)
          0.1293993 = weight(abstract_txt:weighted in 5101) [ClassicSimilarity], result of:
            0.1293993 = score(doc=5101,freq=1.0), product of:
              0.23795219 = queryWeight, product of:
                1.9190688 = boost
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.017813405 = queryNorm
              0.5438038 = fieldWeight in 5101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.078125 = fieldNorm(doc=5101)
          0.36105847 = weight(abstract_txt:hierarchical in 5101) [ClassicSimilarity], result of:
            0.36105847 = score(doc=5101,freq=4.0), product of:
              0.40322438 = queryWeight, product of:
                3.949927 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.017813405 = queryNorm
              0.8954282 = fieldWeight in 5101, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.078125 = fieldNorm(doc=5101)
          0.16837758 = weight(abstract_txt:model in 5101) [ClassicSimilarity], result of:
            0.16837758 = score(doc=5101,freq=3.0), product of:
              0.3121554 = queryWeight, product of:
                4.3960347 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.017813405 = queryNorm
              0.5394031 = fieldWeight in 5101, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.078125 = fieldNorm(doc=5101)
        0.24 = coord(6/25)
    
  2. Yang, C.C.; Liu, N.: Web site topic-hierarchy generation based on link structure (2009) 0.19
    0.19110574 = sum of:
      0.19110574 = product of:
        0.6825205 = sum of:
          0.11717644 = weight(abstract_txt:directed in 2738) [ClassicSimilarity], result of:
            0.11717644 = score(doc=2738,freq=4.0), product of:
              0.12922265 = queryWeight, product of:
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.017813405 = queryNorm
              0.90677947 = fieldWeight in 2738, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.0625 = fieldNorm(doc=2738)
          0.07738315 = weight(abstract_txt:edge in 2738) [ClassicSimilarity], result of:
            0.07738315 = score(doc=2738,freq=1.0), product of:
              0.15555932 = queryWeight, product of:
                1.097182 = boost
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.017813405 = queryNorm
              0.4974511 = fieldWeight in 2738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.0625 = fieldNorm(doc=2738)
          0.10687512 = weight(abstract_txt:edges in 2738) [ClassicSimilarity], result of:
            0.10687512 = score(doc=2738,freq=1.0), product of:
              0.19292247 = queryWeight, product of:
                1.2218618 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.017813405 = queryNorm
              0.55397964 = fieldWeight in 2738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.0625 = fieldNorm(doc=2738)
          0.016608752 = weight(abstract_txt:with in 2738) [ClassicSimilarity], result of:
            0.016608752 = score(doc=2738,freq=3.0), product of:
              0.06137658 = queryWeight, product of:
                1.378358 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.017813405 = queryNorm
              0.27060407 = fieldWeight in 2738, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=2738)
          0.1509738 = weight(abstract_txt:graph in 2738) [ClassicSimilarity], result of:
            0.1509738 = score(doc=2738,freq=3.0), product of:
              0.21217899 = queryWeight, product of:
                1.8121614 = boost
                6.572923 = idf(docFreq=167, maxDocs=44218)
                0.017813405 = queryNorm
              0.7115398 = fieldWeight in 2738, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.572923 = idf(docFreq=167, maxDocs=44218)
                0.0625 = fieldNorm(doc=2738)
          0.10351944 = weight(abstract_txt:weighted in 2738) [ClassicSimilarity], result of:
            0.10351944 = score(doc=2738,freq=1.0), product of:
              0.23795219 = queryWeight, product of:
                1.9190688 = boost
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.017813405 = queryNorm
              0.43504304 = fieldWeight in 2738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.0625 = fieldNorm(doc=2738)
          0.10998377 = weight(abstract_txt:model in 2738) [ClassicSimilarity], result of:
            0.10998377 = score(doc=2738,freq=2.0), product of:
              0.3121554 = queryWeight, product of:
                4.3960347 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.017813405 = queryNorm
              0.35233662 = fieldWeight in 2738, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=2738)
        0.28 = coord(7/25)
    
  3. Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: FACET: thesaurus retrieval with semantic term expansion (2002) 0.18
    0.18145855 = sum of:
      0.18145855 = product of:
        0.56705797 = sum of:
          0.039714478 = weight(abstract_txt:terms in 175) [ClassicSimilarity], result of:
            0.039714478 = score(doc=175,freq=5.0), product of:
              0.08031172 = queryWeight, product of:
                1.1148981 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017813405 = queryNorm
              0.49450412 = fieldWeight in 175, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.031427525 = weight(abstract_txt:conceptual in 175) [ClassicSimilarity], result of:
            0.031427525 = score(doc=175,freq=1.0), product of:
              0.117492415 = queryWeight, product of:
                1.348499 = boost
                4.891165 = idf(docFreq=902, maxDocs=44218)
                0.017813405 = queryNorm
              0.26748556 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.891165 = idf(docFreq=902, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.018761583 = weight(abstract_txt:with in 175) [ClassicSimilarity], result of:
            0.018761583 = score(doc=175,freq=5.0), product of:
              0.06137658 = queryWeight, product of:
                1.378358 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.017813405 = queryNorm
              0.30567983 = fieldWeight in 175, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.024061566 = weight(abstract_txt:results in 175) [ClassicSimilarity], result of:
            0.024061566 = score(doc=175,freq=2.0), product of:
              0.08933865 = queryWeight, product of:
                1.4401609 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.017813405 = queryNorm
              0.26932985 = fieldWeight in 175, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.03684524 = weight(abstract_txt:concept in 175) [ClassicSimilarity], result of:
            0.03684524 = score(doc=175,freq=1.0), product of:
              0.14953898 = queryWeight, product of:
                1.8632388 = boost
                4.505458 = idf(docFreq=1327, maxDocs=44218)
                0.017813405 = queryNorm
              0.24639222 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.505458 = idf(docFreq=1327, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.09057951 = weight(abstract_txt:weighted in 175) [ClassicSimilarity], result of:
            0.09057951 = score(doc=175,freq=1.0), product of:
              0.23795219 = queryWeight, product of:
                1.9190688 = boost
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.017813405 = queryNorm
              0.38066265 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.14695323 = weight(abstract_txt:thesaurus in 175) [ClassicSimilarity], result of:
            0.14695323 = score(doc=175,freq=7.0), product of:
              0.19660151 = queryWeight, product of:
                2.1364107 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.017813405 = queryNorm
              0.7474674 = fieldWeight in 175, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.17871483 = weight(abstract_txt:hierarchical in 175) [ClassicSimilarity], result of:
            0.17871483 = score(doc=175,freq=2.0), product of:
              0.40322438 = queryWeight, product of:
                3.949927 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.017813405 = queryNorm
              0.44321436 = fieldWeight in 175, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
        0.32 = coord(8/25)
    
  4. Buizza, G.: Subject analysis and indexing : an "Italian version" of the analytico-synthetic model (2011) 0.16
    0.15811689 = sum of:
      0.15811689 = product of:
        0.56470317 = sum of:
          0.08964585 = weight(abstract_txt:binary in 1812) [ClassicSimilarity], result of:
            0.08964585 = score(doc=1812,freq=1.0), product of:
              0.13094564 = queryWeight, product of:
                1.0066447 = boost
                7.3024383 = idf(docFreq=80, maxDocs=44218)
                0.017813405 = queryNorm
              0.6846036 = fieldWeight in 1812, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3024383 = idf(docFreq=80, maxDocs=44218)
                0.09375 = fieldNorm(doc=1812)
          0.05387576 = weight(abstract_txt:conceptual in 1812) [ClassicSimilarity], result of:
            0.05387576 = score(doc=1812,freq=1.0), product of:
              0.117492415 = queryWeight, product of:
                1.348499 = boost
                4.891165 = idf(docFreq=902, maxDocs=44218)
                0.017813405 = queryNorm
              0.4585467 = fieldWeight in 1812, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.891165 = idf(docFreq=902, maxDocs=44218)
                0.09375 = fieldNorm(doc=1812)
          0.020341484 = weight(abstract_txt:with in 1812) [ClassicSimilarity], result of:
            0.020341484 = score(doc=1812,freq=2.0), product of:
              0.06137658 = queryWeight, product of:
                1.378358 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.017813405 = queryNorm
              0.33142096 = fieldWeight in 1812, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.09375 = fieldNorm(doc=1812)
          0.07748444 = weight(abstract_txt:represent in 1812) [ClassicSimilarity], result of:
            0.07748444 = score(doc=1812,freq=1.0), product of:
              0.14970072 = queryWeight, product of:
                1.5221506 = boost
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.017813405 = queryNorm
              0.51759565 = fieldWeight in 1812, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.09375 = fieldNorm(doc=1812)
          0.06316327 = weight(abstract_txt:concept in 1812) [ClassicSimilarity], result of:
            0.06316327 = score(doc=1812,freq=1.0), product of:
              0.14953898 = queryWeight, product of:
                1.8632388 = boost
                4.505458 = idf(docFreq=1327, maxDocs=44218)
                0.017813405 = queryNorm
              0.42238668 = fieldWeight in 1812, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.505458 = idf(docFreq=1327, maxDocs=44218)
                0.09375 = fieldNorm(doc=1812)
          0.095216736 = weight(abstract_txt:thesaurus in 1812) [ClassicSimilarity], result of:
            0.095216736 = score(doc=1812,freq=1.0), product of:
              0.19660151 = queryWeight, product of:
                2.1364107 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.017813405 = queryNorm
              0.48431337 = fieldWeight in 1812, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.09375 = fieldNorm(doc=1812)
          0.16497566 = weight(abstract_txt:model in 1812) [ClassicSimilarity], result of:
            0.16497566 = score(doc=1812,freq=2.0), product of:
              0.3121554 = queryWeight, product of:
                4.3960347 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.017813405 = queryNorm
              0.5285049 = fieldWeight in 1812, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.09375 = fieldNorm(doc=1812)
        0.28 = coord(7/25)
    
  5. Ma, X.; Carranza, E.J.M.; Wu, C.; Meer, F.D. van der; Liu, G.: ¬A SKOS-based multilingual thesaurus of geological time scale for interoperability of online geological maps (2011) 0.14
    0.14185388 = sum of:
      0.14185388 = product of:
        0.506621 = sum of:
          0.040596236 = weight(abstract_txt:terms in 4800) [ClassicSimilarity], result of:
            0.040596236 = score(doc=4800,freq=4.0), product of:
              0.08031172 = queryWeight, product of:
                1.1148981 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017813405 = queryNorm
              0.5054833 = fieldWeight in 4800, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=4800)
          0.01356099 = weight(abstract_txt:with in 4800) [ClassicSimilarity], result of:
            0.01356099 = score(doc=4800,freq=2.0), product of:
              0.06137658 = queryWeight, product of:
                1.378358 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.017813405 = queryNorm
              0.22094731 = fieldWeight in 4800, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=4800)
          0.019444682 = weight(abstract_txt:results in 4800) [ClassicSimilarity], result of:
            0.019444682 = score(doc=4800,freq=1.0), product of:
              0.08933865 = queryWeight, product of:
                1.4401609 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.017813405 = queryNorm
              0.21765138 = fieldWeight in 4800, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=4800)
          0.05165629 = weight(abstract_txt:represent in 4800) [ClassicSimilarity], result of:
            0.05165629 = score(doc=4800,freq=1.0), product of:
              0.14970072 = queryWeight, product of:
                1.5221506 = boost
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.017813405 = queryNorm
              0.34506375 = fieldWeight in 4800, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.0625 = fieldNorm(doc=4800)
          0.12695566 = weight(abstract_txt:thesaurus in 4800) [ClassicSimilarity], result of:
            0.12695566 = score(doc=4800,freq=4.0), product of:
              0.19660151 = queryWeight, product of:
                2.1364107 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.017813405 = queryNorm
              0.6457512 = fieldWeight in 4800, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.0625 = fieldNorm(doc=4800)
          0.1444234 = weight(abstract_txt:hierarchical in 4800) [ClassicSimilarity], result of:
            0.1444234 = score(doc=4800,freq=1.0), product of:
              0.40322438 = queryWeight, product of:
                3.949927 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.017813405 = queryNorm
              0.35817128 = fieldWeight in 4800, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.0625 = fieldNorm(doc=4800)
          0.10998377 = weight(abstract_txt:model in 4800) [ClassicSimilarity], result of:
            0.10998377 = score(doc=4800,freq=2.0), product of:
              0.3121554 = queryWeight, product of:
                4.3960347 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.017813405 = queryNorm
              0.35233662 = fieldWeight in 4800, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=4800)
        0.28 = coord(7/25)