Document (#3910)

Author
Kim, Y.W.
Kim, J.H.
Title
¬A model of knowledge based information retrieval with hierarchical concept graph
Source
Journal of documentation. 46(1990) no.2, S.113-136
Year
1990
Abstract
This paper discusses a knowledge based information retrieval model with hierarchical thesaurus. The model computes the conceptual distance between a query and an object and both are indexed with weighted terms from a hierarchical thesaurus. The hierarchical thesaurus is represented by a hierarchical-concept graph (HCG) in which nodes represent concepts and directed edges represent generalised relationships. Rada et al. have developed a similar model. However, their model considered only a binary indexing schemes and revealed some counter-intuitive results. Our proposed model extends theirs by allowing the index term and the edge of the HCG to be weighted. A new concept mapping method is devised to overcome Rada's counter-intuitive results. In addition, a scheme for allowing Boolean operators in user queries is provided with a formula for computing conceptual destance from negated index terms. Experimental results have shown that our model simulates human performance more closely than Rada's model

Similar documents (content)

  1. Tang, X.; Chen, L.; Cui, J.; Wei, B.: Knowledge representation learning with entity descriptions, hierarchical types, and textual relations (2019) 0.20
    0.19568542 = sum of:
      0.19568542 = product of:
        0.8153559 = sum of:
          0.020893082 = weight(abstract_txt:with in 102) [ClassicSimilarity], result of:
            0.020893082 = score(doc=102,freq=3.0), product of:
              0.061498582 = queryWeight, product of:
                1.3794048 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.017757697 = queryNorm
              0.33973274 = fieldWeight in 102, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.078125 = fieldNorm(doc=102)
          0.02452504 = weight(abstract_txt:results in 102) [ClassicSimilarity], result of:
            0.02452504 = score(doc=102,freq=1.0), product of:
              0.0896735 = queryWeight, product of:
                1.44252 = boost
                3.5007057 = idf(docFreq=3547, maxDocs=43254)
                0.017757697 = queryNorm
              0.27349263 = fieldWeight in 102, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5007057 = idf(docFreq=3547, maxDocs=43254)
                0.078125 = fieldNorm(doc=102)
          0.1108172 = weight(abstract_txt:graph in 102) [ClassicSimilarity], result of:
            0.1108172 = score(doc=102,freq=1.0), product of:
              0.21410754 = queryWeight, product of:
                1.819953 = boost
                6.624989 = idf(docFreq=155, maxDocs=43254)
                0.017757697 = queryNorm
              0.5175773 = fieldWeight in 102, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.624989 = idf(docFreq=155, maxDocs=43254)
                0.078125 = fieldNorm(doc=102)
          0.12829071 = weight(abstract_txt:weighted in 102) [ClassicSimilarity], result of:
            0.12829071 = score(doc=102,freq=1.0), product of:
              0.23606086 = queryWeight, product of:
                1.9109801 = boost
                6.956346 = idf(docFreq=111, maxDocs=43254)
                0.017757697 = queryNorm
              0.54346454 = fieldWeight in 102, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.956346 = idf(docFreq=111, maxDocs=43254)
                0.078125 = fieldNorm(doc=102)
          0.36086717 = weight(abstract_txt:hierarchical in 102) [ClassicSimilarity], result of:
            0.36086717 = score(doc=102,freq=4.0), product of:
              0.40217844 = queryWeight, product of:
                3.943879 = boost
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.017757697 = queryNorm
              0.8972812 = fieldWeight in 102, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.078125 = fieldNorm(doc=102)
          0.16996266 = weight(abstract_txt:model in 102) [ClassicSimilarity], result of:
            0.16996266 = score(doc=102,freq=3.0), product of:
              0.3134073 = queryWeight, product of:
                4.4038115 = boost
                4.0076866 = idf(docFreq=2136, maxDocs=43254)
                0.017757697 = queryNorm
              0.542306 = fieldWeight in 102, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0076866 = idf(docFreq=2136, maxDocs=43254)
                0.078125 = fieldNorm(doc=102)
        0.24 = coord(6/25)
    
  2. Yang, C.C.; Liu, N.: Web site topic-hierarchy generation based on link structure (2009) 0.19
    0.19327366 = sum of:
      0.19327366 = product of:
        0.6902631 = sum of:
          0.117653996 = weight(abstract_txt:directed in 4739) [ClassicSimilarity], result of:
            0.117653996 = score(doc=4739,freq=4.0), product of:
              0.12928307 = queryWeight, product of:
                7.280396 = idf(docFreq=80, maxDocs=43254)
                0.017757697 = queryNorm
              0.9100495 = fieldWeight in 4739, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.280396 = idf(docFreq=80, maxDocs=43254)
                0.0625 = fieldNorm(doc=4739)
          0.07622713 = weight(abstract_txt:edge in 4739) [ClassicSimilarity], result of:
            0.07622713 = score(doc=4739,freq=1.0), product of:
              0.15366097 = queryWeight, product of:
                1.090212 = boost
                7.9371753 = idf(docFreq=41, maxDocs=43254)
                0.017757697 = queryNorm
              0.49607345 = fieldWeight in 4739, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9371753 = idf(docFreq=41, maxDocs=43254)
                0.0625 = fieldNorm(doc=4739)
          0.112463005 = weight(abstract_txt:edges in 4739) [ClassicSimilarity], result of:
            0.112463005 = score(doc=4739,freq=1.0), product of:
              0.19914237 = queryWeight, product of:
                1.2411121 = boost
                9.035788 = idf(docFreq=13, maxDocs=43254)
                0.017757697 = queryNorm
              0.5647367 = fieldWeight in 4739, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.035788 = idf(docFreq=13, maxDocs=43254)
                0.0625 = fieldNorm(doc=4739)
          0.016714465 = weight(abstract_txt:with in 4739) [ClassicSimilarity], result of:
            0.016714465 = score(doc=4739,freq=3.0), product of:
              0.061498582 = queryWeight, product of:
                1.3794048 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.017757697 = queryNorm
              0.27178618 = fieldWeight in 4739, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.0625 = fieldNorm(doc=4739)
          0.15355282 = weight(abstract_txt:graph in 4739) [ClassicSimilarity], result of:
            0.15355282 = score(doc=4739,freq=3.0), product of:
              0.21410754 = queryWeight, product of:
                1.819953 = boost
                6.624989 = idf(docFreq=155, maxDocs=43254)
                0.017757697 = queryNorm
              0.7171761 = fieldWeight in 4739, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.624989 = idf(docFreq=155, maxDocs=43254)
                0.0625 = fieldNorm(doc=4739)
          0.10263256 = weight(abstract_txt:weighted in 4739) [ClassicSimilarity], result of:
            0.10263256 = score(doc=4739,freq=1.0), product of:
              0.23606086 = queryWeight, product of:
                1.9109801 = boost
                6.956346 = idf(docFreq=111, maxDocs=43254)
                0.017757697 = queryNorm
              0.43477163 = fieldWeight in 4739, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.956346 = idf(docFreq=111, maxDocs=43254)
                0.0625 = fieldNorm(doc=4739)
          0.11101914 = weight(abstract_txt:model in 4739) [ClassicSimilarity], result of:
            0.11101914 = score(doc=4739,freq=2.0), product of:
              0.3134073 = queryWeight, product of:
                4.4038115 = boost
                4.0076866 = idf(docFreq=2136, maxDocs=43254)
                0.017757697 = queryNorm
              0.3542328 = fieldWeight in 4739, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0076866 = idf(docFreq=2136, maxDocs=43254)
                0.0625 = fieldNorm(doc=4739)
        0.28 = coord(7/25)
    
  3. Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: FACET: thesaurus retrieval with semantic term expansion (2002) 0.18
    0.18118556 = sum of:
      0.18118556 = product of:
        0.5662049 = sum of:
          0.03986512 = weight(abstract_txt:terms in 2176) [ClassicSimilarity], result of:
            0.03986512 = score(doc=2176,freq=5.0), product of:
              0.08033422 = queryWeight, product of:
                1.1147935 = boost
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.017757697 = queryNorm
              0.49624085 = fieldWeight in 2176, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2176)
          0.03170912 = weight(abstract_txt:conceptual in 2176) [ClassicSimilarity], result of:
            0.03170912 = score(doc=2176,freq=1.0), product of:
              0.11792828 = queryWeight, product of:
                1.3506821 = boost
                4.9167504 = idf(docFreq=860, maxDocs=43254)
                0.017757697 = queryNorm
              0.26888478 = fieldWeight in 2176, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9167504 = idf(docFreq=860, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2176)
          0.018880997 = weight(abstract_txt:with in 2176) [ClassicSimilarity], result of:
            0.018880997 = score(doc=2176,freq=5.0), product of:
              0.061498582 = queryWeight, product of:
                1.3794048 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.017757697 = queryNorm
              0.30701515 = fieldWeight in 2176, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2176)
          0.024278553 = weight(abstract_txt:results in 2176) [ClassicSimilarity], result of:
            0.024278553 = score(doc=2176,freq=2.0), product of:
              0.0896735 = queryWeight, product of:
                1.44252 = boost
                3.5007057 = idf(docFreq=3547, maxDocs=43254)
                0.017757697 = queryNorm
              0.2707439 = fieldWeight in 2176, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5007057 = idf(docFreq=3547, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2176)
          0.037074562 = weight(abstract_txt:concept in 2176) [ClassicSimilarity], result of:
            0.037074562 = score(doc=2176,freq=1.0), product of:
              0.14982224 = queryWeight, product of:
                1.8645668 = boost
                4.524928 = idf(docFreq=1273, maxDocs=43254)
                0.017757697 = queryNorm
              0.247457 = fieldWeight in 2176, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.524928 = idf(docFreq=1273, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2176)
          0.089803495 = weight(abstract_txt:weighted in 2176) [ClassicSimilarity], result of:
            0.089803495 = score(doc=2176,freq=1.0), product of:
              0.23606086 = queryWeight, product of:
                1.9109801 = boost
                6.956346 = idf(docFreq=111, maxDocs=43254)
                0.017757697 = queryNorm
              0.38042518 = fieldWeight in 2176, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.956346 = idf(docFreq=111, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2176)
          0.1459729 = weight(abstract_txt:thesaurus in 2176) [ClassicSimilarity], result of:
            0.1459729 = score(doc=2176,freq=7.0), product of:
              0.19528748 = queryWeight, product of:
                2.1287615 = boost
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.017757697 = queryNorm
              0.747477 = fieldWeight in 2176, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2176)
          0.17862013 = weight(abstract_txt:hierarchical in 2176) [ClassicSimilarity], result of:
            0.17862013 = score(doc=2176,freq=2.0), product of:
              0.40217844 = queryWeight, product of:
                3.943879 = boost
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.017757697 = queryNorm
              0.44413155 = fieldWeight in 2176, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2176)
        0.32 = coord(8/25)
    
  4. Buizza, G.: Subject analysis and indexing : an "Italian version" of the analytico-synthetic model (2011) 0.16
    0.15885004 = sum of:
      0.15885004 = product of:
        0.5673216 = sum of:
          0.09009477 = weight(abstract_txt:binary in 3277) [ClassicSimilarity], result of:
            0.09009477 = score(doc=3277,freq=1.0), product of:
              0.13108794 = queryWeight, product of:
                1.0069561 = boost
                7.3310394 = idf(docFreq=76, maxDocs=43254)
                0.017757697 = queryNorm
              0.68728495 = fieldWeight in 3277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3310394 = idf(docFreq=76, maxDocs=43254)
                0.09375 = fieldNorm(doc=3277)
          0.054358494 = weight(abstract_txt:conceptual in 3277) [ClassicSimilarity], result of:
            0.054358494 = score(doc=3277,freq=1.0), product of:
              0.11792828 = queryWeight, product of:
                1.3506821 = boost
                4.9167504 = idf(docFreq=860, maxDocs=43254)
                0.017757697 = queryNorm
              0.46094537 = fieldWeight in 3277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9167504 = idf(docFreq=860, maxDocs=43254)
                0.09375 = fieldNorm(doc=3277)
          0.020470954 = weight(abstract_txt:with in 3277) [ClassicSimilarity], result of:
            0.020470954 = score(doc=3277,freq=2.0), product of:
              0.061498582 = queryWeight, product of:
                1.3794048 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.017757697 = queryNorm
              0.33286873 = fieldWeight in 3277, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.09375 = fieldNorm(doc=3277)
          0.077730745 = weight(abstract_txt:represent in 3277) [ClassicSimilarity], result of:
            0.077730745 = score(doc=3277,freq=1.0), product of:
              0.14968154 = queryWeight, product of:
                1.5216974 = boost
                5.53928 = idf(docFreq=461, maxDocs=43254)
                0.017757697 = queryNorm
              0.5193075 = fieldWeight in 3277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.53928 = idf(docFreq=461, maxDocs=43254)
                0.09375 = fieldNorm(doc=3277)
          0.06355639 = weight(abstract_txt:concept in 3277) [ClassicSimilarity], result of:
            0.06355639 = score(doc=3277,freq=1.0), product of:
              0.14982224 = queryWeight, product of:
                1.8645668 = boost
                4.524928 = idf(docFreq=1273, maxDocs=43254)
                0.017757697 = queryNorm
              0.424212 = fieldWeight in 3277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.524928 = idf(docFreq=1273, maxDocs=43254)
                0.09375 = fieldNorm(doc=3277)
          0.09458155 = weight(abstract_txt:thesaurus in 3277) [ClassicSimilarity], result of:
            0.09458155 = score(doc=3277,freq=1.0), product of:
              0.19528748 = queryWeight, product of:
                2.1287615 = boost
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.017757697 = queryNorm
              0.4843196 = fieldWeight in 3277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.09375 = fieldNorm(doc=3277)
          0.16652872 = weight(abstract_txt:model in 3277) [ClassicSimilarity], result of:
            0.16652872 = score(doc=3277,freq=2.0), product of:
              0.3134073 = queryWeight, product of:
                4.4038115 = boost
                4.0076866 = idf(docFreq=2136, maxDocs=43254)
                0.017757697 = queryNorm
              0.5313492 = fieldWeight in 3277, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0076866 = idf(docFreq=2136, maxDocs=43254)
                0.09375 = fieldNorm(doc=3277)
        0.28 = coord(7/25)
    
  5. Ma, X.; Carranza, E.J.M.; Wu, C.; Meer, F.D. van der; Liu, G.: ¬A SKOS-based multilingual thesaurus of geological time scale for interoperability of online geological maps (2011) 0.14
    0.1420476 = sum of:
      0.1420476 = product of:
        0.50731283 = sum of:
          0.040750228 = weight(abstract_txt:terms in 1265) [ClassicSimilarity], result of:
            0.040750228 = score(doc=1265,freq=4.0), product of:
              0.08033422 = queryWeight, product of:
                1.1147935 = boost
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.017757697 = queryNorm
              0.50725865 = fieldWeight in 1265, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.0625 = fieldNorm(doc=1265)
          0.013647303 = weight(abstract_txt:with in 1265) [ClassicSimilarity], result of:
            0.013647303 = score(doc=1265,freq=2.0), product of:
              0.061498582 = queryWeight, product of:
                1.3794048 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.017757697 = queryNorm
              0.22191249 = fieldWeight in 1265, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.0625 = fieldNorm(doc=1265)
          0.019620033 = weight(abstract_txt:results in 1265) [ClassicSimilarity], result of:
            0.019620033 = score(doc=1265,freq=1.0), product of:
              0.0896735 = queryWeight, product of:
                1.44252 = boost
                3.5007057 = idf(docFreq=3547, maxDocs=43254)
                0.017757697 = queryNorm
              0.2187941 = fieldWeight in 1265, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5007057 = idf(docFreq=3547, maxDocs=43254)
                0.0625 = fieldNorm(doc=1265)
          0.051820498 = weight(abstract_txt:represent in 1265) [ClassicSimilarity], result of:
            0.051820498 = score(doc=1265,freq=1.0), product of:
              0.14968154 = queryWeight, product of:
                1.5216974 = boost
                5.53928 = idf(docFreq=461, maxDocs=43254)
                0.017757697 = queryNorm
              0.346205 = fieldWeight in 1265, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.53928 = idf(docFreq=461, maxDocs=43254)
                0.0625 = fieldNorm(doc=1265)
          0.12610874 = weight(abstract_txt:thesaurus in 1265) [ClassicSimilarity], result of:
            0.12610874 = score(doc=1265,freq=4.0), product of:
              0.19528748 = queryWeight, product of:
                2.1287615 = boost
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.017757697 = queryNorm
              0.64575946 = fieldWeight in 1265, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.0625 = fieldNorm(doc=1265)
          0.14434686 = weight(abstract_txt:hierarchical in 1265) [ClassicSimilarity], result of:
            0.14434686 = score(doc=1265,freq=1.0), product of:
              0.40217844 = queryWeight, product of:
                3.943879 = boost
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.017757697 = queryNorm
              0.3589125 = fieldWeight in 1265, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.0625 = fieldNorm(doc=1265)
          0.11101914 = weight(abstract_txt:model in 1265) [ClassicSimilarity], result of:
            0.11101914 = score(doc=1265,freq=2.0), product of:
              0.3134073 = queryWeight, product of:
                4.4038115 = boost
                4.0076866 = idf(docFreq=2136, maxDocs=43254)
                0.017757697 = queryNorm
              0.3542328 = fieldWeight in 1265, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0076866 = idf(docFreq=2136, maxDocs=43254)
                0.0625 = fieldNorm(doc=1265)
        0.28 = coord(7/25)