Document (#3910)

Author
Kim, Y.W.
Kim, J.H.
Title
¬A model of knowledge based information retrieval with hierarchical concept graph
Source
Journal of documentation. 46(1990) no.2, S.113-136
Year
1990
Abstract
This paper discusses a knowledge based information retrieval model with hierarchical thesaurus. The model computes the conceptual distance between a query and an object and both are indexed with weighted terms from a hierarchical thesaurus. The hierarchical thesaurus is represented by a hierarchical-concept graph (HCG) in which nodes represent concepts and directed edges represent generalised relationships. Rada et al. have developed a similar model. However, their model considered only a binary indexing schemes and revealed some counter-intuitive results. Our proposed model extends theirs by allowing the index term and the edge of the HCG to be weighted. A new concept mapping method is devised to overcome Rada's counter-intuitive results. In addition, a scheme for allowing Boolean operators in user queries is provided with a formula for computing conceptual destance from negated index terms. Experimental results have shown that our model simulates human performance more closely than Rada's model

Similar documents (content)

  1. Tang, X.; Chen, L.; Cui, J.; Wei, B.: Knowledge representation learning with entity descriptions, hierarchical types, and textual relations (2019) 0.20
    0.19656833 = sum of:
      0.19656833 = product of:
        0.8190347 = sum of:
          0.021044075 = weight(abstract_txt:with in 699) [ClassicSimilarity], result of:
            0.021044075 = score(doc=699,freq=3.0), product of:
              0.06174942 = queryWeight, product of:
                1.3842826 = boost
                2.5185254 = idf(docFreq=9329, maxDocs=42596)
                0.017711764 = queryNorm
              0.34079793 = fieldWeight in 699, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.5185254 = idf(docFreq=9329, maxDocs=42596)
                0.078125 = fieldNorm(doc=699)
          0.024604235 = weight(abstract_txt:results in 699) [ClassicSimilarity], result of:
            0.024604235 = score(doc=699,freq=1.0), product of:
              0.089800835 = queryWeight, product of:
                1.4457031 = boost
                3.5070295 = idf(docFreq=3471, maxDocs=42596)
                0.017711764 = queryNorm
              0.2739867 = fieldWeight in 699, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5070295 = idf(docFreq=3471, maxDocs=42596)
                0.078125 = fieldNorm(doc=699)
          0.11245363 = weight(abstract_txt:graph in 699) [ClassicSimilarity], result of:
            0.11245363 = score(doc=699,freq=1.0), product of:
              0.21605238 = queryWeight, product of:
                1.8309346 = boost
                6.6623034 = idf(docFreq=147, maxDocs=42596)
                0.017711764 = queryNorm
              0.52049243 = fieldWeight in 699, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6623034 = idf(docFreq=147, maxDocs=42596)
                0.078125 = fieldNorm(doc=699)
          0.12716556 = weight(abstract_txt:weighted in 699) [ClassicSimilarity], result of:
            0.12716556 = score(doc=699,freq=1.0), product of:
              0.23450732 = queryWeight, product of:
                1.9075305 = boost
                6.9410167 = idf(docFreq=111, maxDocs=42596)
                0.017711764 = queryNorm
              0.5422669 = fieldWeight in 699, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9410167 = idf(docFreq=111, maxDocs=42596)
                0.078125 = fieldNorm(doc=699)
          0.36225525 = weight(abstract_txt:hierarchical in 699) [ClassicSimilarity], result of:
            0.36225525 = score(doc=699,freq=4.0), product of:
              0.40291476 = queryWeight, product of:
                3.9533923 = boost
                5.7541537 = idf(docFreq=366, maxDocs=42596)
                0.017711764 = queryNorm
              0.89908653 = fieldWeight in 699, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7541537 = idf(docFreq=366, maxDocs=42596)
                0.078125 = fieldNorm(doc=699)
          0.17151196 = weight(abstract_txt:model in 699) [ClassicSimilarity], result of:
            0.17151196 = score(doc=699,freq=3.0), product of:
              0.31507885 = queryWeight, product of:
                4.4221444 = boost
                4.0227637 = idf(docFreq=2072, maxDocs=42596)
                0.017711764 = queryNorm
              0.54434615 = fieldWeight in 699, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0227637 = idf(docFreq=2072, maxDocs=42596)
                0.078125 = fieldNorm(doc=699)
        0.24 = coord(6/25)
    
  2. Yang, C.C.; Liu, N.: Web site topic-hierarchy generation based on link structure (2009) 0.19
    0.1936344 = sum of:
      0.1936344 = product of:
        0.69155145 = sum of:
          0.117865026 = weight(abstract_txt:directed in 3918) [ClassicSimilarity], result of:
            0.117865026 = score(doc=3918,freq=4.0), product of:
              0.12934314 = queryWeight, product of:
                1.0017284 = boost
                7.2900677 = idf(docFreq=78, maxDocs=42596)
                0.017711764 = queryNorm
              0.91125846 = fieldWeight in 3918, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.2900677 = idf(docFreq=78, maxDocs=42596)
                0.0625 = fieldNorm(doc=3918)
          0.07562049 = weight(abstract_txt:edge in 3918) [ClassicSimilarity], result of:
            0.07562049 = score(doc=3918,freq=1.0), product of:
              0.15273306 = queryWeight, product of:
                1.088541 = boost
                7.921846 = idf(docFreq=41, maxDocs=42596)
                0.017711764 = queryNorm
              0.49511537 = fieldWeight in 3918, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.921846 = idf(docFreq=41, maxDocs=42596)
                0.0625 = fieldNorm(doc=3918)
          0.11164674 = weight(abstract_txt:edges in 3918) [ClassicSimilarity], result of:
            0.11164674 = score(doc=3918,freq=1.0), product of:
              0.19803295 = queryWeight, product of:
                1.2395015 = boost
                9.020458 = idf(docFreq=13, maxDocs=42596)
                0.017711764 = queryNorm
              0.56377864 = fieldWeight in 3918, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.020458 = idf(docFreq=13, maxDocs=42596)
                0.0625 = fieldNorm(doc=3918)
          0.016835261 = weight(abstract_txt:with in 3918) [ClassicSimilarity], result of:
            0.016835261 = score(doc=3918,freq=3.0), product of:
              0.06174942 = queryWeight, product of:
                1.3842826 = boost
                2.5185254 = idf(docFreq=9329, maxDocs=42596)
                0.017711764 = queryNorm
              0.27263835 = fieldWeight in 3918, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.5185254 = idf(docFreq=9329, maxDocs=42596)
                0.0625 = fieldNorm(doc=3918)
          0.15582033 = weight(abstract_txt:graph in 3918) [ClassicSimilarity], result of:
            0.15582033 = score(doc=3918,freq=3.0), product of:
              0.21605238 = queryWeight, product of:
                1.8309346 = boost
                6.6623034 = idf(docFreq=147, maxDocs=42596)
                0.017711764 = queryNorm
              0.7212155 = fieldWeight in 3918, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.6623034 = idf(docFreq=147, maxDocs=42596)
                0.0625 = fieldNorm(doc=3918)
          0.101732455 = weight(abstract_txt:weighted in 3918) [ClassicSimilarity], result of:
            0.101732455 = score(doc=3918,freq=1.0), product of:
              0.23450732 = queryWeight, product of:
                1.9075305 = boost
                6.9410167 = idf(docFreq=111, maxDocs=42596)
                0.017711764 = queryNorm
              0.43381354 = fieldWeight in 3918, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9410167 = idf(docFreq=111, maxDocs=42596)
                0.0625 = fieldNorm(doc=3918)
          0.11203115 = weight(abstract_txt:model in 3918) [ClassicSimilarity], result of:
            0.11203115 = score(doc=3918,freq=2.0), product of:
              0.31507885 = queryWeight, product of:
                4.4221444 = boost
                4.0227637 = idf(docFreq=2072, maxDocs=42596)
                0.017711764 = queryNorm
              0.35556543 = fieldWeight in 3918, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0227637 = idf(docFreq=2072, maxDocs=42596)
                0.0625 = fieldNorm(doc=3918)
        0.28 = coord(7/25)
    
  3. Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: FACET: thesaurus retrieval with semantic term expansion (2002) 0.18
    0.18112175 = sum of:
      0.18112175 = product of:
        0.56600547 = sum of:
          0.039808605 = weight(abstract_txt:terms in 1355) [ClassicSimilarity], result of:
            0.039808605 = score(doc=1355,freq=5.0), product of:
              0.08019969 = queryWeight, product of:
                1.1155258 = boost
                4.0591135 = idf(docFreq=1998, maxDocs=42596)
                0.017711764 = queryNorm
              0.4963686 = fieldWeight in 1355, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.0591135 = idf(docFreq=1998, maxDocs=42596)
                0.0546875 = fieldNorm(doc=1355)
          0.032100495 = weight(abstract_txt:conceptual in 1355) [ClassicSimilarity], result of:
            0.032100495 = score(doc=1355,freq=1.0), product of:
              0.118809864 = queryWeight, product of:
                1.3577492 = boost
                4.9405026 = idf(docFreq=827, maxDocs=42596)
                0.017711764 = queryNorm
              0.27018374 = fieldWeight in 1355, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9405026 = idf(docFreq=827, maxDocs=42596)
                0.0546875 = fieldNorm(doc=1355)
          0.01901745 = weight(abstract_txt:with in 1355) [ClassicSimilarity], result of:
            0.01901745 = score(doc=1355,freq=5.0), product of:
              0.06174942 = queryWeight, product of:
                1.3842826 = boost
                2.5185254 = idf(docFreq=9329, maxDocs=42596)
                0.017711764 = queryNorm
              0.3079778 = fieldWeight in 1355, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.5185254 = idf(docFreq=9329, maxDocs=42596)
                0.0546875 = fieldNorm(doc=1355)
          0.024356946 = weight(abstract_txt:results in 1355) [ClassicSimilarity], result of:
            0.024356946 = score(doc=1355,freq=2.0), product of:
              0.089800835 = queryWeight, product of:
                1.4457031 = boost
                3.5070295 = idf(docFreq=3471, maxDocs=42596)
                0.017711764 = queryNorm
              0.27123296 = fieldWeight in 1355, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5070295 = idf(docFreq=3471, maxDocs=42596)
                0.0546875 = fieldNorm(doc=1355)
          0.037281644 = weight(abstract_txt:concept in 1355) [ClassicSimilarity], result of:
            0.037281644 = score(doc=1355,freq=1.0), product of:
              0.15026984 = queryWeight, product of:
                1.870143 = boost
                4.5366488 = idf(docFreq=1239, maxDocs=42596)
                0.017711764 = queryNorm
              0.24809799 = fieldWeight in 1355, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5366488 = idf(docFreq=1239, maxDocs=42596)
                0.0546875 = fieldNorm(doc=1355)
          0.08901589 = weight(abstract_txt:weighted in 1355) [ClassicSimilarity], result of:
            0.08901589 = score(doc=1355,freq=1.0), product of:
              0.23450732 = queryWeight, product of:
                1.9075305 = boost
                6.9410167 = idf(docFreq=111, maxDocs=42596)
                0.017711764 = queryNorm
              0.37958685 = fieldWeight in 1355, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9410167 = idf(docFreq=111, maxDocs=42596)
                0.0546875 = fieldNorm(doc=1355)
          0.14511728 = weight(abstract_txt:thesaurus in 1355) [ClassicSimilarity], result of:
            0.14511728 = score(doc=1355,freq=7.0), product of:
              0.19438162 = queryWeight, product of:
                2.1269953 = boost
                5.1597285 = idf(docFreq=664, maxDocs=42596)
                0.017711764 = queryNorm
              0.74655867 = fieldWeight in 1355, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.1597285 = idf(docFreq=664, maxDocs=42596)
                0.0546875 = fieldNorm(doc=1355)
          0.17930718 = weight(abstract_txt:hierarchical in 1355) [ClassicSimilarity], result of:
            0.17930718 = score(doc=1355,freq=2.0), product of:
              0.40291476 = queryWeight, product of:
                3.9533923 = boost
                5.7541537 = idf(docFreq=366, maxDocs=42596)
                0.017711764 = queryNorm
              0.4450251 = fieldWeight in 1355, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7541537 = idf(docFreq=366, maxDocs=42596)
                0.0546875 = fieldNorm(doc=1355)
        0.32 = coord(8/25)
    
  4. Buizza, G.: Subject analysis and indexing : an "Italian version" of the analytico-synthetic model (2011) 0.16
    0.15985972 = sum of:
      0.15985972 = product of:
        0.57092756 = sum of:
          0.09079865 = weight(abstract_txt:binary in 2813) [ClassicSimilarity], result of:
            0.09079865 = score(doc=2813,freq=1.0), product of:
              0.13167363 = queryWeight, product of:
                1.0107127 = boost
                7.3554506 = idf(docFreq=73, maxDocs=42596)
                0.017711764 = queryNorm
              0.6895735 = fieldWeight in 2813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3554506 = idf(docFreq=73, maxDocs=42596)
                0.09375 = fieldNorm(doc=2813)
          0.05502942 = weight(abstract_txt:conceptual in 2813) [ClassicSimilarity], result of:
            0.05502942 = score(doc=2813,freq=1.0), product of:
              0.118809864 = queryWeight, product of:
                1.3577492 = boost
                4.9405026 = idf(docFreq=827, maxDocs=42596)
                0.017711764 = queryNorm
              0.46317214 = fieldWeight in 2813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9405026 = idf(docFreq=827, maxDocs=42596)
                0.09375 = fieldNorm(doc=2813)
          0.020618899 = weight(abstract_txt:with in 2813) [ClassicSimilarity], result of:
            0.020618899 = score(doc=2813,freq=2.0), product of:
              0.06174942 = queryWeight, product of:
                1.3842826 = boost
                2.5185254 = idf(docFreq=9329, maxDocs=42596)
                0.017711764 = queryNorm
              0.33391243 = fieldWeight in 2813, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.5185254 = idf(docFreq=9329, maxDocs=42596)
                0.09375 = fieldNorm(doc=2813)
          0.078495294 = weight(abstract_txt:represent in 2813) [ClassicSimilarity], result of:
            0.078495294 = score(doc=2813,freq=1.0), product of:
              0.15055147 = queryWeight, product of:
                1.5283957 = boost
                5.5614414 = idf(docFreq=444, maxDocs=42596)
                0.017711764 = queryNorm
              0.52138513 = fieldWeight in 2813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5614414 = idf(docFreq=444, maxDocs=42596)
                0.09375 = fieldNorm(doc=2813)
          0.063911386 = weight(abstract_txt:concept in 2813) [ClassicSimilarity], result of:
            0.063911386 = score(doc=2813,freq=1.0), product of:
              0.15026984 = queryWeight, product of:
                1.870143 = boost
                4.5366488 = idf(docFreq=1239, maxDocs=42596)
                0.017711764 = queryNorm
              0.42531082 = fieldWeight in 2813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5366488 = idf(docFreq=1239, maxDocs=42596)
                0.09375 = fieldNorm(doc=2813)
          0.09402716 = weight(abstract_txt:thesaurus in 2813) [ClassicSimilarity], result of:
            0.09402716 = score(doc=2813,freq=1.0), product of:
              0.19438162 = queryWeight, product of:
                2.1269953 = boost
                5.1597285 = idf(docFreq=664, maxDocs=42596)
                0.017711764 = queryNorm
              0.48372453 = fieldWeight in 2813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1597285 = idf(docFreq=664, maxDocs=42596)
                0.09375 = fieldNorm(doc=2813)
          0.16804673 = weight(abstract_txt:model in 2813) [ClassicSimilarity], result of:
            0.16804673 = score(doc=2813,freq=2.0), product of:
              0.31507885 = queryWeight, product of:
                4.4221444 = boost
                4.0227637 = idf(docFreq=2072, maxDocs=42596)
                0.017711764 = queryNorm
              0.53334814 = fieldWeight in 2813, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0227637 = idf(docFreq=2072, maxDocs=42596)
                0.09375 = fieldNorm(doc=2813)
        0.28 = coord(7/25)
    
  5. Ma, X.; Carranza, E.J.M.; Wu, C.; Meer, F.D. van der; Liu, G.: ¬A SKOS-based multilingual thesaurus of geological time scale for interoperability of online geological maps (2011) 0.14
    0.14245135 = sum of:
      0.14245135 = product of:
        0.5087548 = sum of:
          0.040692456 = weight(abstract_txt:terms in 801) [ClassicSimilarity], result of:
            0.040692456 = score(doc=801,freq=4.0), product of:
              0.08019969 = queryWeight, product of:
                1.1155258 = boost
                4.0591135 = idf(docFreq=1998, maxDocs=42596)
                0.017711764 = queryNorm
              0.5073892 = fieldWeight in 801, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0591135 = idf(docFreq=1998, maxDocs=42596)
                0.0625 = fieldNorm(doc=801)
          0.013745934 = weight(abstract_txt:with in 801) [ClassicSimilarity], result of:
            0.013745934 = score(doc=801,freq=2.0), product of:
              0.06174942 = queryWeight, product of:
                1.3842826 = boost
                2.5185254 = idf(docFreq=9329, maxDocs=42596)
                0.017711764 = queryNorm
              0.2226083 = fieldWeight in 801, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.5185254 = idf(docFreq=9329, maxDocs=42596)
                0.0625 = fieldNorm(doc=801)
          0.019683387 = weight(abstract_txt:results in 801) [ClassicSimilarity], result of:
            0.019683387 = score(doc=801,freq=1.0), product of:
              0.089800835 = queryWeight, product of:
                1.4457031 = boost
                3.5070295 = idf(docFreq=3471, maxDocs=42596)
                0.017711764 = queryNorm
              0.21918935 = fieldWeight in 801, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5070295 = idf(docFreq=3471, maxDocs=42596)
                0.0625 = fieldNorm(doc=801)
          0.0523302 = weight(abstract_txt:represent in 801) [ClassicSimilarity], result of:
            0.0523302 = score(doc=801,freq=1.0), product of:
              0.15055147 = queryWeight, product of:
                1.5283957 = boost
                5.5614414 = idf(docFreq=444, maxDocs=42596)
                0.017711764 = queryNorm
              0.3475901 = fieldWeight in 801, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5614414 = idf(docFreq=444, maxDocs=42596)
                0.0625 = fieldNorm(doc=801)
          0.12536955 = weight(abstract_txt:thesaurus in 801) [ClassicSimilarity], result of:
            0.12536955 = score(doc=801,freq=4.0), product of:
              0.19438162 = queryWeight, product of:
                2.1269953 = boost
                5.1597285 = idf(docFreq=664, maxDocs=42596)
                0.017711764 = queryNorm
              0.64496607 = fieldWeight in 801, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1597285 = idf(docFreq=664, maxDocs=42596)
                0.0625 = fieldNorm(doc=801)
          0.1449021 = weight(abstract_txt:hierarchical in 801) [ClassicSimilarity], result of:
            0.1449021 = score(doc=801,freq=1.0), product of:
              0.40291476 = queryWeight, product of:
                3.9533923 = boost
                5.7541537 = idf(docFreq=366, maxDocs=42596)
                0.017711764 = queryNorm
              0.3596346 = fieldWeight in 801, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7541537 = idf(docFreq=366, maxDocs=42596)
                0.0625 = fieldNorm(doc=801)
          0.11203115 = weight(abstract_txt:model in 801) [ClassicSimilarity], result of:
            0.11203115 = score(doc=801,freq=2.0), product of:
              0.31507885 = queryWeight, product of:
                4.4221444 = boost
                4.0227637 = idf(docFreq=2072, maxDocs=42596)
                0.017711764 = queryNorm
              0.35556543 = fieldWeight in 801, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0227637 = idf(docFreq=2072, maxDocs=42596)
                0.0625 = fieldNorm(doc=801)
        0.28 = coord(7/25)