Document (#38106)

Author
Xamena, E.
Brignole, N.B.
Maguitman, A.G.
Title
¬A study of relevance propagation in large topic ontologies
Source
Journal of the American Society for Information Science and Technology. 64(2013) no.11, S.2238-2255
Year
2013
Abstract
Topic ontologies or web directories consist of large collections of links to websites, arranged by topic in different categories. The structure of these ontologies is typically not flat because there are hierarchical and nonhierarchical relationships among topics. As a consequence, websites classified under a certain topic may be relevant to other topics. Although some of these relevance relations are explicit, most of them must be discovered by an analysis of the structure of the ontologies. This article proposes a family of models of relevance propagation in topic ontologies. An efficient computational framework is described and used to compute nine different models for a portion of the Open Directory Project graph consisting of more than half a million nodes and approximately 1.5 million edges of different types. After performing a quantitative analysis, a user study was carried out to compare the most promising models. It was found that some general difficulties rule out the possibility of defining flawless models of relevance propagation that only take into account structural aspects of an ontology. However, there is a clear indication that including transitive relations induced by the nonhierarchical components of the ontology results in relevance propagation models that are superior to more basic approaches.
Theme
Semantisches Umfeld in Indexierung u. Retrieval

Similar documents (content)

  1. Call, A.; Gottlob, G.; Pieris, A.: ¬The return of the entity-relationship model : ontological query answering (2012) 0.18
    0.18442161 = sum of:
      0.18442161 = product of:
        0.57631755 = sum of:
          0.060082894 = weight(abstract_txt:compute in 434) [ClassicSimilarity], result of:
            0.060082894 = score(doc=434,freq=1.0), product of:
              0.12560059 = queryWeight, product of:
                1.0371895 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.015821747 = queryNorm
              0.47836474 = fieldWeight in 434, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.0625 = fieldNorm(doc=434)
          0.022182215 = weight(abstract_txt:structure in 434) [ClassicSimilarity], result of:
            0.022182215 = score(doc=434,freq=1.0), product of:
              0.081439994 = queryWeight, product of:
                1.1811258 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.015821747 = queryNorm
              0.27237496 = fieldWeight in 434, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.0625 = fieldNorm(doc=434)
          0.023682097 = weight(abstract_txt:large in 434) [ClassicSimilarity], result of:
            0.023682097 = score(doc=434,freq=1.0), product of:
              0.08507094 = queryWeight, product of:
                1.2071685 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.015821747 = queryNorm
              0.27838057 = fieldWeight in 434, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0625 = fieldNorm(doc=434)
          0.014261326 = weight(abstract_txt:that in 434) [ClassicSimilarity], result of:
            0.014261326 = score(doc=434,freq=4.0), product of:
              0.04815016 = queryWeight, product of:
                1.2843729 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.015821747 = queryNorm
              0.2961844 = fieldWeight in 434, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=434)
          0.07856436 = weight(abstract_txt:ontology in 434) [ClassicSimilarity], result of:
            0.07856436 = score(doc=434,freq=3.0), product of:
              0.1312032 = queryWeight, product of:
                1.4991652 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.015821747 = queryNorm
              0.5987991 = fieldWeight in 434, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.0625 = fieldNorm(doc=434)
          0.045724597 = weight(abstract_txt:relations in 434) [ClassicSimilarity], result of:
            0.045724597 = score(doc=434,freq=1.0), product of:
              0.13190696 = queryWeight, product of:
                1.5031805 = boost
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.015821747 = queryNorm
              0.3466428 = fieldWeight in 434, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.0625 = fieldNorm(doc=434)
          0.06700072 = weight(abstract_txt:models in 434) [ClassicSimilarity], result of:
            0.06700072 = score(doc=434,freq=1.0), product of:
              0.23095858 = queryWeight, product of:
                3.1449542 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.015821747 = queryNorm
              0.2900984 = fieldWeight in 434, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=434)
          0.26481935 = weight(abstract_txt:ontologies in 434) [ClassicSimilarity], result of:
            0.26481935 = score(doc=434,freq=4.0), product of:
              0.36371478 = queryWeight, product of:
                3.9466422 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.015821747 = queryNorm
              0.7280962 = fieldWeight in 434, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0625 = fieldNorm(doc=434)
        0.32 = coord(8/25)
    
  2. Solskinnsbakk, G.; Gulla, J.A.; Haderlein, V.; Myrseth, P.; Cerrato, O.: Quality of hierarchies in ontologies and folksonomies (2012) 0.18
    0.18258315 = sum of:
      0.18258315 = product of:
        0.6520827 = sum of:
          0.022182215 = weight(abstract_txt:structure in 1034) [ClassicSimilarity], result of:
            0.022182215 = score(doc=1034,freq=1.0), product of:
              0.081439994 = queryWeight, product of:
                1.1811258 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.015821747 = queryNorm
              0.27237496 = fieldWeight in 1034, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.0625 = fieldNorm(doc=1034)
          0.007130663 = weight(abstract_txt:that in 1034) [ClassicSimilarity], result of:
            0.007130663 = score(doc=1034,freq=1.0), product of:
              0.04815016 = queryWeight, product of:
                1.2843729 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.015821747 = queryNorm
              0.1480922 = fieldWeight in 1034, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1034)
          0.027999485 = weight(abstract_txt:different in 1034) [ClassicSimilarity], result of:
            0.027999485 = score(doc=1034,freq=2.0), product of:
              0.08642146 = queryWeight, product of:
                1.4901627 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.015821747 = queryNorm
              0.32398763 = fieldWeight in 1034, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0625 = fieldNorm(doc=1034)
          0.1111068 = weight(abstract_txt:ontology in 1034) [ClassicSimilarity], result of:
            0.1111068 = score(doc=1034,freq=6.0), product of:
              0.1312032 = queryWeight, product of:
                1.4991652 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.015821747 = queryNorm
              0.8468299 = fieldWeight in 1034, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.0625 = fieldNorm(doc=1034)
          0.06466434 = weight(abstract_txt:relations in 1034) [ClassicSimilarity], result of:
            0.06466434 = score(doc=1034,freq=2.0), product of:
              0.13190696 = queryWeight, product of:
                1.5031805 = boost
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.015821747 = queryNorm
              0.49022692 = fieldWeight in 1034, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.0625 = fieldNorm(doc=1034)
          0.122922175 = weight(abstract_txt:topic in 1034) [ClassicSimilarity], result of:
            0.122922175 = score(doc=1034,freq=2.0), product of:
              0.27472064 = queryWeight, product of:
                3.4299905 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.015821747 = queryNorm
              0.44744426 = fieldWeight in 1034, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=1034)
          0.29607704 = weight(abstract_txt:ontologies in 1034) [ClassicSimilarity], result of:
            0.29607704 = score(doc=1034,freq=5.0), product of:
              0.36371478 = queryWeight, product of:
                3.9466422 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.015821747 = queryNorm
              0.8140363 = fieldWeight in 1034, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0625 = fieldNorm(doc=1034)
        0.28 = coord(7/25)
    
  3. Silvello, G.: Theory and practice of data citation (2018) 0.18
    0.18128376 = sum of:
      0.18128376 = product of:
        0.56651175 = sum of:
          0.016438143 = weight(abstract_txt:most in 4006) [ClassicSimilarity], result of:
            0.016438143 = score(doc=4006,freq=1.0), product of:
              0.06669137 = queryWeight, product of:
                1.0688385 = boost
                3.943693 = idf(docFreq=2328, maxDocs=44218)
                0.015821747 = queryNorm
              0.24648081 = fieldWeight in 4006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.943693 = idf(docFreq=2328, maxDocs=44218)
                0.0625 = fieldNorm(doc=4006)
          0.018464318 = weight(abstract_txt:there in 4006) [ClassicSimilarity], result of:
            0.018464318 = score(doc=4006,freq=1.0), product of:
              0.07206482 = queryWeight, product of:
                1.1110636 = boost
                4.099491 = idf(docFreq=1992, maxDocs=44218)
                0.015821747 = queryNorm
              0.2562182 = fieldWeight in 4006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.099491 = idf(docFreq=1992, maxDocs=44218)
                0.0625 = fieldNorm(doc=4006)
          0.023682097 = weight(abstract_txt:large in 4006) [ClassicSimilarity], result of:
            0.023682097 = score(doc=4006,freq=1.0), product of:
              0.08507094 = queryWeight, product of:
                1.2071685 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.015821747 = queryNorm
              0.27838057 = fieldWeight in 4006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0625 = fieldNorm(doc=4006)
          0.007130663 = weight(abstract_txt:that in 4006) [ClassicSimilarity], result of:
            0.007130663 = score(doc=4006,freq=1.0), product of:
              0.04815016 = queryWeight, product of:
                1.2843729 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.015821747 = queryNorm
              0.1480922 = fieldWeight in 4006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=4006)
          0.019798627 = weight(abstract_txt:different in 4006) [ClassicSimilarity], result of:
            0.019798627 = score(doc=4006,freq=1.0), product of:
              0.08642146 = queryWeight, product of:
                1.4901627 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.015821747 = queryNorm
              0.22909386 = fieldWeight in 4006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0625 = fieldNorm(doc=4006)
          0.08037344 = weight(abstract_txt:relevance in 4006) [ClassicSimilarity], result of:
            0.08037344 = score(doc=4006,freq=1.0), product of:
              0.26074913 = queryWeight, product of:
                3.3416326 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.015821747 = queryNorm
              0.3082405 = fieldWeight in 4006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0625 = fieldNorm(doc=4006)
          0.0869191 = weight(abstract_txt:topic in 4006) [ClassicSimilarity], result of:
            0.0869191 = score(doc=4006,freq=1.0), product of:
              0.27472064 = queryWeight, product of:
                3.4299905 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.015821747 = queryNorm
              0.31639087 = fieldWeight in 4006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=4006)
          0.31370535 = weight(abstract_txt:propagation in 4006) [ClassicSimilarity], result of:
            0.31370535 = score(doc=4006,freq=1.0), product of:
              0.6000569 = queryWeight, product of:
                4.534073 = boost
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.015821747 = queryNorm
              0.5227927 = fieldWeight in 4006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.0625 = fieldNorm(doc=4006)
        0.32 = coord(8/25)
    
  4. Alkhodair, S.A.; Fung, B.C.M.; Patrick, O.R.; Hung, C.K.: Improving interpretations of topic modeling in microblogs (2018) 0.15
    0.14808331 = sum of:
      0.14808331 = product of:
        0.528869 = sum of:
          0.022182215 = weight(abstract_txt:structure in 4181) [ClassicSimilarity], result of:
            0.022182215 = score(doc=4181,freq=1.0), product of:
              0.081439994 = queryWeight, product of:
                1.1811258 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.015821747 = queryNorm
              0.27237496 = fieldWeight in 4181, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.0625 = fieldNorm(doc=4181)
          0.023682097 = weight(abstract_txt:large in 4181) [ClassicSimilarity], result of:
            0.023682097 = score(doc=4181,freq=1.0), product of:
              0.08507094 = queryWeight, product of:
                1.2071685 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.015821747 = queryNorm
              0.27838057 = fieldWeight in 4181, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0625 = fieldNorm(doc=4181)
          0.014261326 = weight(abstract_txt:that in 4181) [ClassicSimilarity], result of:
            0.014261326 = score(doc=4181,freq=4.0), product of:
              0.04815016 = queryWeight, product of:
                1.2843729 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.015821747 = queryNorm
              0.2961844 = fieldWeight in 4181, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=4181)
          0.07885087 = weight(abstract_txt:topics in 4181) [ClassicSimilarity], result of:
            0.07885087 = score(doc=4181,freq=5.0), product of:
              0.11092994 = queryWeight, product of:
                1.3784838 = boost
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.015821747 = queryNorm
              0.71081686 = fieldWeight in 4181, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.0625 = fieldNorm(doc=4181)
          0.027999485 = weight(abstract_txt:different in 4181) [ClassicSimilarity], result of:
            0.027999485 = score(doc=4181,freq=2.0), product of:
              0.08642146 = queryWeight, product of:
                1.4901627 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.015821747 = queryNorm
              0.32398763 = fieldWeight in 4181, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0625 = fieldNorm(doc=4181)
          0.116048634 = weight(abstract_txt:models in 4181) [ClassicSimilarity], result of:
            0.116048634 = score(doc=4181,freq=3.0), product of:
              0.23095858 = queryWeight, product of:
                3.1449542 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.015821747 = queryNorm
              0.5024651 = fieldWeight in 4181, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=4181)
          0.24584435 = weight(abstract_txt:topic in 4181) [ClassicSimilarity], result of:
            0.24584435 = score(doc=4181,freq=8.0), product of:
              0.27472064 = queryWeight, product of:
                3.4299905 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.015821747 = queryNorm
              0.8948885 = fieldWeight in 4181, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=4181)
        0.28 = coord(7/25)
    
  5. King, B.E.; Reinold, K.: Finding the concept, not just the word : a librarian's guide to ontologies and semantics (2008) 0.14
    0.14147882 = sum of:
      0.14147882 = product of:
        0.5052815 = sum of:
          0.010273838 = weight(abstract_txt:most in 2863) [ClassicSimilarity], result of:
            0.010273838 = score(doc=2863,freq=1.0), product of:
              0.06669137 = queryWeight, product of:
                1.0688385 = boost
                3.943693 = idf(docFreq=2328, maxDocs=44218)
                0.015821747 = queryNorm
              0.1540505 = fieldWeight in 2863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.943693 = idf(docFreq=2328, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2863)
          0.013863885 = weight(abstract_txt:structure in 2863) [ClassicSimilarity], result of:
            0.013863885 = score(doc=2863,freq=1.0), product of:
              0.081439994 = queryWeight, product of:
                1.1811258 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.015821747 = queryNorm
              0.17023435 = fieldWeight in 2863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2863)
          0.022039488 = weight(abstract_txt:topics in 2863) [ClassicSimilarity], result of:
            0.022039488 = score(doc=2863,freq=1.0), product of:
              0.11092994 = queryWeight, product of:
                1.3784838 = boost
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.015821747 = queryNorm
              0.19867934 = fieldWeight in 2863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2863)
          0.012374141 = weight(abstract_txt:different in 2863) [ClassicSimilarity], result of:
            0.012374141 = score(doc=2863,freq=1.0), product of:
              0.08642146 = queryWeight, product of:
                1.4901627 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.015821747 = queryNorm
              0.14318366 = fieldWeight in 2863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2863)
          0.09402456 = weight(abstract_txt:ontology in 2863) [ClassicSimilarity], result of:
            0.09402456 = score(doc=2863,freq=11.0), product of:
              0.1312032 = queryWeight, product of:
                1.4991652 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.015821747 = queryNorm
              0.71663314 = fieldWeight in 2863, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2863)
          0.05432444 = weight(abstract_txt:topic in 2863) [ClassicSimilarity], result of:
            0.05432444 = score(doc=2863,freq=1.0), product of:
              0.27472064 = queryWeight, product of:
                3.4299905 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.015821747 = queryNorm
              0.1977443 = fieldWeight in 2863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2863)
          0.29838115 = weight(abstract_txt:ontologies in 2863) [ClassicSimilarity], result of:
            0.29838115 = score(doc=2863,freq=13.0), product of:
              0.36371478 = queryWeight, product of:
                3.9466422 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.015821747 = queryNorm
              0.82037127 = fieldWeight in 2863, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2863)
        0.28 = coord(7/25)