Document (#38281)

Author
Koumenides, C.L.
Shadbolt, N.R.
Title
Ranking methods for entity-oriented semantic web search
Source
Journal of the Association for Information Science and Technology. 65(2014) no.6, S.1091-1106
Year
2014
Series
Advances in information science
Abstract
This article provides a technical review of semantic search methods used to support text-based search over formal Semantic Web knowledge bases. Our focus is on ranking methods and auxiliary processes explored by existing semantic search systems, outlined within broad areas of classification. We present reflective examples from the literature in some detail, which should appeal to readers interested in a deeper perspective on the various methods and systems implemented in the outlined literature. The presentation covers graph exploration and propagation methods, adaptations of classic probabilistic retrieval models, and query-independent link analysis via flexible extensions to the PageRank algorithm. Future research directions are discussed, including development of more cohesive retrieval models to unlock further potentials and uses, data indexing schemes, integration with user interfaces, and building community consensus for more systematic evaluation and gradual development.
Content
Verfügbar unter: http://onlinelibrary.wiley.com/doi/10.1002/asi.23018/pdf.
Theme
Retrievalalgorithmen

Similar documents (content)

  1. Mayr, P.; Mutschke, P.; Petras, V.: Reducing semantic complexity in distributed digital libraries : Treatment of term vagueness and document re-ranking (2008) 0.14
    0.14111054 = sum of:
      0.14111054 = product of:
        0.5039662 = sum of:
          0.013813064 = weight(abstract_txt:more in 1909) [ClassicSimilarity], result of:
            0.013813064 = score(doc=1909,freq=1.0), product of:
              0.07424316 = queryWeight, product of:
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.021822821 = queryNorm
              0.18605168 = fieldWeight in 1909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1909)
          0.0147221135 = weight(abstract_txt:retrieval in 1909) [ClassicSimilarity], result of:
            0.0147221135 = score(doc=1909,freq=1.0), product of:
              0.07746577 = queryWeight, product of:
                1.0214726 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.021822821 = queryNorm
              0.19004668 = fieldWeight in 1909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1909)
          0.035079177 = weight(abstract_txt:models in 1909) [ClassicSimilarity], result of:
            0.035079177 = score(doc=1909,freq=1.0), product of:
              0.13819617 = queryWeight, product of:
                1.3643311 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.021822821 = queryNorm
              0.2538361 = fieldWeight in 1909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1909)
          0.1376659 = weight(abstract_txt:ranking in 1909) [ClassicSimilarity], result of:
            0.1376659 = score(doc=1909,freq=5.0), product of:
              0.20107464 = queryWeight, product of:
                1.6456991 = boost
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.021822821 = queryNorm
              0.6846507 = fieldWeight in 1909, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1909)
          0.068685204 = weight(abstract_txt:search in 1909) [ClassicSimilarity], result of:
            0.068685204 = score(doc=1909,freq=4.0), product of:
              0.17167032 = queryWeight, product of:
                2.150475 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.021822821 = queryNorm
              0.4000995 = fieldWeight in 1909, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1909)
          0.12568833 = weight(abstract_txt:semantic in 1909) [ClassicSimilarity], result of:
            0.12568833 = score(doc=1909,freq=4.0), product of:
              0.25683233 = queryWeight, product of:
                2.6303384 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.021822821 = queryNorm
              0.4893789 = fieldWeight in 1909, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1909)
          0.108312406 = weight(abstract_txt:methods in 1909) [ClassicSimilarity], result of:
            0.108312406 = score(doc=1909,freq=3.0), product of:
              0.27575377 = queryWeight, product of:
                3.047211 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.021822821 = queryNorm
              0.39278668 = fieldWeight in 1909, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1909)
        0.28 = coord(7/25)
    
  2. Hubrich, J.: Intersystem relations : Characteristics and functionalities (2011) 0.14
    0.13693845 = sum of:
      0.13693845 = product of:
        0.68469226 = sum of:
          0.033650544 = weight(abstract_txt:retrieval in 4780) [ClassicSimilarity], result of:
            0.033650544 = score(doc=4780,freq=1.0), product of:
              0.07746577 = queryWeight, product of:
                1.0214726 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.021822821 = queryNorm
              0.43439242 = fieldWeight in 4780, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.125 = fieldNorm(doc=4780)
          0.22646604 = weight(abstract_txt:outlined in 4780) [ClassicSimilarity], result of:
            0.22646604 = score(doc=4780,freq=1.0), product of:
              0.2761323 = queryWeight, product of:
                1.9285476 = boost
                6.5610886 = idf(docFreq=169, maxDocs=44218)
                0.021822821 = queryNorm
              0.8201361 = fieldWeight in 4780, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5610886 = idf(docFreq=169, maxDocs=44218)
                0.125 = fieldNorm(doc=4780)
          0.07849738 = weight(abstract_txt:search in 4780) [ClassicSimilarity], result of:
            0.07849738 = score(doc=4780,freq=1.0), product of:
              0.17167032 = queryWeight, product of:
                2.150475 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.021822821 = queryNorm
              0.45725656 = fieldWeight in 4780, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.125 = fieldNorm(doc=4780)
          0.203143 = weight(abstract_txt:semantic in 4780) [ClassicSimilarity], result of:
            0.203143 = score(doc=4780,freq=2.0), product of:
              0.25683233 = queryWeight, product of:
                2.6303384 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.021822821 = queryNorm
              0.7909557 = fieldWeight in 4780, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.125 = fieldNorm(doc=4780)
          0.1429353 = weight(abstract_txt:methods in 4780) [ClassicSimilarity], result of:
            0.1429353 = score(doc=4780,freq=1.0), product of:
              0.27575377 = queryWeight, product of:
                3.047211 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.021822821 = queryNorm
              0.518344 = fieldWeight in 4780, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.125 = fieldNorm(doc=4780)
        0.2 = coord(5/25)
    
  3. Biagetti, M.T.: Pertinence perspective and OPAC enhancement 0.13
    0.13255996 = sum of:
      0.13255996 = product of:
        0.5523332 = sum of:
          0.03425264 = weight(abstract_txt:development in 3549) [ClassicSimilarity], result of:
            0.03425264 = score(doc=3549,freq=1.0), product of:
              0.094959185 = queryWeight, product of:
                1.1309419 = boost
                3.8475635 = idf(docFreq=2563, maxDocs=44218)
                0.021822821 = queryNorm
              0.36070907 = fieldWeight in 3549, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8475635 = idf(docFreq=2563, maxDocs=44218)
                0.09375 = fieldNorm(doc=3549)
          0.051697314 = weight(abstract_txt:literature in 3549) [ClassicSimilarity], result of:
            0.051697314 = score(doc=3549,freq=1.0), product of:
              0.1249452 = queryWeight, product of:
                1.2972736 = boost
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.021822821 = queryNorm
              0.4137599 = fieldWeight in 3549, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.09375 = fieldNorm(doc=3549)
          0.10554182 = weight(abstract_txt:ranking in 3549) [ClassicSimilarity], result of:
            0.10554182 = score(doc=3549,freq=1.0), product of:
              0.20107464 = queryWeight, product of:
                1.6456991 = boost
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.021822821 = queryNorm
              0.52488875 = fieldWeight in 3549, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.09375 = fieldNorm(doc=3549)
          0.16984953 = weight(abstract_txt:outlined in 3549) [ClassicSimilarity], result of:
            0.16984953 = score(doc=3549,freq=1.0), product of:
              0.2761323 = queryWeight, product of:
                1.9285476 = boost
                6.5610886 = idf(docFreq=169, maxDocs=44218)
                0.021822821 = queryNorm
              0.61510205 = fieldWeight in 3549, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5610886 = idf(docFreq=169, maxDocs=44218)
                0.09375 = fieldNorm(doc=3549)
          0.08325904 = weight(abstract_txt:search in 3549) [ClassicSimilarity], result of:
            0.08325904 = score(doc=3549,freq=2.0), product of:
              0.17167032 = queryWeight, product of:
                2.150475 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.021822821 = queryNorm
              0.48499382 = fieldWeight in 3549, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.09375 = fieldNorm(doc=3549)
          0.10773285 = weight(abstract_txt:semantic in 3549) [ClassicSimilarity], result of:
            0.10773285 = score(doc=3549,freq=1.0), product of:
              0.25683233 = queryWeight, product of:
                2.6303384 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.021822821 = queryNorm
              0.41946763 = fieldWeight in 3549, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.09375 = fieldNorm(doc=3549)
        0.24 = coord(6/25)
    
  4. Ning, X.; Jin, H.; Wu, H.: RSS: a framework enabling ranked search on the semantic web (2008) 0.12
    0.1230274 = sum of:
      0.1230274 = product of:
        0.615137 = sum of:
          0.088082984 = weight(abstract_txt:pagerank in 2069) [ClassicSimilarity], result of:
            0.088082984 = score(doc=2069,freq=1.0), product of:
              0.18537584 = queryWeight, product of:
                1.1173348 = boost
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.021822821 = queryNorm
              0.47515893 = fieldWeight in 2069, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.0625 = fieldNorm(doc=2069)
          0.099505775 = weight(abstract_txt:ranking in 2069) [ClassicSimilarity], result of:
            0.099505775 = score(doc=2069,freq=2.0), product of:
              0.20107464 = queryWeight, product of:
                1.6456991 = boost
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.021822821 = queryNorm
              0.49486983 = fieldWeight in 2069, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.0625 = fieldNorm(doc=2069)
          0.11101206 = weight(abstract_txt:search in 2069) [ClassicSimilarity], result of:
            0.11101206 = score(doc=2069,freq=8.0), product of:
              0.17167032 = queryWeight, product of:
                2.150475 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.021822821 = queryNorm
              0.6466584 = fieldWeight in 2069, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=2069)
          0.2154657 = weight(abstract_txt:semantic in 2069) [ClassicSimilarity], result of:
            0.2154657 = score(doc=2069,freq=9.0), product of:
              0.25683233 = queryWeight, product of:
                2.6303384 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.021822821 = queryNorm
              0.83893526 = fieldWeight in 2069, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=2069)
          0.10107052 = weight(abstract_txt:methods in 2069) [ClassicSimilarity], result of:
            0.10107052 = score(doc=2069,freq=2.0), product of:
              0.27575377 = queryWeight, product of:
                3.047211 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.021822821 = queryNorm
              0.36652455 = fieldWeight in 2069, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=2069)
        0.2 = coord(5/25)
    
  5. Urbain, J.; Goharian, N.; Frieder, O.: Probabilistic passage models for semantic search of genomics literature (2008) 0.12
    0.118702516 = sum of:
      0.118702516 = product of:
        0.42393756 = sum of:
          0.050475817 = weight(abstract_txt:retrieval in 2380) [ClassicSimilarity], result of:
            0.050475817 = score(doc=2380,freq=9.0), product of:
              0.07746577 = queryWeight, product of:
                1.0214726 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.021822821 = queryNorm
              0.6515886 = fieldWeight in 2380, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=2380)
          0.034464873 = weight(abstract_txt:literature in 2380) [ClassicSimilarity], result of:
            0.034464873 = score(doc=2380,freq=1.0), product of:
              0.1249452 = queryWeight, product of:
                1.2972736 = boost
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.021822821 = queryNorm
              0.27583992 = fieldWeight in 2380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.413439 = idf(docFreq=1455, maxDocs=44218)
                0.0625 = fieldNorm(doc=2380)
          0.040090486 = weight(abstract_txt:models in 2380) [ClassicSimilarity], result of:
            0.040090486 = score(doc=2380,freq=1.0), product of:
              0.13819617 = queryWeight, product of:
                1.3643311 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.021822821 = queryNorm
              0.2900984 = fieldWeight in 2380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=2380)
          0.07036121 = weight(abstract_txt:ranking in 2380) [ClassicSimilarity], result of:
            0.07036121 = score(doc=2380,freq=1.0), product of:
              0.20107464 = queryWeight, product of:
                1.6456991 = boost
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.021822821 = queryNorm
              0.34992582 = fieldWeight in 2380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.0625 = fieldNorm(doc=2380)
          0.05550603 = weight(abstract_txt:search in 2380) [ClassicSimilarity], result of:
            0.05550603 = score(doc=2380,freq=2.0), product of:
              0.17167032 = queryWeight, product of:
                2.150475 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.021822821 = queryNorm
              0.3233292 = fieldWeight in 2380, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=2380)
          0.1015715 = weight(abstract_txt:semantic in 2380) [ClassicSimilarity], result of:
            0.1015715 = score(doc=2380,freq=2.0), product of:
              0.25683233 = queryWeight, product of:
                2.6303384 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.021822821 = queryNorm
              0.39547786 = fieldWeight in 2380, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=2380)
          0.07146765 = weight(abstract_txt:methods in 2380) [ClassicSimilarity], result of:
            0.07146765 = score(doc=2380,freq=1.0), product of:
              0.27575377 = queryWeight, product of:
                3.047211 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.021822821 = queryNorm
              0.259172 = fieldWeight in 2380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=2380)
        0.28 = coord(7/25)