Document (#35281)

Author
Köhler, J.
Philippi, S.
Specht, M.
Rüegg, A.
Title
Ontology based text indexing and querying for the semantic web
Source
Knowledge-based systems. 19(2006) no.8, S.744-754
Year
2006
Abstract
This publication shows how the gap between the HTML based internet and the RDF based vision of the semantic web might be bridged, by linking words in texts to concepts of ontologies. Most current search engines use indexes that are built at the syntactical level and return hits based on simple string comparisons. However, the indexes do not contain synonyms, cannot differentiate between homonyms ('mouse' as a pointing vs. 'mouse' as an animal) and users receive different search results when they use different conjugation forms of the same word. In this publication, we present a system that uses ontologies and Natural Language Processing techniques to index texts, and thus supports word sense disambiguation and the retrieval of texts that contain equivalent words, by indexing them to concepts of ontologies. For this purpose, we developed fully automated methods for mapping equivalent concepts of imported RDF ontologies (for this prototype WordNet, SUMO and OpenCyc). These methods will thus allow the seamless integration of domain specific ontologies for concept based information retrieval in different domains. To demonstrate the practical workability of this approach, a set of web pages that contain synonyms and homonyms were indexed and can be queried via a search engine like query frontend. However, the ontology based indexing approach can also be used for other data mining applications such text clustering, relation mining and for searching free text fields in biological databases. The ontology alignment methods and some of the text mining principles described in this publication are now incorporated into the ONDEX system http://ondex.sourceforge.net/.
Content
Volltext unter: köhler_et_al._-_ontology_based_text_indexing_and_querying_for_the_semantic_web.pdf; köhler_et_al._-_ontology_based_text_indexing_and_querying_for_the_semantic_web_(annex).pdf (Anhang).
Footnote
Vgl.: http://ondex.sourceforge.net/.
Theme
Wissensrepräsentation
Object
ONDEX

Similar documents (author)

  1. Specht, K.: Methoden der Information : Methoden, Theorie, Praxis, Trends (1993) 2.18
    2.184183 = sum of:
      2.184183 = product of:
        4.368366 = sum of:
          4.368366 = weight(author_txt:specht in 352) [ClassicSimilarity], result of:
            4.368366 = score(doc=352,freq=1.0), product of:
              0.7167882 = queryWeight, product of:
                1.0138843 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.07250272 = queryNorm
              6.094361 = fieldWeight in 352, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.625 = fieldNorm(doc=352)
        0.5 = coord(1/2)
    
  2. Specht, G.: Architekturen von Multimedia-Datenbanksystemen zur Speicherung von Bildern und Videos (1998) 2.18
    2.184183 = sum of:
      2.184183 = product of:
        4.368366 = sum of:
          4.368366 = weight(author_txt:specht in 17) [ClassicSimilarity], result of:
            4.368366 = score(doc=17,freq=1.0), product of:
              0.7167882 = queryWeight, product of:
                1.0138843 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.07250272 = queryNorm
              6.094361 = fieldWeight in 17, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.625 = fieldNorm(doc=17)
        0.5 = coord(1/2)
    
  3. Köhler, O.: ¬Der Brockhaus und sein Weltbild (1975) 2.10
    2.0956745 = sum of:
      2.0956745 = product of:
        4.191349 = sum of:
          4.191349 = weight(author_txt:köhler in 3612) [ClassicSimilarity], result of:
            4.191349 = score(doc=3612,freq=1.0), product of:
              0.69729096 = queryWeight, product of:
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.07250272 = queryNorm
              6.010904 = fieldWeight in 3612, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.625 = fieldNorm(doc=3612)
        0.5 = coord(1/2)
    
  4. Köhler, M.: Internationale Verbundsysteme : Projektmanagement und Realisierung durch die Dynix GmbH am Beispiel der kooperierenden Verbünde Deutschlands (1998) 2.10
    2.0956745 = sum of:
      2.0956745 = product of:
        4.191349 = sum of:
          4.191349 = weight(author_txt:köhler in 3421) [ClassicSimilarity], result of:
            4.191349 = score(doc=3421,freq=1.0), product of:
              0.69729096 = queryWeight, product of:
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.07250272 = queryNorm
              6.010904 = fieldWeight in 3421, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.625 = fieldNorm(doc=3421)
        0.5 = coord(1/2)
    
  5. Köhler, R.: Zeitschriftenaufsatz-Datenbanken : ein kritischer Vergleich zwischen Zeitschrifteninhaltsdienst Theologie und Religion Database unter besonderer Berücksichtigung weiterer Datenbanken (1998) 2.10
    2.0956745 = sum of:
      2.0956745 = product of:
        4.191349 = sum of:
          4.191349 = weight(author_txt:köhler in 1245) [ClassicSimilarity], result of:
            4.191349 = score(doc=1245,freq=1.0), product of:
              0.69729096 = queryWeight, product of:
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.07250272 = queryNorm
              6.010904 = fieldWeight in 1245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.625 = fieldNorm(doc=1245)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Saruladha, K.; Aghila, G.; Penchala, S.K.: Design of new indexing techniques based on ontology for information retrieval systems (2010) 0.74
    0.74153167 = sum of:
      0.74153167 = product of:
        1.0904877 = sum of:
          0.010447128 = weight(abstract_txt:that in 4317) [ClassicSimilarity], result of:
            0.010447128 = score(doc=4317,freq=2.0), product of:
              0.049882676 = queryWeight, product of:
                1.0192395 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.020654816 = queryNorm
              0.20943399 = fieldWeight in 4317, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=4317)
          0.044585023 = weight(abstract_txt:word in 4317) [ClassicSimilarity], result of:
            0.044585023 = score(doc=4317,freq=1.0), product of:
              0.1312435 = queryWeight, product of:
                1.169029 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.020654816 = queryNorm
              0.33971223 = fieldWeight in 4317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=4317)
          0.028830454 = weight(abstract_txt:search in 4317) [ClassicSimilarity], result of:
            0.028830454 = score(doc=4317,freq=2.0), product of:
              0.08916749 = queryWeight, product of:
                1.1801448 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.020654816 = queryNorm
              0.3233292 = fieldWeight in 4317, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=4317)
          0.029006949 = weight(abstract_txt:different in 4317) [ClassicSimilarity], result of:
            0.029006949 = score(doc=4317,freq=2.0), product of:
              0.089531034 = queryWeight, product of:
                1.1825482 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.020654816 = queryNorm
              0.32398763 = fieldWeight in 4317, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0625 = fieldNorm(doc=4317)
          0.07389971 = weight(abstract_txt:indexes in 4317) [ClassicSimilarity], result of:
            0.07389971 = score(doc=4317,freq=2.0), product of:
              0.14589384 = queryWeight, product of:
                1.2325509 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.020654816 = queryNorm
              0.5065307 = fieldWeight in 4317, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.0625 = fieldNorm(doc=4317)
          0.029696885 = weight(abstract_txt:methods in 4317) [ClassicSimilarity], result of:
            0.029696885 = score(doc=4317,freq=1.0), product of:
              0.1145837 = queryWeight, product of:
                1.3378069 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.020654816 = queryNorm
              0.259172 = fieldWeight in 4317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=4317)
          0.034271322 = weight(abstract_txt:indexing in 4317) [ClassicSimilarity], result of:
            0.034271322 = score(doc=4317,freq=1.0), product of:
              0.1260674 = queryWeight, product of:
                1.4032447 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.020654816 = queryNorm
              0.27184922 = fieldWeight in 4317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.0625 = fieldNorm(doc=4317)
          0.039314862 = weight(abstract_txt:concepts in 4317) [ClassicSimilarity], result of:
            0.039314862 = score(doc=4317,freq=1.0), product of:
              0.1381508 = queryWeight, product of:
                1.4689558 = boost
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.020654816 = queryNorm
              0.28457934 = fieldWeight in 4317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.0625 = fieldNorm(doc=4317)
          0.016550578 = weight(abstract_txt:this in 4317) [ClassicSimilarity], result of:
            0.016550578 = score(doc=4317,freq=2.0), product of:
              0.07759928 = queryWeight, product of:
                1.5569543 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.020654816 = queryNorm
              0.21328263 = fieldWeight in 4317, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=4317)
          0.124489546 = weight(abstract_txt:synonyms in 4317) [ClassicSimilarity], result of:
            0.124489546 = score(doc=4317,freq=1.0), product of:
              0.2602398 = queryWeight, product of:
                1.6461645 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.020654816 = queryNorm
              0.47836474 = fieldWeight in 4317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.0625 = fieldNorm(doc=4317)
          0.036721118 = weight(abstract_txt:text in 4317) [ClassicSimilarity], result of:
            0.036721118 = score(doc=4317,freq=1.0), product of:
              0.14529112 = queryWeight, product of:
                1.739486 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020654816 = queryNorm
              0.25274166 = fieldWeight in 4317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=4317)
          0.07048687 = weight(abstract_txt:ontology in 4317) [ClassicSimilarity], result of:
            0.07048687 = score(doc=4317,freq=1.0), product of:
              0.20388615 = queryWeight, product of:
                1.7845384 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.020654816 = queryNorm
              0.34571683 = fieldWeight in 4317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.0625 = fieldNorm(doc=4317)
          0.07528574 = weight(abstract_txt:texts in 4317) [ClassicSimilarity], result of:
            0.07528574 = score(doc=4317,freq=1.0), product of:
              0.21303815 = queryWeight, product of:
                1.8241507 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.020654816 = queryNorm
              0.3533909 = fieldWeight in 4317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.0625 = fieldNorm(doc=4317)
          0.19334671 = weight(abstract_txt:homonyms in 4317) [ClassicSimilarity], result of:
            0.19334671 = score(doc=4317,freq=1.0), product of:
              0.3490141 = queryWeight, product of:
                1.906373 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.020654816 = queryNorm
              0.55397964 = fieldWeight in 4317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.0625 = fieldNorm(doc=4317)
          0.09963943 = weight(abstract_txt:contain in 4317) [ClassicSimilarity], result of:
            0.09963943 = score(doc=4317,freq=1.0), product of:
              0.25680473 = queryWeight, product of:
                2.0027814 = boost
                6.2079496 = idf(docFreq=241, maxDocs=44218)
                0.020654816 = queryNorm
              0.38799685 = fieldWeight in 4317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2079496 = idf(docFreq=241, maxDocs=44218)
                0.0625 = fieldNorm(doc=4317)
          0.04674151 = weight(abstract_txt:based in 4317) [ClassicSimilarity], result of:
            0.04674151 = score(doc=4317,freq=3.0), product of:
              0.1354421 = queryWeight, product of:
                2.0569506 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.020654816 = queryNorm
              0.3451033 = fieldWeight in 4317, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=4317)
          0.13717397 = weight(abstract_txt:ontologies in 4317) [ClassicSimilarity], result of:
            0.13717397 = score(doc=4317,freq=1.0), product of:
              0.3768018 = queryWeight, product of:
                3.131936 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.020654816 = queryNorm
              0.3640481 = fieldWeight in 4317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0625 = fieldNorm(doc=4317)
        0.68 = coord(17/25)
    
  2. Jiang, X.; Tan, A.-H.: CRCTOL: a semantic-based domain ontology learning system (2009) 0.55
    0.5522202 = sum of:
      0.5522202 = product of:
        1.061962 = sum of:
          0.014453564 = weight(abstract_txt:that in 3320) [ClassicSimilarity], result of:
            0.014453564 = score(doc=3320,freq=5.0), product of:
              0.049882676 = queryWeight, product of:
                1.0192395 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.020654816 = queryNorm
              0.28975117 = fieldWeight in 3320, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.037264638 = weight(abstract_txt:words in 3320) [ClassicSimilarity], result of:
            0.037264638 = score(doc=3320,freq=1.0), product of:
              0.12729491 = queryWeight, product of:
                1.151309 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.020654816 = queryNorm
              0.29274255 = fieldWeight in 3320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.039011896 = weight(abstract_txt:word in 3320) [ClassicSimilarity], result of:
            0.039011896 = score(doc=3320,freq=1.0), product of:
              0.1312435 = queryWeight, product of:
                1.169029 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.020654816 = queryNorm
              0.2972482 = fieldWeight in 3320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.03674802 = weight(abstract_txt:methods in 3320) [ClassicSimilarity], result of:
            0.03674802 = score(doc=3320,freq=2.0), product of:
              0.1145837 = queryWeight, product of:
                1.3378069 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.020654816 = queryNorm
              0.320709 = fieldWeight in 3320, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.06880101 = weight(abstract_txt:concepts in 3320) [ClassicSimilarity], result of:
            0.06880101 = score(doc=3320,freq=4.0), product of:
              0.1381508 = queryWeight, product of:
                1.4689558 = boost
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.020654816 = queryNorm
              0.49801385 = fieldWeight in 3320, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.010240148 = weight(abstract_txt:this in 3320) [ClassicSimilarity], result of:
            0.010240148 = score(doc=3320,freq=1.0), product of:
              0.07759928 = queryWeight, product of:
                1.5569543 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.020654816 = queryNorm
              0.1319619 = fieldWeight in 3320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.055652488 = weight(abstract_txt:text in 3320) [ClassicSimilarity], result of:
            0.055652488 = score(doc=3320,freq=3.0), product of:
              0.14529112 = queryWeight, product of:
                1.739486 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020654816 = queryNorm
              0.38304123 = fieldWeight in 3320, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.1631794 = weight(abstract_txt:ontology in 3320) [ClassicSimilarity], result of:
            0.1631794 = score(doc=3320,freq=7.0), product of:
              0.20388615 = queryWeight, product of:
                1.7845384 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.020654816 = queryNorm
              0.80034566 = fieldWeight in 3320, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.06587502 = weight(abstract_txt:texts in 3320) [ClassicSimilarity], result of:
            0.06587502 = score(doc=3320,freq=1.0), product of:
              0.21303815 = queryWeight, product of:
                1.8241507 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.020654816 = queryNorm
              0.30921704 = fieldWeight in 3320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.14864704 = weight(abstract_txt:mining in 3320) [ClassicSimilarity], result of:
            0.14864704 = score(doc=3320,freq=3.0), product of:
              0.25412104 = queryWeight, product of:
                1.992289 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.020654816 = queryNorm
              0.58494586 = fieldWeight in 3320, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.087184496 = weight(abstract_txt:contain in 3320) [ClassicSimilarity], result of:
            0.087184496 = score(doc=3320,freq=1.0), product of:
              0.25680473 = queryWeight, product of:
                2.0027814 = boost
                6.2079496 = idf(docFreq=241, maxDocs=44218)
                0.020654816 = queryNorm
              0.33949724 = fieldWeight in 3320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2079496 = idf(docFreq=241, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.040898822 = weight(abstract_txt:based in 3320) [ClassicSimilarity], result of:
            0.040898822 = score(doc=3320,freq=3.0), product of:
              0.1354421 = queryWeight, product of:
                2.0569506 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.020654816 = queryNorm
              0.3019654 = fieldWeight in 3320, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.29400545 = weight(abstract_txt:ontologies in 3320) [ClassicSimilarity], result of:
            0.29400545 = score(doc=3320,freq=6.0), product of:
              0.3768018 = queryWeight, product of:
                3.131936 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.020654816 = queryNorm
              0.78026557 = fieldWeight in 3320, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
        0.52 = coord(13/25)
    
  3. Kiren, T.: ¬A clustering based indexing technique of modularized ontologies for information retrieval (2017) 0.34
    0.33744395 = sum of:
      0.33744395 = product of:
        0.84360987 = sum of:
          0.005540426 = weight(abstract_txt:that in 4399) [ClassicSimilarity], result of:
            0.005540426 = score(doc=4399,freq=1.0), product of:
              0.049882676 = queryWeight, product of:
                1.0192395 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.020654816 = queryNorm
              0.11106914 = fieldWeight in 4399, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.046875 = fieldNorm(doc=4399)
          0.045171563 = weight(abstract_txt:words in 4399) [ClassicSimilarity], result of:
            0.045171563 = score(doc=4399,freq=2.0), product of:
              0.12729491 = queryWeight, product of:
                1.151309 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.020654816 = queryNorm
              0.35485756 = fieldWeight in 4399, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.046875 = fieldNorm(doc=4399)
          0.02162284 = weight(abstract_txt:search in 4399) [ClassicSimilarity], result of:
            0.02162284 = score(doc=4399,freq=2.0), product of:
              0.08916749 = queryWeight, product of:
                1.1801448 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.020654816 = queryNorm
              0.24249691 = fieldWeight in 4399, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.046875 = fieldNorm(doc=4399)
          0.015383258 = weight(abstract_txt:different in 4399) [ClassicSimilarity], result of:
            0.015383258 = score(doc=4399,freq=1.0), product of:
              0.089531034 = queryWeight, product of:
                1.1825482 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.020654816 = queryNorm
              0.1718204 = fieldWeight in 4399, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.046875 = fieldNorm(doc=4399)
          0.08128158 = weight(abstract_txt:indexing in 4399) [ClassicSimilarity], result of:
            0.08128158 = score(doc=4399,freq=10.0), product of:
              0.1260674 = queryWeight, product of:
                1.4032447 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.020654816 = queryNorm
              0.644747 = fieldWeight in 4399, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.046875 = fieldNorm(doc=4399)
          0.051071502 = weight(abstract_txt:concepts in 4399) [ClassicSimilarity], result of:
            0.051071502 = score(doc=4399,freq=3.0), product of:
              0.1381508 = queryWeight, product of:
                1.4689558 = boost
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.020654816 = queryNorm
              0.3696794 = fieldWeight in 4399, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.046875 = fieldNorm(doc=4399)
          0.012412934 = weight(abstract_txt:this in 4399) [ClassicSimilarity], result of:
            0.012412934 = score(doc=4399,freq=2.0), product of:
              0.07759928 = queryWeight, product of:
                1.5569543 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.020654816 = queryNorm
              0.15996197 = fieldWeight in 4399, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.046875 = fieldNorm(doc=4399)
          0.19060802 = weight(abstract_txt:ontology in 4399) [ClassicSimilarity], result of:
            0.19060802 = score(doc=4399,freq=13.0), product of:
              0.20388615 = queryWeight, product of:
                1.7845384 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.020654816 = queryNorm
              0.93487483 = fieldWeight in 4399, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.046875 = fieldNorm(doc=4399)
          0.049576864 = weight(abstract_txt:based in 4399) [ClassicSimilarity], result of:
            0.049576864 = score(doc=4399,freq=6.0), product of:
              0.1354421 = queryWeight, product of:
                2.0569506 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.020654816 = queryNorm
              0.36603734 = fieldWeight in 4399, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.046875 = fieldNorm(doc=4399)
          0.37094086 = weight(abstract_txt:ontologies in 4399) [ClassicSimilarity], result of:
            0.37094086 = score(doc=4399,freq=13.0), product of:
              0.3768018 = queryWeight, product of:
                3.131936 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.020654816 = queryNorm
              0.9844456 = fieldWeight in 4399, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.046875 = fieldNorm(doc=4399)
        0.4 = coord(10/25)
    
  4. Styltsvig, H.B.: Ontology-based information retrieval (2006) 0.26
    0.26125184 = sum of:
      0.26125184 = product of:
        0.5937542 = sum of:
          0.01238877 = weight(abstract_txt:that in 1154) [ClassicSimilarity], result of:
            0.01238877 = score(doc=1154,freq=5.0), product of:
              0.049882676 = queryWeight, product of:
                1.0192395 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.020654816 = queryNorm
              0.24835816 = fieldWeight in 1154, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.046875 = fieldNorm(doc=1154)
          0.015383258 = weight(abstract_txt:different in 1154) [ClassicSimilarity], result of:
            0.015383258 = score(doc=1154,freq=1.0), product of:
              0.089531034 = queryWeight, product of:
                1.1825482 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.020654816 = queryNorm
              0.1718204 = fieldWeight in 1154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.046875 = fieldNorm(doc=1154)
          0.022272665 = weight(abstract_txt:methods in 1154) [ClassicSimilarity], result of:
            0.022272665 = score(doc=1154,freq=1.0), product of:
              0.1145837 = queryWeight, product of:
                1.3378069 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.020654816 = queryNorm
              0.194379 = fieldWeight in 1154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.046875 = fieldNorm(doc=1154)
          0.036350228 = weight(abstract_txt:indexing in 1154) [ClassicSimilarity], result of:
            0.036350228 = score(doc=1154,freq=2.0), product of:
              0.1260674 = queryWeight, product of:
                1.4032447 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.020654816 = queryNorm
              0.28833964 = fieldWeight in 1154, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.046875 = fieldNorm(doc=1154)
          0.065933034 = weight(abstract_txt:concepts in 1154) [ClassicSimilarity], result of:
            0.065933034 = score(doc=1154,freq=5.0), product of:
              0.1381508 = queryWeight, product of:
                1.4689558 = boost
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.020654816 = queryNorm
              0.4772541 = fieldWeight in 1154, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.046875 = fieldNorm(doc=1154)
          0.01755454 = weight(abstract_txt:this in 1154) [ClassicSimilarity], result of:
            0.01755454 = score(doc=1154,freq=4.0), product of:
              0.07759928 = queryWeight, product of:
                1.5569543 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.020654816 = queryNorm
              0.2262204 = fieldWeight in 1154, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.046875 = fieldNorm(doc=1154)
          0.027540838 = weight(abstract_txt:text in 1154) [ClassicSimilarity], result of:
            0.027540838 = score(doc=1154,freq=1.0), product of:
              0.14529112 = queryWeight, product of:
                1.739486 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020654816 = queryNorm
              0.18955624 = fieldWeight in 1154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.046875 = fieldNorm(doc=1154)
          0.07476262 = weight(abstract_txt:ontology in 1154) [ClassicSimilarity], result of:
            0.07476262 = score(doc=1154,freq=2.0), product of:
              0.20388615 = queryWeight, product of:
                1.7845384 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.020654816 = queryNorm
              0.36668807 = fieldWeight in 1154, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.046875 = fieldNorm(doc=1154)
          0.056464307 = weight(abstract_txt:texts in 1154) [ClassicSimilarity], result of:
            0.056464307 = score(doc=1154,freq=1.0), product of:
              0.21303815 = queryWeight, product of:
                1.8241507 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.020654816 = queryNorm
              0.26504317 = fieldWeight in 1154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.046875 = fieldNorm(doc=1154)
          0.035056137 = weight(abstract_txt:based in 1154) [ClassicSimilarity], result of:
            0.035056137 = score(doc=1154,freq=3.0), product of:
              0.1354421 = queryWeight, product of:
                2.0569506 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.020654816 = queryNorm
              0.25882748 = fieldWeight in 1154, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.046875 = fieldNorm(doc=1154)
          0.23004775 = weight(abstract_txt:ontologies in 1154) [ClassicSimilarity], result of:
            0.23004775 = score(doc=1154,freq=5.0), product of:
              0.3768018 = queryWeight, product of:
                3.131936 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.020654816 = queryNorm
              0.6105272 = fieldWeight in 1154, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.046875 = fieldNorm(doc=1154)
        0.44 = coord(11/25)
    
  5. Dumais, S.T.: Latent semantic analysis (2003) 0.24
    0.23942052 = sum of:
      0.23942052 = product of:
        0.4604241 = sum of:
          0.011080852 = weight(abstract_txt:that in 2462) [ClassicSimilarity], result of:
            0.011080852 = score(doc=2462,freq=9.0), product of:
              0.049882676 = queryWeight, product of:
                1.0192395 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.020654816 = queryNorm
              0.22213829 = fieldWeight in 2462, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.018412381 = weight(abstract_txt:thus in 2462) [ClassicSimilarity], result of:
            0.018412381 = score(doc=2462,freq=1.0), product of:
              0.11553452 = queryWeight, product of:
                1.0968374 = boost
                5.0997415 = idf(docFreq=732, maxDocs=44218)
                0.020654816 = queryNorm
              0.15936692 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0997415 = idf(docFreq=732, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.07376485 = weight(abstract_txt:words in 2462) [ClassicSimilarity], result of:
            0.07376485 = score(doc=2462,freq=12.0), product of:
              0.12729491 = queryWeight, product of:
                1.151309 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.020654816 = queryNorm
              0.57948 = fieldWeight in 2462, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.03861176 = weight(abstract_txt:word in 2462) [ClassicSimilarity], result of:
            0.03861176 = score(doc=2462,freq=3.0), product of:
              0.1312435 = queryWeight, product of:
                1.169029 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.020654816 = queryNorm
              0.2941994 = fieldWeight in 2462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.017654976 = weight(abstract_txt:search in 2462) [ClassicSimilarity], result of:
            0.017654976 = score(doc=2462,freq=3.0), product of:
              0.08916749 = queryWeight, product of:
                1.1801448 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.020654816 = queryNorm
              0.1979979 = fieldWeight in 2462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.017763056 = weight(abstract_txt:different in 2462) [ClassicSimilarity], result of:
            0.017763056 = score(doc=2462,freq=3.0), product of:
              0.089531034 = queryWeight, product of:
                1.1825482 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.020654816 = queryNorm
              0.19840111 = fieldWeight in 2462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.024233487 = weight(abstract_txt:indexing in 2462) [ClassicSimilarity], result of:
            0.024233487 = score(doc=2462,freq=2.0), product of:
              0.1260674 = queryWeight, product of:
                1.4032447 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.020654816 = queryNorm
              0.19222642 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.008275289 = weight(abstract_txt:this in 2462) [ClassicSimilarity], result of:
            0.008275289 = score(doc=2462,freq=2.0), product of:
              0.07759928 = queryWeight, product of:
                1.5569543 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.020654816 = queryNorm
              0.106641315 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.062244773 = weight(abstract_txt:synonyms in 2462) [ClassicSimilarity], result of:
            0.062244773 = score(doc=2462,freq=1.0), product of:
              0.2602398 = queryWeight, product of:
                1.6461645 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.020654816 = queryNorm
              0.23918237 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.044974003 = weight(abstract_txt:text in 2462) [ClassicSimilarity], result of:
            0.044974003 = score(doc=2462,freq=6.0), product of:
              0.14529112 = queryWeight, product of:
                1.739486 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020654816 = queryNorm
              0.30954406 = fieldWeight in 2462, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.07528574 = weight(abstract_txt:texts in 2462) [ClassicSimilarity], result of:
            0.07528574 = score(doc=2462,freq=4.0), product of:
              0.21303815 = queryWeight, product of:
                1.8241507 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.020654816 = queryNorm
              0.3533909 = fieldWeight in 2462, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.049040806 = weight(abstract_txt:mining in 2462) [ClassicSimilarity], result of:
            0.049040806 = score(doc=2462,freq=1.0), product of:
              0.25412104 = queryWeight, product of:
                1.992289 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.020654816 = queryNorm
              0.19298208 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.019082142 = weight(abstract_txt:based in 2462) [ClassicSimilarity], result of:
            0.019082142 = score(doc=2462,freq=2.0), product of:
              0.1354421 = queryWeight, product of:
                2.0569506 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.020654816 = queryNorm
              0.14088783 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
        0.52 = coord(13/25)