Document (#41371)

Author
Adhikari, A.
Dutta, B.
Dutta, A.
Mondal, D.
Singh, S.
Title
¬An intrinsic information content-based semantic similarity measure considering the disjoint common subsumers of concepts of an ontology
Source
Journal of the Association for Information Science and Technology. 69(2018) no.8, S.1023-1034
Year
2018
Abstract
Finding similarity between concepts based on semantics has become a new trend in many applications (e.g., biomedical informatics, natural language processing). Measuring the Semantic Similarity (SS) with higher accuracy is a challenging task. In this context, the Information Content (IC)-based SS measure has gained popularity over the others. The notion of IC evolves from the science of information theory. Information theory has very high potential to characterize the semantics of concepts. Designing an IC-based SS framework comprises (i) an IC calculator, and (ii) an SS calculator. In this article, we propose a generic intrinsic IC-based SS calculator. We also introduce here a new structural aspect of an ontology called DCS (Disjoint Common Subsumers) that plays a significant role in deciding the similarity between two concepts. We evaluated our proposed similarity calculator with the existing intrinsic IC-based similarity calculators, as well as corpora-dependent similarity calculators using several benchmark data sets. The experimental results show that the proposed similarity calculator produces a high correlation with human evaluation over the existing state-of-the-art IC-based similarity calculators.
Content
Vgl.: https://onlinelibrary.wiley.com/doi/abs/10.1002/asi.24021.
Theme
Semantisches Umfeld in Indexierung u. Retrieval

Similar documents (author)

  1. Dutta, B.: Organizing knowledge : then and now (2015) 2.63
    2.6342568 = sum of:
      2.6342568 = product of:
        5.2685137 = sum of:
          5.2685137 = weight(author_txt:dutta in 6631) [ClassicSimilarity], result of:
            5.2685137 = score(doc=6631,freq=1.0), product of:
              0.91657245 = queryWeight, product of:
                2.1411135 = boost
                9.196897 = idf(docFreq=11, maxDocs=43556)
                0.04654637 = queryNorm
              5.74806 = fieldWeight in 6631, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.196897 = idf(docFreq=11, maxDocs=43556)
                0.625 = fieldNorm(doc=6631)
        0.5 = coord(1/2)
    
  2. Dutta, A.: ¬A journey from Cutter to Austin : critical analysis of their contribution in subject indexing (2017) 2.63
    2.6342568 = sum of:
      2.6342568 = product of:
        5.2685137 = sum of:
          5.2685137 = weight(author_txt:dutta in 2533) [ClassicSimilarity], result of:
            5.2685137 = score(doc=2533,freq=1.0), product of:
              0.91657245 = queryWeight, product of:
                2.1411135 = boost
                9.196897 = idf(docFreq=11, maxDocs=43556)
                0.04654637 = queryNorm
              5.74806 = fieldWeight in 2533, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.196897 = idf(docFreq=11, maxDocs=43556)
                0.625 = fieldNorm(doc=2533)
        0.5 = coord(1/2)
    
  3. Dutta, B.: Ranganathan's elucidation of subject in the light of 'Infinity (8)' (2015) 2.63
    2.6342568 = sum of:
      2.6342568 = product of:
        5.2685137 = sum of:
          5.2685137 = weight(author_txt:dutta in 4792) [ClassicSimilarity], result of:
            5.2685137 = score(doc=4792,freq=1.0), product of:
              0.91657245 = queryWeight, product of:
                2.1411135 = boost
                9.196897 = idf(docFreq=11, maxDocs=43556)
                0.04654637 = queryNorm
              5.74806 = fieldWeight in 4792, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.196897 = idf(docFreq=11, maxDocs=43556)
                0.625 = fieldNorm(doc=4792)
        0.5 = coord(1/2)
    
  4. Sinha, P.K.; Dutta, B.: ¬A systematic analysis of flood ontologies : a parametric approach (2020) 2.11
    2.1074054 = sum of:
      2.1074054 = product of:
        4.214811 = sum of:
          4.214811 = weight(author_txt:dutta in 2044) [ClassicSimilarity], result of:
            4.214811 = score(doc=2044,freq=1.0), product of:
              0.91657245 = queryWeight, product of:
                2.1411135 = boost
                9.196897 = idf(docFreq=11, maxDocs=43556)
                0.04654637 = queryNorm
              4.5984483 = fieldWeight in 2044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.196897 = idf(docFreq=11, maxDocs=43556)
                0.5 = fieldNorm(doc=2044)
        0.5 = coord(1/2)
    
  5. Giunchiglia, F.; Maltese, V.; Dutta, B.: Domains and context : first steps towards managing diversity in knowledge (2011) 1.58
    1.5805541 = sum of:
      1.5805541 = product of:
        3.1611083 = sum of:
          3.1611083 = weight(author_txt:dutta in 2601) [ClassicSimilarity], result of:
            3.1611083 = score(doc=2601,freq=1.0), product of:
              0.91657245 = queryWeight, product of:
                2.1411135 = boost
                9.196897 = idf(docFreq=11, maxDocs=43556)
                0.04654637 = queryNorm
              3.4488363 = fieldWeight in 2601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.196897 = idf(docFreq=11, maxDocs=43556)
                0.375 = fieldNorm(doc=2601)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Jiang, Y.; Zhang, X.; Tang, Y.; Nie, R.: Feature-based approaches to semantic similarity assessment of concepts using Wikipedia (2015) 0.40
    0.3954517 = sum of:
      0.3954517 = product of:
        1.0984769 = sum of:
          0.053458974 = weight(abstract_txt:benchmark in 4680) [ClassicSimilarity], result of:
            0.053458974 = score(doc=4680,freq=1.0), product of:
              0.116768956 = queryWeight, product of:
                1.0351101 = boost
                7.3250937 = idf(docFreq=77, maxDocs=43556)
                0.015400246 = queryNorm
              0.45781836 = fieldWeight in 4680, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3250937 = idf(docFreq=77, maxDocs=43556)
                0.0625 = fieldNorm(doc=4680)
          0.009110346 = weight(abstract_txt:with in 4680) [ClassicSimilarity], result of:
            0.009110346 = score(doc=4680,freq=2.0), product of:
              0.041086644 = queryWeight, product of:
                1.063491 = boost
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.015400246 = queryNorm
              0.22173499 = fieldWeight in 4680, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.0625 = fieldNorm(doc=4680)
          0.049037375 = weight(abstract_txt:semantic in 4680) [ClassicSimilarity], result of:
            0.049037375 = score(doc=4680,freq=4.0), product of:
              0.08749606 = queryWeight, product of:
                1.2671618 = boost
                4.483619 = idf(docFreq=1336, maxDocs=43556)
                0.015400246 = queryNorm
              0.5604524 = fieldWeight in 4680, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.483619 = idf(docFreq=1336, maxDocs=43556)
                0.0625 = fieldNorm(doc=4680)
          0.038053557 = weight(abstract_txt:proposed in 4680) [ClassicSimilarity], result of:
            0.038053557 = score(doc=4680,freq=2.0), product of:
              0.0930916 = queryWeight, product of:
                1.3070527 = boost
                4.624766 = idf(docFreq=1160, maxDocs=43556)
                0.015400246 = queryNorm
              0.40877542 = fieldWeight in 4680, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.624766 = idf(docFreq=1160, maxDocs=43556)
                0.0625 = fieldNorm(doc=4680)
          0.027812395 = weight(abstract_txt:existing in 4680) [ClassicSimilarity], result of:
            0.027812395 = score(doc=4680,freq=1.0), product of:
              0.09516617 = queryWeight, product of:
                1.3215364 = boost
                4.676014 = idf(docFreq=1102, maxDocs=43556)
                0.015400246 = queryNorm
              0.29225087 = fieldWeight in 4680, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.676014 = idf(docFreq=1102, maxDocs=43556)
                0.0625 = fieldNorm(doc=4680)
          0.046738386 = weight(abstract_txt:ontology in 4680) [ClassicSimilarity], result of:
            0.046738386 = score(doc=4680,freq=1.0), product of:
              0.13451564 = queryWeight, product of:
                1.5711739 = boost
                5.55931 = idf(docFreq=455, maxDocs=43556)
                0.015400246 = queryNorm
              0.34745687 = fieldWeight in 4680, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.55931 = idf(docFreq=455, maxDocs=43556)
                0.0625 = fieldNorm(doc=4680)
          0.11594729 = weight(abstract_txt:concepts in 4680) [ClassicSimilarity], result of:
            0.11594729 = score(doc=4680,freq=5.0), product of:
              0.18162853 = queryWeight, product of:
                2.5819325 = boost
                4.567847 = idf(docFreq=1228, maxDocs=43556)
                0.015400246 = queryNorm
              0.638376 = fieldWeight in 4680, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.567847 = idf(docFreq=1228, maxDocs=43556)
                0.0625 = fieldNorm(doc=4680)
          0.07640687 = weight(abstract_txt:based in 4680) [ClassicSimilarity], result of:
            0.07640687 = score(doc=4680,freq=6.0), product of:
              0.15597351 = queryWeight, product of:
                3.1651719 = boost
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.015400246 = queryNorm
              0.4898708 = fieldWeight in 4680, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.0625 = fieldNorm(doc=4680)
          0.68191165 = weight(abstract_txt:similarity in 4680) [ClassicSimilarity], result of:
            0.68191165 = score(doc=4680,freq=8.0), product of:
              0.66300464 = queryWeight, product of:
                7.3995004 = boost
                5.8181715 = idf(docFreq=351, maxDocs=43556)
                0.015400246 = queryNorm
              1.0285171 = fieldWeight in 4680, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.8181715 = idf(docFreq=351, maxDocs=43556)
                0.0625 = fieldNorm(doc=4680)
        0.36 = coord(9/25)
    
  2. Jiang, Y.; Bai, W.; Zhang, X.; Hu, J.: Wikipedia-based information content and semantic similarity computation (2017) 0.39
    0.3872779 = sum of:
      0.3872779 = product of:
        0.806829 = sum of:
          0.048201542 = weight(abstract_txt:corpora in 4875) [ClassicSimilarity], result of:
            0.048201542 = score(doc=4875,freq=1.0), product of:
              0.10898188 = queryWeight, product of:
                7.0766325 = idf(docFreq=99, maxDocs=43556)
                0.015400246 = queryNorm
              0.44228953 = fieldWeight in 4875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0766325 = idf(docFreq=99, maxDocs=43556)
                0.0625 = fieldNorm(doc=4875)
          0.053458974 = weight(abstract_txt:benchmark in 4875) [ClassicSimilarity], result of:
            0.053458974 = score(doc=4875,freq=1.0), product of:
              0.116768956 = queryWeight, product of:
                1.0351101 = boost
                7.3250937 = idf(docFreq=77, maxDocs=43556)
                0.015400246 = queryNorm
              0.45781836 = fieldWeight in 4875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3250937 = idf(docFreq=77, maxDocs=43556)
                0.0625 = fieldNorm(doc=4875)
          0.009110346 = weight(abstract_txt:with in 4875) [ClassicSimilarity], result of:
            0.009110346 = score(doc=4875,freq=2.0), product of:
              0.041086644 = queryWeight, product of:
                1.063491 = boost
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.015400246 = queryNorm
              0.22173499 = fieldWeight in 4875, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.0625 = fieldNorm(doc=4875)
          0.020044545 = weight(abstract_txt:content in 4875) [ClassicSimilarity], result of:
            0.020044545 = score(doc=4875,freq=1.0), product of:
              0.07649877 = queryWeight, product of:
                1.1848546 = boost
                4.1923904 = idf(docFreq=1788, maxDocs=43556)
                0.015400246 = queryNorm
              0.2620244 = fieldWeight in 4875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1923904 = idf(docFreq=1788, maxDocs=43556)
                0.0625 = fieldNorm(doc=4875)
          0.042467613 = weight(abstract_txt:semantic in 4875) [ClassicSimilarity], result of:
            0.042467613 = score(doc=4875,freq=3.0), product of:
              0.08749606 = queryWeight, product of:
                1.2671618 = boost
                4.483619 = idf(docFreq=1336, maxDocs=43556)
                0.015400246 = queryNorm
              0.48536602 = fieldWeight in 4875, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.483619 = idf(docFreq=1336, maxDocs=43556)
                0.0625 = fieldNorm(doc=4875)
          0.038053557 = weight(abstract_txt:proposed in 4875) [ClassicSimilarity], result of:
            0.038053557 = score(doc=4875,freq=2.0), product of:
              0.0930916 = queryWeight, product of:
                1.3070527 = boost
                4.624766 = idf(docFreq=1160, maxDocs=43556)
                0.015400246 = queryNorm
              0.40877542 = fieldWeight in 4875, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.624766 = idf(docFreq=1160, maxDocs=43556)
                0.0625 = fieldNorm(doc=4875)
          0.039332666 = weight(abstract_txt:existing in 4875) [ClassicSimilarity], result of:
            0.039332666 = score(doc=4875,freq=2.0), product of:
              0.09516617 = queryWeight, product of:
                1.3215364 = boost
                4.676014 = idf(docFreq=1102, maxDocs=43556)
                0.015400246 = queryNorm
              0.41330513 = fieldWeight in 4875, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.676014 = idf(docFreq=1102, maxDocs=43556)
                0.0625 = fieldNorm(doc=4875)
          0.0077608437 = weight(abstract_txt:information in 4875) [ClassicSimilarity], result of:
            0.0077608437 = score(doc=4875,freq=1.0), product of:
              0.051200353 = queryWeight, product of:
                1.3708481 = boost
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.015400246 = queryNorm
              0.15157793 = fieldWeight in 4875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.0625 = fieldNorm(doc=4875)
          0.059623107 = weight(abstract_txt:semantics in 4875) [ClassicSimilarity], result of:
            0.059623107 = score(doc=4875,freq=1.0), product of:
              0.15822195 = queryWeight, product of:
                1.7040064 = boost
                6.0293136 = idf(docFreq=284, maxDocs=43556)
                0.015400246 = queryNorm
              0.3768321 = fieldWeight in 4875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0293136 = idf(docFreq=284, maxDocs=43556)
                0.0625 = fieldNorm(doc=4875)
          0.10370641 = weight(abstract_txt:concepts in 4875) [ClassicSimilarity], result of:
            0.10370641 = score(doc=4875,freq=4.0), product of:
              0.18162853 = queryWeight, product of:
                2.5819325 = boost
                4.567847 = idf(docFreq=1228, maxDocs=43556)
                0.015400246 = queryNorm
              0.57098085 = fieldWeight in 4875, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.567847 = idf(docFreq=1228, maxDocs=43556)
                0.0625 = fieldNorm(doc=4875)
          0.044113524 = weight(abstract_txt:based in 4875) [ClassicSimilarity], result of:
            0.044113524 = score(doc=4875,freq=2.0), product of:
              0.15597351 = queryWeight, product of:
                3.1651719 = boost
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.015400246 = queryNorm
              0.28282702 = fieldWeight in 4875, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.0625 = fieldNorm(doc=4875)
          0.34095582 = weight(abstract_txt:similarity in 4875) [ClassicSimilarity], result of:
            0.34095582 = score(doc=4875,freq=2.0), product of:
              0.66300464 = queryWeight, product of:
                7.3995004 = boost
                5.8181715 = idf(docFreq=351, maxDocs=43556)
                0.015400246 = queryNorm
              0.51425856 = fieldWeight in 4875, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8181715 = idf(docFreq=351, maxDocs=43556)
                0.0625 = fieldNorm(doc=4875)
        0.48 = coord(12/25)
    
  3. Wan, X.; Yang, J.; Xiao, J.: Towards a unified approach to document similarity search using manifold-ranking of blocks (2008) 0.29
    0.28861105 = sum of:
      0.28861105 = product of:
        0.9019096 = sum of:
          0.006441988 = weight(abstract_txt:with in 4079) [ClassicSimilarity], result of:
            0.006441988 = score(doc=4079,freq=1.0), product of:
              0.041086644 = queryWeight, product of:
                1.063491 = boost
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.015400246 = queryNorm
              0.15679032 = fieldWeight in 4079, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.0625 = fieldNorm(doc=4079)
          0.020959813 = weight(abstract_txt:over in 4079) [ClassicSimilarity], result of:
            0.020959813 = score(doc=4079,freq=1.0), product of:
              0.07881011 = queryWeight, product of:
                1.2026211 = boost
                4.255254 = idf(docFreq=1679, maxDocs=43556)
                0.015400246 = queryNorm
              0.26595336 = fieldWeight in 4079, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.255254 = idf(docFreq=1679, maxDocs=43556)
                0.0625 = fieldNorm(doc=4079)
          0.038053557 = weight(abstract_txt:proposed in 4079) [ClassicSimilarity], result of:
            0.038053557 = score(doc=4079,freq=2.0), product of:
              0.0930916 = queryWeight, product of:
                1.3070527 = boost
                4.624766 = idf(docFreq=1160, maxDocs=43556)
                0.015400246 = queryNorm
              0.40877542 = fieldWeight in 4079, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.624766 = idf(docFreq=1160, maxDocs=43556)
                0.0625 = fieldNorm(doc=4079)
          0.039332666 = weight(abstract_txt:existing in 4079) [ClassicSimilarity], result of:
            0.039332666 = score(doc=4079,freq=2.0), product of:
              0.09516617 = queryWeight, product of:
                1.3215364 = boost
                4.676014 = idf(docFreq=1102, maxDocs=43556)
                0.015400246 = queryNorm
              0.41330513 = fieldWeight in 4079, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.676014 = idf(docFreq=1102, maxDocs=43556)
                0.0625 = fieldNorm(doc=4079)
          0.043689445 = weight(abstract_txt:measure in 4079) [ClassicSimilarity], result of:
            0.043689445 = score(doc=4079,freq=1.0), product of:
              0.1286001 = queryWeight, product of:
                1.5362381 = boost
                5.435696 = idf(docFreq=515, maxDocs=43556)
                0.015400246 = queryNorm
              0.339731 = fieldWeight in 4079, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.435696 = idf(docFreq=515, maxDocs=43556)
                0.0625 = fieldNorm(doc=4079)
          0.031192971 = weight(abstract_txt:based in 4079) [ClassicSimilarity], result of:
            0.031192971 = score(doc=4079,freq=1.0), product of:
              0.15597351 = queryWeight, product of:
                3.1651719 = boost
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.015400246 = queryNorm
              0.1999889 = fieldWeight in 4079, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.0625 = fieldNorm(doc=4079)
          0.18314068 = weight(abstract_txt:intrinsic in 4079) [ClassicSimilarity], result of:
            0.18314068 = score(doc=4079,freq=1.0), product of:
              0.3827166 = queryWeight, product of:
                3.2458026 = boost
                7.656451 = idf(docFreq=55, maxDocs=43556)
                0.015400246 = queryNorm
              0.4785282 = fieldWeight in 4079, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.656451 = idf(docFreq=55, maxDocs=43556)
                0.0625 = fieldNorm(doc=4079)
          0.5390985 = weight(abstract_txt:similarity in 4079) [ClassicSimilarity], result of:
            0.5390985 = score(doc=4079,freq=5.0), product of:
              0.66300464 = queryWeight, product of:
                7.3995004 = boost
                5.8181715 = idf(docFreq=351, maxDocs=43556)
                0.015400246 = queryNorm
              0.8131142 = fieldWeight in 4079, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8181715 = idf(docFreq=351, maxDocs=43556)
                0.0625 = fieldNorm(doc=4079)
        0.32 = coord(8/25)
    
  4. Styltsvig, H.B.: Ontology-based information retrieval (2006) 0.28
    0.279684 = sum of:
      0.279684 = product of:
        0.7769 = sum of:
          0.0394777 = weight(abstract_txt:comprises in 3152) [ClassicSimilarity], result of:
            0.0394777 = score(doc=3152,freq=1.0), product of:
              0.115568824 = queryWeight, product of:
                1.029777 = boost
                7.2873535 = idf(docFreq=80, maxDocs=43556)
                0.015400246 = queryNorm
              0.3415947 = fieldWeight in 3152, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2873535 = idf(docFreq=80, maxDocs=43556)
                0.046875 = fieldNorm(doc=3152)
          0.009662982 = weight(abstract_txt:with in 3152) [ClassicSimilarity], result of:
            0.009662982 = score(doc=3152,freq=4.0), product of:
              0.041086644 = queryWeight, product of:
                1.063491 = boost
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.015400246 = queryNorm
              0.23518547 = fieldWeight in 3152, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.046875 = fieldNorm(doc=3152)
          0.03677803 = weight(abstract_txt:semantic in 3152) [ClassicSimilarity], result of:
            0.03677803 = score(doc=3152,freq=4.0), product of:
              0.08749606 = queryWeight, product of:
                1.2671618 = boost
                4.483619 = idf(docFreq=1336, maxDocs=43556)
                0.015400246 = queryNorm
              0.4203393 = fieldWeight in 3152, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.483619 = idf(docFreq=1336, maxDocs=43556)
                0.046875 = fieldNorm(doc=3152)
          0.014257581 = weight(abstract_txt:information in 3152) [ClassicSimilarity], result of:
            0.014257581 = score(doc=3152,freq=6.0), product of:
              0.051200353 = queryWeight, product of:
                1.3708481 = boost
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.015400246 = queryNorm
              0.27846646 = fieldWeight in 3152, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.046875 = fieldNorm(doc=3152)
          0.05675425 = weight(abstract_txt:measure in 3152) [ClassicSimilarity], result of:
            0.05675425 = score(doc=3152,freq=3.0), product of:
              0.1286001 = queryWeight, product of:
                1.5362381 = boost
                5.435696 = idf(docFreq=515, maxDocs=43556)
                0.015400246 = queryNorm
              0.44132352 = fieldWeight in 3152, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.435696 = idf(docFreq=515, maxDocs=43556)
                0.046875 = fieldNorm(doc=3152)
          0.049573544 = weight(abstract_txt:ontology in 3152) [ClassicSimilarity], result of:
            0.049573544 = score(doc=3152,freq=2.0), product of:
              0.13451564 = queryWeight, product of:
                1.5711739 = boost
                5.55931 = idf(docFreq=455, maxDocs=43556)
                0.015400246 = queryNorm
              0.36853367 = fieldWeight in 3152, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.55931 = idf(docFreq=455, maxDocs=43556)
                0.046875 = fieldNorm(doc=3152)
          0.086960465 = weight(abstract_txt:concepts in 3152) [ClassicSimilarity], result of:
            0.086960465 = score(doc=3152,freq=5.0), product of:
              0.18162853 = queryWeight, product of:
                2.5819325 = boost
                4.567847 = idf(docFreq=1228, maxDocs=43556)
                0.015400246 = queryNorm
              0.478782 = fieldWeight in 3152, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.567847 = idf(docFreq=1228, maxDocs=43556)
                0.046875 = fieldNorm(doc=3152)
          0.040520854 = weight(abstract_txt:based in 3152) [ClassicSimilarity], result of:
            0.040520854 = score(doc=3152,freq=3.0), product of:
              0.15597351 = queryWeight, product of:
                3.1651719 = boost
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.015400246 = queryNorm
              0.2597932 = fieldWeight in 3152, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.046875 = fieldNorm(doc=3152)
          0.4429146 = weight(abstract_txt:similarity in 3152) [ClassicSimilarity], result of:
            0.4429146 = score(doc=3152,freq=6.0), product of:
              0.66300464 = queryWeight, product of:
                7.3995004 = boost
                5.8181715 = idf(docFreq=351, maxDocs=43556)
                0.015400246 = queryNorm
              0.66804147 = fieldWeight in 3152, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.8181715 = idf(docFreq=351, maxDocs=43556)
                0.046875 = fieldNorm(doc=3152)
        0.36 = coord(9/25)
    
  5. Gipp, B.; Meuschke, N.; Breitinger, C.: Citation-based plagiarism detection : practicability on a large-scale scientific corpus (2014) 0.28
    0.2775287 = sum of:
      0.2775287 = product of:
        0.770913 = sum of:
          0.04992553 = weight(abstract_txt:biomedical in 330) [ClassicSimilarity], result of:
            0.04992553 = score(doc=330,freq=1.0), product of:
              0.1115652 = queryWeight, product of:
                1.0117826 = boost
                7.160014 = idf(docFreq=91, maxDocs=43556)
                0.015400246 = queryNorm
              0.44750088 = fieldWeight in 330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.160014 = idf(docFreq=91, maxDocs=43556)
                0.0625 = fieldNorm(doc=330)
          0.053458974 = weight(abstract_txt:benchmark in 330) [ClassicSimilarity], result of:
            0.053458974 = score(doc=330,freq=1.0), product of:
              0.116768956 = queryWeight, product of:
                1.0351101 = boost
                7.3250937 = idf(docFreq=77, maxDocs=43556)
                0.015400246 = queryNorm
              0.45781836 = fieldWeight in 330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3250937 = idf(docFreq=77, maxDocs=43556)
                0.0625 = fieldNorm(doc=330)
          0.009110346 = weight(abstract_txt:with in 330) [ClassicSimilarity], result of:
            0.009110346 = score(doc=330,freq=2.0), product of:
              0.041086644 = queryWeight, product of:
                1.063491 = boost
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.015400246 = queryNorm
              0.22173499 = fieldWeight in 330, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.0625 = fieldNorm(doc=330)
          0.024518687 = weight(abstract_txt:semantic in 330) [ClassicSimilarity], result of:
            0.024518687 = score(doc=330,freq=1.0), product of:
              0.08749606 = queryWeight, product of:
                1.2671618 = boost
                4.483619 = idf(docFreq=1336, maxDocs=43556)
                0.015400246 = queryNorm
              0.2802262 = fieldWeight in 330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.483619 = idf(docFreq=1336, maxDocs=43556)
                0.0625 = fieldNorm(doc=330)
          0.026907928 = weight(abstract_txt:proposed in 330) [ClassicSimilarity], result of:
            0.026907928 = score(doc=330,freq=1.0), product of:
              0.0930916 = queryWeight, product of:
                1.3070527 = boost
                4.624766 = idf(docFreq=1160, maxDocs=43556)
                0.015400246 = queryNorm
              0.28904787 = fieldWeight in 330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.624766 = idf(docFreq=1160, maxDocs=43556)
                0.0625 = fieldNorm(doc=330)
          0.01097549 = weight(abstract_txt:information in 330) [ClassicSimilarity], result of:
            0.01097549 = score(doc=330,freq=2.0), product of:
              0.051200353 = queryWeight, product of:
                1.3708481 = boost
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.015400246 = queryNorm
              0.21436356 = fieldWeight in 330, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.0625 = fieldNorm(doc=330)
          0.031302877 = weight(abstract_txt:high in 330) [ClassicSimilarity], result of:
            0.031302877 = score(doc=330,freq=1.0), product of:
              0.10297058 = queryWeight, product of:
                1.3746573 = boost
                4.863972 = idf(docFreq=913, maxDocs=43556)
                0.015400246 = queryNorm
              0.30399826 = fieldWeight in 330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.863972 = idf(docFreq=913, maxDocs=43556)
                0.0625 = fieldNorm(doc=330)
          0.08252884 = weight(abstract_txt:based in 330) [ClassicSimilarity], result of:
            0.08252884 = score(doc=330,freq=7.0), product of:
              0.15597351 = queryWeight, product of:
                3.1651719 = boost
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.015400246 = queryNorm
              0.52912086 = fieldWeight in 330, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.0625 = fieldNorm(doc=330)
          0.48218432 = weight(abstract_txt:similarity in 330) [ClassicSimilarity], result of:
            0.48218432 = score(doc=330,freq=4.0), product of:
              0.66300464 = queryWeight, product of:
                7.3995004 = boost
                5.8181715 = idf(docFreq=351, maxDocs=43556)
                0.015400246 = queryNorm
              0.72727144 = fieldWeight in 330, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8181715 = idf(docFreq=351, maxDocs=43556)
                0.0625 = fieldNorm(doc=330)
        0.36 = coord(9/25)