Document (#37690)

Author
Wicaksana, I.W.S.
Wahyudi, B.
Title
Comparison Latent Semantic and WordNet approach for semantic similarity calculation
Source
http://arxiv.org/find/all/1/all:+EXACT+semantic_interoperability/0/1/0/all/0/1. [arXiv:1105.1406]
Year
2011
Abstract
Information exchange among many sources in Internet is more autonomous, dynamic and free. The situation drive difference view of concepts among sources. For example, word 'bank' has meaning as economic institution for economy domain, but for ecology domain it will be defined as slope of river or lake. In this paper, we will evaluate latent semantic and WordNet approach to calculate semantic similarity. The evaluation will be run for some concepts from different domain with reference by expert or human. Result of the evaluation can provide a contribution for mapping of concept, query rewriting, interoperability, etc.
Theme
Semantische Interoperabilität
Object
Latent semantic indexing
WordNet

Similar documents (content)

  1. Kiren, T.; Shoaib, M.: ¬A novel ontology matching approach using key concepts (2016) 0.18
    0.17543896 = sum of:
      0.17543896 = product of:
        0.6265677 = sum of:
          0.029531213 = weight(abstract_txt:approach in 2589) [ClassicSimilarity], result of:
            0.029531213 = score(doc=2589,freq=2.0), product of:
              0.08920649 = queryWeight, product of:
                1.2463837 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.019109743 = queryNorm
              0.33104333 = fieldWeight in 2589, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=2589)
          0.035883475 = weight(abstract_txt:evaluation in 2589) [ClassicSimilarity], result of:
            0.035883475 = score(doc=2589,freq=1.0), product of:
              0.1279819 = queryWeight, product of:
                1.49289 = boost
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.019109743 = queryNorm
              0.2803793 = fieldWeight in 2589, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.0625 = fieldNorm(doc=2589)
          0.08389804 = weight(abstract_txt:concepts in 2589) [ClassicSimilarity], result of:
            0.08389804 = score(doc=2589,freq=5.0), product of:
              0.13184492 = queryWeight, product of:
                1.5152533 = boost
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.019109743 = queryNorm
              0.6363388 = fieldWeight in 2589, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.0625 = fieldNorm(doc=2589)
          0.1566411 = weight(abstract_txt:similarity in 2589) [ClassicSimilarity], result of:
            0.1566411 = score(doc=2589,freq=4.0), product of:
              0.21534562 = queryWeight, product of:
                1.936518 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.019109743 = queryNorm
              0.7273939 = fieldWeight in 2589, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0625 = fieldNorm(doc=2589)
          0.063315585 = weight(abstract_txt:domain in 2589) [ClassicSimilarity], result of:
            0.063315585 = score(doc=2589,freq=1.0), product of:
              0.21392247 = queryWeight, product of:
                2.3638904 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.019109743 = queryNorm
              0.29597446 = fieldWeight in 2589, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0625 = fieldNorm(doc=2589)
          0.1860936 = weight(abstract_txt:wordnet in 2589) [ClassicSimilarity], result of:
            0.1860936 = score(doc=2589,freq=1.0), product of:
              0.38344803 = queryWeight, product of:
                2.584085 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.019109743 = queryNorm
              0.48531634 = fieldWeight in 2589, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.0625 = fieldNorm(doc=2589)
          0.07120463 = weight(abstract_txt:semantic in 2589) [ClassicSimilarity], result of:
            0.07120463 = score(doc=2589,freq=1.0), product of:
              0.25462502 = queryWeight, product of:
                2.9779613 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.019109743 = queryNorm
              0.2796451 = fieldWeight in 2589, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=2589)
        0.28 = coord(7/25)
    
  2. Kim, H.H.; Kim, Y.H.: Generic speech summarization of transcribed lecture videos : using tags and their semantic relations (2016) 0.15
    0.14791764 = sum of:
      0.14791764 = product of:
        0.61632353 = sum of:
          0.044992317 = weight(abstract_txt:difference in 2640) [ClassicSimilarity], result of:
            0.044992317 = score(doc=2640,freq=1.0), product of:
              0.118113935 = queryWeight, product of:
                1.0141194 = boost
                6.0947685 = idf(docFreq=270, maxDocs=44218)
                0.019109743 = queryNorm
              0.38092303 = fieldWeight in 2640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0947685 = idf(docFreq=270, maxDocs=44218)
                0.0625 = fieldNorm(doc=2640)
          0.0507469 = weight(abstract_txt:evaluation in 2640) [ClassicSimilarity], result of:
            0.0507469 = score(doc=2640,freq=2.0), product of:
              0.1279819 = queryWeight, product of:
                1.49289 = boost
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.019109743 = queryNorm
              0.3965162 = fieldWeight in 2640, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.0625 = fieldNorm(doc=2640)
          0.07504068 = weight(abstract_txt:concepts in 2640) [ClassicSimilarity], result of:
            0.07504068 = score(doc=2640,freq=4.0), product of:
              0.13184492 = queryWeight, product of:
                1.5152533 = boost
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.019109743 = queryNorm
              0.5691587 = fieldWeight in 2640, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.0625 = fieldNorm(doc=2640)
          0.13611999 = weight(abstract_txt:latent in 2640) [ClassicSimilarity], result of:
            0.13611999 = score(doc=2640,freq=1.0), product of:
              0.3112912 = queryWeight, product of:
                2.3282893 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.019109743 = queryNorm
              0.43727544 = fieldWeight in 2640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.0625 = fieldNorm(doc=2640)
          0.1860936 = weight(abstract_txt:wordnet in 2640) [ClassicSimilarity], result of:
            0.1860936 = score(doc=2640,freq=1.0), product of:
              0.38344803 = queryWeight, product of:
                2.584085 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.019109743 = queryNorm
              0.48531634 = fieldWeight in 2640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.0625 = fieldNorm(doc=2640)
          0.12333004 = weight(abstract_txt:semantic in 2640) [ClassicSimilarity], result of:
            0.12333004 = score(doc=2640,freq=3.0), product of:
              0.25462502 = queryWeight, product of:
                2.9779613 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.019109743 = queryNorm
              0.48435947 = fieldWeight in 2640, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=2640)
        0.24 = coord(6/25)
    
  3. Burke, R.D.: Question answering from frequently asked question files : experiences with the FAQ Finder System (1997) 0.15
    0.14765728 = sum of:
      0.14765728 = product of:
        0.7382864 = sum of:
          0.036543015 = weight(abstract_txt:approach in 1191) [ClassicSimilarity], result of:
            0.036543015 = score(doc=1191,freq=1.0), product of:
              0.08920649 = queryWeight, product of:
                1.2463837 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.019109743 = queryNorm
              0.40964526 = fieldWeight in 1191, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.109375 = fieldNorm(doc=1191)
          0.06279608 = weight(abstract_txt:evaluation in 1191) [ClassicSimilarity], result of:
            0.06279608 = score(doc=1191,freq=1.0), product of:
              0.1279819 = queryWeight, product of:
                1.49289 = boost
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.019109743 = queryNorm
              0.49066377 = fieldWeight in 1191, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.109375 = fieldNorm(doc=1191)
          0.13706096 = weight(abstract_txt:similarity in 1191) [ClassicSimilarity], result of:
            0.13706096 = score(doc=1191,freq=1.0), product of:
              0.21534562 = queryWeight, product of:
                1.936518 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.019109743 = queryNorm
              0.63646966 = fieldWeight in 1191, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.109375 = fieldNorm(doc=1191)
          0.3256638 = weight(abstract_txt:wordnet in 1191) [ClassicSimilarity], result of:
            0.3256638 = score(doc=1191,freq=1.0), product of:
              0.38344803 = queryWeight, product of:
                2.584085 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.019109743 = queryNorm
              0.8493036 = fieldWeight in 1191, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.109375 = fieldNorm(doc=1191)
          0.17622249 = weight(abstract_txt:semantic in 1191) [ClassicSimilarity], result of:
            0.17622249 = score(doc=1191,freq=2.0), product of:
              0.25462502 = queryWeight, product of:
                2.9779613 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.019109743 = queryNorm
              0.6920863 = fieldWeight in 1191, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.109375 = fieldNorm(doc=1191)
        0.2 = coord(5/25)
    
  4. Green, R.: WordNet (2009) 0.13
    0.12932712 = sum of:
      0.12932712 = product of:
        0.80829453 = sum of:
          0.056280512 = weight(abstract_txt:concepts in 4696) [ClassicSimilarity], result of:
            0.056280512 = score(doc=4696,freq=1.0), product of:
              0.13184492 = queryWeight, product of:
                1.5152533 = boost
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.019109743 = queryNorm
              0.426869 = fieldWeight in 4696, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.09375 = fieldNorm(doc=4696)
          0.11748083 = weight(abstract_txt:similarity in 4696) [ClassicSimilarity], result of:
            0.11748083 = score(doc=4696,freq=1.0), product of:
              0.21534562 = queryWeight, product of:
                1.936518 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.019109743 = queryNorm
              0.54554546 = fieldWeight in 4696, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.09375 = fieldNorm(doc=4696)
          0.48348534 = weight(abstract_txt:wordnet in 4696) [ClassicSimilarity], result of:
            0.48348534 = score(doc=4696,freq=3.0), product of:
              0.38344803 = queryWeight, product of:
                2.584085 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.019109743 = queryNorm
              1.2608888 = fieldWeight in 4696, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.09375 = fieldNorm(doc=4696)
          0.15104784 = weight(abstract_txt:semantic in 4696) [ClassicSimilarity], result of:
            0.15104784 = score(doc=4696,freq=2.0), product of:
              0.25462502 = queryWeight, product of:
                2.9779613 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.019109743 = queryNorm
              0.5932168 = fieldWeight in 4696, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.09375 = fieldNorm(doc=4696)
        0.16 = coord(4/25)
    
  5. K., Vani; Gupta, D.: Unmasking text plagiarism using syntactic-semantic based natural language processing techniques : comparisons, analysis and challenges (2018) 0.12
    0.1239349 = sum of:
      0.1239349 = product of:
        0.51639545 = sum of:
          0.036168203 = weight(abstract_txt:approach in 5084) [ClassicSimilarity], result of:
            0.036168203 = score(doc=5084,freq=3.0), product of:
              0.08920649 = queryWeight, product of:
                1.2463837 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.019109743 = queryNorm
              0.40544364 = fieldWeight in 5084, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=5084)
          0.035883475 = weight(abstract_txt:evaluation in 5084) [ClassicSimilarity], result of:
            0.035883475 = score(doc=5084,freq=1.0), product of:
              0.1279819 = queryWeight, product of:
                1.49289 = boost
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.019109743 = queryNorm
              0.2803793 = fieldWeight in 5084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.0625 = fieldNorm(doc=5084)
          0.03752034 = weight(abstract_txt:concepts in 5084) [ClassicSimilarity], result of:
            0.03752034 = score(doc=5084,freq=1.0), product of:
              0.13184492 = queryWeight, product of:
                1.5152533 = boost
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.019109743 = queryNorm
              0.28457934 = fieldWeight in 5084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.0625 = fieldNorm(doc=5084)
          0.07832055 = weight(abstract_txt:similarity in 5084) [ClassicSimilarity], result of:
            0.07832055 = score(doc=5084,freq=1.0), product of:
              0.21534562 = queryWeight, product of:
                1.936518 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.019109743 = queryNorm
              0.36369696 = fieldWeight in 5084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0625 = fieldNorm(doc=5084)
          0.1860936 = weight(abstract_txt:wordnet in 5084) [ClassicSimilarity], result of:
            0.1860936 = score(doc=5084,freq=1.0), product of:
              0.38344803 = queryWeight, product of:
                2.584085 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.019109743 = queryNorm
              0.48531634 = fieldWeight in 5084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.0625 = fieldNorm(doc=5084)
          0.14240927 = weight(abstract_txt:semantic in 5084) [ClassicSimilarity], result of:
            0.14240927 = score(doc=5084,freq=4.0), product of:
              0.25462502 = queryWeight, product of:
                2.9779613 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.019109743 = queryNorm
              0.5592902 = fieldWeight in 5084, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=5084)
        0.24 = coord(6/25)