Document (#40383)

Author
Li, N.
Sun, J.
Title
Improving Chinese term association from the linguistic perspective
Source
Knowledge organization. 44(2017) no.1, S.13-23
Year
2017
Abstract
The study aims to solve how to construct the semantic relations of specific domain terms by applying linguistic rules. The semantic structure analysis at the morpheme level was used for semantic measure, and a morpheme-based term association model was proposed by improving and combining the literal-based similarity algorithm and co-occurrence relatedness methods. This study provides a novel insight into the method of semantic analysis and calculation by morpheme parsing, and the proposed solution is feasible for the automatic association of compound terms. The results show that this approach could be used to construct appropriate term association and form a reasonable structural knowledge graph. However, due to linguistic differences, the viability and effectiveness of the use of our method in non-Chinese linguistic environments should be verified.
Theme
Semantisches Umfeld in Indexierung u. Retrieval
Computerlinguistik

Similar documents (content)

  1. Galvez, C.; Moya-Anegón, F. de; Solana, V.H.: Term conflation methods in information retrieval : non-linguistic and linguistic approaches (2005) 0.20
    0.1950031 = sum of:
      0.1950031 = product of:
        0.81251293 = sum of:
          0.017838584 = weight(abstract_txt:used in 395) [ClassicSimilarity], result of:
            0.017838584 = score(doc=395,freq=1.0), product of:
              0.06779753 = queryWeight, product of:
                1.0300826 = boost
                3.3678792 = idf(docFreq=4051, maxDocs=43254)
                0.019542733 = queryNorm
              0.26311556 = fieldWeight in 395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3678792 = idf(docFreq=4051, maxDocs=43254)
                0.078125 = fieldNorm(doc=395)
          0.0312068 = weight(abstract_txt:terms in 395) [ClassicSimilarity], result of:
            0.0312068 = score(doc=395,freq=1.0), product of:
              0.09843278 = queryWeight, product of:
                1.2411807 = boost
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.019542733 = queryNorm
              0.31703666 = fieldWeight in 395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.078125 = fieldNorm(doc=395)
          0.042906918 = weight(abstract_txt:method in 395) [ClassicSimilarity], result of:
            0.042906918 = score(doc=395,freq=1.0), product of:
              0.12170968 = queryWeight, product of:
                1.3801545 = boost
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.019542733 = queryNorm
              0.35253495 = fieldWeight in 395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.078125 = fieldNorm(doc=395)
          0.1358099 = weight(abstract_txt:term in 395) [ClassicSimilarity], result of:
            0.1358099 = score(doc=395,freq=3.0), product of:
              0.20824978 = queryWeight, product of:
                2.2110727 = boost
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.019542733 = queryNorm
              0.6521491 = fieldWeight in 395, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.078125 = fieldNorm(doc=395)
          0.16879019 = weight(abstract_txt:association in 395) [ClassicSimilarity], result of:
            0.16879019 = score(doc=395,freq=1.0), product of:
              0.38213345 = queryWeight, product of:
                3.458499 = boost
                5.6538215 = idf(docFreq=411, maxDocs=43254)
                0.019542733 = queryNorm
              0.4417048 = fieldWeight in 395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6538215 = idf(docFreq=411, maxDocs=43254)
                0.078125 = fieldNorm(doc=395)
          0.41596055 = weight(abstract_txt:linguistic in 395) [ClassicSimilarity], result of:
            0.41596055 = score(doc=395,freq=5.0), product of:
              0.4077197 = queryWeight, product of:
                3.5724072 = boost
                5.8400345 = idf(docFreq=341, maxDocs=43254)
                0.019542733 = queryNorm
              1.020212 = fieldWeight in 395, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8400345 = idf(docFreq=341, maxDocs=43254)
                0.078125 = fieldNorm(doc=395)
        0.24 = coord(6/25)
    
  2. Guo, L.; Wan, X.: Exploiting syntactic and semantic relationships between terms for opinion retrieval (2012) 0.17
    0.17374061 = sum of:
      0.17374061 = product of:
        0.5429394 = sum of:
          0.025227569 = weight(abstract_txt:used in 1957) [ClassicSimilarity], result of:
            0.025227569 = score(doc=1957,freq=2.0), product of:
              0.06779753 = queryWeight, product of:
                1.0300826 = boost
                3.3678792 = idf(docFreq=4051, maxDocs=43254)
                0.019542733 = queryNorm
              0.3721016 = fieldWeight in 1957, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3678792 = idf(docFreq=4051, maxDocs=43254)
                0.078125 = fieldNorm(doc=1957)
          0.019213794 = weight(abstract_txt:study in 1957) [ClassicSimilarity], result of:
            0.019213794 = score(doc=1957,freq=1.0), product of:
              0.07123865 = queryWeight, product of:
                1.0559005 = boost
                3.4522913 = idf(docFreq=3723, maxDocs=43254)
                0.019542733 = queryNorm
              0.26971024 = fieldWeight in 1957, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4522913 = idf(docFreq=3723, maxDocs=43254)
                0.078125 = fieldNorm(doc=1957)
          0.05405176 = weight(abstract_txt:terms in 1957) [ClassicSimilarity], result of:
            0.05405176 = score(doc=1957,freq=3.0), product of:
              0.09843278 = queryWeight, product of:
                1.2411807 = boost
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.019542733 = queryNorm
              0.5491236 = fieldWeight in 1957, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.078125 = fieldNorm(doc=1957)
          0.1263256 = weight(abstract_txt:relatedness in 1957) [ClassicSimilarity], result of:
            0.1263256 = score(doc=1957,freq=1.0), product of:
              0.19843785 = queryWeight, product of:
                1.2461272 = boost
                8.148484 = idf(docFreq=33, maxDocs=43254)
                0.019542733 = queryNorm
              0.6366003 = fieldWeight in 1957, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.148484 = idf(docFreq=33, maxDocs=43254)
                0.078125 = fieldNorm(doc=1957)
          0.074316956 = weight(abstract_txt:method in 1957) [ClassicSimilarity], result of:
            0.074316956 = score(doc=1957,freq=3.0), product of:
              0.12170968 = queryWeight, product of:
                1.3801545 = boost
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.019542733 = queryNorm
              0.6106084 = fieldWeight in 1957, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.078125 = fieldNorm(doc=1957)
          0.046242032 = weight(abstract_txt:proposed in 1957) [ClassicSimilarity], result of:
            0.046242032 = score(doc=1957,freq=1.0), product of:
              0.1279376 = queryWeight, product of:
                1.4150254 = boost
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.019542733 = queryNorm
              0.3614421 = fieldWeight in 1957, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.078125 = fieldNorm(doc=1957)
          0.07840988 = weight(abstract_txt:term in 1957) [ClassicSimilarity], result of:
            0.07840988 = score(doc=1957,freq=1.0), product of:
              0.20824978 = queryWeight, product of:
                2.2110727 = boost
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.019542733 = queryNorm
              0.37651843 = fieldWeight in 1957, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.078125 = fieldNorm(doc=1957)
          0.119151846 = weight(abstract_txt:semantic in 1957) [ClassicSimilarity], result of:
            0.119151846 = score(doc=1957,freq=2.0), product of:
              0.24045886 = queryWeight, product of:
                2.7434719 = boost
                4.484923 = idf(docFreq=1325, maxDocs=43254)
                0.019542733 = queryNorm
              0.49551862 = fieldWeight in 1957, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.484923 = idf(docFreq=1325, maxDocs=43254)
                0.078125 = fieldNorm(doc=1957)
        0.32 = coord(8/25)
    
  3. Lee, C.-H.; Khoo, C.; Na, J.-C.: Automatic identification of treatment relations for medical ontology learning : an exploratory study (2004) 0.17
    0.17119081 = sum of:
      0.17119081 = product of:
        0.71329504 = sum of:
          0.020182054 = weight(abstract_txt:used in 4662) [ClassicSimilarity], result of:
            0.020182054 = score(doc=4662,freq=2.0), product of:
              0.06779753 = queryWeight, product of:
                1.0300826 = boost
                3.3678792 = idf(docFreq=4051, maxDocs=43254)
                0.019542733 = queryNorm
              0.29768127 = fieldWeight in 4662, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3678792 = idf(docFreq=4051, maxDocs=43254)
                0.0625 = fieldNorm(doc=4662)
          0.030742072 = weight(abstract_txt:study in 4662) [ClassicSimilarity], result of:
            0.030742072 = score(doc=4662,freq=4.0), product of:
              0.07123865 = queryWeight, product of:
                1.0559005 = boost
                3.4522913 = idf(docFreq=3723, maxDocs=43254)
                0.019542733 = queryNorm
              0.4315364 = fieldWeight in 4662, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4522913 = idf(docFreq=3723, maxDocs=43254)
                0.0625 = fieldNorm(doc=4662)
          0.048543636 = weight(abstract_txt:method in 4662) [ClassicSimilarity], result of:
            0.048543636 = score(doc=4662,freq=2.0), product of:
              0.12170968 = queryWeight, product of:
                1.3801545 = boost
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.019542733 = queryNorm
              0.39884776 = fieldWeight in 4662, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.0625 = fieldNorm(doc=4662)
          0.16510166 = weight(abstract_txt:semantic in 4662) [ClassicSimilarity], result of:
            0.16510166 = score(doc=4662,freq=6.0), product of:
              0.24045886 = queryWeight, product of:
                2.7434719 = boost
                4.484923 = idf(docFreq=1325, maxDocs=43254)
                0.019542733 = queryNorm
              0.6866108 = fieldWeight in 4662, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.484923 = idf(docFreq=1325, maxDocs=43254)
                0.0625 = fieldNorm(doc=4662)
          0.19096428 = weight(abstract_txt:association in 4662) [ClassicSimilarity], result of:
            0.19096428 = score(doc=4662,freq=2.0), product of:
              0.38213345 = queryWeight, product of:
                3.458499 = boost
                5.6538215 = idf(docFreq=411, maxDocs=43254)
                0.019542733 = queryNorm
              0.49973193 = fieldWeight in 4662, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6538215 = idf(docFreq=411, maxDocs=43254)
                0.0625 = fieldNorm(doc=4662)
          0.25776133 = weight(abstract_txt:linguistic in 4662) [ClassicSimilarity], result of:
            0.25776133 = score(doc=4662,freq=3.0), product of:
              0.4077197 = queryWeight, product of:
                3.5724072 = boost
                5.8400345 = idf(docFreq=341, maxDocs=43254)
                0.019542733 = queryNorm
              0.63220227 = fieldWeight in 4662, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8400345 = idf(docFreq=341, maxDocs=43254)
                0.0625 = fieldNorm(doc=4662)
        0.24 = coord(6/25)
    
  4. Atlam, E.-S.; Morita, K.; Fuketa, M.; Aoe, J.-i.: ¬A new method for selecting English field association terms of compound words and its knowledge representation (2002) 0.16
    0.16029648 = sum of:
      0.16029648 = product of:
        0.66790205 = sum of:
          0.13932827 = weight(abstract_txt:compound in 4591) [ClassicSimilarity], result of:
            0.13932827 = score(doc=4591,freq=2.0), product of:
              0.16813047 = queryWeight, product of:
                1.1470262 = boost
                7.500458 = idf(docFreq=64, maxDocs=43254)
                0.019542733 = queryNorm
              0.82869136 = fieldWeight in 4591, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.500458 = idf(docFreq=64, maxDocs=43254)
                0.078125 = fieldNorm(doc=4591)
          0.0624136 = weight(abstract_txt:terms in 4591) [ClassicSimilarity], result of:
            0.0624136 = score(doc=4591,freq=4.0), product of:
              0.09843278 = queryWeight, product of:
                1.2411807 = boost
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.019542733 = queryNorm
              0.6340733 = fieldWeight in 4591, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.078125 = fieldNorm(doc=4591)
          0.042906918 = weight(abstract_txt:method in 4591) [ClassicSimilarity], result of:
            0.042906918 = score(doc=4591,freq=1.0), product of:
              0.12170968 = queryWeight, product of:
                1.3801545 = boost
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.019542733 = queryNorm
              0.35253495 = fieldWeight in 4591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.078125 = fieldNorm(doc=4591)
          0.06539611 = weight(abstract_txt:proposed in 4591) [ClassicSimilarity], result of:
            0.06539611 = score(doc=4591,freq=2.0), product of:
              0.1279376 = queryWeight, product of:
                1.4150254 = boost
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.019542733 = queryNorm
              0.51115626 = fieldWeight in 4591, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.078125 = fieldNorm(doc=4591)
          0.119151846 = weight(abstract_txt:semantic in 4591) [ClassicSimilarity], result of:
            0.119151846 = score(doc=4591,freq=2.0), product of:
              0.24045886 = queryWeight, product of:
                2.7434719 = boost
                4.484923 = idf(docFreq=1325, maxDocs=43254)
                0.019542733 = queryNorm
              0.49551862 = fieldWeight in 4591, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.484923 = idf(docFreq=1325, maxDocs=43254)
                0.078125 = fieldNorm(doc=4591)
          0.23870535 = weight(abstract_txt:association in 4591) [ClassicSimilarity], result of:
            0.23870535 = score(doc=4591,freq=2.0), product of:
              0.38213345 = queryWeight, product of:
                3.458499 = boost
                5.6538215 = idf(docFreq=411, maxDocs=43254)
                0.019542733 = queryNorm
              0.6246649 = fieldWeight in 4591, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6538215 = idf(docFreq=411, maxDocs=43254)
                0.078125 = fieldNorm(doc=4591)
        0.24 = coord(6/25)
    
  5. Fu, T.; Abbasi, A.; Chen, H.: ¬A hybrid approach to Web forum interactional coherence analysis (2008) 0.16
    0.15795867 = sum of:
      0.15795867 = product of:
        0.49362084 = sum of:
          0.017838584 = weight(abstract_txt:used in 3873) [ClassicSimilarity], result of:
            0.017838584 = score(doc=3873,freq=1.0), product of:
              0.06779753 = queryWeight, product of:
                1.0300826 = boost
                3.3678792 = idf(docFreq=4051, maxDocs=43254)
                0.019542733 = queryNorm
              0.26311556 = fieldWeight in 3873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3678792 = idf(docFreq=4051, maxDocs=43254)
                0.078125 = fieldNorm(doc=3873)
          0.019213794 = weight(abstract_txt:study in 3873) [ClassicSimilarity], result of:
            0.019213794 = score(doc=3873,freq=1.0), product of:
              0.07123865 = queryWeight, product of:
                1.0559005 = boost
                3.4522913 = idf(docFreq=3723, maxDocs=43254)
                0.019542733 = queryNorm
              0.26971024 = fieldWeight in 3873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4522913 = idf(docFreq=3723, maxDocs=43254)
                0.078125 = fieldNorm(doc=3873)
          0.040094804 = weight(abstract_txt:analysis in 3873) [ClassicSimilarity], result of:
            0.040094804 = score(doc=3873,freq=3.0), product of:
              0.08066007 = queryWeight, product of:
                1.1235552 = boost
                3.67349 = idf(docFreq=2984, maxDocs=43254)
                0.019542733 = queryNorm
              0.4970837 = fieldWeight in 3873, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.67349 = idf(docFreq=2984, maxDocs=43254)
                0.078125 = fieldNorm(doc=3873)
          0.0312068 = weight(abstract_txt:terms in 3873) [ClassicSimilarity], result of:
            0.0312068 = score(doc=3873,freq=1.0), product of:
              0.09843278 = queryWeight, product of:
                1.2411807 = boost
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.019542733 = queryNorm
              0.31703666 = fieldWeight in 3873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.078125 = fieldNorm(doc=3873)
          0.042906918 = weight(abstract_txt:method in 3873) [ClassicSimilarity], result of:
            0.042906918 = score(doc=3873,freq=1.0), product of:
              0.12170968 = queryWeight, product of:
                1.3801545 = boost
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.019542733 = queryNorm
              0.35253495 = fieldWeight in 3873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.078125 = fieldNorm(doc=3873)
          0.046242032 = weight(abstract_txt:proposed in 3873) [ClassicSimilarity], result of:
            0.046242032 = score(doc=3873,freq=1.0), product of:
              0.1279376 = queryWeight, product of:
                1.4150254 = boost
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.019542733 = queryNorm
              0.3614421 = fieldWeight in 3873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.078125 = fieldNorm(doc=3873)
          0.110094704 = weight(abstract_txt:construct in 3873) [ClassicSimilarity], result of:
            0.110094704 = score(doc=3873,freq=1.0), product of:
              0.22811362 = queryWeight, product of:
                1.8894732 = boost
                6.1776767 = idf(docFreq=243, maxDocs=43254)
                0.019542733 = queryNorm
              0.482631 = fieldWeight in 3873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1776767 = idf(docFreq=243, maxDocs=43254)
                0.078125 = fieldNorm(doc=3873)
          0.1860232 = weight(abstract_txt:linguistic in 3873) [ClassicSimilarity], result of:
            0.1860232 = score(doc=3873,freq=1.0), product of:
              0.4077197 = queryWeight, product of:
                3.5724072 = boost
                5.8400345 = idf(docFreq=341, maxDocs=43254)
                0.019542733 = queryNorm
              0.4562527 = fieldWeight in 3873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8400345 = idf(docFreq=341, maxDocs=43254)
                0.078125 = fieldNorm(doc=3873)
        0.32 = coord(8/25)