Document (#40382)

Author
Li, N.
Sun, J.
Title
Improving Chinese term association from the linguistic perspective
Source
Knowledge organization. 44(2017) no.1, S.13-23
Year
2017
Abstract
The study aims to solve how to construct the semantic relations of specific domain terms by applying linguistic rules. The semantic structure analysis at the morpheme level was used for semantic measure, and a morpheme-based term association model was proposed by improving and combining the literal-based similarity algorithm and co-occurrence relatedness methods. This study provides a novel insight into the method of semantic analysis and calculation by morpheme parsing, and the proposed solution is feasible for the automatic association of compound terms. The results show that this approach could be used to construct appropriate term association and form a reasonable structural knowledge graph. However, due to linguistic differences, the viability and effectiveness of the use of our method in non-Chinese linguistic environments should be verified.
Theme
Semantisches Umfeld in Indexierung u. Retrieval
Computerlinguistik

Similar documents (content)

  1. Galvez, C.; Moya-Anegón, F. de; Solana, V.H.: Term conflation methods in information retrieval : non-linguistic and linguistic approaches (2005) 0.19
    0.1938729 = sum of:
      0.1938729 = product of:
        0.80780375 = sum of:
          0.017780513 = weight(abstract_txt:used in 4394) [ClassicSimilarity], result of:
            0.017780513 = score(doc=4394,freq=1.0), product of:
              0.06774942 = queryWeight, product of:
                1.0303433 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.019573791 = queryNorm
              0.26244524 = fieldWeight in 4394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.078125 = fieldNorm(doc=4394)
          0.031016208 = weight(abstract_txt:terms in 4394) [ClassicSimilarity], result of:
            0.031016208 = score(doc=4394,freq=1.0), product of:
              0.09817521 = queryWeight, product of:
                1.2403096 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019573791 = queryNorm
              0.3159271 = fieldWeight in 4394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=4394)
          0.042767197 = weight(abstract_txt:method in 4394) [ClassicSimilarity], result of:
            0.042767197 = score(doc=4394,freq=1.0), product of:
              0.12162324 = queryWeight, product of:
                1.3805033 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.019573791 = queryNorm
              0.3516367 = fieldWeight in 4394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.078125 = fieldNorm(doc=4394)
          0.13486546 = weight(abstract_txt:term in 4394) [ClassicSimilarity], result of:
            0.13486546 = score(doc=4394,freq=3.0), product of:
              0.20758687 = queryWeight, product of:
                2.2088933 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.019573791 = queryNorm
              0.64968204 = fieldWeight in 4394, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.078125 = fieldNorm(doc=4394)
          0.16685058 = weight(abstract_txt:association in 4394) [ClassicSimilarity], result of:
            0.16685058 = score(doc=4394,freq=1.0), product of:
              0.3797559 = queryWeight, product of:
                3.44982 = boost
                5.6238427 = idf(docFreq=433, maxDocs=44218)
                0.019573791 = queryNorm
              0.4393627 = fieldWeight in 4394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6238427 = idf(docFreq=433, maxDocs=44218)
                0.078125 = fieldNorm(doc=4394)
          0.41452378 = weight(abstract_txt:linguistic in 4394) [ClassicSimilarity], result of:
            0.41452378 = score(doc=4394,freq=5.0), product of:
              0.40737623 = queryWeight, product of:
                3.5730739 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.019573791 = queryNorm
              1.0175453 = fieldWeight in 4394, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.078125 = fieldNorm(doc=4394)
        0.24 = coord(6/25)
    
  2. Guo, L.; Wan, X.: Exploiting syntactic and semantic relationships between terms for opinion retrieval (2012) 0.17
    0.17353794 = sum of:
      0.17353794 = product of:
        0.54230607 = sum of:
          0.025145441 = weight(abstract_txt:used in 492) [ClassicSimilarity], result of:
            0.025145441 = score(doc=492,freq=2.0), product of:
              0.06774942 = queryWeight, product of:
                1.0303433 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.019573791 = queryNorm
              0.37115362 = fieldWeight in 492, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.078125 = fieldNorm(doc=492)
          0.018824598 = weight(abstract_txt:study in 492) [ClassicSimilarity], result of:
            0.018824598 = score(doc=492,freq=1.0), product of:
              0.070376314 = queryWeight, product of:
                1.0501285 = boost
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.019573791 = queryNorm
              0.26748484 = fieldWeight in 492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.078125 = fieldNorm(doc=492)
          0.053721648 = weight(abstract_txt:terms in 492) [ClassicSimilarity], result of:
            0.053721648 = score(doc=492,freq=3.0), product of:
              0.09817521 = queryWeight, product of:
                1.2403096 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019573791 = queryNorm
              0.54720175 = fieldWeight in 492, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=492)
          0.12791409 = weight(abstract_txt:relatedness in 492) [ClassicSimilarity], result of:
            0.12791409 = score(doc=492,freq=1.0), product of:
              0.20039104 = queryWeight, product of:
                1.2530065 = boost
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.019573791 = queryNorm
              0.63832235 = fieldWeight in 492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.078125 = fieldNorm(doc=492)
          0.07407496 = weight(abstract_txt:method in 492) [ClassicSimilarity], result of:
            0.07407496 = score(doc=492,freq=3.0), product of:
              0.12162324 = queryWeight, product of:
                1.3805033 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.019573791 = queryNorm
              0.60905266 = fieldWeight in 492, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.078125 = fieldNorm(doc=492)
          0.045931112 = weight(abstract_txt:proposed in 492) [ClassicSimilarity], result of:
            0.045931112 = score(doc=492,freq=1.0), product of:
              0.12755007 = queryWeight, product of:
                1.4137399 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.019573791 = queryNorm
              0.36010262 = fieldWeight in 492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.078125 = fieldNorm(doc=492)
          0.07786461 = weight(abstract_txt:term in 492) [ClassicSimilarity], result of:
            0.07786461 = score(doc=492,freq=1.0), product of:
              0.20758687 = queryWeight, product of:
                2.2088933 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.019573791 = queryNorm
              0.37509412 = fieldWeight in 492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.078125 = fieldNorm(doc=492)
          0.11882962 = weight(abstract_txt:semantic in 492) [ClassicSimilarity], result of:
            0.11882962 = score(doc=492,freq=2.0), product of:
              0.24037679 = queryWeight, product of:
                2.7446718 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.019573791 = queryNorm
              0.49434733 = fieldWeight in 492, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.078125 = fieldNorm(doc=492)
        0.32 = coord(8/25)
    
  3. Lee, C.-H.; Khoo, C.; Na, J.-C.: Automatic identification of treatment relations for medical ontology learning : an exploratory study (2004) 0.17
    0.17014013 = sum of:
      0.17014013 = product of:
        0.70891726 = sum of:
          0.020116353 = weight(abstract_txt:used in 2661) [ClassicSimilarity], result of:
            0.020116353 = score(doc=2661,freq=2.0), product of:
              0.06774942 = queryWeight, product of:
                1.0303433 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.019573791 = queryNorm
              0.2969229 = fieldWeight in 2661, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=2661)
          0.030119356 = weight(abstract_txt:study in 2661) [ClassicSimilarity], result of:
            0.030119356 = score(doc=2661,freq=4.0), product of:
              0.070376314 = queryWeight, product of:
                1.0501285 = boost
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.019573791 = queryNorm
              0.42797574 = fieldWeight in 2661, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.0625 = fieldNorm(doc=2661)
          0.048385557 = weight(abstract_txt:method in 2661) [ClassicSimilarity], result of:
            0.048385557 = score(doc=2661,freq=2.0), product of:
              0.12162324 = queryWeight, product of:
                1.3805033 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.019573791 = queryNorm
              0.3978315 = fieldWeight in 2661, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=2661)
          0.16465516 = weight(abstract_txt:semantic in 2661) [ClassicSimilarity], result of:
            0.16465516 = score(doc=2661,freq=6.0), product of:
              0.24037679 = queryWeight, product of:
                2.7446718 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.019573791 = queryNorm
              0.6849878 = fieldWeight in 2661, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=2661)
          0.18876988 = weight(abstract_txt:association in 2661) [ClassicSimilarity], result of:
            0.18876988 = score(doc=2661,freq=2.0), product of:
              0.3797559 = queryWeight, product of:
                3.44982 = boost
                5.6238427 = idf(docFreq=433, maxDocs=44218)
                0.019573791 = queryNorm
              0.49708214 = fieldWeight in 2661, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6238427 = idf(docFreq=433, maxDocs=44218)
                0.0625 = fieldNorm(doc=2661)
          0.25687099 = weight(abstract_txt:linguistic in 2661) [ClassicSimilarity], result of:
            0.25687099 = score(doc=2661,freq=3.0), product of:
              0.40737623 = queryWeight, product of:
                3.5730739 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.019573791 = queryNorm
              0.6305498 = fieldWeight in 2661, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0625 = fieldNorm(doc=2661)
        0.24 = coord(6/25)
    
  4. Atlam, E.-S.; Morita, K.; Fuketa, M.; Aoe, J.-i.: ¬A new method for selecting English field association terms of compound words and its knowledge representation (2002) 0.16
    0.15936658 = sum of:
      0.15936658 = product of:
        0.66402745 = sum of:
          0.1394795 = weight(abstract_txt:compound in 2590) [ClassicSimilarity], result of:
            0.1394795 = score(doc=2590,freq=2.0), product of:
              0.16849862 = queryWeight, product of:
                1.1489797 = boost
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.019573791 = queryNorm
              0.82777834 = fieldWeight in 2590, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.078125 = fieldNorm(doc=2590)
          0.062032416 = weight(abstract_txt:terms in 2590) [ClassicSimilarity], result of:
            0.062032416 = score(doc=2590,freq=4.0), product of:
              0.09817521 = queryWeight, product of:
                1.2403096 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019573791 = queryNorm
              0.6318542 = fieldWeight in 2590, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=2590)
          0.042767197 = weight(abstract_txt:method in 2590) [ClassicSimilarity], result of:
            0.042767197 = score(doc=2590,freq=1.0), product of:
              0.12162324 = queryWeight, product of:
                1.3805033 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.019573791 = queryNorm
              0.3516367 = fieldWeight in 2590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.078125 = fieldNorm(doc=2590)
          0.064956404 = weight(abstract_txt:proposed in 2590) [ClassicSimilarity], result of:
            0.064956404 = score(doc=2590,freq=2.0), product of:
              0.12755007 = queryWeight, product of:
                1.4137399 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.019573791 = queryNorm
              0.509262 = fieldWeight in 2590, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.078125 = fieldNorm(doc=2590)
          0.11882962 = weight(abstract_txt:semantic in 2590) [ClassicSimilarity], result of:
            0.11882962 = score(doc=2590,freq=2.0), product of:
              0.24037679 = queryWeight, product of:
                2.7446718 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.019573791 = queryNorm
              0.49434733 = fieldWeight in 2590, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.078125 = fieldNorm(doc=2590)
          0.23596235 = weight(abstract_txt:association in 2590) [ClassicSimilarity], result of:
            0.23596235 = score(doc=2590,freq=2.0), product of:
              0.3797559 = queryWeight, product of:
                3.44982 = boost
                5.6238427 = idf(docFreq=433, maxDocs=44218)
                0.019573791 = queryNorm
              0.6213527 = fieldWeight in 2590, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6238427 = idf(docFreq=433, maxDocs=44218)
                0.078125 = fieldNorm(doc=2590)
        0.24 = coord(6/25)
    
  5. Fu, T.; Abbasi, A.; Chen, H.: ¬A hybrid approach to Web forum interactional coherence analysis (2008) 0.16
    0.15716442 = sum of:
      0.15716442 = product of:
        0.49113882 = sum of:
          0.017780513 = weight(abstract_txt:used in 1872) [ClassicSimilarity], result of:
            0.017780513 = score(doc=1872,freq=1.0), product of:
              0.06774942 = queryWeight, product of:
                1.0303433 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.019573791 = queryNorm
              0.26244524 = fieldWeight in 1872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.078125 = fieldNorm(doc=1872)
          0.018824598 = weight(abstract_txt:study in 1872) [ClassicSimilarity], result of:
            0.018824598 = score(doc=1872,freq=1.0), product of:
              0.070376314 = queryWeight, product of:
                1.0501285 = boost
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.019573791 = queryNorm
              0.26748484 = fieldWeight in 1872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.078125 = fieldNorm(doc=1872)
          0.03961889 = weight(abstract_txt:analysis in 1872) [ClassicSimilarity], result of:
            0.03961889 = score(doc=1872,freq=3.0), product of:
              0.08013776 = queryWeight, product of:
                1.1205926 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.019573791 = queryNorm
              0.4943848 = fieldWeight in 1872, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.078125 = fieldNorm(doc=1872)
          0.031016208 = weight(abstract_txt:terms in 1872) [ClassicSimilarity], result of:
            0.031016208 = score(doc=1872,freq=1.0), product of:
              0.09817521 = queryWeight, product of:
                1.2403096 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019573791 = queryNorm
              0.3159271 = fieldWeight in 1872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=1872)
          0.042767197 = weight(abstract_txt:method in 1872) [ClassicSimilarity], result of:
            0.042767197 = score(doc=1872,freq=1.0), product of:
              0.12162324 = queryWeight, product of:
                1.3805033 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.019573791 = queryNorm
              0.3516367 = fieldWeight in 1872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.078125 = fieldNorm(doc=1872)
          0.045931112 = weight(abstract_txt:proposed in 1872) [ClassicSimilarity], result of:
            0.045931112 = score(doc=1872,freq=1.0), product of:
              0.12755007 = queryWeight, product of:
                1.4137399 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.019573791 = queryNorm
              0.36010262 = fieldWeight in 1872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.078125 = fieldNorm(doc=1872)
          0.109819636 = weight(abstract_txt:construct in 1872) [ClassicSimilarity], result of:
            0.109819636 = score(doc=1872,freq=1.0), product of:
              0.22806714 = queryWeight, product of:
                1.8904296 = boost
                6.163498 = idf(docFreq=252, maxDocs=44218)
                0.019573791 = queryNorm
              0.48152328 = fieldWeight in 1872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.163498 = idf(docFreq=252, maxDocs=44218)
                0.078125 = fieldNorm(doc=1872)
          0.18538068 = weight(abstract_txt:linguistic in 1872) [ClassicSimilarity], result of:
            0.18538068 = score(doc=1872,freq=1.0), product of:
              0.40737623 = queryWeight, product of:
                3.5730739 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.019573791 = queryNorm
              0.45506012 = fieldWeight in 1872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.078125 = fieldNorm(doc=1872)
        0.32 = coord(8/25)