Document (#27591)

Author
Atlam, E.-S.
Morita, K.
Fuketa, M.
Aoe, J.-i.
Title
¬A new method for selecting English field association terms of compound words and its knowledge representation
Source
Information processing and management. 38(2002) no.6, S.807-821
Year
2002
Abstract
This paper presents a strategy for building a morphological machine dictionary of English that infers meaning of derivations by considering morphological affixes and their semantic classification. Derivations are grouped into a frame that is accessible to semantic stem and knowledge base. This paper also proposes an efficient method for selecting compound Field Association (FA) terms from a large pool of single FA terms for some specialized fields. For single FA terms, five levels of association are defined and two ranks are defined, based on stability and inheritance. About 85% of redundant compound FA terms can be removed effectively by using levels and ranks proposed in this paper. Recall averages of 60-80% are achieved, depending on the type of text. The proposed methods are applied to 22,000 relationships between verbs and nouns extracted from the large tagged corpus.
Theme
Computerlinguistik

Similar documents (author)

  1. Atlam, E.S.: Similarity measurement using term negative weight and its application to word similarity (2000) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:atlam in 4844) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 4844, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=4844)
    
  2. El-Sayed Atlam -> Atlam, E.-S.: 5.25
    5.252987 = sum of:
      5.252987 = weight(author_txt:atlam in 2525) [ClassicSimilarity], result of:
        5.252987 = fieldWeight in 2525, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.375 = fieldNorm(doc=2525)
    
  3. Atlam, E.-S.; Morita, K.; Fuketa, M.; Aoe, J.-i.: ¬A new approach for Arabic text classification using Arabic field-association terms (2011) 3.10
    3.0953524 = sum of:
      3.0953524 = weight(author_txt:atlam in 4927) [ClassicSimilarity], result of:
        3.0953524 = fieldWeight in 4927, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.3125 = fieldNorm(doc=4927)
    
  4. Rokaya, M.; Atlam, E.; Fuketa, M.; Dorji, T.C.; Aoe, J.-i.: Ranking of field association terms using Co-word analysis (2008) 2.48
    2.476282 = sum of:
      2.476282 = weight(author_txt:atlam in 2060) [ClassicSimilarity], result of:
        2.476282 = fieldWeight in 2060, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.25 = fieldNorm(doc=2060)
    

Similar documents (content)

  1. Atlam, E.-S.; Morita, K.; Fuketa, M.; Aoe, J.-i.: ¬A new approach for Arabic text classification using Arabic field-association terms (2011) 0.17
    0.17196122 = sum of:
      0.17196122 = product of:
        0.6141472 = sum of:
          0.052969523 = weight(abstract_txt:field in 4927) [ClassicSimilarity], result of:
            0.052969523 = score(doc=4927,freq=4.0), product of:
              0.094335854 = queryWeight, product of:
                1.1218758 = boost
                4.491995 = idf(docFreq=1345, maxDocs=44218)
                0.018719437 = queryNorm
              0.56149936 = fieldWeight in 4927, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.491995 = idf(docFreq=1345, maxDocs=44218)
                0.0625 = fieldNorm(doc=4927)
          0.037679557 = weight(abstract_txt:method in 4927) [ClassicSimilarity], result of:
            0.037679557 = score(doc=4927,freq=2.0), product of:
              0.09471235 = queryWeight, product of:
                1.1241122 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.018719437 = queryNorm
              0.3978315 = fieldWeight in 4927, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=4927)
          0.041669916 = weight(abstract_txt:single in 4927) [ClassicSimilarity], result of:
            0.041669916 = score(doc=4927,freq=1.0), product of:
              0.12761287 = queryWeight, product of:
                1.304829 = boost
                5.2245407 = idf(docFreq=646, maxDocs=44218)
                0.018719437 = queryNorm
              0.3265338 = fieldWeight in 4927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2245407 = idf(docFreq=646, maxDocs=44218)
                0.0625 = fieldNorm(doc=4927)
          0.050614078 = weight(abstract_txt:english in 4927) [ClassicSimilarity], result of:
            0.050614078 = score(doc=4927,freq=1.0), product of:
              0.14527592 = queryWeight, product of:
                1.392205 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.018719437 = queryNorm
              0.34839964 = fieldWeight in 4927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.0625 = fieldNorm(doc=4927)
          0.110251375 = weight(abstract_txt:association in 4927) [ClassicSimilarity], result of:
            0.110251375 = score(doc=4927,freq=2.0), product of:
              0.2217971 = queryWeight, product of:
                2.106832 = boost
                5.6238427 = idf(docFreq=433, maxDocs=44218)
                0.018719437 = queryNorm
              0.49708214 = fieldWeight in 4927, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6238427 = idf(docFreq=433, maxDocs=44218)
                0.0625 = fieldNorm(doc=4927)
          0.13663243 = weight(abstract_txt:terms in 4927) [ClassicSimilarity], result of:
            0.13663243 = score(doc=4927,freq=8.0), product of:
              0.19113135 = queryWeight, product of:
                2.5248892 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.018719437 = queryNorm
              0.7148614 = fieldWeight in 4927, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=4927)
          0.1843303 = weight(abstract_txt:compound in 4927) [ClassicSimilarity], result of:
            0.1843303 = score(doc=4927,freq=1.0), product of:
              0.39364764 = queryWeight, product of:
                2.8067634 = boost
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.018719437 = queryNorm
              0.46826217 = fieldWeight in 4927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.0625 = fieldNorm(doc=4927)
        0.28 = coord(7/25)
    
  2. Li, N.; Sun, J.: Improving Chinese term association from the linguistic perspective (2017) 0.16
    0.15545091 = sum of:
      0.15545091 = product of:
        0.6477121 = sum of:
          0.06543345 = weight(abstract_txt:semantic in 3381) [ClassicSimilarity], result of:
            0.06543345 = score(doc=3381,freq=4.0), product of:
              0.093595 = queryWeight, product of:
                1.1174618 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.018719437 = queryNorm
              0.6991127 = fieldWeight in 3381, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.078125 = fieldNorm(doc=3381)
          0.04709944 = weight(abstract_txt:method in 3381) [ClassicSimilarity], result of:
            0.04709944 = score(doc=3381,freq=2.0), product of:
              0.09471235 = queryWeight, product of:
                1.1241122 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.018719437 = queryNorm
              0.49728936 = fieldWeight in 3381, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.078125 = fieldNorm(doc=3381)
          0.05058387 = weight(abstract_txt:proposed in 3381) [ClassicSimilarity], result of:
            0.05058387 = score(doc=3381,freq=2.0), product of:
              0.09932779 = queryWeight, product of:
                1.1511761 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.018719437 = queryNorm
              0.509262 = fieldWeight in 3381, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.078125 = fieldNorm(doc=3381)
          0.16878726 = weight(abstract_txt:association in 3381) [ClassicSimilarity], result of:
            0.16878726 = score(doc=3381,freq=3.0), product of:
              0.2217971 = queryWeight, product of:
                2.106832 = boost
                5.6238427 = idf(docFreq=433, maxDocs=44218)
                0.018719437 = queryNorm
              0.7609985 = fieldWeight in 3381, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6238427 = idf(docFreq=433, maxDocs=44218)
                0.078125 = fieldNorm(doc=3381)
          0.08539527 = weight(abstract_txt:terms in 3381) [ClassicSimilarity], result of:
            0.08539527 = score(doc=3381,freq=2.0), product of:
              0.19113135 = queryWeight, product of:
                2.5248892 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.018719437 = queryNorm
              0.44678837 = fieldWeight in 3381, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=3381)
          0.23041286 = weight(abstract_txt:compound in 3381) [ClassicSimilarity], result of:
            0.23041286 = score(doc=3381,freq=1.0), product of:
              0.39364764 = queryWeight, product of:
                2.8067634 = boost
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.018719437 = queryNorm
              0.5853277 = fieldWeight in 3381, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.078125 = fieldNorm(doc=3381)
        0.24 = coord(6/25)
    
  3. Seo, H.-C.; Kim, S.-B.; Rim, H.-C.; Myaeng, S.-H.: lmproving query translation in English-Korean Cross-language information retrieval (2005) 0.14
    0.13641535 = sum of:
      0.13641535 = product of:
        0.5683973 = sum of:
          0.066608675 = weight(abstract_txt:method in 1023) [ClassicSimilarity], result of:
            0.066608675 = score(doc=1023,freq=4.0), product of:
              0.09471235 = queryWeight, product of:
                1.1241122 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.018719437 = queryNorm
              0.7032734 = fieldWeight in 1023, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.078125 = fieldNorm(doc=1023)
          0.02283928 = weight(abstract_txt:paper in 1023) [ClassicSimilarity], result of:
            0.02283928 = score(doc=1023,freq=1.0), product of:
              0.0843124 = queryWeight, product of:
                1.2989658 = boost
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.018719437 = queryNorm
              0.27088875 = fieldWeight in 1023, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.078125 = fieldNorm(doc=1023)
          0.063267596 = weight(abstract_txt:english in 1023) [ClassicSimilarity], result of:
            0.063267596 = score(doc=1023,freq=1.0), product of:
              0.14527592 = queryWeight, product of:
                1.392205 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.018719437 = queryNorm
              0.43549955 = fieldWeight in 1023, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.078125 = fieldNorm(doc=1023)
          0.14284575 = weight(abstract_txt:selecting in 1023) [ClassicSimilarity], result of:
            0.14284575 = score(doc=1023,freq=2.0), product of:
              0.19844536 = queryWeight, product of:
                1.6271472 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.018719437 = queryNorm
              0.7198241 = fieldWeight in 1023, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.078125 = fieldNorm(doc=1023)
          0.13781422 = weight(abstract_txt:association in 1023) [ClassicSimilarity], result of:
            0.13781422 = score(doc=1023,freq=2.0), product of:
              0.2217971 = queryWeight, product of:
                2.106832 = boost
                5.6238427 = idf(docFreq=433, maxDocs=44218)
                0.018719437 = queryNorm
              0.6213527 = fieldWeight in 1023, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6238427 = idf(docFreq=433, maxDocs=44218)
                0.078125 = fieldNorm(doc=1023)
          0.13502178 = weight(abstract_txt:terms in 1023) [ClassicSimilarity], result of:
            0.13502178 = score(doc=1023,freq=5.0), product of:
              0.19113135 = queryWeight, product of:
                2.5248892 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.018719437 = queryNorm
              0.7064345 = fieldWeight in 1023, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=1023)
        0.24 = coord(6/25)
    
  4. Bounhas, I.; Elayeb, B.; Evrard, F.; Slimani, Y.: Organizing contextual knowledge for Arabic text disambiguation and terminology extraction (2011) 0.12
    0.12429481 = sum of:
      0.12429481 = product of:
        0.621474 = sum of:
          0.12995183 = weight(abstract_txt:nouns in 4846) [ClassicSimilarity], result of:
            0.12995183 = score(doc=4846,freq=3.0), product of:
              0.1499054 = queryWeight, product of:
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.018719437 = queryNorm
              0.8668923 = fieldWeight in 4846, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0625 = fieldNorm(doc=4846)
          0.026173381 = weight(abstract_txt:semantic in 4846) [ClassicSimilarity], result of:
            0.026173381 = score(doc=4846,freq=1.0), product of:
              0.093595 = queryWeight, product of:
                1.1174618 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.018719437 = queryNorm
              0.2796451 = fieldWeight in 4846, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=4846)
          0.018271426 = weight(abstract_txt:paper in 4846) [ClassicSimilarity], result of:
            0.018271426 = score(doc=4846,freq=1.0), product of:
              0.0843124 = queryWeight, product of:
                1.2989658 = boost
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.018719437 = queryNorm
              0.216711 = fieldWeight in 4846, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.0625 = fieldNorm(doc=4846)
          0.12780793 = weight(abstract_txt:terms in 4846) [ClassicSimilarity], result of:
            0.12780793 = score(doc=4846,freq=7.0), product of:
              0.19113135 = queryWeight, product of:
                2.5248892 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.018719437 = queryNorm
              0.6686916 = fieldWeight in 4846, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=4846)
          0.31926945 = weight(abstract_txt:compound in 4846) [ClassicSimilarity], result of:
            0.31926945 = score(doc=4846,freq=3.0), product of:
              0.39364764 = queryWeight, product of:
                2.8067634 = boost
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.018719437 = queryNorm
              0.8110539 = fieldWeight in 4846, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.0625 = fieldNorm(doc=4846)
        0.2 = coord(5/25)
    
  5. Kettunen, K.; Kunttu, T.; Järvelin, K.: To stem or lemmatize a highly inflectional language in a probabilistic IR environment? (2005) 0.11
    0.10770222 = sum of:
      0.10770222 = product of:
        0.67313886 = sum of:
          0.14679621 = weight(abstract_txt:stem in 4395) [ClassicSimilarity], result of:
            0.14679621 = score(doc=4395,freq=5.0), product of:
              0.1499054 = queryWeight, product of:
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.018719437 = queryNorm
              0.979259 = fieldWeight in 4395, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4395)
          0.04037936 = weight(abstract_txt:method in 4395) [ClassicSimilarity], result of:
            0.04037936 = score(doc=4395,freq=3.0), product of:
              0.09471235 = queryWeight, product of:
                1.1241122 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.018719437 = queryNorm
              0.42633682 = fieldWeight in 4395, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4395)
          0.3246743 = weight(abstract_txt:morphological in 4395) [ClassicSimilarity], result of:
            0.3246743 = score(doc=4395,freq=6.0), product of:
              0.30170944 = queryWeight, product of:
                2.0063229 = boost
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.018719437 = queryNorm
              1.0761158 = fieldWeight in 4395, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4395)
          0.161289 = weight(abstract_txt:compound in 4395) [ClassicSimilarity], result of:
            0.161289 = score(doc=4395,freq=1.0), product of:
              0.39364764 = queryWeight, product of:
                2.8067634 = boost
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.018719437 = queryNorm
              0.4097294 = fieldWeight in 4395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4395)
        0.16 = coord(4/25)