Document (#15823)

Author
Basili, R.
Pazienza, M.T.
Velardi, P.
Title
¬An empirical symbolic approach to natural language processing
Source
Artificial intelligence. 85(1996) nos.1/2, S.59-99
Year
1996
Abstract
Describes and evaluates the results of a large scale lexical learning system, ARISTO-LEX, that uses a combination of probabilisitc and knowledge based methods for the acquisition of selectional restrictions of words in sublanguages. Presents experimental data obtained from different corpora in different doamins and languages, and shows that the acquired lexical data not only have practical applications in natural language processing, but they are useful for a comparative analysis of sublanguages
Theme
Computerlinguistik

Similar documents (content)

  1. Stede, M.: Lexicalization in natural language generation (2002) 0.19
    0.18657276 = sum of:
      0.18657276 = product of:
        0.5830399 = sum of:
          0.027236186 = weight(abstract_txt:languages in 5246) [ClassicSimilarity], result of:
            0.027236186 = score(doc=5246,freq=1.0), product of:
              0.09592656 = queryWeight, product of:
                1.0752127 = boost
                5.191817 = idf(docFreq=643, maxDocs=42596)
                0.017184034 = queryNorm
              0.28392747 = fieldWeight in 5246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.191817 = idf(docFreq=643, maxDocs=42596)
                0.0546875 = fieldNorm(doc=5246)
          0.051818963 = weight(abstract_txt:words in 5246) [ClassicSimilarity], result of:
            0.051818963 = score(doc=5246,freq=3.0), product of:
              0.102123745 = queryWeight, product of:
                1.1094004 = boost
                5.356897 = idf(docFreq=545, maxDocs=42596)
                0.017184034 = queryNorm
              0.50741345 = fieldWeight in 5246, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.356897 = idf(docFreq=545, maxDocs=42596)
                0.0546875 = fieldNorm(doc=5246)
          0.032689635 = weight(abstract_txt:scale in 5246) [ClassicSimilarity], result of:
            0.032689635 = score(doc=5246,freq=1.0), product of:
              0.10833815 = queryWeight, product of:
                1.1426564 = boost
                5.517478 = idf(docFreq=464, maxDocs=42596)
                0.017184034 = queryNorm
              0.30173707 = fieldWeight in 5246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.517478 = idf(docFreq=464, maxDocs=42596)
                0.0546875 = fieldNorm(doc=5246)
          0.019614438 = weight(abstract_txt:different in 5246) [ClassicSimilarity], result of:
            0.019614438 = score(doc=5246,freq=1.0), product of:
              0.09710358 = queryWeight, product of:
                1.5298808 = boost
                3.6936228 = idf(docFreq=2880, maxDocs=42596)
                0.017184034 = queryNorm
              0.201995 = fieldWeight in 5246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6936228 = idf(docFreq=2880, maxDocs=42596)
                0.0546875 = fieldNorm(doc=5246)
          0.06425413 = weight(abstract_txt:language in 5246) [ClassicSimilarity], result of:
            0.06425413 = score(doc=5246,freq=5.0), product of:
              0.12525508 = queryWeight, product of:
                1.7375512 = boost
                4.195006 = idf(docFreq=1744, maxDocs=42596)
                0.017184034 = queryNorm
              0.5129862 = fieldWeight in 5246, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.195006 = idf(docFreq=1744, maxDocs=42596)
                0.0546875 = fieldNorm(doc=5246)
          0.047216035 = weight(abstract_txt:processing in 5246) [ClassicSimilarity], result of:
            0.047216035 = score(doc=5246,freq=1.0), product of:
              0.17441253 = queryWeight, product of:
                2.0503538 = boost
                4.9502115 = idf(docFreq=819, maxDocs=42596)
                0.017184034 = queryNorm
              0.2707147 = fieldWeight in 5246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9502115 = idf(docFreq=819, maxDocs=42596)
                0.0546875 = fieldNorm(doc=5246)
          0.07332175 = weight(abstract_txt:natural in 5246) [ClassicSimilarity], result of:
            0.07332175 = score(doc=5246,freq=2.0), product of:
              0.18563645 = queryWeight, product of:
                2.115298 = boost
                5.107008 = idf(docFreq=700, maxDocs=42596)
                0.017184034 = queryNorm
              0.39497498 = fieldWeight in 5246, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.107008 = idf(docFreq=700, maxDocs=42596)
                0.0546875 = fieldNorm(doc=5246)
          0.26688877 = weight(abstract_txt:lexical in 5246) [ClassicSimilarity], result of:
            0.26688877 = score(doc=5246,freq=6.0), product of:
              0.30457047 = queryWeight, product of:
                2.709467 = boost
                6.541522 = idf(docFreq=166, maxDocs=42596)
                0.017184034 = queryNorm
              0.87627923 = fieldWeight in 5246, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.541522 = idf(docFreq=166, maxDocs=42596)
                0.0546875 = fieldNorm(doc=5246)
        0.32 = coord(8/25)
    
  2. Markó, K.G.: Foundation, implementation and evaluation of the MorphoSaurus system (2008) 0.16
    0.15932338 = sum of:
      0.15932338 = product of:
        0.4425649 = sum of:
          0.043501407 = weight(abstract_txt:languages in 416) [ClassicSimilarity], result of:
            0.043501407 = score(doc=416,freq=5.0), product of:
              0.09592656 = queryWeight, product of:
                1.0752127 = boost
                5.191817 = idf(docFreq=643, maxDocs=42596)
                0.017184034 = queryNorm
              0.45348656 = fieldWeight in 416, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.191817 = idf(docFreq=643, maxDocs=42596)
                0.0390625 = fieldNorm(doc=416)
          0.023349741 = weight(abstract_txt:scale in 416) [ClassicSimilarity], result of:
            0.023349741 = score(doc=416,freq=1.0), product of:
              0.10833815 = queryWeight, product of:
                1.1426564 = boost
                5.517478 = idf(docFreq=464, maxDocs=42596)
                0.017184034 = queryNorm
              0.21552649 = fieldWeight in 416, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.517478 = idf(docFreq=464, maxDocs=42596)
                0.0390625 = fieldNorm(doc=416)
          0.034050617 = weight(abstract_txt:acquisition in 416) [ClassicSimilarity], result of:
            0.034050617 = score(doc=416,freq=1.0), product of:
              0.1393189 = queryWeight, product of:
                1.2957761 = boost
                6.2568383 = idf(docFreq=221, maxDocs=42596)
                0.017184034 = queryNorm
              0.24440774 = fieldWeight in 416, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2568383 = idf(docFreq=221, maxDocs=42596)
                0.0390625 = fieldNorm(doc=416)
          0.0505522 = weight(abstract_txt:acquired in 416) [ClassicSimilarity], result of:
            0.0505522 = score(doc=416,freq=1.0), product of:
              0.1813093 = queryWeight, product of:
                1.4782062 = boost
                7.1377273 = idf(docFreq=91, maxDocs=42596)
                0.017184034 = queryNorm
              0.27881747 = fieldWeight in 416, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1377273 = idf(docFreq=91, maxDocs=42596)
                0.0390625 = fieldNorm(doc=416)
          0.019813573 = weight(abstract_txt:different in 416) [ClassicSimilarity], result of:
            0.019813573 = score(doc=416,freq=2.0), product of:
              0.09710358 = queryWeight, product of:
                1.5298808 = boost
                3.6936228 = idf(docFreq=2880, maxDocs=42596)
                0.017184034 = queryNorm
              0.20404576 = fieldWeight in 416, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6936228 = idf(docFreq=2880, maxDocs=42596)
                0.0390625 = fieldNorm(doc=416)
          0.041050453 = weight(abstract_txt:language in 416) [ClassicSimilarity], result of:
            0.041050453 = score(doc=416,freq=4.0), product of:
              0.12525508 = queryWeight, product of:
                1.7375512 = boost
                4.195006 = idf(docFreq=1744, maxDocs=42596)
                0.017184034 = queryNorm
              0.32773483 = fieldWeight in 416, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.195006 = idf(docFreq=1744, maxDocs=42596)
                0.0390625 = fieldNorm(doc=416)
          0.05841469 = weight(abstract_txt:processing in 416) [ClassicSimilarity], result of:
            0.05841469 = score(doc=416,freq=3.0), product of:
              0.17441253 = queryWeight, product of:
                2.0503538 = boost
                4.9502115 = idf(docFreq=819, maxDocs=42596)
                0.017184034 = queryNorm
              0.33492255 = fieldWeight in 416, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9502115 = idf(docFreq=819, maxDocs=42596)
                0.0390625 = fieldNorm(doc=416)
          0.037033077 = weight(abstract_txt:natural in 416) [ClassicSimilarity], result of:
            0.037033077 = score(doc=416,freq=1.0), product of:
              0.18563645 = queryWeight, product of:
                2.115298 = boost
                5.107008 = idf(docFreq=700, maxDocs=42596)
                0.017184034 = queryNorm
              0.1994925 = fieldWeight in 416, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.107008 = idf(docFreq=700, maxDocs=42596)
                0.0390625 = fieldNorm(doc=416)
          0.13479917 = weight(abstract_txt:lexical in 416) [ClassicSimilarity], result of:
            0.13479917 = score(doc=416,freq=3.0), product of:
              0.30457047 = queryWeight, product of:
                2.709467 = boost
                6.541522 = idf(docFreq=166, maxDocs=42596)
                0.017184034 = queryNorm
              0.4425878 = fieldWeight in 416, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.541522 = idf(docFreq=166, maxDocs=42596)
                0.0390625 = fieldNorm(doc=416)
        0.36 = coord(9/25)
    
  3. Conlon, S.P.N.; Evens, M.; Ahlswede, T.: Developing a large lexical database for information retrieval, parsing, and text generation systems (1993) 0.15
    0.14660716 = sum of:
      0.14660716 = product of:
        0.6108632 = sum of:
          0.037505616 = weight(abstract_txt:shows in 5813) [ClassicSimilarity], result of:
            0.037505616 = score(doc=5813,freq=1.0), product of:
              0.09360612 = queryWeight, product of:
                1.0621285 = boost
                5.128638 = idf(docFreq=685, maxDocs=42596)
                0.017184034 = queryNorm
              0.40067482 = fieldWeight in 5813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.128638 = idf(docFreq=685, maxDocs=42596)
                0.078125 = fieldNorm(doc=5813)
          0.042739563 = weight(abstract_txt:words in 5813) [ClassicSimilarity], result of:
            0.042739563 = score(doc=5813,freq=1.0), product of:
              0.102123745 = queryWeight, product of:
                1.1094004 = boost
                5.356897 = idf(docFreq=545, maxDocs=42596)
                0.017184034 = queryNorm
              0.41850758 = fieldWeight in 5813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.356897 = idf(docFreq=545, maxDocs=42596)
                0.078125 = fieldNorm(doc=5813)
          0.041050453 = weight(abstract_txt:language in 5813) [ClassicSimilarity], result of:
            0.041050453 = score(doc=5813,freq=1.0), product of:
              0.12525508 = queryWeight, product of:
                1.7375512 = boost
                4.195006 = idf(docFreq=1744, maxDocs=42596)
                0.017184034 = queryNorm
              0.32773483 = fieldWeight in 5813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.195006 = idf(docFreq=1744, maxDocs=42596)
                0.078125 = fieldNorm(doc=5813)
          0.06745148 = weight(abstract_txt:processing in 5813) [ClassicSimilarity], result of:
            0.06745148 = score(doc=5813,freq=1.0), product of:
              0.17441253 = queryWeight, product of:
                2.0503538 = boost
                4.9502115 = idf(docFreq=819, maxDocs=42596)
                0.017184034 = queryNorm
              0.38673526 = fieldWeight in 5813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9502115 = idf(docFreq=819, maxDocs=42596)
                0.078125 = fieldNorm(doc=5813)
          0.074066155 = weight(abstract_txt:natural in 5813) [ClassicSimilarity], result of:
            0.074066155 = score(doc=5813,freq=1.0), product of:
              0.18563645 = queryWeight, product of:
                2.115298 = boost
                5.107008 = idf(docFreq=700, maxDocs=42596)
                0.017184034 = queryNorm
              0.398985 = fieldWeight in 5813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.107008 = idf(docFreq=700, maxDocs=42596)
                0.078125 = fieldNorm(doc=5813)
          0.34804997 = weight(abstract_txt:lexical in 5813) [ClassicSimilarity], result of:
            0.34804997 = score(doc=5813,freq=5.0), product of:
              0.30457047 = queryWeight, product of:
                2.709467 = boost
                6.541522 = idf(docFreq=166, maxDocs=42596)
                0.017184034 = queryNorm
              1.1427568 = fieldWeight in 5813, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.541522 = idf(docFreq=166, maxDocs=42596)
                0.078125 = fieldNorm(doc=5813)
        0.24 = coord(6/25)
    
  4. Sánchez-de-Madariaga, R.; Fernández-del-Castillo, J.R.: ¬The bootstrapping of the Yarowsky algorithm in real corpora (2009) 0.15
    0.14509204 = sum of:
      0.14509204 = product of:
        0.6045502 = sum of:
          0.04500674 = weight(abstract_txt:shows in 3631) [ClassicSimilarity], result of:
            0.04500674 = score(doc=3631,freq=1.0), product of:
              0.09360612 = queryWeight, product of:
                1.0621285 = boost
                5.128638 = idf(docFreq=685, maxDocs=42596)
                0.017184034 = queryNorm
              0.4808098 = fieldWeight in 3631, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.128638 = idf(docFreq=685, maxDocs=42596)
                0.09375 = fieldNorm(doc=3631)
          0.081721485 = weight(abstract_txt:acquisition in 3631) [ClassicSimilarity], result of:
            0.081721485 = score(doc=3631,freq=1.0), product of:
              0.1393189 = queryWeight, product of:
                1.2957761 = boost
                6.2568383 = idf(docFreq=221, maxDocs=42596)
                0.017184034 = queryNorm
              0.5865786 = fieldWeight in 3631, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2568383 = idf(docFreq=221, maxDocs=42596)
                0.09375 = fieldNorm(doc=3631)
          0.23833588 = weight(abstract_txt:corpora in 3631) [ClassicSimilarity], result of:
            0.23833588 = score(doc=3631,freq=4.0), product of:
              0.17915358 = queryWeight, product of:
                1.4693921 = boost
                7.0951676 = idf(docFreq=95, maxDocs=42596)
                0.017184034 = queryNorm
              1.330344 = fieldWeight in 3631, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.0951676 = idf(docFreq=95, maxDocs=42596)
                0.09375 = fieldNorm(doc=3631)
          0.06966493 = weight(abstract_txt:language in 3631) [ClassicSimilarity], result of:
            0.06966493 = score(doc=3631,freq=2.0), product of:
              0.12525508 = queryWeight, product of:
                1.7375512 = boost
                4.195006 = idf(docFreq=1744, maxDocs=42596)
                0.017184034 = queryNorm
              0.5561845 = fieldWeight in 3631, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.195006 = idf(docFreq=1744, maxDocs=42596)
                0.09375 = fieldNorm(doc=3631)
          0.080941774 = weight(abstract_txt:processing in 3631) [ClassicSimilarity], result of:
            0.080941774 = score(doc=3631,freq=1.0), product of:
              0.17441253 = queryWeight, product of:
                2.0503538 = boost
                4.9502115 = idf(docFreq=819, maxDocs=42596)
                0.017184034 = queryNorm
              0.46408233 = fieldWeight in 3631, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9502115 = idf(docFreq=819, maxDocs=42596)
                0.09375 = fieldNorm(doc=3631)
          0.08887939 = weight(abstract_txt:natural in 3631) [ClassicSimilarity], result of:
            0.08887939 = score(doc=3631,freq=1.0), product of:
              0.18563645 = queryWeight, product of:
                2.115298 = boost
                5.107008 = idf(docFreq=700, maxDocs=42596)
                0.017184034 = queryNorm
              0.478782 = fieldWeight in 3631, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.107008 = idf(docFreq=700, maxDocs=42596)
                0.09375 = fieldNorm(doc=3631)
        0.24 = coord(6/25)
    
  5. Talvensaari, T.; Laurikkala, J.; Järvelin, K.; Juhola, M.: ¬A study on automatic creation of a comparable document collection in cross-language information retrieval (2006) 0.14
    0.14164406 = sum of:
      0.14164406 = product of:
        0.4426377 = sum of:
          0.025041195 = weight(abstract_txt:practical in 602) [ClassicSimilarity], result of:
            0.025041195 = score(doc=602,freq=1.0), product of:
              0.08297554 = queryWeight, product of:
                4.8286414 = idf(docFreq=925, maxDocs=42596)
                0.017184034 = queryNorm
              0.3017901 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8286414 = idf(docFreq=925, maxDocs=42596)
                0.0625 = fieldNorm(doc=602)
          0.053913668 = weight(abstract_txt:languages in 602) [ClassicSimilarity], result of:
            0.053913668 = score(doc=602,freq=3.0), product of:
              0.09592656 = queryWeight, product of:
                1.0752127 = boost
                5.191817 = idf(docFreq=643, maxDocs=42596)
                0.017184034 = queryNorm
              0.5620307 = fieldWeight in 602, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.191817 = idf(docFreq=643, maxDocs=42596)
                0.0625 = fieldNorm(doc=602)
          0.03419165 = weight(abstract_txt:words in 602) [ClassicSimilarity], result of:
            0.03419165 = score(doc=602,freq=1.0), product of:
              0.102123745 = queryWeight, product of:
                1.1094004 = boost
                5.356897 = idf(docFreq=545, maxDocs=42596)
                0.017184034 = queryNorm
              0.33480605 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.356897 = idf(docFreq=545, maxDocs=42596)
                0.0625 = fieldNorm(doc=602)
          0.037359584 = weight(abstract_txt:scale in 602) [ClassicSimilarity], result of:
            0.037359584 = score(doc=602,freq=1.0), product of:
              0.10833815 = queryWeight, product of:
                1.1426564 = boost
                5.517478 = idf(docFreq=464, maxDocs=42596)
                0.017184034 = queryNorm
              0.34484237 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.517478 = idf(docFreq=464, maxDocs=42596)
                0.0625 = fieldNorm(doc=602)
          0.11235261 = weight(abstract_txt:corpora in 602) [ClassicSimilarity], result of:
            0.11235261 = score(doc=602,freq=2.0), product of:
              0.17915358 = queryWeight, product of:
                1.4693921 = boost
                7.0951676 = idf(docFreq=95, maxDocs=42596)
                0.017184034 = queryNorm
              0.62713015 = fieldWeight in 602, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0951676 = idf(docFreq=95, maxDocs=42596)
                0.0625 = fieldNorm(doc=602)
          0.0224165 = weight(abstract_txt:different in 602) [ClassicSimilarity], result of:
            0.0224165 = score(doc=602,freq=1.0), product of:
              0.09710358 = queryWeight, product of:
                1.5298808 = boost
                3.6936228 = idf(docFreq=2880, maxDocs=42596)
                0.017184034 = queryNorm
              0.23085143 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6936228 = idf(docFreq=2880, maxDocs=42596)
                0.0625 = fieldNorm(doc=602)
          0.032840364 = weight(abstract_txt:language in 602) [ClassicSimilarity], result of:
            0.032840364 = score(doc=602,freq=1.0), product of:
              0.12525508 = queryWeight, product of:
                1.7375512 = boost
                4.195006 = idf(docFreq=1744, maxDocs=42596)
                0.017184034 = queryNorm
              0.26218787 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.195006 = idf(docFreq=1744, maxDocs=42596)
                0.0625 = fieldNorm(doc=602)
          0.12452215 = weight(abstract_txt:lexical in 602) [ClassicSimilarity], result of:
            0.12452215 = score(doc=602,freq=1.0), product of:
              0.30457047 = queryWeight, product of:
                2.709467 = boost
                6.541522 = idf(docFreq=166, maxDocs=42596)
                0.017184034 = queryNorm
              0.40884513 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.541522 = idf(docFreq=166, maxDocs=42596)
                0.0625 = fieldNorm(doc=602)
        0.32 = coord(8/25)