Document (#15820)

Author
Basili, R.
Pazienza, M.T.
Velardi, P.
Title
¬An empirical symbolic approach to natural language processing
Source
Artificial intelligence. 85(1996) nos.1/2, S.59-99
Year
1996
Abstract
Describes and evaluates the results of a large scale lexical learning system, ARISTO-LEX, that uses a combination of probabilisitc and knowledge based methods for the acquisition of selectional restrictions of words in sublanguages. Presents experimental data obtained from different corpora in different doamins and languages, and shows that the acquired lexical data not only have practical applications in natural language processing, but they are useful for a comparative analysis of sublanguages
Theme
Computerlinguistik

Similar documents (content)

  1. Stede, M.: Lexicalization in natural language generation (2002) 0.19
    0.18507282 = sum of:
      0.18507282 = product of:
        0.5783526 = sum of:
          0.027201958 = weight(abstract_txt:languages in 243) [ClassicSimilarity], result of:
            0.027201958 = score(doc=243,freq=1.0), product of:
              0.095819615 = queryWeight, product of:
                1.0797642 = boost
                5.191079 = idf(docFreq=658, maxDocs=43556)
                0.01709495 = queryNorm
              0.28388715 = fieldWeight in 243, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.191079 = idf(docFreq=658, maxDocs=43556)
                0.0546875 = fieldNorm(doc=243)
          0.051688064 = weight(abstract_txt:words in 243) [ClassicSimilarity], result of:
            0.051688064 = score(doc=243,freq=3.0), product of:
              0.10192345 = queryWeight, product of:
                1.1136246 = boost
                5.353866 = idf(docFreq=559, maxDocs=43556)
                0.01709495 = queryNorm
              0.50712633 = fieldWeight in 243, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.353866 = idf(docFreq=559, maxDocs=43556)
                0.0546875 = fieldNorm(doc=243)
          0.03234807 = weight(abstract_txt:scale in 243) [ClassicSimilarity], result of:
            0.03234807 = score(doc=243,freq=1.0), product of:
              0.10755235 = queryWeight, product of:
                1.1439623 = boost
                5.4997177 = idf(docFreq=483, maxDocs=43556)
                0.01709495 = queryNorm
              0.3007658 = fieldWeight in 243, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4997177 = idf(docFreq=483, maxDocs=43556)
                0.0546875 = fieldNorm(doc=243)
          0.019336358 = weight(abstract_txt:different in 243) [ClassicSimilarity], result of:
            0.019336358 = score(doc=243,freq=1.0), product of:
              0.09615697 = queryWeight, product of:
                1.529703 = boost
                3.6771033 = idf(docFreq=2994, maxDocs=43556)
                0.01709495 = queryNorm
              0.20109159 = fieldWeight in 243, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6771033 = idf(docFreq=2994, maxDocs=43556)
                0.0546875 = fieldNorm(doc=243)
          0.063978374 = weight(abstract_txt:language in 243) [ClassicSimilarity], result of:
            0.063978374 = score(doc=243,freq=5.0), product of:
              0.12486185 = queryWeight, product of:
                1.7431374 = boost
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.01709495 = queryNorm
              0.5123933 = fieldWeight in 243, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.0546875 = fieldNorm(doc=243)
          0.04698945 = weight(abstract_txt:processing in 243) [ClassicSimilarity], result of:
            0.04698945 = score(doc=243,freq=1.0), product of:
              0.1738059 = queryWeight, product of:
                2.056596 = boost
                4.9436502 = idf(docFreq=843, maxDocs=43556)
                0.01709495 = queryNorm
              0.27035588 = fieldWeight in 243, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9436502 = idf(docFreq=843, maxDocs=43556)
                0.0546875 = fieldNorm(doc=243)
          0.073128924 = weight(abstract_txt:natural in 243) [ClassicSimilarity], result of:
            0.073128924 = score(doc=243,freq=2.0), product of:
              0.18525949 = queryWeight, product of:
                2.1232786 = boost
                5.1039414 = idf(docFreq=718, maxDocs=43556)
                0.01709495 = queryNorm
              0.3947378 = fieldWeight in 243, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1039414 = idf(docFreq=718, maxDocs=43556)
                0.0546875 = fieldNorm(doc=243)
          0.26368135 = weight(abstract_txt:lexical in 243) [ClassicSimilarity], result of:
            0.26368135 = score(doc=243,freq=6.0), product of:
              0.30204165 = queryWeight, product of:
                2.7111287 = boost
                6.517017 = idf(docFreq=174, maxDocs=43556)
                0.01709495 = queryNorm
              0.8729966 = fieldWeight in 243, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.517017 = idf(docFreq=174, maxDocs=43556)
                0.0546875 = fieldNorm(doc=243)
        0.32 = coord(8/25)
    
  2. Markó, K.G.: Foundation, implementation and evaluation of the MorphoSaurus system (2008) 0.16
    0.15836897 = sum of:
      0.15836897 = product of:
        0.4399138 = sum of:
          0.04344673 = weight(abstract_txt:languages in 1413) [ClassicSimilarity], result of:
            0.04344673 = score(doc=1413,freq=5.0), product of:
              0.095819615 = queryWeight, product of:
                1.0797642 = boost
                5.191079 = idf(docFreq=658, maxDocs=43556)
                0.01709495 = queryNorm
              0.4534221 = fieldWeight in 1413, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.191079 = idf(docFreq=658, maxDocs=43556)
                0.0390625 = fieldNorm(doc=1413)
          0.023105765 = weight(abstract_txt:scale in 1413) [ClassicSimilarity], result of:
            0.023105765 = score(doc=1413,freq=1.0), product of:
              0.10755235 = queryWeight, product of:
                1.1439623 = boost
                5.4997177 = idf(docFreq=483, maxDocs=43556)
                0.01709495 = queryNorm
              0.21483272 = fieldWeight in 1413, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4997177 = idf(docFreq=483, maxDocs=43556)
                0.0390625 = fieldNorm(doc=1413)
          0.0339509 = weight(abstract_txt:acquisition in 1413) [ClassicSimilarity], result of:
            0.0339509 = score(doc=1413,freq=1.0), product of:
              0.13900824 = queryWeight, product of:
                1.3005348 = boost
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.01709495 = queryNorm
              0.2442366 = fieldWeight in 1413, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.0390625 = fieldNorm(doc=1413)
          0.050754208 = weight(abstract_txt:acquired in 1413) [ClassicSimilarity], result of:
            0.050754208 = score(doc=1413,freq=1.0), product of:
              0.18174161 = queryWeight, product of:
                1.4870615 = boost
                7.1492033 = idf(docFreq=92, maxDocs=43556)
                0.01709495 = queryNorm
              0.27926576 = fieldWeight in 1413, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1492033 = idf(docFreq=92, maxDocs=43556)
                0.0390625 = fieldNorm(doc=1413)
          0.01953267 = weight(abstract_txt:different in 1413) [ClassicSimilarity], result of:
            0.01953267 = score(doc=1413,freq=2.0), product of:
              0.09615697 = queryWeight, product of:
                1.529703 = boost
                3.6771033 = idf(docFreq=2994, maxDocs=43556)
                0.01709495 = queryNorm
              0.20313317 = fieldWeight in 1413, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6771033 = idf(docFreq=2994, maxDocs=43556)
                0.0390625 = fieldNorm(doc=1413)
          0.04087428 = weight(abstract_txt:language in 1413) [ClassicSimilarity], result of:
            0.04087428 = score(doc=1413,freq=4.0), product of:
              0.12486185 = queryWeight, product of:
                1.7431374 = boost
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.01709495 = queryNorm
              0.32735604 = fieldWeight in 1413, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.0390625 = fieldNorm(doc=1413)
          0.058134366 = weight(abstract_txt:processing in 1413) [ClassicSimilarity], result of:
            0.058134366 = score(doc=1413,freq=3.0), product of:
              0.1738059 = queryWeight, product of:
                2.056596 = boost
                4.9436502 = idf(docFreq=843, maxDocs=43556)
                0.01709495 = queryNorm
              0.33447865 = fieldWeight in 1413, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9436502 = idf(docFreq=843, maxDocs=43556)
                0.0390625 = fieldNorm(doc=1413)
          0.036935687 = weight(abstract_txt:natural in 1413) [ClassicSimilarity], result of:
            0.036935687 = score(doc=1413,freq=1.0), product of:
              0.18525949 = queryWeight, product of:
                2.1232786 = boost
                5.1039414 = idf(docFreq=718, maxDocs=43556)
                0.01709495 = queryNorm
              0.19937271 = fieldWeight in 1413, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1039414 = idf(docFreq=718, maxDocs=43556)
                0.0390625 = fieldNorm(doc=1413)
          0.13317919 = weight(abstract_txt:lexical in 1413) [ClassicSimilarity], result of:
            0.13317919 = score(doc=1413,freq=3.0), product of:
              0.30204165 = queryWeight, product of:
                2.7111287 = boost
                6.517017 = idf(docFreq=174, maxDocs=43556)
                0.01709495 = queryNorm
              0.44092986 = fieldWeight in 1413, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.517017 = idf(docFreq=174, maxDocs=43556)
                0.0390625 = fieldNorm(doc=1413)
        0.36 = coord(9/25)
    
  3. Conlon, S.P.N.; Evens, M.; Ahlswede, T.: Developing a large lexical database for information retrieval, parsing, and text generation systems (1993) 0.15
    0.14536189 = sum of:
      0.14536189 = product of:
        0.6056745 = sum of:
          0.03730228 = weight(abstract_txt:shows in 5810) [ClassicSimilarity], result of:
            0.03730228 = score(doc=5810,freq=1.0), product of:
              0.09324165 = queryWeight, product of:
                1.06514 = boost
                5.120772 = idf(docFreq=706, maxDocs=43556)
                0.01709495 = queryNorm
              0.4000603 = fieldWeight in 5810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.120772 = idf(docFreq=706, maxDocs=43556)
                0.078125 = fieldNorm(doc=5810)
          0.042631604 = weight(abstract_txt:words in 5810) [ClassicSimilarity], result of:
            0.042631604 = score(doc=5810,freq=1.0), product of:
              0.10192345 = queryWeight, product of:
                1.1136246 = boost
                5.353866 = idf(docFreq=559, maxDocs=43556)
                0.01709495 = queryNorm
              0.4182708 = fieldWeight in 5810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353866 = idf(docFreq=559, maxDocs=43556)
                0.078125 = fieldNorm(doc=5810)
          0.04087428 = weight(abstract_txt:language in 5810) [ClassicSimilarity], result of:
            0.04087428 = score(doc=5810,freq=1.0), product of:
              0.12486185 = queryWeight, product of:
                1.7431374 = boost
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.01709495 = queryNorm
              0.32735604 = fieldWeight in 5810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.078125 = fieldNorm(doc=5810)
          0.06712778 = weight(abstract_txt:processing in 5810) [ClassicSimilarity], result of:
            0.06712778 = score(doc=5810,freq=1.0), product of:
              0.1738059 = queryWeight, product of:
                2.056596 = boost
                4.9436502 = idf(docFreq=843, maxDocs=43556)
                0.01709495 = queryNorm
              0.38622266 = fieldWeight in 5810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9436502 = idf(docFreq=843, maxDocs=43556)
                0.078125 = fieldNorm(doc=5810)
          0.073871374 = weight(abstract_txt:natural in 5810) [ClassicSimilarity], result of:
            0.073871374 = score(doc=5810,freq=1.0), product of:
              0.18525949 = queryWeight, product of:
                2.1232786 = boost
                5.1039414 = idf(docFreq=718, maxDocs=43556)
                0.01709495 = queryNorm
              0.39874542 = fieldWeight in 5810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1039414 = idf(docFreq=718, maxDocs=43556)
                0.078125 = fieldNorm(doc=5810)
          0.34386718 = weight(abstract_txt:lexical in 5810) [ClassicSimilarity], result of:
            0.34386718 = score(doc=5810,freq=5.0), product of:
              0.30204165 = queryWeight, product of:
                2.7111287 = boost
                6.517017 = idf(docFreq=174, maxDocs=43556)
                0.01709495 = queryNorm
              1.138476 = fieldWeight in 5810, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.517017 = idf(docFreq=174, maxDocs=43556)
                0.078125 = fieldNorm(doc=5810)
        0.24 = coord(6/25)
    
  4. Sánchez-de-Madariaga, R.; Fernández-del-Castillo, J.R.: ¬The bootstrapping of the Yarowsky algorithm in real corpora (2009) 0.14
    0.14426069 = sum of:
      0.14426069 = product of:
        0.6010862 = sum of:
          0.044762738 = weight(abstract_txt:shows in 4449) [ClassicSimilarity], result of:
            0.044762738 = score(doc=4449,freq=1.0), product of:
              0.09324165 = queryWeight, product of:
                1.06514 = boost
                5.120772 = idf(docFreq=706, maxDocs=43556)
                0.01709495 = queryNorm
              0.48007238 = fieldWeight in 4449, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.120772 = idf(docFreq=706, maxDocs=43556)
                0.09375 = fieldNorm(doc=4449)
          0.081482165 = weight(abstract_txt:acquisition in 4449) [ClassicSimilarity], result of:
            0.081482165 = score(doc=4449,freq=1.0), product of:
              0.13900824 = queryWeight, product of:
                1.3005348 = boost
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.01709495 = queryNorm
              0.5861679 = fieldWeight in 4449, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.09375 = fieldNorm(doc=4449)
          0.23627636 = weight(abstract_txt:corpora in 4449) [ClassicSimilarity], result of:
            0.23627636 = score(doc=4449,freq=4.0), product of:
              0.17807066 = queryWeight, product of:
                1.4719665 = boost
                7.0766325 = idf(docFreq=99, maxDocs=43556)
                0.01709495 = queryNorm
              1.3268685 = fieldWeight in 4449, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.0766325 = idf(docFreq=99, maxDocs=43556)
                0.09375 = fieldNorm(doc=4449)
          0.069365956 = weight(abstract_txt:language in 4449) [ClassicSimilarity], result of:
            0.069365956 = score(doc=4449,freq=2.0), product of:
              0.12486185 = queryWeight, product of:
                1.7431374 = boost
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.01709495 = queryNorm
              0.55554163 = fieldWeight in 4449, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.09375 = fieldNorm(doc=4449)
          0.08055334 = weight(abstract_txt:processing in 4449) [ClassicSimilarity], result of:
            0.08055334 = score(doc=4449,freq=1.0), product of:
              0.1738059 = queryWeight, product of:
                2.056596 = boost
                4.9436502 = idf(docFreq=843, maxDocs=43556)
                0.01709495 = queryNorm
              0.4634672 = fieldWeight in 4449, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9436502 = idf(docFreq=843, maxDocs=43556)
                0.09375 = fieldNorm(doc=4449)
          0.08864565 = weight(abstract_txt:natural in 4449) [ClassicSimilarity], result of:
            0.08864565 = score(doc=4449,freq=1.0), product of:
              0.18525949 = queryWeight, product of:
                2.1232786 = boost
                5.1039414 = idf(docFreq=718, maxDocs=43556)
                0.01709495 = queryNorm
              0.47849452 = fieldWeight in 4449, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1039414 = idf(docFreq=718, maxDocs=43556)
                0.09375 = fieldNorm(doc=4449)
        0.24 = coord(6/25)
    
  5. Talvensaari, T.; Laurikkala, J.; Järvelin, K.; Juhola, M.: ¬A study on automatic creation of a comparable document collection in cross-language information retrieval (2006) 0.14
    0.14042263 = sum of:
      0.14042263 = product of:
        0.43882072 = sum of:
          0.024694785 = weight(abstract_txt:practical in 599) [ClassicSimilarity], result of:
            0.024694785 = score(doc=599,freq=1.0), product of:
              0.08218575 = queryWeight, product of:
                4.8076043 = idf(docFreq=966, maxDocs=43556)
                0.01709495 = queryNorm
              0.30047527 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8076043 = idf(docFreq=966, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
          0.05384591 = weight(abstract_txt:languages in 599) [ClassicSimilarity], result of:
            0.05384591 = score(doc=599,freq=3.0), product of:
              0.095819615 = queryWeight, product of:
                1.0797642 = boost
                5.191079 = idf(docFreq=658, maxDocs=43556)
                0.01709495 = queryNorm
              0.5619508 = fieldWeight in 599, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.191079 = idf(docFreq=658, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
          0.034105282 = weight(abstract_txt:words in 599) [ClassicSimilarity], result of:
            0.034105282 = score(doc=599,freq=1.0), product of:
              0.10192345 = queryWeight, product of:
                1.1136246 = boost
                5.353866 = idf(docFreq=559, maxDocs=43556)
                0.01709495 = queryNorm
              0.33461663 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353866 = idf(docFreq=559, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
          0.036969222 = weight(abstract_txt:scale in 599) [ClassicSimilarity], result of:
            0.036969222 = score(doc=599,freq=1.0), product of:
              0.10755235 = queryWeight, product of:
                1.1439623 = boost
                5.4997177 = idf(docFreq=483, maxDocs=43556)
                0.01709495 = queryNorm
              0.34373236 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4997177 = idf(docFreq=483, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
          0.111381754 = weight(abstract_txt:corpora in 599) [ClassicSimilarity], result of:
            0.111381754 = score(doc=599,freq=2.0), product of:
              0.17807066 = queryWeight, product of:
                1.4719665 = boost
                7.0766325 = idf(docFreq=99, maxDocs=43556)
                0.01709495 = queryNorm
              0.62549186 = fieldWeight in 599, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0766325 = idf(docFreq=99, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
          0.022098694 = weight(abstract_txt:different in 599) [ClassicSimilarity], result of:
            0.022098694 = score(doc=599,freq=1.0), product of:
              0.09615697 = queryWeight, product of:
                1.529703 = boost
                3.6771033 = idf(docFreq=2994, maxDocs=43556)
                0.01709495 = queryNorm
              0.22981896 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6771033 = idf(docFreq=2994, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
          0.032699425 = weight(abstract_txt:language in 599) [ClassicSimilarity], result of:
            0.032699425 = score(doc=599,freq=1.0), product of:
              0.12486185 = queryWeight, product of:
                1.7431374 = boost
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.01709495 = queryNorm
              0.26188484 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
          0.123025656 = weight(abstract_txt:lexical in 599) [ClassicSimilarity], result of:
            0.123025656 = score(doc=599,freq=1.0), product of:
              0.30204165 = queryWeight, product of:
                2.7111287 = boost
                6.517017 = idf(docFreq=174, maxDocs=43556)
                0.01709495 = queryNorm
              0.40731356 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.517017 = idf(docFreq=174, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
        0.32 = coord(8/25)