Document (#15823)

Author
Basili, R.
Pazienza, M.T.
Velardi, P.
Title
¬An empirical symbolic approach to natural language processing
Source
Artificial intelligence. 85(1996) nos.1/2, S.59-99
Year
1996
Abstract
Describes and evaluates the results of a large scale lexical learning system, ARISTO-LEX, that uses a combination of probabilisitc and knowledge based methods for the acquisition of selectional restrictions of words in sublanguages. Presents experimental data obtained from different corpora in different doamins and languages, and shows that the acquired lexical data not only have practical applications in natural language processing, but they are useful for a comparative analysis of sublanguages
Theme
Computerlinguistik

Similar documents (content)

  1. Stede, M.: Lexicalization in natural language generation (2002) 0.19
    0.18586189 = sum of:
      0.18586189 = product of:
        0.5808184 = sum of:
          0.02721164 = weight(abstract_txt:languages in 246) [ClassicSimilarity], result of:
            0.02721164 = score(doc=246,freq=1.0), product of:
              0.09584157 = queryWeight, product of:
                1.0786606 = boost
                5.1917377 = idf(docFreq=653, maxDocs=43254)
                0.01711419 = queryNorm
              0.28392315 = fieldWeight in 246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1917377 = idf(docFreq=653, maxDocs=43254)
                0.0546875 = fieldNorm(doc=246)
          0.051692907 = weight(abstract_txt:words in 246) [ClassicSimilarity], result of:
            0.051692907 = score(doc=246,freq=3.0), product of:
              0.10192897 = queryWeight, product of:
                1.112389 = boost
                5.354077 = idf(docFreq=555, maxDocs=43254)
                0.01711419 = queryNorm
              0.50714636 = fieldWeight in 246, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.354077 = idf(docFreq=555, maxDocs=43254)
                0.0546875 = fieldNorm(doc=246)
          0.032407776 = weight(abstract_txt:scale in 246) [ClassicSimilarity], result of:
            0.032407776 = score(doc=246,freq=1.0), product of:
              0.10768376 = queryWeight, product of:
                1.1433599 = boost
                5.5031443 = idf(docFreq=478, maxDocs=43254)
                0.01711419 = queryNorm
              0.3009532 = fieldWeight in 246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5031443 = idf(docFreq=478, maxDocs=43254)
                0.0546875 = fieldNorm(doc=246)
          0.019417003 = weight(abstract_txt:different in 246) [ClassicSimilarity], result of:
            0.019417003 = score(doc=246,freq=1.0), product of:
              0.09642335 = queryWeight, product of:
                1.5300794 = boost
                3.6822383 = idf(docFreq=2958, maxDocs=43254)
                0.01711419 = queryNorm
              0.20137241 = fieldWeight in 246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6822383 = idf(docFreq=2958, maxDocs=43254)
                0.0546875 = fieldNorm(doc=246)
          0.06406871 = weight(abstract_txt:language in 246) [ClassicSimilarity], result of:
            0.06406871 = score(doc=246,freq=5.0), product of:
              0.12497835 = queryWeight, product of:
                1.7419683 = boost
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.01711419 = queryNorm
              0.5126385 = fieldWeight in 246, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.0546875 = fieldNorm(doc=246)
          0.04712991 = weight(abstract_txt:processing in 246) [ClassicSimilarity], result of:
            0.04712991 = score(doc=246,freq=1.0), product of:
              0.17415068 = queryWeight, product of:
                2.0562952 = boost
                4.9486117 = idf(docFreq=833, maxDocs=43254)
                0.01711419 = queryNorm
              0.2706272 = fieldWeight in 246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9486117 = idf(docFreq=833, maxDocs=43254)
                0.0546875 = fieldNorm(doc=246)
          0.073248655 = weight(abstract_txt:natural in 246) [ClassicSimilarity], result of:
            0.073248655 = score(doc=246,freq=2.0), product of:
              0.18546012 = queryWeight, product of:
                2.1220133 = boost
                5.106767 = idf(docFreq=711, maxDocs=43254)
                0.01711419 = queryNorm
              0.39495635 = fieldWeight in 246, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.106767 = idf(docFreq=711, maxDocs=43254)
                0.0546875 = fieldNorm(doc=246)
          0.26564178 = weight(abstract_txt:lexical in 246) [ClassicSimilarity], result of:
            0.26564178 = score(doc=246,freq=6.0), product of:
              0.3035344 = queryWeight, product of:
                2.714731 = boost
                6.5331817 = idf(docFreq=170, maxDocs=43254)
                0.01711419 = queryNorm
              0.875162 = fieldWeight in 246, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.5331817 = idf(docFreq=170, maxDocs=43254)
                0.0546875 = fieldNorm(doc=246)
        0.32 = coord(8/25)
    
  2. Markó, K.G.: Foundation, implementation and evaluation of the MorphoSaurus system (2008) 0.16
    0.1589465 = sum of:
      0.1589465 = product of:
        0.44151804 = sum of:
          0.0434622 = weight(abstract_txt:languages in 880) [ClassicSimilarity], result of:
            0.0434622 = score(doc=880,freq=5.0), product of:
              0.09584157 = queryWeight, product of:
                1.0786606 = boost
                5.1917377 = idf(docFreq=653, maxDocs=43254)
                0.01711419 = queryNorm
              0.45347962 = fieldWeight in 880, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.1917377 = idf(docFreq=653, maxDocs=43254)
                0.0390625 = fieldNorm(doc=880)
          0.023148408 = weight(abstract_txt:scale in 880) [ClassicSimilarity], result of:
            0.023148408 = score(doc=880,freq=1.0), product of:
              0.10768376 = queryWeight, product of:
                1.1433599 = boost
                5.5031443 = idf(docFreq=478, maxDocs=43254)
                0.01711419 = queryNorm
              0.21496657 = fieldWeight in 880, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5031443 = idf(docFreq=478, maxDocs=43254)
                0.0390625 = fieldNorm(doc=880)
          0.034052595 = weight(abstract_txt:acquisition in 880) [ClassicSimilarity], result of:
            0.034052595 = score(doc=880,freq=1.0), product of:
              0.13928455 = queryWeight, product of:
                1.3003472 = boost
                6.2587447 = idf(docFreq=224, maxDocs=43254)
                0.01711419 = queryNorm
              0.24448222 = fieldWeight in 880, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2587447 = idf(docFreq=224, maxDocs=43254)
                0.0390625 = fieldNorm(doc=880)
          0.050835073 = weight(abstract_txt:acquired in 880) [ClassicSimilarity], result of:
            0.050835073 = score(doc=880,freq=1.0), product of:
              0.18193312 = queryWeight, product of:
                1.4861537 = boost
                7.1530566 = idf(docFreq=91, maxDocs=43254)
                0.01711419 = queryNorm
              0.27941626 = fieldWeight in 880, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1530566 = idf(docFreq=91, maxDocs=43254)
                0.0390625 = fieldNorm(doc=880)
          0.019614134 = weight(abstract_txt:different in 880) [ClassicSimilarity], result of:
            0.019614134 = score(doc=880,freq=2.0), product of:
              0.09642335 = queryWeight, product of:
                1.5300794 = boost
                3.6822383 = idf(docFreq=2958, maxDocs=43254)
                0.01711419 = queryNorm
              0.20341685 = fieldWeight in 880, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6822383 = idf(docFreq=2958, maxDocs=43254)
                0.0390625 = fieldNorm(doc=880)
          0.040932 = weight(abstract_txt:language in 880) [ClassicSimilarity], result of:
            0.040932 = score(doc=880,freq=4.0), product of:
              0.12497835 = queryWeight, product of:
                1.7419683 = boost
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.01711419 = queryNorm
              0.32751274 = fieldWeight in 880, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.0390625 = fieldNorm(doc=880)
          0.058308143 = weight(abstract_txt:processing in 880) [ClassicSimilarity], result of:
            0.058308143 = score(doc=880,freq=3.0), product of:
              0.17415068 = queryWeight, product of:
                2.0562952 = boost
                4.9486117 = idf(docFreq=833, maxDocs=43254)
                0.01711419 = queryNorm
              0.33481434 = fieldWeight in 880, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9486117 = idf(docFreq=833, maxDocs=43254)
                0.0390625 = fieldNorm(doc=880)
          0.03699616 = weight(abstract_txt:natural in 880) [ClassicSimilarity], result of:
            0.03699616 = score(doc=880,freq=1.0), product of:
              0.18546012 = queryWeight, product of:
                2.1220133 = boost
                5.106767 = idf(docFreq=711, maxDocs=43254)
                0.01711419 = queryNorm
              0.1994831 = fieldWeight in 880, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.106767 = idf(docFreq=711, maxDocs=43254)
                0.0390625 = fieldNorm(doc=880)
          0.13416934 = weight(abstract_txt:lexical in 880) [ClassicSimilarity], result of:
            0.13416934 = score(doc=880,freq=3.0), product of:
              0.3035344 = queryWeight, product of:
                2.714731 = boost
                6.5331817 = idf(docFreq=170, maxDocs=43254)
                0.01711419 = queryNorm
              0.44202355 = fieldWeight in 880, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5331817 = idf(docFreq=170, maxDocs=43254)
                0.0390625 = fieldNorm(doc=880)
        0.36 = coord(9/25)
    
  3. Conlon, S.P.N.; Evens, M.; Ahlswede, T.: Developing a large lexical database for information retrieval, parsing, and text generation systems (1993) 0.15
    0.14609043 = sum of:
      0.14609043 = product of:
        0.60871017 = sum of:
          0.03739809 = weight(abstract_txt:shows in 5813) [ClassicSimilarity], result of:
            0.03739809 = score(doc=5813,freq=1.0), product of:
              0.09340047 = queryWeight, product of:
                1.0648352 = boost
                5.125194 = idf(docFreq=698, maxDocs=43254)
                0.01711419 = queryNorm
              0.4004058 = fieldWeight in 5813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.125194 = idf(docFreq=698, maxDocs=43254)
                0.078125 = fieldNorm(doc=5813)
          0.04263559 = weight(abstract_txt:words in 5813) [ClassicSimilarity], result of:
            0.04263559 = score(doc=5813,freq=1.0), product of:
              0.10192897 = queryWeight, product of:
                1.112389 = boost
                5.354077 = idf(docFreq=555, maxDocs=43254)
                0.01711419 = queryNorm
              0.41828725 = fieldWeight in 5813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.354077 = idf(docFreq=555, maxDocs=43254)
                0.078125 = fieldNorm(doc=5813)
          0.040932 = weight(abstract_txt:language in 5813) [ClassicSimilarity], result of:
            0.040932 = score(doc=5813,freq=1.0), product of:
              0.12497835 = queryWeight, product of:
                1.7419683 = boost
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.01711419 = queryNorm
              0.32751274 = fieldWeight in 5813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.078125 = fieldNorm(doc=5813)
          0.067328446 = weight(abstract_txt:processing in 5813) [ClassicSimilarity], result of:
            0.067328446 = score(doc=5813,freq=1.0), product of:
              0.17415068 = queryWeight, product of:
                2.0562952 = boost
                4.9486117 = idf(docFreq=833, maxDocs=43254)
                0.01711419 = queryNorm
              0.3866103 = fieldWeight in 5813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9486117 = idf(docFreq=833, maxDocs=43254)
                0.078125 = fieldNorm(doc=5813)
          0.07399232 = weight(abstract_txt:natural in 5813) [ClassicSimilarity], result of:
            0.07399232 = score(doc=5813,freq=1.0), product of:
              0.18546012 = queryWeight, product of:
                2.1220133 = boost
                5.106767 = idf(docFreq=711, maxDocs=43254)
                0.01711419 = queryNorm
              0.3989662 = fieldWeight in 5813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.106767 = idf(docFreq=711, maxDocs=43254)
                0.078125 = fieldNorm(doc=5813)
          0.34642377 = weight(abstract_txt:lexical in 5813) [ClassicSimilarity], result of:
            0.34642377 = score(doc=5813,freq=5.0), product of:
              0.3035344 = queryWeight, product of:
                2.714731 = boost
                6.5331817 = idf(docFreq=170, maxDocs=43254)
                0.01711419 = queryNorm
              1.1413 = fieldWeight in 5813, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.5331817 = idf(docFreq=170, maxDocs=43254)
                0.078125 = fieldNorm(doc=5813)
        0.24 = coord(6/25)
    
  4. Sánchez-de-Madariaga, R.; Fernández-del-Castillo, J.R.: ¬The bootstrapping of the Yarowsky algorithm in real corpora (2009) 0.15
    0.14502843 = sum of:
      0.14502843 = product of:
        0.6042851 = sum of:
          0.044877704 = weight(abstract_txt:shows in 4452) [ClassicSimilarity], result of:
            0.044877704 = score(doc=4452,freq=1.0), product of:
              0.09340047 = queryWeight, product of:
                1.0648352 = boost
                5.125194 = idf(docFreq=698, maxDocs=43254)
                0.01711419 = queryNorm
              0.48048693 = fieldWeight in 4452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.125194 = idf(docFreq=698, maxDocs=43254)
                0.09375 = fieldNorm(doc=4452)
          0.08172623 = weight(abstract_txt:acquisition in 4452) [ClassicSimilarity], result of:
            0.08172623 = score(doc=4452,freq=1.0), product of:
              0.13928455 = queryWeight, product of:
                1.3003472 = boost
                6.2587447 = idf(docFreq=224, maxDocs=43254)
                0.01711419 = queryNorm
              0.5867573 = fieldWeight in 4452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2587447 = idf(docFreq=224, maxDocs=43254)
                0.09375 = fieldNorm(doc=4452)
          0.23863234 = weight(abstract_txt:corpora in 4452) [ClassicSimilarity], result of:
            0.23863234 = score(doc=4452,freq=4.0), product of:
              0.17925096 = queryWeight, product of:
                1.4751582 = boost
                7.100134 = idf(docFreq=96, maxDocs=43254)
                0.01711419 = queryNorm
              1.3312751 = fieldWeight in 4452, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.100134 = idf(docFreq=96, maxDocs=43254)
                0.09375 = fieldNorm(doc=4452)
          0.06946391 = weight(abstract_txt:language in 4452) [ClassicSimilarity], result of:
            0.06946391 = score(doc=4452,freq=2.0), product of:
              0.12497835 = queryWeight, product of:
                1.7419683 = boost
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.01711419 = queryNorm
              0.55580753 = fieldWeight in 4452, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.09375 = fieldNorm(doc=4452)
          0.08079413 = weight(abstract_txt:processing in 4452) [ClassicSimilarity], result of:
            0.08079413 = score(doc=4452,freq=1.0), product of:
              0.17415068 = queryWeight, product of:
                2.0562952 = boost
                4.9486117 = idf(docFreq=833, maxDocs=43254)
                0.01711419 = queryNorm
              0.46393234 = fieldWeight in 4452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9486117 = idf(docFreq=833, maxDocs=43254)
                0.09375 = fieldNorm(doc=4452)
          0.088790774 = weight(abstract_txt:natural in 4452) [ClassicSimilarity], result of:
            0.088790774 = score(doc=4452,freq=1.0), product of:
              0.18546012 = queryWeight, product of:
                2.1220133 = boost
                5.106767 = idf(docFreq=711, maxDocs=43254)
                0.01711419 = queryNorm
              0.4787594 = fieldWeight in 4452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.106767 = idf(docFreq=711, maxDocs=43254)
                0.09375 = fieldNorm(doc=4452)
        0.24 = coord(6/25)
    
  5. Talvensaari, T.; Laurikkala, J.; Järvelin, K.; Juhola, M.: ¬A study on automatic creation of a comparable document collection in cross-language information retrieval (2006) 0.14
    0.14117108 = sum of:
      0.14117108 = product of:
        0.44115967 = sum of:
          0.024779484 = weight(abstract_txt:practical in 602) [ClassicSimilarity], result of:
            0.024779484 = score(doc=602,freq=1.0), product of:
              0.08237289 = queryWeight, product of:
                4.8131337 = idf(docFreq=954, maxDocs=43254)
                0.01711419 = queryNorm
              0.30082086 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8131337 = idf(docFreq=954, maxDocs=43254)
                0.0625 = fieldNorm(doc=602)
          0.05386508 = weight(abstract_txt:languages in 602) [ClassicSimilarity], result of:
            0.05386508 = score(doc=602,freq=3.0), product of:
              0.09584157 = queryWeight, product of:
                1.0786606 = boost
                5.1917377 = idf(docFreq=653, maxDocs=43254)
                0.01711419 = queryNorm
              0.5620221 = fieldWeight in 602, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1917377 = idf(docFreq=653, maxDocs=43254)
                0.0625 = fieldNorm(doc=602)
          0.03410847 = weight(abstract_txt:words in 602) [ClassicSimilarity], result of:
            0.03410847 = score(doc=602,freq=1.0), product of:
              0.10192897 = queryWeight, product of:
                1.112389 = boost
                5.354077 = idf(docFreq=555, maxDocs=43254)
                0.01711419 = queryNorm
              0.3346298 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.354077 = idf(docFreq=555, maxDocs=43254)
                0.0625 = fieldNorm(doc=602)
          0.037037455 = weight(abstract_txt:scale in 602) [ClassicSimilarity], result of:
            0.037037455 = score(doc=602,freq=1.0), product of:
              0.10768376 = queryWeight, product of:
                1.1433599 = boost
                5.5031443 = idf(docFreq=478, maxDocs=43254)
                0.01711419 = queryNorm
              0.34394652 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5031443 = idf(docFreq=478, maxDocs=43254)
                0.0625 = fieldNorm(doc=602)
          0.11249236 = weight(abstract_txt:corpora in 602) [ClassicSimilarity], result of:
            0.11249236 = score(doc=602,freq=2.0), product of:
              0.17925096 = queryWeight, product of:
                1.4751582 = boost
                7.100134 = idf(docFreq=96, maxDocs=43254)
                0.01711419 = queryNorm
              0.6275691 = fieldWeight in 602, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.100134 = idf(docFreq=96, maxDocs=43254)
                0.0625 = fieldNorm(doc=602)
          0.02219086 = weight(abstract_txt:different in 602) [ClassicSimilarity], result of:
            0.02219086 = score(doc=602,freq=1.0), product of:
              0.09642335 = queryWeight, product of:
                1.5300794 = boost
                3.6822383 = idf(docFreq=2958, maxDocs=43254)
                0.01711419 = queryNorm
              0.2301399 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6822383 = idf(docFreq=2958, maxDocs=43254)
                0.0625 = fieldNorm(doc=602)
          0.0327456 = weight(abstract_txt:language in 602) [ClassicSimilarity], result of:
            0.0327456 = score(doc=602,freq=1.0), product of:
              0.12497835 = queryWeight, product of:
                1.7419683 = boost
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.01711419 = queryNorm
              0.2620102 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.0625 = fieldNorm(doc=602)
          0.12394033 = weight(abstract_txt:lexical in 602) [ClassicSimilarity], result of:
            0.12394033 = score(doc=602,freq=1.0), product of:
              0.3035344 = queryWeight, product of:
                2.714731 = boost
                6.5331817 = idf(docFreq=170, maxDocs=43254)
                0.01711419 = queryNorm
              0.40832385 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5331817 = idf(docFreq=170, maxDocs=43254)
                0.0625 = fieldNorm(doc=602)
        0.32 = coord(8/25)