Document (#15823)

Author
Basili, R.
Pazienza, M.T.
Velardi, P.
Title
¬An empirical symbolic approach to natural language processing
Source
Artificial intelligence. 85(1996) nos.1/2, S.59-99
Year
1996
Abstract
Describes and evaluates the results of a large scale lexical learning system, ARISTO-LEX, that uses a combination of probabilisitc and knowledge based methods for the acquisition of selectional restrictions of words in sublanguages. Presents experimental data obtained from different corpora in different doamins and languages, and shows that the acquired lexical data not only have practical applications in natural language processing, but they are useful for a comparative analysis of sublanguages
Theme
Computerlinguistik

Similar documents (content)

  1. Stede, M.: Lexicalization in natural language generation (2002) 0.18
    0.18481874 = sum of:
      0.18481874 = product of:
        0.5775586 = sum of:
          0.027178083 = weight(abstract_txt:languages in 4245) [ClassicSimilarity], result of:
            0.027178083 = score(doc=4245,freq=1.0), product of:
              0.09579016 = queryWeight, product of:
                1.0819511 = boost
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.017064886 = queryNorm
              0.2837252 = fieldWeight in 4245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4245)
          0.05170628 = weight(abstract_txt:words in 4245) [ClassicSimilarity], result of:
            0.05170628 = score(doc=4245,freq=3.0), product of:
              0.10197573 = queryWeight, product of:
                1.1163378 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.017064886 = queryNorm
              0.507045 = fieldWeight in 4245, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4245)
          0.031963203 = weight(abstract_txt:scale in 4245) [ClassicSimilarity], result of:
            0.031963203 = score(doc=4245,freq=1.0), product of:
              0.10672722 = queryWeight, product of:
                1.1420492 = boost
                5.476297 = idf(docFreq=502, maxDocs=44218)
                0.017064886 = queryNorm
              0.299485 = fieldWeight in 4245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.476297 = idf(docFreq=502, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4245)
          0.01916989 = weight(abstract_txt:different in 4245) [ClassicSimilarity], result of:
            0.01916989 = score(doc=4245,freq=1.0), product of:
              0.09563087 = queryWeight, product of:
                1.5288372 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.017064886 = queryNorm
              0.20045713 = fieldWeight in 4245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4245)
          0.063662484 = weight(abstract_txt:language in 4245) [ClassicSimilarity], result of:
            0.063662484 = score(doc=4245,freq=5.0), product of:
              0.12448511 = queryWeight, product of:
                1.7442989 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.017064886 = queryNorm
              0.5114064 = fieldWeight in 4245, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4245)
          0.04669263 = weight(abstract_txt:processing in 4245) [ClassicSimilarity], result of:
            0.04669263 = score(doc=4245,freq=1.0), product of:
              0.17312132 = queryWeight, product of:
                2.0570152 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.017064886 = queryNorm
              0.26971045 = fieldWeight in 4245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4245)
          0.072142825 = weight(abstract_txt:natural in 4245) [ClassicSimilarity], result of:
            0.072142825 = score(doc=4245,freq=2.0), product of:
              0.18364134 = queryWeight, product of:
                2.1185925 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.017064886 = queryNorm
              0.39284632 = fieldWeight in 4245, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4245)
          0.2650432 = weight(abstract_txt:lexical in 4245) [ClassicSimilarity], result of:
            0.2650432 = score(doc=4245,freq=6.0), product of:
              0.303165 = queryWeight, product of:
                2.7220852 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.017064886 = queryNorm
              0.874254 = fieldWeight in 4245, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4245)
        0.32 = coord(8/25)
    
  2. Markó, K.G.: Foundation, implementation and evaluation of the MorphoSaurus system (2008) 0.16
    0.1577113 = sum of:
      0.1577113 = product of:
        0.43808693 = sum of:
          0.0434086 = weight(abstract_txt:languages in 4415) [ClassicSimilarity], result of:
            0.0434086 = score(doc=4415,freq=5.0), product of:
              0.09579016 = queryWeight, product of:
                1.0819511 = boost
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.017064886 = queryNorm
              0.45316344 = fieldWeight in 4415, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4415)
          0.022830857 = weight(abstract_txt:scale in 4415) [ClassicSimilarity], result of:
            0.022830857 = score(doc=4415,freq=1.0), product of:
              0.10672722 = queryWeight, product of:
                1.1420492 = boost
                5.476297 = idf(docFreq=502, maxDocs=44218)
                0.017064886 = queryNorm
              0.21391785 = fieldWeight in 4415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.476297 = idf(docFreq=502, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4415)
          0.033732735 = weight(abstract_txt:acquisition in 4415) [ClassicSimilarity], result of:
            0.033732735 = score(doc=4415,freq=1.0), product of:
              0.13845058 = queryWeight, product of:
                1.3007523 = boost
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.017064886 = queryNorm
              0.2436446 = fieldWeight in 4415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4415)
          0.050005965 = weight(abstract_txt:acquired in 4415) [ClassicSimilarity], result of:
            0.050005965 = score(doc=4415,freq=1.0), product of:
              0.18000099 = queryWeight, product of:
                1.4831486 = boost
                7.11192 = idf(docFreq=97, maxDocs=44218)
                0.017064886 = queryNorm
              0.27780938 = fieldWeight in 4415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.11192 = idf(docFreq=97, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4415)
          0.019364512 = weight(abstract_txt:different in 4415) [ClassicSimilarity], result of:
            0.019364512 = score(doc=4415,freq=2.0), product of:
              0.09563087 = queryWeight, product of:
                1.5288372 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.017064886 = queryNorm
              0.20249227 = fieldWeight in 4415, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4415)
          0.04067247 = weight(abstract_txt:language in 4415) [ClassicSimilarity], result of:
            0.04067247 = score(doc=4415,freq=4.0), product of:
              0.12448511 = queryWeight, product of:
                1.7442989 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.017064886 = queryNorm
              0.32672557 = fieldWeight in 4415, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4415)
          0.05776715 = weight(abstract_txt:processing in 4415) [ClassicSimilarity], result of:
            0.05776715 = score(doc=4415,freq=3.0), product of:
              0.17312132 = queryWeight, product of:
                2.0570152 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.017064886 = queryNorm
              0.33368015 = fieldWeight in 4415, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4415)
          0.03643763 = weight(abstract_txt:natural in 4415) [ClassicSimilarity], result of:
            0.03643763 = score(doc=4415,freq=1.0), product of:
              0.18364134 = queryWeight, product of:
                2.1185925 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.017064886 = queryNorm
              0.19841737 = fieldWeight in 4415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4415)
          0.13386703 = weight(abstract_txt:lexical in 4415) [ClassicSimilarity], result of:
            0.13386703 = score(doc=4415,freq=3.0), product of:
              0.303165 = queryWeight, product of:
                2.7220852 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.017064886 = queryNorm
              0.44156492 = fieldWeight in 4415, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4415)
        0.36 = coord(9/25)
    
  3. Conlon, S.P.N.; Evens, M.; Ahlswede, T.: Developing a large lexical database for information retrieval, parsing, and text generation systems (1993) 0.15
    0.14535724 = sum of:
      0.14535724 = product of:
        0.60565513 = sum of:
          0.03711388 = weight(abstract_txt:shows in 5813) [ClassicSimilarity], result of:
            0.03711388 = score(doc=5813,freq=1.0), product of:
              0.09295326 = queryWeight, product of:
                1.0658094 = boost
                5.1107154 = idf(docFreq=724, maxDocs=44218)
                0.017064886 = queryNorm
              0.39927465 = fieldWeight in 5813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1107154 = idf(docFreq=724, maxDocs=44218)
                0.078125 = fieldNorm(doc=5813)
          0.042646624 = weight(abstract_txt:words in 5813) [ClassicSimilarity], result of:
            0.042646624 = score(doc=5813,freq=1.0), product of:
              0.10197573 = queryWeight, product of:
                1.1163378 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.017064886 = queryNorm
              0.41820365 = fieldWeight in 5813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.078125 = fieldNorm(doc=5813)
          0.04067247 = weight(abstract_txt:language in 5813) [ClassicSimilarity], result of:
            0.04067247 = score(doc=5813,freq=1.0), product of:
              0.12448511 = queryWeight, product of:
                1.7442989 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.017064886 = queryNorm
              0.32672557 = fieldWeight in 5813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=5813)
          0.06670375 = weight(abstract_txt:processing in 5813) [ClassicSimilarity], result of:
            0.06670375 = score(doc=5813,freq=1.0), product of:
              0.17312132 = queryWeight, product of:
                2.0570152 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.017064886 = queryNorm
              0.38530064 = fieldWeight in 5813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.078125 = fieldNorm(doc=5813)
          0.07287526 = weight(abstract_txt:natural in 5813) [ClassicSimilarity], result of:
            0.07287526 = score(doc=5813,freq=1.0), product of:
              0.18364134 = queryWeight, product of:
                2.1185925 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.017064886 = queryNorm
              0.39683473 = fieldWeight in 5813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.078125 = fieldNorm(doc=5813)
          0.34564316 = weight(abstract_txt:lexical in 5813) [ClassicSimilarity], result of:
            0.34564316 = score(doc=5813,freq=5.0), product of:
              0.303165 = queryWeight, product of:
                2.7220852 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.017064886 = queryNorm
              1.1401157 = fieldWeight in 5813, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.078125 = fieldNorm(doc=5813)
        0.24 = coord(6/25)
    
  4. Sánchez-de-Madariaga, R.; Fernández-del-Castillo, J.R.: ¬The bootstrapping of the Yarowsky algorithm in real corpora (2009) 0.14
    0.14238134 = sum of:
      0.14238134 = product of:
        0.5932556 = sum of:
          0.044536654 = weight(abstract_txt:shows in 2451) [ClassicSimilarity], result of:
            0.044536654 = score(doc=2451,freq=1.0), product of:
              0.09295326 = queryWeight, product of:
                1.0658094 = boost
                5.1107154 = idf(docFreq=724, maxDocs=44218)
                0.017064886 = queryNorm
              0.47912955 = fieldWeight in 2451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1107154 = idf(docFreq=724, maxDocs=44218)
                0.09375 = fieldNorm(doc=2451)
          0.08095857 = weight(abstract_txt:acquisition in 2451) [ClassicSimilarity], result of:
            0.08095857 = score(doc=2451,freq=1.0), product of:
              0.13845058 = queryWeight, product of:
                1.3007523 = boost
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.017064886 = queryNorm
              0.5847471 = fieldWeight in 2451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.09375 = fieldNorm(doc=2451)
          0.23124205 = weight(abstract_txt:corpora in 2451) [ClassicSimilarity], result of:
            0.23124205 = score(doc=2451,freq=4.0), product of:
              0.17558096 = queryWeight, product of:
                1.4648256 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.017064886 = queryNorm
              1.3170109 = fieldWeight in 2451, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.09375 = fieldNorm(doc=2451)
          0.069023475 = weight(abstract_txt:language in 2451) [ClassicSimilarity], result of:
            0.069023475 = score(doc=2451,freq=2.0), product of:
              0.12448511 = queryWeight, product of:
                1.7442989 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.017064886 = queryNorm
              0.55447173 = fieldWeight in 2451, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.09375 = fieldNorm(doc=2451)
          0.0800445 = weight(abstract_txt:processing in 2451) [ClassicSimilarity], result of:
            0.0800445 = score(doc=2451,freq=1.0), product of:
              0.17312132 = queryWeight, product of:
                2.0570152 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.017064886 = queryNorm
              0.46236074 = fieldWeight in 2451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.09375 = fieldNorm(doc=2451)
          0.08745031 = weight(abstract_txt:natural in 2451) [ClassicSimilarity], result of:
            0.08745031 = score(doc=2451,freq=1.0), product of:
              0.18364134 = queryWeight, product of:
                2.1185925 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.017064886 = queryNorm
              0.47620165 = fieldWeight in 2451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.09375 = fieldNorm(doc=2451)
        0.24 = coord(6/25)
    
  5. Talvensaari, T.; Laurikkala, J.; Järvelin, K.; Juhola, M.: ¬A study on automatic creation of a comparable document collection in cross-language information retrieval (2006) 0.14
    0.13954724 = sum of:
      0.13954724 = product of:
        0.43608513 = sum of:
          0.0245238 = weight(abstract_txt:practical in 5601) [ClassicSimilarity], result of:
            0.0245238 = score(doc=5601,freq=1.0), product of:
              0.08182868 = queryWeight, product of:
                4.79515 = idf(docFreq=993, maxDocs=44218)
                0.017064886 = queryNorm
              0.29969686 = fieldWeight in 5601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.79515 = idf(docFreq=993, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.053798653 = weight(abstract_txt:languages in 5601) [ClassicSimilarity], result of:
            0.053798653 = score(doc=5601,freq=3.0), product of:
              0.09579016 = queryWeight, product of:
                1.0819511 = boost
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.017064886 = queryNorm
              0.56163025 = fieldWeight in 5601, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.0341173 = weight(abstract_txt:words in 5601) [ClassicSimilarity], result of:
            0.0341173 = score(doc=5601,freq=1.0), product of:
              0.10197573 = queryWeight, product of:
                1.1163378 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.017064886 = queryNorm
              0.33456293 = fieldWeight in 5601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.036529373 = weight(abstract_txt:scale in 5601) [ClassicSimilarity], result of:
            0.036529373 = score(doc=5601,freq=1.0), product of:
              0.10672722 = queryWeight, product of:
                1.1420492 = boost
                5.476297 = idf(docFreq=502, maxDocs=44218)
                0.017064886 = queryNorm
              0.34226856 = fieldWeight in 5601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.476297 = idf(docFreq=502, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.10900854 = weight(abstract_txt:corpora in 5601) [ClassicSimilarity], result of:
            0.10900854 = score(doc=5601,freq=2.0), product of:
              0.17558096 = queryWeight, product of:
                1.4648256 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.017064886 = queryNorm
              0.6208449 = fieldWeight in 5601, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.021908445 = weight(abstract_txt:different in 5601) [ClassicSimilarity], result of:
            0.021908445 = score(doc=5601,freq=1.0), product of:
              0.09563087 = queryWeight, product of:
                1.5288372 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.017064886 = queryNorm
              0.22909386 = fieldWeight in 5601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.03253798 = weight(abstract_txt:language in 5601) [ClassicSimilarity], result of:
            0.03253798 = score(doc=5601,freq=1.0), product of:
              0.12448511 = queryWeight, product of:
                1.7442989 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.017064886 = queryNorm
              0.26138046 = fieldWeight in 5601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.12366106 = weight(abstract_txt:lexical in 5601) [ClassicSimilarity], result of:
            0.12366106 = score(doc=5601,freq=1.0), product of:
              0.303165 = queryWeight, product of:
                2.7220852 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.017064886 = queryNorm
              0.4079002 = fieldWeight in 5601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
        0.32 = coord(8/25)