Document (#16477)

Author
Lonsdale, D.
Mitamura, T.
Nyberg, E.
Title
Acquisition of large lexicons for practical knowledge-based MT
Source
Machine translation. 9(1994/95) nos.3/4, S.251-283
Year
1994/95
Abstract
Although knowledge based MT systems have the potential to achieve high translation accuracy, each successful application system requires a large amount of hand coded lexical knowledge. Systems like KBMT-89 and its descendants have demonstarted how knowledge based translation can produce good results in technical domains with tractable domain semantics. Nevertheless, the magnitude of the development task for large scale applications with 10s of 1000s of of domain concepts precludes a purely hand crafted approach. The current challenge for the next generation of knowledge based MT systems is to utilize online textual resources and corpus analysis software in order to automate the most laborious aspects of the knowledge acquisition process. This partial automation can in turn maximize the productivity of human knowledge engineers and help to make large scale applications of knowledge based MT an viable approach. Discusses the corpus based knowledge acquisition methodology used in KANT, a knowledge based translation system for multilingual document production. This methodology can be generalized beyond the KANT interlinhua approach for use with any system that requires similar kinds of knowledge
Theme
Computerlinguistik
Multilinguale Probleme
Object
KBMT-89

Similar documents (content)

  1. Knight, K.: Automatic knowledge acquisition for machine translation (1997) 0.25
    0.2520476 = sum of:
      0.2520476 = product of:
        1.2602379 = sum of:
          0.23790035 = weight(abstract_txt:automate in 4246) [ClassicSimilarity], result of:
            0.23790035 = score(doc=4246,freq=1.0), product of:
              0.15923257 = queryWeight, product of:
                1.045513 = boost
                7.9682307 = idf(docFreq=40, maxDocs=43556)
                0.019113516 = queryNorm
              1.4940432 = fieldWeight in 4246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9682307 = idf(docFreq=40, maxDocs=43556)
                0.1875 = fieldNorm(doc=4246)
          0.3330511 = weight(abstract_txt:translation in 4246) [ClassicSimilarity], result of:
            0.3330511 = score(doc=4246,freq=1.0), product of:
              0.2873974 = queryWeight, product of:
                2.432851 = boost
                6.1805444 = idf(docFreq=244, maxDocs=43556)
                0.019113516 = queryNorm
              1.1588521 = fieldWeight in 4246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1805444 = idf(docFreq=244, maxDocs=43556)
                0.1875 = fieldNorm(doc=4246)
          0.3448124 = weight(abstract_txt:acquisition in 4246) [ClassicSimilarity], result of:
            0.3448124 = score(doc=4246,freq=1.0), product of:
              0.29412428 = queryWeight, product of:
                2.4611583 = boost
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.019113516 = queryNorm
              1.1723357 = fieldWeight in 4246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.1875 = fieldNorm(doc=4246)
          0.1078413 = weight(abstract_txt:based in 4246) [ClassicSimilarity], result of:
            0.1078413 = score(doc=4246,freq=1.0), product of:
              0.17974547 = queryWeight, product of:
                2.938945 = boost
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.019113516 = queryNorm
              0.5999667 = fieldWeight in 4246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.1875 = fieldNorm(doc=4246)
          0.23663285 = weight(abstract_txt:knowledge in 4246) [ClassicSimilarity], result of:
            0.23663285 = score(doc=4246,freq=1.0), product of:
              0.35287112 = queryWeight, product of:
                5.1619983 = boost
                3.5764952 = idf(docFreq=3311, maxDocs=43556)
                0.019113516 = queryNorm
              0.67059284 = fieldWeight in 4246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5764952 = idf(docFreq=3311, maxDocs=43556)
                0.1875 = fieldNorm(doc=4246)
        0.2 = coord(5/25)
    
  2. Dorr, B.J.: Large-scale dictionary construction for foreign language tutoring and interlingual machine translation (1997) 0.23
    0.22998545 = sum of:
      0.22998545 = product of:
        0.7187045 = sum of:
          0.13313031 = weight(abstract_txt:lexicons in 4242) [ClassicSimilarity], result of:
            0.13313031 = score(doc=4242,freq=1.0), product of:
              0.19383283 = queryWeight, product of:
                1.1535254 = boost
                8.791431 = idf(docFreq=17, maxDocs=43556)
                0.019113516 = queryNorm
              0.6868306 = fieldWeight in 4242, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.791431 = idf(docFreq=17, maxDocs=43556)
                0.078125 = fieldNorm(doc=4242)
          0.023422599 = weight(abstract_txt:systems in 4242) [ClassicSimilarity], result of:
            0.023422599 = score(doc=4242,freq=1.0), product of:
              0.08777547 = queryWeight, product of:
                1.3444996 = boost
                3.4156382 = idf(docFreq=3889, maxDocs=43556)
                0.019113516 = queryNorm
              0.26684675 = fieldWeight in 4242, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4156382 = idf(docFreq=3889, maxDocs=43556)
                0.078125 = fieldNorm(doc=4242)
          0.09218587 = weight(abstract_txt:scale in 4242) [ClassicSimilarity], result of:
            0.09218587 = score(doc=4242,freq=2.0), product of:
              0.15171164 = queryWeight, product of:
                1.4432379 = boost
                5.4997177 = idf(docFreq=483, maxDocs=43556)
                0.019113516 = queryNorm
              0.6076387 = fieldWeight in 4242, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4997177 = idf(docFreq=483, maxDocs=43556)
                0.078125 = fieldNorm(doc=4242)
          0.04406918 = weight(abstract_txt:approach in 4242) [ClassicSimilarity], result of:
            0.04406918 = score(doc=4242,freq=2.0), product of:
              0.10617683 = queryWeight, product of:
                1.4787303 = boost
                3.7566452 = idf(docFreq=2765, maxDocs=43556)
                0.019113516 = queryNorm
              0.41505456 = fieldWeight in 4242, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7566452 = idf(docFreq=2765, maxDocs=43556)
                0.078125 = fieldNorm(doc=4242)
          0.098519586 = weight(abstract_txt:large in 4242) [ClassicSimilarity], result of:
            0.098519586 = score(doc=4242,freq=2.0), product of:
              0.19980258 = queryWeight, product of:
                2.342308 = boost
                4.462893 = idf(docFreq=1364, maxDocs=43556)
                0.019113516 = queryNorm
              0.49308467 = fieldWeight in 4242, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.462893 = idf(docFreq=1364, maxDocs=43556)
                0.078125 = fieldNorm(doc=4242)
          0.13877128 = weight(abstract_txt:translation in 4242) [ClassicSimilarity], result of:
            0.13877128 = score(doc=4242,freq=1.0), product of:
              0.2873974 = queryWeight, product of:
                2.432851 = boost
                6.1805444 = idf(docFreq=244, maxDocs=43556)
                0.019113516 = queryNorm
              0.48285502 = fieldWeight in 4242, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1805444 = idf(docFreq=244, maxDocs=43556)
                0.078125 = fieldNorm(doc=4242)
          0.14367183 = weight(abstract_txt:acquisition in 4242) [ClassicSimilarity], result of:
            0.14367183 = score(doc=4242,freq=1.0), product of:
              0.29412428 = queryWeight, product of:
                2.4611583 = boost
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.019113516 = queryNorm
              0.4884732 = fieldWeight in 4242, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.078125 = fieldNorm(doc=4242)
          0.044933874 = weight(abstract_txt:based in 4242) [ClassicSimilarity], result of:
            0.044933874 = score(doc=4242,freq=1.0), product of:
              0.17974547 = queryWeight, product of:
                2.938945 = boost
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.019113516 = queryNorm
              0.24998613 = fieldWeight in 4242, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.078125 = fieldNorm(doc=4242)
        0.32 = coord(8/25)
    
  3. Li, L.X.; Xu, L.D.: Knowledge-based problem solving (2002) 0.22
    0.21867989 = sum of:
      0.21867989 = product of:
        0.6074441 = sum of:
          0.060714662 = weight(abstract_txt:coded in 257) [ClassicSimilarity], result of:
            0.060714662 = score(doc=257,freq=1.0), product of:
              0.14567098 = queryWeight, product of:
                7.62136 = idf(docFreq=57, maxDocs=43556)
                0.019113516 = queryNorm
              0.4167931 = fieldWeight in 257, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.62136 = idf(docFreq=57, maxDocs=43556)
                0.0546875 = fieldNorm(doc=257)
          0.029277094 = weight(abstract_txt:applications in 257) [ClassicSimilarity], result of:
            0.029277094 = score(doc=257,freq=1.0), product of:
              0.11285981 = queryWeight, product of:
                1.2447958 = boost
                4.7435184 = idf(docFreq=1030, maxDocs=43556)
                0.019113516 = queryNorm
              0.25941116 = fieldWeight in 257, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7435184 = idf(docFreq=1030, maxDocs=43556)
                0.0546875 = fieldNorm(doc=257)
          0.05114914 = weight(abstract_txt:domain in 257) [ClassicSimilarity], result of:
            0.05114914 = score(doc=257,freq=3.0), product of:
              0.11351131 = queryWeight, product of:
                1.2483835 = boost
                4.75719 = idf(docFreq=1016, maxDocs=43556)
                0.019113516 = queryNorm
              0.4506083 = fieldWeight in 257, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.75719 = idf(docFreq=1016, maxDocs=43556)
                0.0546875 = fieldNorm(doc=257)
          0.015694637 = weight(abstract_txt:system in 257) [ClassicSimilarity], result of:
            0.015694637 = score(doc=257,freq=1.0), product of:
              0.08525475 = queryWeight, product of:
                1.3250535 = boost
                3.3662362 = idf(docFreq=4086, maxDocs=43556)
                0.019113516 = queryNorm
              0.18409105 = fieldWeight in 257, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3662362 = idf(docFreq=4086, maxDocs=43556)
                0.0546875 = fieldNorm(doc=257)
          0.032791637 = weight(abstract_txt:systems in 257) [ClassicSimilarity], result of:
            0.032791637 = score(doc=257,freq=4.0), product of:
              0.08777547 = queryWeight, product of:
                1.3444996 = boost
                3.4156382 = idf(docFreq=3889, maxDocs=43556)
                0.019113516 = queryNorm
              0.37358543 = fieldWeight in 257, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4156382 = idf(docFreq=3889, maxDocs=43556)
                0.0546875 = fieldNorm(doc=257)
          0.021813132 = weight(abstract_txt:approach in 257) [ClassicSimilarity], result of:
            0.021813132 = score(doc=257,freq=1.0), product of:
              0.10617683 = queryWeight, product of:
                1.4787303 = boost
                3.7566452 = idf(docFreq=2765, maxDocs=43556)
                0.019113516 = queryNorm
              0.20544153 = fieldWeight in 257, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7566452 = idf(docFreq=2765, maxDocs=43556)
                0.0546875 = fieldNorm(doc=257)
          0.05279506 = weight(abstract_txt:requires in 257) [ClassicSimilarity], result of:
            0.05279506 = score(doc=257,freq=1.0), product of:
              0.1672051 = queryWeight, product of:
                1.5151416 = boost
                5.77372 = idf(docFreq=367, maxDocs=43556)
                0.019113516 = queryNorm
              0.3157503 = fieldWeight in 257, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.77372 = idf(docFreq=367, maxDocs=43556)
                0.0546875 = fieldNorm(doc=257)
          0.094361134 = weight(abstract_txt:based in 257) [ClassicSimilarity], result of:
            0.094361134 = score(doc=257,freq=9.0), product of:
              0.17974547 = queryWeight, product of:
                2.938945 = boost
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.019113516 = queryNorm
              0.5249709 = fieldWeight in 257, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.0546875 = fieldNorm(doc=257)
          0.24884765 = weight(abstract_txt:knowledge in 257) [ClassicSimilarity], result of:
            0.24884765 = score(doc=257,freq=13.0), product of:
              0.35287112 = queryWeight, product of:
                5.1619983 = boost
                3.5764952 = idf(docFreq=3311, maxDocs=43556)
                0.019113516 = queryNorm
              0.7052083 = fieldWeight in 257, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                3.5764952 = idf(docFreq=3311, maxDocs=43556)
                0.0546875 = fieldNorm(doc=257)
        0.36 = coord(9/25)
    
  4. Xu, Y.; Li, G.; Mou, L.; Lu, Y.: Learning non-taxonomic relations on demand for ontology extension (2014) 0.20
    0.19692442 = sum of:
      0.19692442 = product of:
        0.6153888 = sum of:
          0.108595096 = weight(abstract_txt:crafted in 4959) [ClassicSimilarity], result of:
            0.108595096 = score(doc=4959,freq=1.0), product of:
              0.19636142 = queryWeight, product of:
                1.1610249 = boost
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.019113516 = queryNorm
              0.5530368 = fieldWeight in 4959, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.0625 = fieldNorm(doc=4959)
          0.12932666 = weight(abstract_txt:laborious in 4959) [ClassicSimilarity], result of:
            0.12932666 = score(doc=4959,freq=1.0), product of:
              0.22061823 = queryWeight, product of:
                1.2306489 = boost
                9.379218 = idf(docFreq=9, maxDocs=43556)
                0.019113516 = queryNorm
              0.58620113 = fieldWeight in 4959, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.379218 = idf(docFreq=9, maxDocs=43556)
                0.0625 = fieldNorm(doc=4959)
          0.03374968 = weight(abstract_txt:domain in 4959) [ClassicSimilarity], result of:
            0.03374968 = score(doc=4959,freq=1.0), product of:
              0.11351131 = queryWeight, product of:
                1.2483835 = boost
                4.75719 = idf(docFreq=1016, maxDocs=43556)
                0.019113516 = queryNorm
              0.2973244 = fieldWeight in 4959, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.75719 = idf(docFreq=1016, maxDocs=43556)
                0.0625 = fieldNorm(doc=4959)
          0.035255343 = weight(abstract_txt:approach in 4959) [ClassicSimilarity], result of:
            0.035255343 = score(doc=4959,freq=2.0), product of:
              0.10617683 = queryWeight, product of:
                1.4787303 = boost
                3.7566452 = idf(docFreq=2765, maxDocs=43556)
                0.019113516 = queryNorm
              0.33204365 = fieldWeight in 4959, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7566452 = idf(docFreq=2765, maxDocs=43556)
                0.0625 = fieldNorm(doc=4959)
          0.059500556 = weight(abstract_txt:hand in 4959) [ClassicSimilarity], result of:
            0.059500556 = score(doc=4959,freq=1.0), product of:
              0.16565582 = queryWeight, product of:
                1.5081059 = boost
                5.7469087 = idf(docFreq=377, maxDocs=43556)
                0.019113516 = queryNorm
              0.3591818 = fieldWeight in 4959, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7469087 = idf(docFreq=377, maxDocs=43556)
                0.0625 = fieldNorm(doc=4959)
          0.07176187 = weight(abstract_txt:corpus in 4959) [ClassicSimilarity], result of:
            0.07176187 = score(doc=4959,freq=1.0), product of:
              0.18769607 = queryWeight, product of:
                1.6052995 = boost
                6.1172824 = idf(docFreq=260, maxDocs=43556)
                0.019113516 = queryNorm
              0.38233015 = fieldWeight in 4959, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1172824 = idf(docFreq=260, maxDocs=43556)
                0.0625 = fieldNorm(doc=4959)
          0.11493746 = weight(abstract_txt:acquisition in 4959) [ClassicSimilarity], result of:
            0.11493746 = score(doc=4959,freq=1.0), product of:
              0.29412428 = queryWeight, product of:
                2.4611583 = boost
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.019113516 = queryNorm
              0.39077857 = fieldWeight in 4959, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.0625 = fieldNorm(doc=4959)
          0.0622622 = weight(abstract_txt:based in 4959) [ClassicSimilarity], result of:
            0.0622622 = score(doc=4959,freq=3.0), product of:
              0.17974547 = queryWeight, product of:
                2.938945 = boost
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.019113516 = queryNorm
              0.34639093 = fieldWeight in 4959, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.0625 = fieldNorm(doc=4959)
        0.32 = coord(8/25)
    
  5. Basili, R.; Pazienza, M.T.; Velardi, P.: ¬An empirical symbolic approach to natural language processing (1996) 0.19
    0.19062851 = sum of:
      0.19062851 = product of:
        0.6808161 = sum of:
          0.058554187 = weight(abstract_txt:applications in 6819) [ClassicSimilarity], result of:
            0.058554187 = score(doc=6819,freq=1.0), product of:
              0.11285981 = queryWeight, product of:
                1.2447958 = boost
                4.7435184 = idf(docFreq=1030, maxDocs=43556)
                0.019113516 = queryNorm
              0.5188223 = fieldWeight in 6819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7435184 = idf(docFreq=1030, maxDocs=43556)
                0.109375 = fieldNorm(doc=6819)
          0.031389274 = weight(abstract_txt:system in 6819) [ClassicSimilarity], result of:
            0.031389274 = score(doc=6819,freq=1.0), product of:
              0.08525475 = queryWeight, product of:
                1.3250535 = boost
                3.3662362 = idf(docFreq=4086, maxDocs=43556)
                0.019113516 = queryNorm
              0.3681821 = fieldWeight in 6819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3662362 = idf(docFreq=4086, maxDocs=43556)
                0.109375 = fieldNorm(doc=6819)
          0.09125935 = weight(abstract_txt:scale in 6819) [ClassicSimilarity], result of:
            0.09125935 = score(doc=6819,freq=1.0), product of:
              0.15171164 = queryWeight, product of:
                1.4432379 = boost
                5.4997177 = idf(docFreq=483, maxDocs=43556)
                0.019113516 = queryNorm
              0.6015316 = fieldWeight in 6819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4997177 = idf(docFreq=483, maxDocs=43556)
                0.109375 = fieldNorm(doc=6819)
          0.09752942 = weight(abstract_txt:large in 6819) [ClassicSimilarity], result of:
            0.09752942 = score(doc=6819,freq=1.0), product of:
              0.19980258 = queryWeight, product of:
                2.342308 = boost
                4.462893 = idf(docFreq=1364, maxDocs=43556)
                0.019113516 = queryNorm
              0.48812893 = fieldWeight in 6819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.462893 = idf(docFreq=1364, maxDocs=43556)
                0.109375 = fieldNorm(doc=6819)
          0.20114057 = weight(abstract_txt:acquisition in 6819) [ClassicSimilarity], result of:
            0.20114057 = score(doc=6819,freq=1.0), product of:
              0.29412428 = queryWeight, product of:
                2.4611583 = boost
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.019113516 = queryNorm
              0.6838625 = fieldWeight in 6819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.109375 = fieldNorm(doc=6819)
          0.06290743 = weight(abstract_txt:based in 6819) [ClassicSimilarity], result of:
            0.06290743 = score(doc=6819,freq=1.0), product of:
              0.17974547 = queryWeight, product of:
                2.938945 = boost
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.019113516 = queryNorm
              0.3499806 = fieldWeight in 6819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1998224 = idf(docFreq=4826, maxDocs=43556)
                0.109375 = fieldNorm(doc=6819)
          0.13803582 = weight(abstract_txt:knowledge in 6819) [ClassicSimilarity], result of:
            0.13803582 = score(doc=6819,freq=1.0), product of:
              0.35287112 = queryWeight, product of:
                5.1619983 = boost
                3.5764952 = idf(docFreq=3311, maxDocs=43556)
                0.019113516 = queryNorm
              0.39117914 = fieldWeight in 6819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5764952 = idf(docFreq=3311, maxDocs=43556)
                0.109375 = fieldNorm(doc=6819)
        0.28 = coord(7/25)