Document (#16479)

Author
Lonsdale, D.
Mitamura, T.
Nyberg, E.
Title
Acquisition of large lexicons for practical knowledge-based MT
Source
Machine translation. 9(1994/95) nos.3/4, S.251-283
Year
1994/95
Abstract
Although knowledge based MT systems have the potential to achieve high translation accuracy, each successful application system requires a large amount of hand coded lexical knowledge. Systems like KBMT-89 and its descendants have demonstarted how knowledge based translation can produce good results in technical domains with tractable domain semantics. Nevertheless, the magnitude of the development task for large scale applications with 10s of 1000s of of domain concepts precludes a purely hand crafted approach. The current challenge for the next generation of knowledge based MT systems is to utilize online textual resources and corpus analysis software in order to automate the most laborious aspects of the knowledge acquisition process. This partial automation can in turn maximize the productivity of human knowledge engineers and help to make large scale applications of knowledge based MT an viable approach. Discusses the corpus based knowledge acquisition methodology used in KANT, a knowledge based translation system for multilingual document production. This methodology can be generalized beyond the KANT interlinhua approach for use with any system that requires similar kinds of knowledge
Theme
Computerlinguistik
Multilinguale Probleme
Object
KBMT-89

Similar documents (content)

  1. Knight, K.: Automatic knowledge acquisition for machine translation (1997) 0.25
    0.2516899 = sum of:
      0.2516899 = product of:
        1.2584496 = sum of:
          0.2404114 = weight(abstract_txt:automate in 3248) [ClassicSimilarity], result of:
            0.2404114 = score(doc=3248,freq=1.0), product of:
              0.16060923 = queryWeight, product of:
                1.0477685 = boost
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.019200914 = queryNorm
              1.4968716 = fieldWeight in 3248, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.1875 = fieldNorm(doc=3248)
          0.33383155 = weight(abstract_txt:translation in 3248) [ClassicSimilarity], result of:
            0.33383155 = score(doc=3248,freq=1.0), product of:
              0.28830963 = queryWeight, product of:
                2.4314778 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.019200914 = queryNorm
              1.1578925 = fieldWeight in 3248, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.1875 = fieldNorm(doc=3248)
          0.34396702 = weight(abstract_txt:acquisition in 3248) [ClassicSimilarity], result of:
            0.34396702 = score(doc=3248,freq=1.0), product of:
              0.29411608 = queryWeight, product of:
                2.4558403 = boost
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.019200914 = queryNorm
              1.1694942 = fieldWeight in 3248, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.1875 = fieldNorm(doc=3248)
          0.10715901 = weight(abstract_txt:based in 3248) [ClassicSimilarity], result of:
            0.10715901 = score(doc=3248,freq=1.0), product of:
              0.17927468 = queryWeight, product of:
                2.9287925 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019200914 = queryNorm
              0.5977365 = fieldWeight in 3248, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.1875 = fieldNorm(doc=3248)
          0.23308058 = weight(abstract_txt:knowledge in 3248) [ClassicSimilarity], result of:
            0.23308058 = score(doc=3248,freq=1.0), product of:
              0.34989315 = queryWeight, product of:
                5.129135 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.019200914 = queryNorm
              0.6661479 = fieldWeight in 3248, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.1875 = fieldNorm(doc=3248)
        0.2 = coord(5/25)
    
  2. Dorr, B.J.: Large-scale dictionary construction for foreign language tutoring and interlingual machine translation (1997) 0.23
    0.22999561 = sum of:
      0.22999561 = product of:
        0.7187363 = sum of:
          0.13446409 = weight(abstract_txt:lexicons in 3244) [ClassicSimilarity], result of:
            0.13446409 = score(doc=3244,freq=1.0), product of:
              0.19543943 = queryWeight, product of:
                1.1558093 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.019200914 = queryNorm
              0.688009 = fieldWeight in 3244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.078125 = fieldNorm(doc=3244)
          0.023458263 = weight(abstract_txt:systems in 3244) [ClassicSimilarity], result of:
            0.023458263 = score(doc=3244,freq=1.0), product of:
              0.088006005 = queryWeight, product of:
                1.3433738 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.019200914 = queryNorm
              0.26655298 = fieldWeight in 3244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.078125 = fieldNorm(doc=3244)
          0.09145342 = weight(abstract_txt:scale in 3244) [ClassicSimilarity], result of:
            0.09145342 = score(doc=3244,freq=2.0), product of:
              0.15114993 = queryWeight, product of:
                1.4374709 = boost
                5.476297 = idf(docFreq=502, maxDocs=44218)
                0.019200914 = queryNorm
              0.60505104 = fieldWeight in 3244, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.476297 = idf(docFreq=502, maxDocs=44218)
                0.078125 = fieldNorm(doc=3244)
          0.04388335 = weight(abstract_txt:approach in 3244) [ClassicSimilarity], result of:
            0.04388335 = score(doc=3244,freq=2.0), product of:
              0.10604859 = queryWeight, product of:
                1.4746643 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.019200914 = queryNorm
              0.41380417 = fieldWeight in 3244, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.078125 = fieldNorm(doc=3244)
          0.09841146 = weight(abstract_txt:large in 3244) [ClassicSimilarity], result of:
            0.09841146 = score(doc=3244,freq=2.0), product of:
              0.19997779 = queryWeight, product of:
                2.338304 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.019200914 = queryNorm
              0.49211198 = fieldWeight in 3244, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.078125 = fieldNorm(doc=3244)
          0.13909648 = weight(abstract_txt:translation in 3244) [ClassicSimilarity], result of:
            0.13909648 = score(doc=3244,freq=1.0), product of:
              0.28830963 = queryWeight, product of:
                2.4314778 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.019200914 = queryNorm
              0.4824552 = fieldWeight in 3244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.078125 = fieldNorm(doc=3244)
          0.14331959 = weight(abstract_txt:acquisition in 3244) [ClassicSimilarity], result of:
            0.14331959 = score(doc=3244,freq=1.0), product of:
              0.29411608 = queryWeight, product of:
                2.4558403 = boost
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.019200914 = queryNorm
              0.4872892 = fieldWeight in 3244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.078125 = fieldNorm(doc=3244)
          0.044649586 = weight(abstract_txt:based in 3244) [ClassicSimilarity], result of:
            0.044649586 = score(doc=3244,freq=1.0), product of:
              0.17927468 = queryWeight, product of:
                2.9287925 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019200914 = queryNorm
              0.24905685 = fieldWeight in 3244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.078125 = fieldNorm(doc=3244)
        0.32 = coord(8/25)
    
  3. Li, L.X.; Xu, L.D.: Knowledge-based problem solving (2002) 0.22
    0.21716046 = sum of:
      0.21716046 = product of:
        0.6032235 = sum of:
          0.06096012 = weight(abstract_txt:coded in 4259) [ClassicSimilarity], result of:
            0.06096012 = score(doc=4259,freq=1.0), product of:
              0.14629848 = queryWeight, product of:
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.019200914 = queryNorm
              0.4166832 = fieldWeight in 4259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4259)
          0.05069969 = weight(abstract_txt:domain in 4259) [ClassicSimilarity], result of:
            0.05069969 = score(doc=4259,freq=3.0), product of:
              0.11302705 = queryWeight, product of:
                1.2430434 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.019200914 = queryNorm
              0.44856244 = fieldWeight in 4259, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4259)
          0.0293951 = weight(abstract_txt:applications in 4259) [ClassicSimilarity], result of:
            0.0293951 = score(doc=4259,freq=1.0), product of:
              0.11334505 = queryWeight, product of:
                1.2447908 = boost
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.019200914 = queryNorm
              0.25934172 = fieldWeight in 4259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4259)
          0.015856056 = weight(abstract_txt:system in 4259) [ClassicSimilarity], result of:
            0.015856056 = score(doc=4259,freq=1.0), product of:
              0.085976504 = queryWeight, product of:
                1.3277937 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.019200914 = queryNorm
              0.18442312 = fieldWeight in 4259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4259)
          0.03284157 = weight(abstract_txt:systems in 4259) [ClassicSimilarity], result of:
            0.03284157 = score(doc=4259,freq=4.0), product of:
              0.088006005 = queryWeight, product of:
                1.3433738 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.019200914 = queryNorm
              0.3731742 = fieldWeight in 4259, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4259)
          0.02172115 = weight(abstract_txt:approach in 4259) [ClassicSimilarity], result of:
            0.02172115 = score(doc=4259,freq=1.0), product of:
              0.10604859 = queryWeight, product of:
                1.4746643 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.019200914 = queryNorm
              0.20482263 = fieldWeight in 4259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4259)
          0.052873645 = weight(abstract_txt:requires in 4259) [ClassicSimilarity], result of:
            0.052873645 = score(doc=4259,freq=1.0), product of:
              0.16764043 = queryWeight, product of:
                1.5138557 = boost
                5.767298 = idf(docFreq=375, maxDocs=44218)
                0.019200914 = queryNorm
              0.3153991 = fieldWeight in 4259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.767298 = idf(docFreq=375, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4259)
          0.09376414 = weight(abstract_txt:based in 4259) [ClassicSimilarity], result of:
            0.09376414 = score(doc=4259,freq=9.0), product of:
              0.17927468 = queryWeight, product of:
                2.9287925 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019200914 = queryNorm
              0.52301943 = fieldWeight in 4259, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4259)
          0.245112 = weight(abstract_txt:knowledge in 4259) [ClassicSimilarity], result of:
            0.245112 = score(doc=4259,freq=13.0), product of:
              0.34989315 = queryWeight, product of:
                5.129135 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.019200914 = queryNorm
              0.70053387 = fieldWeight in 4259, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4259)
        0.36 = coord(9/25)
    
  4. Xu, Y.; Li, G.; Mou, L.; Lu, Y.: Learning non-taxonomic relations on demand for ontology extension (2014) 0.20
    0.19664963 = sum of:
      0.19664963 = product of:
        0.6145301 = sum of:
          0.107571274 = weight(abstract_txt:crafted in 2961) [ClassicSimilarity], result of:
            0.107571274 = score(doc=2961,freq=1.0), product of:
              0.19543943 = queryWeight, product of:
                1.1558093 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.019200914 = queryNorm
              0.55040723 = fieldWeight in 2961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.0625 = fieldNorm(doc=2961)
          0.13058028 = weight(abstract_txt:laborious in 2961) [ClassicSimilarity], result of:
            0.13058028 = score(doc=2961,freq=1.0), product of:
              0.22239912 = queryWeight, product of:
                1.2329533 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.019200914 = queryNorm
              0.5871439 = fieldWeight in 2961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=2961)
          0.03345312 = weight(abstract_txt:domain in 2961) [ClassicSimilarity], result of:
            0.03345312 = score(doc=2961,freq=1.0), product of:
              0.11302705 = queryWeight, product of:
                1.2430434 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.019200914 = queryNorm
              0.29597446 = fieldWeight in 2961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0625 = fieldNorm(doc=2961)
          0.035106678 = weight(abstract_txt:approach in 2961) [ClassicSimilarity], result of:
            0.035106678 = score(doc=2961,freq=2.0), product of:
              0.10604859 = queryWeight, product of:
                1.4746643 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.019200914 = queryNorm
              0.33104333 = fieldWeight in 2961, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=2961)
          0.059849072 = weight(abstract_txt:hand in 2961) [ClassicSimilarity], result of:
            0.059849072 = score(doc=2961,freq=1.0), product of:
              0.1665698 = queryWeight, product of:
                1.5090139 = boost
                5.7488523 = idf(docFreq=382, maxDocs=44218)
                0.019200914 = queryNorm
              0.35930327 = fieldWeight in 2961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7488523 = idf(docFreq=382, maxDocs=44218)
                0.0625 = fieldNorm(doc=2961)
          0.07144564 = weight(abstract_txt:corpus in 2961) [ClassicSimilarity], result of:
            0.07144564 = score(doc=2961,freq=1.0), product of:
              0.18744555 = queryWeight, product of:
                1.6007837 = boost
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.019200914 = queryNorm
              0.3811541 = fieldWeight in 2961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.0625 = fieldNorm(doc=2961)
          0.11465567 = weight(abstract_txt:acquisition in 2961) [ClassicSimilarity], result of:
            0.11465567 = score(doc=2961,freq=1.0), product of:
              0.29411608 = queryWeight, product of:
                2.4558403 = boost
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.019200914 = queryNorm
              0.38983136 = fieldWeight in 2961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.0625 = fieldNorm(doc=2961)
          0.06186828 = weight(abstract_txt:based in 2961) [ClassicSimilarity], result of:
            0.06186828 = score(doc=2961,freq=3.0), product of:
              0.17927468 = queryWeight, product of:
                2.9287925 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019200914 = queryNorm
              0.3451033 = fieldWeight in 2961, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=2961)
        0.32 = coord(8/25)
    
  5. Basili, R.; Pazienza, M.T.; Velardi, P.: ¬An empirical symbolic approach to natural language processing (1996) 0.19
    0.18972225 = sum of:
      0.18972225 = product of:
        0.67757946 = sum of:
          0.0587902 = weight(abstract_txt:applications in 6753) [ClassicSimilarity], result of:
            0.0587902 = score(doc=6753,freq=1.0), product of:
              0.11334505 = queryWeight, product of:
                1.2447908 = boost
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.019200914 = queryNorm
              0.51868343 = fieldWeight in 6753, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.109375 = fieldNorm(doc=6753)
          0.03171211 = weight(abstract_txt:system in 6753) [ClassicSimilarity], result of:
            0.03171211 = score(doc=6753,freq=1.0), product of:
              0.085976504 = queryWeight, product of:
                1.3277937 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.019200914 = queryNorm
              0.36884624 = fieldWeight in 6753, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.109375 = fieldNorm(doc=6753)
          0.09053427 = weight(abstract_txt:scale in 6753) [ClassicSimilarity], result of:
            0.09053427 = score(doc=6753,freq=1.0), product of:
              0.15114993 = queryWeight, product of:
                1.4374709 = boost
                5.476297 = idf(docFreq=502, maxDocs=44218)
                0.019200914 = queryNorm
              0.59897 = fieldWeight in 6753, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.476297 = idf(docFreq=502, maxDocs=44218)
                0.109375 = fieldNorm(doc=6753)
          0.09742238 = weight(abstract_txt:large in 6753) [ClassicSimilarity], result of:
            0.09742238 = score(doc=6753,freq=1.0), product of:
              0.19997779 = queryWeight, product of:
                2.338304 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.019200914 = queryNorm
              0.487166 = fieldWeight in 6753, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.109375 = fieldNorm(doc=6753)
          0.20064743 = weight(abstract_txt:acquisition in 6753) [ClassicSimilarity], result of:
            0.20064743 = score(doc=6753,freq=1.0), product of:
              0.29411608 = queryWeight, product of:
                2.4558403 = boost
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.019200914 = queryNorm
              0.6822049 = fieldWeight in 6753, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.109375 = fieldNorm(doc=6753)
          0.062509425 = weight(abstract_txt:based in 6753) [ClassicSimilarity], result of:
            0.062509425 = score(doc=6753,freq=1.0), product of:
              0.17927468 = queryWeight, product of:
                2.9287925 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019200914 = queryNorm
              0.3486796 = fieldWeight in 6753, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.109375 = fieldNorm(doc=6753)
          0.13596368 = weight(abstract_txt:knowledge in 6753) [ClassicSimilarity], result of:
            0.13596368 = score(doc=6753,freq=1.0), product of:
              0.34989315 = queryWeight, product of:
                5.129135 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.019200914 = queryNorm
              0.38858628 = fieldWeight in 6753, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.109375 = fieldNorm(doc=6753)
        0.28 = coord(7/25)