Document (#20874)

Author
Chandrasekar, R.
Srinivas, B.
Title
Automatic induction of rules for text simplification
Source
Knowledge-based systems. 10(1997) no.3, S.183-190
Year
1997
Abstract
Explores methods to automatically transform sentences in order to make them simpler. These methods involve the use of a rule-based system, driven by the syntax of the text in the domain of interest. Hand-crafting rules for every domain is time-consuming and impractical. Describes an algorithm and an implementation by which generalized rules for simplification are automatically induced from annotated training materials using a novel partial parsing technique, which combines constituent structure and dependency information. The algorithm employs example-based generalisations on linguistically motivated structures
Footnote
Contribution to an issue devoted to papers from the International Conference on Knowledge Based Computer systems, 16-18 Dec 1996, Mumbai, India
Theme
Computerlinguistik

Similar documents (content)

  1. Jiang, X.; Tan, A.-H.: CRCTOL: a semantic-based domain ontology learning system (2009) 0.17
    0.16537413 = sum of:
      0.16537413 = product of:
        0.51679415 = sum of:
          0.0585451 = weight(abstract_txt:generalized in 3320) [ClassicSimilarity], result of:
            0.0585451 = score(doc=3320,freq=1.0), product of:
              0.15137896 = queryWeight, product of:
                1.0378376 = boost
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.020625245 = queryNorm
              0.3867453 = fieldWeight in 3320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.06114532 = weight(abstract_txt:employs in 3320) [ClassicSimilarity], result of:
            0.06114532 = score(doc=3320,freq=1.0), product of:
              0.15582864 = queryWeight, product of:
                1.0529804 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.020625245 = queryNorm
              0.3923882 = fieldWeight in 3320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.076922275 = weight(abstract_txt:parsing in 3320) [ClassicSimilarity], result of:
            0.076922275 = score(doc=3320,freq=1.0), product of:
              0.18159612 = queryWeight, product of:
                1.1367106 = boost
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.020625245 = queryNorm
              0.4235899 = fieldWeight in 3320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.03791933 = weight(abstract_txt:text in 3320) [ClassicSimilarity], result of:
            0.03791933 = score(doc=3320,freq=3.0), product of:
              0.09899543 = queryWeight, product of:
                1.186914 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020625245 = queryNorm
              0.38304123 = fieldWeight in 3320, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.0333848 = weight(abstract_txt:methods in 3320) [ClassicSimilarity], result of:
            0.0333848 = score(doc=3320,freq=2.0), product of:
              0.10409687 = queryWeight, product of:
                1.2171118 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.020625245 = queryNorm
              0.320709 = fieldWeight in 3320, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.07031731 = weight(abstract_txt:domain in 3320) [ClassicSimilarity], result of:
            0.07031731 = score(doc=3320,freq=4.0), product of:
              0.13575943 = queryWeight, product of:
                1.3899419 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.020625245 = queryNorm
              0.5179553 = fieldWeight in 3320, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.055588886 = weight(abstract_txt:automatically in 3320) [ClassicSimilarity], result of:
            0.055588886 = score(doc=3320,freq=1.0), product of:
              0.18424983 = queryWeight, product of:
                1.6192548 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.020625245 = queryNorm
              0.30170387 = fieldWeight in 3320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.12297112 = weight(abstract_txt:algorithm in 3320) [ClassicSimilarity], result of:
            0.12297112 = score(doc=3320,freq=4.0), product of:
              0.19705942 = queryWeight, product of:
                1.6745968 = boost
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.020625245 = queryNorm
              0.62403065 = fieldWeight in 3320, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
        0.32 = coord(8/25)
    
  2. Finegan-Dollak, C.; Radev, D.R.: Sentence simplification, compression, and disaggregation for summarization of sophisticated documents (2016) 0.16
    0.16047171 = sum of:
      0.16047171 = product of:
        0.8023585 = sum of:
          0.18324906 = weight(abstract_txt:sentences in 3122) [ClassicSimilarity], result of:
            0.18324906 = score(doc=3122,freq=8.0), product of:
              0.14816365 = queryWeight, product of:
                1.0267565 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.020625245 = queryNorm
              1.2368017 = fieldWeight in 3122, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.0625 = fieldNorm(doc=3122)
          0.026978992 = weight(abstract_txt:methods in 3122) [ClassicSimilarity], result of:
            0.026978992 = score(doc=3122,freq=1.0), product of:
              0.10409687 = queryWeight, product of:
                1.2171118 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.020625245 = queryNorm
              0.259172 = fieldWeight in 3122, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=3122)
          0.099375665 = weight(abstract_txt:algorithm in 3122) [ClassicSimilarity], result of:
            0.099375665 = score(doc=3122,freq=2.0), product of:
              0.19705942 = queryWeight, product of:
                1.6745968 = boost
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.020625245 = queryNorm
              0.5042929 = fieldWeight in 3122, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.0625 = fieldNorm(doc=3122)
          0.08151475 = weight(abstract_txt:rules in 3122) [ClassicSimilarity], result of:
            0.08151475 = score(doc=3122,freq=1.0), product of:
              0.24904342 = queryWeight, product of:
                2.3056576 = boost
                5.236983 = idf(docFreq=638, maxDocs=44218)
                0.020625245 = queryNorm
              0.32731143 = fieldWeight in 3122, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.236983 = idf(docFreq=638, maxDocs=44218)
                0.0625 = fieldNorm(doc=3122)
          0.41124004 = weight(abstract_txt:simplification in 3122) [ClassicSimilarity], result of:
            0.41124004 = score(doc=3122,freq=3.0), product of:
              0.44372135 = queryWeight, product of:
                2.5128515 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.020625245 = queryNorm
              0.9267979 = fieldWeight in 3122, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.0625 = fieldNorm(doc=3122)
        0.2 = coord(5/25)
    
  3. Kauchak, D.; Leroy, G.; Hogue, A.: Measuring text difficulty using parse-tree frequency (2017) 0.11
    0.11057385 = sum of:
      0.11057385 = product of:
        0.55286926 = sum of:
          0.12957665 = weight(abstract_txt:sentences in 3786) [ClassicSimilarity], result of:
            0.12957665 = score(doc=3786,freq=4.0), product of:
              0.14816365 = queryWeight, product of:
                1.0267565 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.020625245 = queryNorm
              0.8745509 = fieldWeight in 3786, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.0625 = fieldNorm(doc=3786)
          0.072931655 = weight(abstract_txt:motivated in 3786) [ClassicSimilarity], result of:
            0.072931655 = score(doc=3786,freq=1.0), product of:
              0.16033237 = queryWeight, product of:
                1.0680885 = boost
                7.2780466 = idf(docFreq=82, maxDocs=44218)
                0.020625245 = queryNorm
              0.4548779 = fieldWeight in 3786, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2780466 = idf(docFreq=82, maxDocs=44218)
                0.0625 = fieldNorm(doc=3786)
          0.08791117 = weight(abstract_txt:parsing in 3786) [ClassicSimilarity], result of:
            0.08791117 = score(doc=3786,freq=1.0), product of:
              0.18159612 = queryWeight, product of:
                1.1367106 = boost
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.020625245 = queryNorm
              0.48410273 = fieldWeight in 3786, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.0625 = fieldNorm(doc=3786)
          0.02502027 = weight(abstract_txt:text in 3786) [ClassicSimilarity], result of:
            0.02502027 = score(doc=3786,freq=1.0), product of:
              0.09899543 = queryWeight, product of:
                1.186914 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020625245 = queryNorm
              0.25274166 = fieldWeight in 3786, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=3786)
          0.23742954 = weight(abstract_txt:simplification in 3786) [ClassicSimilarity], result of:
            0.23742954 = score(doc=3786,freq=1.0), product of:
              0.44372135 = queryWeight, product of:
                2.5128515 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.020625245 = queryNorm
              0.53508705 = fieldWeight in 3786, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.0625 = fieldNorm(doc=3786)
        0.2 = coord(5/25)
    
  4. Cui, H.; Heidorn, P.B.: ¬The reusability of induced knowledge for the automatic semantic markup of taxonomic descriptions (2007) 0.09
    0.08833539 = sum of:
      0.08833539 = product of:
        0.4416769 = sum of:
          0.14919984 = weight(abstract_txt:induced in 84) [ClassicSimilarity], result of:
            0.14919984 = score(doc=84,freq=2.0), product of:
              0.20507501 = queryWeight, product of:
                1.2079613 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.020625245 = queryNorm
              0.7275379 = fieldWeight in 84, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.0625 = fieldNorm(doc=84)
          0.026978992 = weight(abstract_txt:methods in 84) [ClassicSimilarity], result of:
            0.026978992 = score(doc=84,freq=1.0), product of:
              0.10409687 = queryWeight, product of:
                1.2171118 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.020625245 = queryNorm
              0.259172 = fieldWeight in 84, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=84)
          0.040181324 = weight(abstract_txt:domain in 84) [ClassicSimilarity], result of:
            0.040181324 = score(doc=84,freq=1.0), product of:
              0.13575943 = queryWeight, product of:
                1.3899419 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.020625245 = queryNorm
              0.29597446 = fieldWeight in 84, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0625 = fieldNorm(doc=84)
          0.11003745 = weight(abstract_txt:automatically in 84) [ClassicSimilarity], result of:
            0.11003745 = score(doc=84,freq=3.0), product of:
              0.18424983 = queryWeight, product of:
                1.6192548 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.020625245 = queryNorm
              0.59721875 = fieldWeight in 84, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0625 = fieldNorm(doc=84)
          0.11527927 = weight(abstract_txt:rules in 84) [ClassicSimilarity], result of:
            0.11527927 = score(doc=84,freq=2.0), product of:
              0.24904342 = queryWeight, product of:
                2.3056576 = boost
                5.236983 = idf(docFreq=638, maxDocs=44218)
                0.020625245 = queryNorm
              0.46288824 = fieldWeight in 84, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.236983 = idf(docFreq=638, maxDocs=44218)
                0.0625 = fieldNorm(doc=84)
        0.2 = coord(5/25)
    
  5. Ku, C.-H.; Leroy, G.: ¬A crime reports analysis system to identify related crimes (2011) 0.08
    0.07737296 = sum of:
      0.07737296 = product of:
        0.38686478 = sum of:
          0.07221818 = weight(abstract_txt:consuming in 4629) [ClassicSimilarity], result of:
            0.07221818 = score(doc=4629,freq=1.0), product of:
              0.159285 = queryWeight, product of:
                1.0645941 = boost
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.020625245 = queryNorm
              0.45338973 = fieldWeight in 4629, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.0625 = fieldNorm(doc=4629)
          0.02502027 = weight(abstract_txt:text in 4629) [ClassicSimilarity], result of:
            0.02502027 = score(doc=4629,freq=1.0), product of:
              0.09899543 = queryWeight, product of:
                1.186914 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020625245 = queryNorm
              0.25274166 = fieldWeight in 4629, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=4629)
          0.040181324 = weight(abstract_txt:domain in 4629) [ClassicSimilarity], result of:
            0.040181324 = score(doc=4629,freq=1.0), product of:
              0.13575943 = queryWeight, product of:
                1.3899419 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.020625245 = queryNorm
              0.29597446 = fieldWeight in 4629, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0625 = fieldNorm(doc=4629)
          0.063530155 = weight(abstract_txt:automatically in 4629) [ClassicSimilarity], result of:
            0.063530155 = score(doc=4629,freq=1.0), product of:
              0.18424983 = queryWeight, product of:
                1.6192548 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.020625245 = queryNorm
              0.3448044 = fieldWeight in 4629, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0625 = fieldNorm(doc=4629)
          0.18591484 = weight(abstract_txt:algorithm in 4629) [ClassicSimilarity], result of:
            0.18591484 = score(doc=4629,freq=7.0), product of:
              0.19705942 = queryWeight, product of:
                1.6745968 = boost
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.020625245 = queryNorm
              0.9434456 = fieldWeight in 4629, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.0625 = fieldNorm(doc=4629)
        0.2 = coord(5/25)