Document (#13138)

Author
Losee, R.M.
Title
Learning syntactic rules and tags with genetic algorithms for information retrieval and filtering : an empirical basis for grammatical rules
Source
Information processing and management. 32(1996) no.2, S.185-197
Year
1996
Abstract
The grammars of natural languages may be learned by using genetic algorithms that reproduce and mutate grammatical rules and parts of speech tags, improving the quality of later generations of grammatical components. Syntactic rules are randomly generated and then evolve; those rules resulting in improved parsing and occasionally improved filtering performance are allowed to further propagate. The LUST system learns the characteristics of the language or subkanguage used in document abstracts by learning from the document rankings obtained from the parsed abstracts. Unlike the application of traditional linguistic rules to retrieval and filtering applications, LUST develops grammatical structures and tags without the prior imposition of some common grammatical assumptions (e.g. part of speech assumptions), producing grammars that are empirically based and are optimized for this particular application
Theme
Computerlinguistik

Similar documents (author)

  1. Losee, R.M.: ¬A Gray code based ordering for documents on shelves : classification for browsing and retrieval (1992) 5.18
    5.184806 = sum of:
      5.184806 = weight(author_txt:losee in 2335) [ClassicSimilarity], result of:
        5.184806 = fieldWeight in 2335, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.29569 = idf(docFreq=29, maxDocs=44218)
          0.625 = fieldNorm(doc=2335)
    
  2. Losee, R.M.: ¬The relative shelf location of circulated books : a study of classification, users, and browsing (1993) 5.18
    5.184806 = sum of:
      5.184806 = weight(author_txt:losee in 4485) [ClassicSimilarity], result of:
        5.184806 = fieldWeight in 4485, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.29569 = idf(docFreq=29, maxDocs=44218)
          0.625 = fieldNorm(doc=4485)
    
  3. Losee, R.M.: Seven fundamental questions for the science of library classification (1993) 5.18
    5.184806 = sum of:
      5.184806 = weight(author_txt:losee in 4508) [ClassicSimilarity], result of:
        5.184806 = fieldWeight in 4508, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.29569 = idf(docFreq=29, maxDocs=44218)
          0.625 = fieldNorm(doc=4508)
    
  4. Losee, R.M.: Term dependence : truncating the Bahadur Lazarsfeld expansion (1994) 5.18
    5.184806 = sum of:
      5.184806 = weight(author_txt:losee in 7390) [ClassicSimilarity], result of:
        5.184806 = fieldWeight in 7390, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.29569 = idf(docFreq=29, maxDocs=44218)
          0.625 = fieldNorm(doc=7390)
    
  5. Losee, R.M.: Upper bounds for retrieval performance and their user measuring performance and generating optimal queries : can it get any better than this? (1994) 5.18
    5.184806 = sum of:
      5.184806 = weight(author_txt:losee in 7418) [ClassicSimilarity], result of:
        5.184806 = fieldWeight in 7418, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.29569 = idf(docFreq=29, maxDocs=44218)
          0.625 = fieldNorm(doc=7418)
    

Similar documents (content)

  1. Fang, L.; Tuan, L.A.; Hui, S.C.; Wu, L.: Syntactic based approach for grammar question retrieval (2018) 0.13
    0.12730247 = sum of:
      0.12730247 = product of:
        0.7956404 = sum of:
          0.023564663 = weight(abstract_txt:learning in 5086) [ClassicSimilarity], result of:
            0.023564663 = score(doc=5086,freq=1.0), product of:
              0.07936112 = queryWeight, product of:
                1.2267213 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.013617219 = queryNorm
              0.29692957 = fieldWeight in 5086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=5086)
          0.13659905 = weight(abstract_txt:syntactic in 5086) [ClassicSimilarity], result of:
            0.13659905 = score(doc=5086,freq=5.0), product of:
              0.14976445 = queryWeight, product of:
                1.6851804 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.013617219 = queryNorm
              0.9120926 = fieldWeight in 5086, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.0625 = fieldNorm(doc=5086)
          0.071209945 = weight(abstract_txt:speech in 5086) [ClassicSimilarity], result of:
            0.071209945 = score(doc=5086,freq=1.0), product of:
              0.16588001 = queryWeight, product of:
                1.773532 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.013617219 = queryNorm
              0.42928585 = fieldWeight in 5086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.0625 = fieldNorm(doc=5086)
          0.56426674 = weight(abstract_txt:grammatical in 5086) [ClassicSimilarity], result of:
            0.56426674 = score(doc=5086,freq=4.0), product of:
              0.56370246 = queryWeight, product of:
                5.1693625 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.013617219 = queryNorm
              1.001001 = fieldWeight in 5086, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0625 = fieldNorm(doc=5086)
        0.16 = coord(4/25)
    
  2. Marcu, D.: Automatic abstracting and summarization (2009) 0.09
    0.092877656 = sum of:
      0.092877656 = product of:
        0.58048534 = sum of:
          0.0260733 = weight(abstract_txt:document in 3748) [ClassicSimilarity], result of:
            0.0260733 = score(doc=3748,freq=1.0), product of:
              0.064789325 = queryWeight, product of:
                1.108393 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.013617219 = queryNorm
              0.40243202 = fieldWeight in 3748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.09375 = fieldNorm(doc=3748)
          0.06130094 = weight(abstract_txt:algorithms in 3748) [ClassicSimilarity], result of:
            0.06130094 = score(doc=3748,freq=1.0), product of:
              0.114555925 = queryWeight, product of:
                1.4738415 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.013617219 = queryNorm
              0.53511804 = fieldWeight in 3748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.09375 = fieldNorm(doc=3748)
          0.06991105 = weight(abstract_txt:abstracts in 3748) [ClassicSimilarity], result of:
            0.06991105 = score(doc=3748,freq=1.0), product of:
              0.12504606 = queryWeight, product of:
                1.5398451 = boost
                5.963546 = idf(docFreq=308, maxDocs=44218)
                0.013617219 = queryNorm
              0.5590824 = fieldWeight in 3748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.963546 = idf(docFreq=308, maxDocs=44218)
                0.09375 = fieldNorm(doc=3748)
          0.42320007 = weight(abstract_txt:grammatical in 3748) [ClassicSimilarity], result of:
            0.42320007 = score(doc=3748,freq=1.0), product of:
              0.56370246 = queryWeight, product of:
                5.1693625 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.013617219 = queryNorm
              0.7507508 = fieldWeight in 3748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.09375 = fieldNorm(doc=3748)
        0.16 = coord(4/25)
    
  3. Losee, R.M.: Text windows and phrases differing by discipline, location in document, and syntactic structure (1996) 0.09
    0.09152903 = sum of:
      0.09152903 = product of:
        0.7627419 = sum of:
          0.0260733 = weight(abstract_txt:document in 6962) [ClassicSimilarity], result of:
            0.0260733 = score(doc=6962,freq=1.0), product of:
              0.064789325 = queryWeight, product of:
                1.108393 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.013617219 = queryNorm
              0.40243202 = fieldWeight in 6962, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.09375 = fieldNorm(doc=6962)
          0.13817348 = weight(abstract_txt:filtering in 6962) [ClassicSimilarity], result of:
            0.13817348 = score(doc=6962,freq=1.0), product of:
              0.22543414 = queryWeight, product of:
                2.532197 = boost
                6.537832 = idf(docFreq=173, maxDocs=44218)
                0.013617219 = queryNorm
              0.6129217 = fieldWeight in 6962, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.537832 = idf(docFreq=173, maxDocs=44218)
                0.09375 = fieldNorm(doc=6962)
          0.5984952 = weight(abstract_txt:grammatical in 6962) [ClassicSimilarity], result of:
            0.5984952 = score(doc=6962,freq=2.0), product of:
              0.56370246 = queryWeight, product of:
                5.1693625 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.013617219 = queryNorm
              1.0617218 = fieldWeight in 6962, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.09375 = fieldNorm(doc=6962)
        0.12 = coord(3/25)
    
  4. Svenonius, E.: Facets as semantic categories (1979) 0.08
    0.07827823 = sum of:
      0.07827823 = product of:
        0.6523186 = sum of:
          0.13226146 = weight(abstract_txt:syntactic in 1427) [ClassicSimilarity], result of:
            0.13226146 = score(doc=1427,freq=3.0), product of:
              0.14976445 = queryWeight, product of:
                1.6851804 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.013617219 = queryNorm
              0.88312984 = fieldWeight in 1427, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.078125 = fieldNorm(doc=1427)
          0.16739044 = weight(abstract_txt:rules in 1427) [ClassicSimilarity], result of:
            0.16739044 = score(doc=1427,freq=2.0), product of:
              0.28929737 = queryWeight, product of:
                4.056719 = boost
                5.236983 = idf(docFreq=638, maxDocs=44218)
                0.013617219 = queryNorm
              0.5786103 = fieldWeight in 1427, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.236983 = idf(docFreq=638, maxDocs=44218)
                0.078125 = fieldNorm(doc=1427)
          0.3526667 = weight(abstract_txt:grammatical in 1427) [ClassicSimilarity], result of:
            0.3526667 = score(doc=1427,freq=1.0), product of:
              0.56370246 = queryWeight, product of:
                5.1693625 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.013617219 = queryNorm
              0.6256256 = fieldWeight in 1427, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.078125 = fieldNorm(doc=1427)
        0.12 = coord(3/25)
    
  5. Brill, E.: ¬An overview of empirical natural language processing (1997) 0.07
    0.066215634 = sum of:
      0.066215634 = product of:
        0.41384774 = sum of:
          0.10212063 = weight(abstract_txt:parsing in 3249) [ClassicSimilarity], result of:
            0.10212063 = score(doc=3249,freq=1.0), product of:
              0.10547413 = queryWeight, product of:
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.013617219 = queryNorm
              0.96820545 = fieldWeight in 3249, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.125 = fieldNorm(doc=3249)
          0.047129326 = weight(abstract_txt:learning in 3249) [ClassicSimilarity], result of:
            0.047129326 = score(doc=3249,freq=1.0), product of:
              0.07936112 = queryWeight, product of:
                1.2267213 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.013617219 = queryNorm
              0.59385914 = fieldWeight in 3249, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.125 = fieldNorm(doc=3249)
          0.1221779 = weight(abstract_txt:syntactic in 3249) [ClassicSimilarity], result of:
            0.1221779 = score(doc=3249,freq=1.0), product of:
              0.14976445 = queryWeight, product of:
                1.6851804 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.013617219 = queryNorm
              0.8158004 = fieldWeight in 3249, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.125 = fieldNorm(doc=3249)
          0.14241989 = weight(abstract_txt:speech in 3249) [ClassicSimilarity], result of:
            0.14241989 = score(doc=3249,freq=1.0), product of:
              0.16588001 = queryWeight, product of:
                1.773532 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.013617219 = queryNorm
              0.8585717 = fieldWeight in 3249, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.125 = fieldNorm(doc=3249)
        0.16 = coord(4/25)