Document (#6682)

Editor
Thomas, D.W.
Author
Driscoll, J.R.
Rajala, D.A.
Shaffer, W.H.
Title
¬The operation and performance of an artificially intelligent keywording system
Source
Information processing and management. 27(1991) no.1, S.43-54
Year
1991
Abstract
Presents a new approach to text analysis for automating the key phrase indexing process, using artificial intelligence techniques. This mimics the behaviour of human experts by using a rule base consisting of insertion and deletion rules generated by subject-matter experts. The insertion rules are based on the idea that some phrases found in a text imply or trigger other phrases. The deletion rules apply to semantically ambiguous phrases where text presence alone does not determine appropriateness as a key phrase. The insertion and deletion rules are used to transform a list of found phrases to a list of key phrases for indexing a document. Statistical data are provided to demonstrate the performance of this expert rule based system
Theme
Automatisches Indexieren
Computerlinguistik

Similar documents (content)

  1. Vlachidis, A.; Tudhope, D.: ¬A knowledge-based approach to information extraction for semantic interoperability in the archaeology domain (2016) 0.21
    0.20716096 = sum of:
      0.20716096 = product of:
        0.5754471 = sum of:
          0.009150384 = weight(abstract_txt:system in 4896) [ClassicSimilarity], result of:
            0.009150384 = score(doc=4896,freq=1.0), product of:
              0.04350975 = queryWeight, product of:
                1.0317384 = boost
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.01253269 = queryNorm
              0.21030651 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.017633762 = weight(abstract_txt:using in 4896) [ClassicSimilarity], result of:
            0.017633762 = score(doc=4896,freq=3.0), product of:
              0.046717897 = queryWeight, product of:
                1.0690991 = boost
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.01253269 = queryNorm
              0.377452 = fieldWeight in 4896, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.027729645 = weight(abstract_txt:indexing in 4896) [ClassicSimilarity], result of:
            0.027729645 = score(doc=4896,freq=2.0), product of:
              0.07231803 = queryWeight, product of:
                1.3301469 = boost
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.01253269 = queryNorm
              0.38344026 = fieldWeight in 4896, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.024127848 = weight(abstract_txt:performance in 4896) [ClassicSimilarity], result of:
            0.024127848 = score(doc=4896,freq=1.0), product of:
              0.08304359 = queryWeight, product of:
                1.4253757 = boost
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.01253269 = queryNorm
              0.2905444 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.023963515 = weight(abstract_txt:text in 4896) [ClassicSimilarity], result of:
            0.023963515 = score(doc=4896,freq=1.0), product of:
              0.09462905 = queryWeight, product of:
                1.8635205 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.01253269 = queryNorm
              0.25323635 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.09806059 = weight(abstract_txt:rule in 4896) [ClassicSimilarity], result of:
            0.09806059 = score(doc=4896,freq=2.0), product of:
              0.16785981 = queryWeight, product of:
                2.0265143 = boost
                6.609259 = idf(docFreq=154, maxDocs=42306)
                0.01253269 = queryNorm
              0.5841815 = fieldWeight in 4896, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.609259 = idf(docFreq=154, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.08553701 = weight(abstract_txt:phrase in 4896) [ClassicSimilarity], result of:
            0.08553701 = score(doc=4896,freq=1.0), product of:
              0.19307664 = queryWeight, product of:
                2.1734076 = boost
                7.088336 = idf(docFreq=95, maxDocs=42306)
                0.01253269 = queryNorm
              0.443021 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.088336 = idf(docFreq=95, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.097056985 = weight(abstract_txt:rules in 4896) [ClassicSimilarity], result of:
            0.097056985 = score(doc=4896,freq=2.0), product of:
              0.21004462 = queryWeight, product of:
                3.2058787 = boost
                5.227815 = idf(docFreq=616, maxDocs=42306)
                0.01253269 = queryNorm
              0.46207795 = fieldWeight in 4896, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.227815 = idf(docFreq=616, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.19218731 = weight(abstract_txt:phrases in 4896) [ClassicSimilarity], result of:
            0.19218731 = score(doc=4896,freq=1.0), product of:
              0.4495281 = queryWeight, product of:
                5.2435417 = boost
                6.8405 = idf(docFreq=122, maxDocs=42306)
                0.01253269 = queryNorm
              0.42753124 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8405 = idf(docFreq=122, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
        0.36 = coord(9/25)
    
  2. Craven, T.C.: Adapting of string indexing systems for retrieval using proximity operators (1988) 0.16
    0.16291663 = sum of:
      0.16291663 = product of:
        0.8145831 = sum of:
          0.016013172 = weight(abstract_txt:system in 705) [ClassicSimilarity], result of:
            0.016013172 = score(doc=705,freq=1.0), product of:
              0.04350975 = queryWeight, product of:
                1.0317384 = boost
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.01253269 = queryNorm
              0.3680364 = fieldWeight in 705, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.109375 = fieldNorm(doc=705)
          0.0178165 = weight(abstract_txt:using in 705) [ClassicSimilarity], result of:
            0.0178165 = score(doc=705,freq=1.0), product of:
              0.046717897 = queryWeight, product of:
                1.0690991 = boost
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.01253269 = queryNorm
              0.3813635 = fieldWeight in 705, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.109375 = fieldNorm(doc=705)
          0.04852688 = weight(abstract_txt:indexing in 705) [ClassicSimilarity], result of:
            0.04852688 = score(doc=705,freq=2.0), product of:
              0.07231803 = queryWeight, product of:
                1.3301469 = boost
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.01253269 = queryNorm
              0.67102045 = fieldWeight in 705, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.109375 = fieldNorm(doc=705)
          0.14968976 = weight(abstract_txt:phrase in 705) [ClassicSimilarity], result of:
            0.14968976 = score(doc=705,freq=1.0), product of:
              0.19307664 = queryWeight, product of:
                2.1734076 = boost
                7.088336 = idf(docFreq=95, maxDocs=42306)
                0.01253269 = queryNorm
              0.77528673 = fieldWeight in 705, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.088336 = idf(docFreq=95, maxDocs=42306)
                0.109375 = fieldNorm(doc=705)
          0.5825368 = weight(abstract_txt:phrases in 705) [ClassicSimilarity], result of:
            0.5825368 = score(doc=705,freq=3.0), product of:
              0.4495281 = queryWeight, product of:
                5.2435417 = boost
                6.8405 = idf(docFreq=122, maxDocs=42306)
                0.01253269 = queryNorm
              1.2958852 = fieldWeight in 705, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.8405 = idf(docFreq=122, maxDocs=42306)
                0.109375 = fieldNorm(doc=705)
        0.2 = coord(5/25)
    
  3. Losee, R.: ¬A performance model of the length and number of subject headings and index phrases (2004) 0.16
    0.15889776 = sum of:
      0.15889776 = product of:
        0.7944888 = sum of:
          0.034662057 = weight(abstract_txt:indexing in 4726) [ClassicSimilarity], result of:
            0.034662057 = score(doc=4726,freq=2.0), product of:
              0.07231803 = queryWeight, product of:
                1.3301469 = boost
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.01253269 = queryNorm
              0.47930032 = fieldWeight in 4726, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.078125 = fieldNorm(doc=4726)
          0.042361908 = weight(abstract_txt:text in 4726) [ClassicSimilarity], result of:
            0.042361908 = score(doc=4726,freq=2.0), product of:
              0.09462905 = queryWeight, product of:
                1.8635205 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.01253269 = queryNorm
              0.44766283 = fieldWeight in 4726, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.078125 = fieldNorm(doc=4726)
          0.15120949 = weight(abstract_txt:phrase in 4726) [ClassicSimilarity], result of:
            0.15120949 = score(doc=4726,freq=2.0), product of:
              0.19307664 = queryWeight, product of:
                2.1734076 = boost
                7.088336 = idf(docFreq=95, maxDocs=42306)
                0.01253269 = queryNorm
              0.7831579 = fieldWeight in 4726, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.088336 = idf(docFreq=95, maxDocs=42306)
                0.078125 = fieldNorm(doc=4726)
          0.085787065 = weight(abstract_txt:rules in 4726) [ClassicSimilarity], result of:
            0.085787065 = score(doc=4726,freq=1.0), product of:
              0.21004462 = queryWeight, product of:
                3.2058787 = boost
                5.227815 = idf(docFreq=616, maxDocs=42306)
                0.01253269 = queryNorm
              0.40842307 = fieldWeight in 4726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.227815 = idf(docFreq=616, maxDocs=42306)
                0.078125 = fieldNorm(doc=4726)
          0.48046827 = weight(abstract_txt:phrases in 4726) [ClassicSimilarity], result of:
            0.48046827 = score(doc=4726,freq=4.0), product of:
              0.4495281 = queryWeight, product of:
                5.2435417 = boost
                6.8405 = idf(docFreq=122, maxDocs=42306)
                0.01253269 = queryNorm
              1.0688281 = fieldWeight in 4726, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8405 = idf(docFreq=122, maxDocs=42306)
                0.078125 = fieldNorm(doc=4726)
        0.2 = coord(5/25)
    
  4. Fagan, J.L.: ¬The effectiveness of a nonsyntactic approach to automatic phrase indexing for document retrieval (1989) 0.16
    0.15595537 = sum of:
      0.15595537 = product of:
        0.64981407 = sum of:
          0.010180858 = weight(abstract_txt:using in 3846) [ClassicSimilarity], result of:
            0.010180858 = score(doc=3846,freq=1.0), product of:
              0.046717897 = queryWeight, product of:
                1.0690991 = boost
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.01253269 = queryNorm
              0.217922 = fieldWeight in 3846, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.0625 = fieldNorm(doc=3846)
          0.039215643 = weight(abstract_txt:indexing in 3846) [ClassicSimilarity], result of:
            0.039215643 = score(doc=3846,freq=4.0), product of:
              0.07231803 = queryWeight, product of:
                1.3301469 = boost
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.01253269 = queryNorm
              0.5422664 = fieldWeight in 3846, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.3381314 = idf(docFreq=1501, maxDocs=42306)
                0.0625 = fieldNorm(doc=3846)
          0.024127848 = weight(abstract_txt:performance in 3846) [ClassicSimilarity], result of:
            0.024127848 = score(doc=3846,freq=1.0), product of:
              0.08304359 = queryWeight, product of:
                1.4253757 = boost
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.01253269 = queryNorm
              0.2905444 = fieldWeight in 3846, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.0625 = fieldNorm(doc=3846)
          0.03388953 = weight(abstract_txt:text in 3846) [ClassicSimilarity], result of:
            0.03388953 = score(doc=3846,freq=2.0), product of:
              0.09462905 = queryWeight, product of:
                1.8635205 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.01253269 = queryNorm
              0.35813028 = fieldWeight in 3846, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.0625 = fieldNorm(doc=3846)
          0.20952202 = weight(abstract_txt:phrase in 3846) [ClassicSimilarity], result of:
            0.20952202 = score(doc=3846,freq=6.0), product of:
              0.19307664 = queryWeight, product of:
                2.1734076 = boost
                7.088336 = idf(docFreq=95, maxDocs=42306)
                0.01253269 = queryNorm
              1.0851754 = fieldWeight in 3846, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.088336 = idf(docFreq=95, maxDocs=42306)
                0.0625 = fieldNorm(doc=3846)
          0.33287817 = weight(abstract_txt:phrases in 3846) [ClassicSimilarity], result of:
            0.33287817 = score(doc=3846,freq=3.0), product of:
              0.4495281 = queryWeight, product of:
                5.2435417 = boost
                6.8405 = idf(docFreq=122, maxDocs=42306)
                0.01253269 = queryNorm
              0.7405058 = fieldWeight in 3846, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.8405 = idf(docFreq=122, maxDocs=42306)
                0.0625 = fieldNorm(doc=3846)
        0.24 = coord(6/25)
    
  5. Cox, K.: ¬An experiment to test the utility of repeating phrases for information retrieval systems (1994) 0.11
    0.11231784 = sum of:
      0.11231784 = product of:
        0.7019865 = sum of:
          0.06291821 = weight(abstract_txt:alone in 896) [ClassicSimilarity], result of:
            0.06291821 = score(doc=896,freq=1.0), product of:
              0.08598894 = queryWeight, product of:
                1.0256108 = boost
                6.6898394 = idf(docFreq=142, maxDocs=42306)
                0.01253269 = queryNorm
              0.7317012 = fieldWeight in 896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6898394 = idf(docFreq=142, maxDocs=42306)
                0.109375 = fieldNorm(doc=896)
          0.0178165 = weight(abstract_txt:using in 896) [ClassicSimilarity], result of:
            0.0178165 = score(doc=896,freq=1.0), product of:
              0.046717897 = queryWeight, product of:
                1.0690991 = boost
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.01253269 = queryNorm
              0.3813635 = fieldWeight in 896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.109375 = fieldNorm(doc=896)
          0.03871498 = weight(abstract_txt:found in 896) [ClassicSimilarity], result of:
            0.03871498 = score(doc=896,freq=1.0), product of:
              0.07837683 = queryWeight, product of:
                1.3847461 = boost
                4.516201 = idf(docFreq=1256, maxDocs=42306)
                0.01253269 = queryNorm
              0.4939595 = fieldWeight in 896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.516201 = idf(docFreq=1256, maxDocs=42306)
                0.109375 = fieldNorm(doc=896)
          0.5825368 = weight(abstract_txt:phrases in 896) [ClassicSimilarity], result of:
            0.5825368 = score(doc=896,freq=3.0), product of:
              0.4495281 = queryWeight, product of:
                5.2435417 = boost
                6.8405 = idf(docFreq=122, maxDocs=42306)
                0.01253269 = queryNorm
              1.2958852 = fieldWeight in 896, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.8405 = idf(docFreq=122, maxDocs=42306)
                0.109375 = fieldNorm(doc=896)
        0.16 = coord(4/25)