Document (#11289)

Author
Savic, D.
Title
Automatic classification of office documents : review of available methods and techniques
Source
Records management quarterly. 29(1995) no.4, S.3-18
Year
1995
Abstract
Classification of office documents is one of the administrative functions carried out by almost every organization and institution which sends and receives correspondence. Processing of this increasing amount of information coming and out going mail, in particular its classification, is time consuming and expensive. More and more organizations are seeking a solution for meeting this challenge by designing computer based systems for automatic classification. Examines the present status of available knowledge and methodology which can be used for automatic classification of office documents. Besides a review of classic methods and techniques, the focus id also placed on the application of artificial intelligence
Theme
Dokumentenmanagement
Automatisches Klassifizieren

Similar documents (content)

  1. Savic, D.: Designing an expert system for classifying office documents (1994) 0.17
    0.17357425 = sum of:
      0.17357425 = product of:
        0.8678712 = sum of:
          0.09457064 = weight(abstract_txt:artificial in 2655) [ClassicSimilarity], result of:
            0.09457064 = score(doc=2655,freq=1.0), product of:
              0.12472892 = queryWeight, product of:
                6.0656753 = idf(docFreq=278, maxDocs=44218)
                0.020563072 = queryNorm
              0.7582094 = fieldWeight in 2655, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0656753 = idf(docFreq=278, maxDocs=44218)
                0.125 = fieldNorm(doc=2655)
          0.088990495 = weight(abstract_txt:documents in 2655) [ClassicSimilarity], result of:
            0.088990495 = score(doc=2655,freq=1.0), product of:
              0.17274246 = queryWeight, product of:
                2.0383399 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.020563072 = queryNorm
              0.5151628 = fieldWeight in 2655, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.125 = fieldNorm(doc=2655)
          0.17829807 = weight(abstract_txt:automatic in 2655) [ClassicSimilarity], result of:
            0.17829807 = score(doc=2655,freq=1.0), product of:
              0.27453715 = queryWeight, product of:
                2.5696716 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.020563072 = queryNorm
              0.6494497 = fieldWeight in 2655, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.125 = fieldNorm(doc=2655)
          0.3712136 = weight(abstract_txt:office in 2655) [ClassicSimilarity], result of:
            0.3712136 = score(doc=2655,freq=1.0), product of:
              0.44763008 = queryWeight, product of:
                3.28123 = boost
                6.634292 = idf(docFreq=157, maxDocs=44218)
                0.020563072 = queryNorm
              0.8292865 = fieldWeight in 2655, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.634292 = idf(docFreq=157, maxDocs=44218)
                0.125 = fieldNorm(doc=2655)
          0.13479848 = weight(abstract_txt:classification in 2655) [ClassicSimilarity], result of:
            0.13479848 = score(doc=2655,freq=1.0), product of:
              0.27013215 = queryWeight, product of:
                3.2907097 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.020563072 = queryNorm
              0.4990094 = fieldWeight in 2655, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.125 = fieldNorm(doc=2655)
        0.2 = coord(5/25)
    
  2. Pong, J.Y.-H.; Kwok, R.C.-W.; Lau, R.Y.-K.; Hao, J.-X.; Wong, P.C.-C.: ¬A comparative study of two automatic document classification methods in a library setting (2008) 0.15
    0.14737183 = sum of:
      0.14737183 = product of:
        0.52632797 = sum of:
          0.016686123 = weight(abstract_txt:more in 2532) [ClassicSimilarity], result of:
            0.016686123 = score(doc=2532,freq=1.0), product of:
              0.07847474 = queryWeight, product of:
                1.1217507 = boost
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.020563072 = queryNorm
              0.2126305 = fieldWeight in 2532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.0625 = fieldNorm(doc=2532)
          0.03021642 = weight(abstract_txt:methods in 2532) [ClassicSimilarity], result of:
            0.03021642 = score(doc=2532,freq=1.0), product of:
              0.116588295 = queryWeight, product of:
                1.3672845 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.020563072 = queryNorm
              0.259172 = fieldWeight in 2532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=2532)
          0.033719208 = weight(abstract_txt:available in 2532) [ClassicSimilarity], result of:
            0.033719208 = score(doc=2532,freq=1.0), product of:
              0.12543282 = queryWeight, product of:
                1.4181985 = boost
                4.3011656 = idf(docFreq=1628, maxDocs=44218)
                0.020563072 = queryNorm
              0.26882285 = fieldWeight in 2532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3011656 = idf(docFreq=1628, maxDocs=44218)
                0.0625 = fieldNorm(doc=2532)
          0.039388567 = weight(abstract_txt:techniques in 2532) [ClassicSimilarity], result of:
            0.039388567 = score(doc=2532,freq=1.0), product of:
              0.13912539 = queryWeight, product of:
                1.4936011 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.020563072 = queryNorm
              0.2831156 = fieldWeight in 2532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.0625 = fieldNorm(doc=2532)
          0.06292578 = weight(abstract_txt:documents in 2532) [ClassicSimilarity], result of:
            0.06292578 = score(doc=2532,freq=2.0), product of:
              0.17274246 = queryWeight, product of:
                2.0383399 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.020563072 = queryNorm
              0.36427513 = fieldWeight in 2532, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=2532)
          0.17829807 = weight(abstract_txt:automatic in 2532) [ClassicSimilarity], result of:
            0.17829807 = score(doc=2532,freq=4.0), product of:
              0.27453715 = queryWeight, product of:
                2.5696716 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.020563072 = queryNorm
              0.6494497 = fieldWeight in 2532, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=2532)
          0.16509375 = weight(abstract_txt:classification in 2532) [ClassicSimilarity], result of:
            0.16509375 = score(doc=2532,freq=6.0), product of:
              0.27013215 = queryWeight, product of:
                3.2907097 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.020563072 = queryNorm
              0.6111592 = fieldWeight in 2532, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=2532)
        0.28 = coord(7/25)
    
  3. Malak, P.: Is the Artificial Intelligence applicable for the libraries purposes? (2005) 0.12
    0.12381576 = sum of:
      0.12381576 = product of:
        0.38692427 = sum of:
          0.03546399 = weight(abstract_txt:artificial in 3006) [ClassicSimilarity], result of:
            0.03546399 = score(doc=3006,freq=1.0), product of:
              0.12472892 = queryWeight, product of:
                6.0656753 = idf(docFreq=278, maxDocs=44218)
                0.020563072 = queryNorm
              0.28432852 = fieldWeight in 3006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0656753 = idf(docFreq=278, maxDocs=44218)
                0.046875 = fieldNorm(doc=3006)
          0.061514474 = weight(abstract_txt:meeting in 3006) [ClassicSimilarity], result of:
            0.061514474 = score(doc=3006,freq=2.0), product of:
              0.14291692 = queryWeight, product of:
                1.0704299 = boost
                6.4928803 = idf(docFreq=181, maxDocs=44218)
                0.020563072 = queryNorm
              0.43042123 = fieldWeight in 3006, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4928803 = idf(docFreq=181, maxDocs=44218)
                0.046875 = fieldNorm(doc=3006)
          0.012514591 = weight(abstract_txt:more in 3006) [ClassicSimilarity], result of:
            0.012514591 = score(doc=3006,freq=1.0), product of:
              0.07847474 = queryWeight, product of:
                1.1217507 = boost
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.020563072 = queryNorm
              0.15947287 = fieldWeight in 3006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.046875 = fieldNorm(doc=3006)
          0.032049354 = weight(abstract_txt:methods in 3006) [ClassicSimilarity], result of:
            0.032049354 = score(doc=3006,freq=2.0), product of:
              0.116588295 = queryWeight, product of:
                1.3672845 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.020563072 = queryNorm
              0.2748934 = fieldWeight in 3006, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.046875 = fieldNorm(doc=3006)
          0.025289405 = weight(abstract_txt:available in 3006) [ClassicSimilarity], result of:
            0.025289405 = score(doc=3006,freq=1.0), product of:
              0.12543282 = queryWeight, product of:
                1.4181985 = boost
                4.3011656 = idf(docFreq=1628, maxDocs=44218)
                0.020563072 = queryNorm
              0.20161714 = fieldWeight in 3006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3011656 = idf(docFreq=1628, maxDocs=44218)
                0.046875 = fieldNorm(doc=3006)
          0.08174299 = weight(abstract_txt:documents in 3006) [ClassicSimilarity], result of:
            0.08174299 = score(doc=3006,freq=6.0), product of:
              0.17274246 = queryWeight, product of:
                2.0383399 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.020563072 = queryNorm
              0.4732073 = fieldWeight in 3006, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.046875 = fieldNorm(doc=3006)
          0.06686178 = weight(abstract_txt:automatic in 3006) [ClassicSimilarity], result of:
            0.06686178 = score(doc=3006,freq=1.0), product of:
              0.27453715 = queryWeight, product of:
                2.5696716 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.020563072 = queryNorm
              0.24354364 = fieldWeight in 3006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.046875 = fieldNorm(doc=3006)
          0.07148769 = weight(abstract_txt:classification in 3006) [ClassicSimilarity], result of:
            0.07148769 = score(doc=3006,freq=2.0), product of:
              0.27013215 = queryWeight, product of:
                3.2907097 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.020563072 = queryNorm
              0.26463968 = fieldWeight in 3006, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.046875 = fieldNorm(doc=3006)
        0.32 = coord(8/25)
    
  4. Desale, S.K.; Kumbhar, R.: Research on automatic classification of documents in library environment : a literature review (2013) 0.11
    0.10541218 = sum of:
      0.10541218 = product of:
        0.6588261 = sum of:
          0.10521829 = weight(abstract_txt:review in 1071) [ClassicSimilarity], result of:
            0.10521829 = score(doc=1071,freq=3.0), product of:
              0.16004422 = queryWeight, product of:
                1.6019591 = boost
                4.858482 = idf(docFreq=932, maxDocs=44218)
                0.020563072 = queryNorm
              0.6574326 = fieldWeight in 1071, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.858482 = idf(docFreq=932, maxDocs=44218)
                0.078125 = fieldNorm(doc=1071)
          0.124368005 = weight(abstract_txt:documents in 1071) [ClassicSimilarity], result of:
            0.124368005 = score(doc=1071,freq=5.0), product of:
              0.17274246 = queryWeight, product of:
                2.0383399 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.020563072 = queryNorm
              0.719962 = fieldWeight in 1071, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=1071)
          0.2228726 = weight(abstract_txt:automatic in 1071) [ClassicSimilarity], result of:
            0.2228726 = score(doc=1071,freq=4.0), product of:
              0.27453715 = queryWeight, product of:
                2.5696716 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.020563072 = queryNorm
              0.81181216 = fieldWeight in 1071, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.078125 = fieldNorm(doc=1071)
          0.2063672 = weight(abstract_txt:classification in 1071) [ClassicSimilarity], result of:
            0.2063672 = score(doc=1071,freq=6.0), product of:
              0.27013215 = queryWeight, product of:
                3.2907097 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.020563072 = queryNorm
              0.76394904 = fieldWeight in 1071, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=1071)
        0.16 = coord(4/25)
    
  5. Hendley, T.: Document image processing : going beyond the black-and-white barrier (1995) 0.10
    0.10327303 = sum of:
      0.10327303 = product of:
        0.5163651 = sum of:
          0.020857653 = weight(abstract_txt:more in 2434) [ClassicSimilarity], result of:
            0.020857653 = score(doc=2434,freq=1.0), product of:
              0.07847474 = queryWeight, product of:
                1.1217507 = boost
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.020563072 = queryNorm
              0.2657881 = fieldWeight in 2434, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.078125 = fieldNorm(doc=2434)
          0.04214901 = weight(abstract_txt:available in 2434) [ClassicSimilarity], result of:
            0.04214901 = score(doc=2434,freq=1.0), product of:
              0.12543282 = queryWeight, product of:
                1.4181985 = boost
                4.3011656 = idf(docFreq=1628, maxDocs=44218)
                0.020563072 = queryNorm
              0.33602858 = fieldWeight in 2434, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3011656 = idf(docFreq=1628, maxDocs=44218)
                0.078125 = fieldNorm(doc=2434)
          0.06962981 = weight(abstract_txt:techniques in 2434) [ClassicSimilarity], result of:
            0.06962981 = score(doc=2434,freq=2.0), product of:
              0.13912539 = queryWeight, product of:
                1.4936011 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.020563072 = queryNorm
              0.5004824 = fieldWeight in 2434, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.078125 = fieldNorm(doc=2434)
          0.05561906 = weight(abstract_txt:documents in 2434) [ClassicSimilarity], result of:
            0.05561906 = score(doc=2434,freq=1.0), product of:
              0.17274246 = queryWeight, product of:
                2.0383399 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.020563072 = queryNorm
              0.32197678 = fieldWeight in 2434, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=2434)
          0.32810956 = weight(abstract_txt:office in 2434) [ClassicSimilarity], result of:
            0.32810956 = score(doc=2434,freq=2.0), product of:
              0.44763008 = queryWeight, product of:
                3.28123 = boost
                6.634292 = idf(docFreq=157, maxDocs=44218)
                0.020563072 = queryNorm
              0.73299265 = fieldWeight in 2434, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.634292 = idf(docFreq=157, maxDocs=44218)
                0.078125 = fieldNorm(doc=2434)
        0.2 = coord(5/25)