Document (#17138)

Author
Spitz, A.L.
Wilcox, L.D.
Title
Classification techniques applied to the recognition of office documents
Source
Wissensorganisation im Wandel: Dezimalklassifikation - Thesaurusfragen - Warenklassifikation. Proc. 11. Jahrestagung der Gesellschaft für Klassifikation, Aachen, 29.6.-1.7.1987. Hrsg.: H.-J. Hermes u. J. Hölzl
Imprint
Frankfurt : Indeks
Year
1988
Pages
S.115-122
Series
Studien zur Klassifikation; Bd.18
Abstract
In the process of developing a document recognition network service, techniques were developed for the segmentation and classification of text, line drawing graphics and pictures
Theme
Dokumentenmanagement

Similar documents (content)

  1. Peng, F.; Huang, X.: Machine learning for Asian language text classification (2007) 0.29
    0.2881276 = sum of:
      0.2881276 = product of:
        0.9124041 = sum of:
          0.03806979 = weight(abstract_txt:were in 831) [ClassicSimilarity], result of:
            0.03806979 = score(doc=831,freq=4.0), product of:
              0.08298448 = queryWeight, product of:
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.022611182 = queryNorm
              0.45875797 = fieldWeight in 831, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.0625 = fieldNorm(doc=831)
          0.056938395 = weight(abstract_txt:text in 831) [ClassicSimilarity], result of:
            0.056938395 = score(doc=831,freq=5.0), product of:
              0.100749604 = queryWeight, product of:
                1.1018519 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.022611182 = queryNorm
              0.5651476 = fieldWeight in 831, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=831)
          0.060041454 = weight(abstract_txt:applied in 831) [ClassicSimilarity], result of:
            0.060041454 = score(doc=831,freq=2.0), product of:
              0.14166221 = queryWeight, product of:
                1.3065577 = boost
                4.79515 = idf(docFreq=993, maxDocs=44218)
                0.022611182 = queryNorm
              0.42383537 = fieldWeight in 831, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.79515 = idf(docFreq=993, maxDocs=44218)
                0.0625 = fieldNorm(doc=831)
          0.50913423 = weight(abstract_txt:segmentation in 831) [ClassicSimilarity], result of:
            0.50913423 = score(doc=831,freq=7.0), product of:
              0.3879884 = queryWeight, product of:
                2.162275 = boost
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.022611182 = queryNorm
              1.3122408 = fieldWeight in 831, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.0625 = fieldNorm(doc=831)
          0.14698638 = weight(abstract_txt:classification in 831) [ClassicSimilarity], result of:
            0.14698638 = score(doc=831,freq=9.0), product of:
              0.19637088 = queryWeight, product of:
                2.1754801 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.022611182 = queryNorm
              0.7485141 = fieldWeight in 831, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=831)
          0.1012339 = weight(abstract_txt:techniques in 831) [ClassicSimilarity], result of:
            0.1012339 = score(doc=831,freq=2.0), product of:
              0.25284082 = queryWeight, product of:
                2.46854 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.022611182 = queryNorm
              0.40038592 = fieldWeight in 831, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.0625 = fieldNorm(doc=831)
        0.31578946 = coord(6/19)
    
  2. Tseng, Y.-H.; Lin, C.-J.; Lin, Y.-I.: Text mining techniques for patent analysis (2007) 0.26
    0.256897 = sum of:
      0.256897 = product of:
        0.61013037 = sum of:
          0.036011003 = weight(abstract_txt:text in 935) [ClassicSimilarity], result of:
            0.036011003 = score(doc=935,freq=2.0), product of:
              0.100749604 = queryWeight, product of:
                1.1018519 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.022611182 = queryNorm
              0.3574307 = fieldWeight in 935, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.03620221 = weight(abstract_txt:process in 935) [ClassicSimilarity], result of:
            0.03620221 = score(doc=935,freq=2.0), product of:
              0.10110593 = queryWeight, product of:
                1.1037987 = boost
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.022611182 = queryNorm
              0.3580622 = fieldWeight in 935, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.02695462 = weight(abstract_txt:documents in 935) [ClassicSimilarity], result of:
            0.02695462 = score(doc=935,freq=1.0), product of:
              0.10464505 = queryWeight, product of:
                1.1229513 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.022611182 = queryNorm
              0.2575814 = fieldWeight in 935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.030457443 = weight(abstract_txt:document in 935) [ClassicSimilarity], result of:
            0.030457443 = score(doc=935,freq=1.0), product of:
              0.11352517 = queryWeight, product of:
                1.1696277 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.022611182 = queryNorm
              0.26828802 = fieldWeight in 935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.060041454 = weight(abstract_txt:applied in 935) [ClassicSimilarity], result of:
            0.060041454 = score(doc=935,freq=2.0), product of:
              0.14166221 = queryWeight, product of:
                1.3065577 = boost
                4.79515 = idf(docFreq=993, maxDocs=44218)
                0.022611182 = queryNorm
              0.42383537 = fieldWeight in 935, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.79515 = idf(docFreq=993, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.19243465 = weight(abstract_txt:segmentation in 935) [ClassicSimilarity], result of:
            0.19243465 = score(doc=935,freq=1.0), product of:
              0.3879884 = queryWeight, product of:
                2.162275 = boost
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.022611182 = queryNorm
              0.49598044 = fieldWeight in 935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.08486262 = weight(abstract_txt:classification in 935) [ClassicSimilarity], result of:
            0.08486262 = score(doc=935,freq=3.0), product of:
              0.19637088 = queryWeight, product of:
                2.1754801 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.022611182 = queryNorm
              0.4321548 = fieldWeight in 935, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.14316636 = weight(abstract_txt:techniques in 935) [ClassicSimilarity], result of:
            0.14316636 = score(doc=935,freq=4.0), product of:
              0.25284082 = queryWeight, product of:
                2.46854 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.022611182 = queryNorm
              0.5662312 = fieldWeight in 935, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
        0.42105263 = coord(8/19)
    
  3. Steinmetz, R.: Data compression in multimedia computing : principles and techniques (1994) 0.22
    0.2169338 = sum of:
      0.2169338 = product of:
        0.58882034 = sum of:
          0.025463622 = weight(abstract_txt:text in 8182) [ClassicSimilarity], result of:
            0.025463622 = score(doc=8182,freq=1.0), product of:
              0.100749604 = queryWeight, product of:
                1.1018519 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.022611182 = queryNorm
              0.25274166 = fieldWeight in 8182, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=8182)
          0.02559883 = weight(abstract_txt:process in 8182) [ClassicSimilarity], result of:
            0.02559883 = score(doc=8182,freq=1.0), product of:
              0.10110593 = queryWeight, product of:
                1.1037987 = boost
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.022611182 = queryNorm
              0.25318822 = fieldWeight in 8182, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.0625 = fieldNorm(doc=8182)
          0.028443014 = weight(abstract_txt:developed in 8182) [ClassicSimilarity], result of:
            0.028443014 = score(doc=8182,freq=1.0), product of:
              0.10846267 = queryWeight, product of:
                1.1432513 = boost
                4.195805 = idf(docFreq=1809, maxDocs=44218)
                0.022611182 = queryNorm
              0.26223782 = fieldWeight in 8182, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.195805 = idf(docFreq=1809, maxDocs=44218)
                0.0625 = fieldNorm(doc=8182)
          0.03806058 = weight(abstract_txt:network in 8182) [ClassicSimilarity], result of:
            0.03806058 = score(doc=8182,freq=1.0), product of:
              0.1317084 = queryWeight, product of:
                1.2598194 = boost
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.022611182 = queryNorm
              0.2889761 = fieldWeight in 8182, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.0625 = fieldNorm(doc=8182)
          0.1872274 = weight(abstract_txt:graphics in 8182) [ClassicSimilarity], result of:
            0.1872274 = score(doc=8182,freq=2.0), product of:
              0.3023659 = queryWeight, product of:
                1.9088331 = boost
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.022611182 = queryNorm
              0.61920804 = fieldWeight in 8182, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.0625 = fieldNorm(doc=8182)
          0.16004118 = weight(abstract_txt:pictures in 8182) [ClassicSimilarity], result of:
            0.16004118 = score(doc=8182,freq=1.0), product of:
              0.34312397 = queryWeight, product of:
                2.03342 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.022611182 = queryNorm
              0.4664238 = fieldWeight in 8182, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.0625 = fieldNorm(doc=8182)
          0.1239857 = weight(abstract_txt:techniques in 8182) [ClassicSimilarity], result of:
            0.1239857 = score(doc=8182,freq=3.0), product of:
              0.25284082 = queryWeight, product of:
                2.46854 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.022611182 = queryNorm
              0.4903706 = fieldWeight in 8182, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.0625 = fieldNorm(doc=8182)
        0.36842105 = coord(7/19)
    
  4. Multilingual information management : current levels and future abilities. A report Commissioned by the US National Science Foundation and also delivered to the European Commission's Language Engineering Office and the US Defense Advanced Research Projects Agency, April 1999 (1999) 0.21
    0.21101442 = sum of:
      0.21101442 = product of:
        0.5727534 = sum of:
          0.016655535 = weight(abstract_txt:were in 6068) [ClassicSimilarity], result of:
            0.016655535 = score(doc=6068,freq=1.0), product of:
              0.08298448 = queryWeight, product of:
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.022611182 = queryNorm
              0.20070662 = fieldWeight in 6068, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6068)
          0.031509627 = weight(abstract_txt:text in 6068) [ClassicSimilarity], result of:
            0.031509627 = score(doc=6068,freq=2.0), product of:
              0.100749604 = queryWeight, product of:
                1.1018519 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.022611182 = queryNorm
              0.31275186 = fieldWeight in 6068, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6068)
          0.022398977 = weight(abstract_txt:process in 6068) [ClassicSimilarity], result of:
            0.022398977 = score(doc=6068,freq=1.0), product of:
              0.10110593 = queryWeight, product of:
                1.1037987 = boost
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.022611182 = queryNorm
              0.22153969 = fieldWeight in 6068, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6068)
          0.035196435 = weight(abstract_txt:developed in 6068) [ClassicSimilarity], result of:
            0.035196435 = score(doc=6068,freq=2.0), product of:
              0.10846267 = queryWeight, product of:
                1.1432513 = boost
                4.195805 = idf(docFreq=1809, maxDocs=44218)
                0.022611182 = queryNorm
              0.32450274 = fieldWeight in 6068, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.195805 = idf(docFreq=1809, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6068)
          0.045909036 = weight(abstract_txt:developing in 6068) [ClassicSimilarity], result of:
            0.045909036 = score(doc=6068,freq=1.0), product of:
              0.16313855 = queryWeight, product of:
                1.4021028 = boost
                5.145807 = idf(docFreq=699, maxDocs=44218)
                0.022611182 = queryNorm
              0.28141132 = fieldWeight in 6068, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.145807 = idf(docFreq=699, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6068)
          0.15342449 = weight(abstract_txt:techniques in 6068) [ClassicSimilarity], result of:
            0.15342449 = score(doc=6068,freq=6.0), product of:
              0.25284082 = queryWeight, product of:
                2.46854 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.022611182 = queryNorm
              0.6068027 = fieldWeight in 6068, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6068)
          0.2676593 = weight(abstract_txt:recognition in 6068) [ClassicSimilarity], result of:
            0.2676593 = score(doc=6068,freq=3.0), product of:
              0.46165296 = queryWeight, product of:
                3.3356032 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.022611182 = queryNorm
              0.57978463 = fieldWeight in 6068, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6068)
        0.36842105 = coord(7/19)
    
  5. Saeed, K.; Dardzinska, A.: Natural language processing : word recognition without segmentation (2001) 0.19
    0.18937743 = sum of:
      0.18937743 = product of:
        0.89954275 = sum of:
          0.038398243 = weight(abstract_txt:process in 7707) [ClassicSimilarity], result of:
            0.038398243 = score(doc=7707,freq=1.0), product of:
              0.10110593 = queryWeight, product of:
                1.1037987 = boost
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.022611182 = queryNorm
              0.37978232 = fieldWeight in 7707, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.09375 = fieldNorm(doc=7707)
          0.04266452 = weight(abstract_txt:developed in 7707) [ClassicSimilarity], result of:
            0.04266452 = score(doc=7707,freq=1.0), product of:
              0.10846267 = queryWeight, product of:
                1.1432513 = boost
                4.195805 = idf(docFreq=1809, maxDocs=44218)
                0.022611182 = queryNorm
              0.39335674 = fieldWeight in 7707, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.195805 = idf(docFreq=1809, maxDocs=44218)
                0.09375 = fieldNorm(doc=7707)
          0.28865197 = weight(abstract_txt:segmentation in 7707) [ClassicSimilarity], result of:
            0.28865197 = score(doc=7707,freq=1.0), product of:
              0.3879884 = queryWeight, product of:
                2.162275 = boost
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.022611182 = queryNorm
              0.74397063 = fieldWeight in 7707, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.09375 = fieldNorm(doc=7707)
          0.529828 = weight(abstract_txt:recognition in 7707) [ClassicSimilarity], result of:
            0.529828 = score(doc=7707,freq=4.0), product of:
              0.46165296 = queryWeight, product of:
                3.3356032 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.022611182 = queryNorm
              1.147676 = fieldWeight in 7707, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.09375 = fieldNorm(doc=7707)
        0.21052632 = coord(4/19)