Search (66 results, page 1 of 4)

  • × theme_ss:"Automatisches Klassifizieren"
  1. Subramanian, S.; Shafer, K.E.: Clustering (2001) 0.09
    0.08985087 = product of:
      0.2156421 = sum of:
        0.053954795 = weight(_text_:23 in 1046) [ClassicSimilarity], result of:
          0.053954795 = score(doc=1046,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.47518367 = fieldWeight in 1046, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.09375 = fieldNorm(doc=1046)
        0.053954795 = weight(_text_:23 in 1046) [ClassicSimilarity], result of:
          0.053954795 = score(doc=1046,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.47518367 = fieldWeight in 1046, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.09375 = fieldNorm(doc=1046)
        0.053954795 = weight(_text_:23 in 1046) [ClassicSimilarity], result of:
          0.053954795 = score(doc=1046,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.47518367 = fieldWeight in 1046, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.09375 = fieldNorm(doc=1046)
        0.036608573 = weight(_text_:internet in 1046) [ClassicSimilarity], result of:
          0.036608573 = score(doc=1046,freq=2.0), product of:
            0.0935287 = queryWeight, product of:
              2.9522398 = idf(docFreq=6276, maxDocs=44218)
              0.03168059 = queryNorm
            0.3914154 = fieldWeight in 1046, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.9522398 = idf(docFreq=6276, maxDocs=44218)
              0.09375 = fieldNorm(doc=1046)
        0.017169131 = product of:
          0.05150739 = sum of:
            0.05150739 = weight(_text_:22 in 1046) [ClassicSimilarity], result of:
              0.05150739 = score(doc=1046,freq=2.0), product of:
                0.11094003 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03168059 = queryNorm
                0.46428138 = fieldWeight in 1046, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=1046)
          0.33333334 = coord(1/3)
      0.41666666 = coord(5/12)
    
    Date
    11. 2.1997 20:11:23
    5. 5.2003 14:17:22
    Footnote
    Teil eines Themenheftes: OCLC and the Internet: An Historical Overview of Research Activities, 1990-1999 - Part II
  2. Shafer, K.E.: Automatic Subject Assignment via the Scorpion System (2001) 0.07
    0.07121225 = product of:
      0.21363673 = sum of:
        0.053954795 = weight(_text_:23 in 1043) [ClassicSimilarity], result of:
          0.053954795 = score(doc=1043,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.47518367 = fieldWeight in 1043, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.09375 = fieldNorm(doc=1043)
        0.053954795 = weight(_text_:23 in 1043) [ClassicSimilarity], result of:
          0.053954795 = score(doc=1043,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.47518367 = fieldWeight in 1043, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.09375 = fieldNorm(doc=1043)
        0.053954795 = weight(_text_:23 in 1043) [ClassicSimilarity], result of:
          0.053954795 = score(doc=1043,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.47518367 = fieldWeight in 1043, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.09375 = fieldNorm(doc=1043)
        0.05177234 = weight(_text_:internet in 1043) [ClassicSimilarity], result of:
          0.05177234 = score(doc=1043,freq=4.0), product of:
            0.0935287 = queryWeight, product of:
              2.9522398 = idf(docFreq=6276, maxDocs=44218)
              0.03168059 = queryNorm
            0.55354494 = fieldWeight in 1043, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.9522398 = idf(docFreq=6276, maxDocs=44218)
              0.09375 = fieldNorm(doc=1043)
      0.33333334 = coord(4/12)
    
    Date
    11. 2.1997 20:11:23
    Footnote
    Teil eines Themenheftes: OCLC and the Internet: An Historical Overview of Research Activities, 1990-1999 - Part I
    Theme
    Internet
  3. Shafer, K.E.: Evaluating Scorpion Results (2001) 0.06
    0.06257564 = product of:
      0.18772691 = sum of:
        0.04496233 = weight(_text_:23 in 4085) [ClassicSimilarity], result of:
          0.04496233 = score(doc=4085,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.3959864 = fieldWeight in 4085, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.078125 = fieldNorm(doc=4085)
        0.04496233 = weight(_text_:23 in 4085) [ClassicSimilarity], result of:
          0.04496233 = score(doc=4085,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.3959864 = fieldWeight in 4085, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.078125 = fieldNorm(doc=4085)
        0.04496233 = weight(_text_:23 in 4085) [ClassicSimilarity], result of:
          0.04496233 = score(doc=4085,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.3959864 = fieldWeight in 4085, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.078125 = fieldNorm(doc=4085)
        0.052839927 = weight(_text_:internet in 4085) [ClassicSimilarity], result of:
          0.052839927 = score(doc=4085,freq=6.0), product of:
            0.0935287 = queryWeight, product of:
              2.9522398 = idf(docFreq=6276, maxDocs=44218)
              0.03168059 = queryNorm
            0.56495947 = fieldWeight in 4085, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.9522398 = idf(docFreq=6276, maxDocs=44218)
              0.078125 = fieldNorm(doc=4085)
      0.33333334 = coord(4/12)
    
    Abstract
    Using DDC for automatic indexing and classifying of Internet resources
    Date
    11. 2.1997 20:11:23
    Footnote
    Teil eines Themenheftes: OCLC and the Internet: An Historical Overview of Research Activities, 1990-1999 - Part II
    Theme
    Internet
  4. Subramanian, S.; Shafer, K.E.: Clustering (1998) 0.06
    0.055131383 = product of:
      0.16539414 = sum of:
        0.04496233 = weight(_text_:23 in 1103) [ClassicSimilarity], result of:
          0.04496233 = score(doc=1103,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.3959864 = fieldWeight in 1103, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.078125 = fieldNorm(doc=1103)
        0.04496233 = weight(_text_:23 in 1103) [ClassicSimilarity], result of:
          0.04496233 = score(doc=1103,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.3959864 = fieldWeight in 1103, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.078125 = fieldNorm(doc=1103)
        0.04496233 = weight(_text_:23 in 1103) [ClassicSimilarity], result of:
          0.04496233 = score(doc=1103,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.3959864 = fieldWeight in 1103, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.078125 = fieldNorm(doc=1103)
        0.030507145 = weight(_text_:internet in 1103) [ClassicSimilarity], result of:
          0.030507145 = score(doc=1103,freq=2.0), product of:
            0.0935287 = queryWeight, product of:
              2.9522398 = idf(docFreq=6276, maxDocs=44218)
              0.03168059 = queryNorm
            0.3261795 = fieldWeight in 1103, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.9522398 = idf(docFreq=6276, maxDocs=44218)
              0.078125 = fieldNorm(doc=1103)
      0.33333334 = coord(4/12)
    
    Date
    11. 2.1997 20:11:23
    Theme
    Internet
  5. Shafer, K.E.: Evaluating Scorpion results (1998) 0.06
    0.055131383 = product of:
      0.16539414 = sum of:
        0.04496233 = weight(_text_:23 in 1569) [ClassicSimilarity], result of:
          0.04496233 = score(doc=1569,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.3959864 = fieldWeight in 1569, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.078125 = fieldNorm(doc=1569)
        0.04496233 = weight(_text_:23 in 1569) [ClassicSimilarity], result of:
          0.04496233 = score(doc=1569,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.3959864 = fieldWeight in 1569, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.078125 = fieldNorm(doc=1569)
        0.04496233 = weight(_text_:23 in 1569) [ClassicSimilarity], result of:
          0.04496233 = score(doc=1569,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.3959864 = fieldWeight in 1569, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.078125 = fieldNorm(doc=1569)
        0.030507145 = weight(_text_:internet in 1569) [ClassicSimilarity], result of:
          0.030507145 = score(doc=1569,freq=2.0), product of:
            0.0935287 = queryWeight, product of:
              2.9522398 = idf(docFreq=6276, maxDocs=44218)
              0.03168059 = queryNorm
            0.3261795 = fieldWeight in 1569, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.9522398 = idf(docFreq=6276, maxDocs=44218)
              0.078125 = fieldNorm(doc=1569)
      0.33333334 = coord(4/12)
    
    Date
    11. 2.1997 20:11:23
    Theme
    Internet
  6. Montesi, M.; Navarrete, T.: Classifying web genres in context : A case study documenting the web genres used by a software engineer (2008) 0.04
    0.04425323 = product of:
      0.13275969 = sum of:
        0.0381518 = weight(_text_:23 in 2100) [ClassicSimilarity], result of:
          0.0381518 = score(doc=2100,freq=4.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.3360056 = fieldWeight in 2100, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=2100)
        0.0381518 = weight(_text_:23 in 2100) [ClassicSimilarity], result of:
          0.0381518 = score(doc=2100,freq=4.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.3360056 = fieldWeight in 2100, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=2100)
        0.0381518 = weight(_text_:23 in 2100) [ClassicSimilarity], result of:
          0.0381518 = score(doc=2100,freq=4.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.3360056 = fieldWeight in 2100, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=2100)
        0.018304287 = weight(_text_:internet in 2100) [ClassicSimilarity], result of:
          0.018304287 = score(doc=2100,freq=2.0), product of:
            0.0935287 = queryWeight, product of:
              2.9522398 = idf(docFreq=6276, maxDocs=44218)
              0.03168059 = queryNorm
            0.1957077 = fieldWeight in 2100, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.9522398 = idf(docFreq=6276, maxDocs=44218)
              0.046875 = fieldNorm(doc=2100)
      0.33333334 = coord(4/12)
    
    Abstract
    This case study analyzes the Internet-based resources that a software engineer uses in his daily work. Methodologically, we studied the web browser history of the participant, classifying all the web pages he had seen over a period of 12 days into web genres. We interviewed him before and after the analysis of the web browser history. In the first interview, he spoke about his general information behavior; in the second, he commented on each web genre, explaining why and how he used them. As a result, three approaches allow us to describe the set of 23 web genres obtained: (a) the purposes they serve for the participant; (b) the role they play in the various work and search phases; (c) and the way they are used in combination with each other. Further observations concern the way the participant assesses quality of web-based resources, and his information behavior as a software engineer.
    Date
    1. 8.2008 12:17:23
  7. Pfeffer, M.: Automatische Vergabe von RVK-Notationen mittels fallbasiertem Schließen (2009) 0.03
    0.02983892 = product of:
      0.08951676 = sum of:
        0.026977398 = weight(_text_:23 in 3051) [ClassicSimilarity], result of:
          0.026977398 = score(doc=3051,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 3051, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=3051)
        0.026977398 = weight(_text_:23 in 3051) [ClassicSimilarity], result of:
          0.026977398 = score(doc=3051,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 3051, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=3051)
        0.026977398 = weight(_text_:23 in 3051) [ClassicSimilarity], result of:
          0.026977398 = score(doc=3051,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 3051, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=3051)
        0.0085845655 = product of:
          0.025753696 = sum of:
            0.025753696 = weight(_text_:22 in 3051) [ClassicSimilarity], result of:
              0.025753696 = score(doc=3051,freq=2.0), product of:
                0.11094003 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03168059 = queryNorm
                0.23214069 = fieldWeight in 3051, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3051)
          0.33333334 = coord(1/3)
      0.33333334 = coord(4/12)
    
    Date
    22. 8.2009 19:51:28
    23. 8.2009 9:46:44
  8. Zhu, W.Z.; Allen, R.B.: Document clustering using the LSI subspace signature model (2013) 0.03
    0.02983892 = product of:
      0.08951676 = sum of:
        0.026977398 = weight(_text_:23 in 690) [ClassicSimilarity], result of:
          0.026977398 = score(doc=690,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 690, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=690)
        0.026977398 = weight(_text_:23 in 690) [ClassicSimilarity], result of:
          0.026977398 = score(doc=690,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 690, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=690)
        0.026977398 = weight(_text_:23 in 690) [ClassicSimilarity], result of:
          0.026977398 = score(doc=690,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 690, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=690)
        0.0085845655 = product of:
          0.025753696 = sum of:
            0.025753696 = weight(_text_:22 in 690) [ClassicSimilarity], result of:
              0.025753696 = score(doc=690,freq=2.0), product of:
                0.11094003 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03168059 = queryNorm
                0.23214069 = fieldWeight in 690, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=690)
          0.33333334 = coord(1/3)
      0.33333334 = coord(4/12)
    
    Date
    23. 3.2013 13:22:36
  9. Illing, S.: Automatisiertes klinisches Codieren (2021) 0.03
    0.026977398 = product of:
      0.10790959 = sum of:
        0.035969865 = weight(_text_:23 in 419) [ClassicSimilarity], result of:
          0.035969865 = score(doc=419,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.31678912 = fieldWeight in 419, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.0625 = fieldNorm(doc=419)
        0.035969865 = weight(_text_:23 in 419) [ClassicSimilarity], result of:
          0.035969865 = score(doc=419,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.31678912 = fieldWeight in 419, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.0625 = fieldNorm(doc=419)
        0.035969865 = weight(_text_:23 in 419) [ClassicSimilarity], result of:
          0.035969865 = score(doc=419,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.31678912 = fieldWeight in 419, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.0625 = fieldNorm(doc=419)
      0.25 = coord(3/12)
    
    Date
    10.11.2021 19:11:23
  10. Savic, D.: Automatic classification of office documents : review of available methods and techniques (1995) 0.02
    0.023605222 = product of:
      0.09442089 = sum of:
        0.03147363 = weight(_text_:23 in 2219) [ClassicSimilarity], result of:
          0.03147363 = score(doc=2219,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.27719048 = fieldWeight in 2219, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2219)
        0.03147363 = weight(_text_:23 in 2219) [ClassicSimilarity], result of:
          0.03147363 = score(doc=2219,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.27719048 = fieldWeight in 2219, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2219)
        0.03147363 = weight(_text_:23 in 2219) [ClassicSimilarity], result of:
          0.03147363 = score(doc=2219,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.27719048 = fieldWeight in 2219, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2219)
      0.25 = coord(3/12)
    
    Date
    23. 7.1996 10:28:09
  11. Wille, J.: Automatisches Klassifizieren bibliographischer Beschreibungsdaten : Vorgehensweise und Ergebnisse (2006) 0.02
    0.023605222 = product of:
      0.09442089 = sum of:
        0.03147363 = weight(_text_:23 in 6090) [ClassicSimilarity], result of:
          0.03147363 = score(doc=6090,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.27719048 = fieldWeight in 6090, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.0546875 = fieldNorm(doc=6090)
        0.03147363 = weight(_text_:23 in 6090) [ClassicSimilarity], result of:
          0.03147363 = score(doc=6090,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.27719048 = fieldWeight in 6090, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.0546875 = fieldNorm(doc=6090)
        0.03147363 = weight(_text_:23 in 6090) [ClassicSimilarity], result of:
          0.03147363 = score(doc=6090,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.27719048 = fieldWeight in 6090, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.0546875 = fieldNorm(doc=6090)
      0.25 = coord(3/12)
    
    Date
    10. 9.2006 19:23:31
  12. Mukhopadhyay, S.; Peng, S.; Raje, R.; Palakal, M.; Mostafa, J.: Multi-agent information classification using dynamic acquaintance lists (2003) 0.02
    0.020233048 = product of:
      0.08093219 = sum of:
        0.026977398 = weight(_text_:23 in 1755) [ClassicSimilarity], result of:
          0.026977398 = score(doc=1755,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 1755, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=1755)
        0.026977398 = weight(_text_:23 in 1755) [ClassicSimilarity], result of:
          0.026977398 = score(doc=1755,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 1755, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=1755)
        0.026977398 = weight(_text_:23 in 1755) [ClassicSimilarity], result of:
          0.026977398 = score(doc=1755,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 1755, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=1755)
      0.25 = coord(3/12)
    
    Date
    17. 8.2003 14:17:23
  13. Sun, A.; Lim, E.-P.; Ng, W.-K.: Performance measurement framework for hierarchical text classification (2003) 0.02
    0.020233048 = product of:
      0.08093219 = sum of:
        0.026977398 = weight(_text_:23 in 1808) [ClassicSimilarity], result of:
          0.026977398 = score(doc=1808,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 1808, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=1808)
        0.026977398 = weight(_text_:23 in 1808) [ClassicSimilarity], result of:
          0.026977398 = score(doc=1808,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 1808, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=1808)
        0.026977398 = weight(_text_:23 in 1808) [ClassicSimilarity], result of:
          0.026977398 = score(doc=1808,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 1808, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=1808)
      0.25 = coord(3/12)
    
    Date
    20. 8.2003 20:23:39
  14. Gauch, S.; Chandramouli, A.; Ranganathan, S.: Training a hierarchical classifier using inter document relationships (2009) 0.02
    0.020233048 = product of:
      0.08093219 = sum of:
        0.026977398 = weight(_text_:23 in 2697) [ClassicSimilarity], result of:
          0.026977398 = score(doc=2697,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 2697, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=2697)
        0.026977398 = weight(_text_:23 in 2697) [ClassicSimilarity], result of:
          0.026977398 = score(doc=2697,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 2697, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=2697)
        0.026977398 = weight(_text_:23 in 2697) [ClassicSimilarity], result of:
          0.026977398 = score(doc=2697,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 2697, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=2697)
      0.25 = coord(3/12)
    
    Date
    23. 2.2009 18:34:38
  15. Ko, Y.: ¬A new term-weighting scheme for text classification using the odds of positive and negative class probabilities (2015) 0.02
    0.020233048 = product of:
      0.08093219 = sum of:
        0.026977398 = weight(_text_:23 in 2339) [ClassicSimilarity], result of:
          0.026977398 = score(doc=2339,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 2339, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=2339)
        0.026977398 = weight(_text_:23 in 2339) [ClassicSimilarity], result of:
          0.026977398 = score(doc=2339,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 2339, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=2339)
        0.026977398 = weight(_text_:23 in 2339) [ClassicSimilarity], result of:
          0.026977398 = score(doc=2339,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 2339, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=2339)
      0.25 = coord(3/12)
    
    Date
    24.11.2015 14:23:26
  16. Teich, E.; Degaetano-Ortlieb, S.; Fankhauser, P.; Kermes, H.; Lapshinova-Koltunski, E.: ¬The linguistic construal of disciplinarity : a data-mining approach using register features (2016) 0.02
    0.020233048 = product of:
      0.08093219 = sum of:
        0.026977398 = weight(_text_:23 in 3015) [ClassicSimilarity], result of:
          0.026977398 = score(doc=3015,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 3015, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=3015)
        0.026977398 = weight(_text_:23 in 3015) [ClassicSimilarity], result of:
          0.026977398 = score(doc=3015,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 3015, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=3015)
        0.026977398 = weight(_text_:23 in 3015) [ClassicSimilarity], result of:
          0.026977398 = score(doc=3015,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.23759183 = fieldWeight in 3015, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.046875 = fieldNorm(doc=3015)
      0.25 = coord(3/12)
    
    Date
    12. 6.2016 20:23:08
  17. Schek, M.: Automatische Klassifizierung und Visualisierung im Archiv der Süddeutschen Zeitung (2005) 0.02
    0.019295983 = product of:
      0.057887945 = sum of:
        0.015736815 = weight(_text_:23 in 4884) [ClassicSimilarity], result of:
          0.015736815 = score(doc=4884,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.13859524 = fieldWeight in 4884, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.02734375 = fieldNorm(doc=4884)
        0.015736815 = weight(_text_:23 in 4884) [ClassicSimilarity], result of:
          0.015736815 = score(doc=4884,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.13859524 = fieldWeight in 4884, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.02734375 = fieldNorm(doc=4884)
        0.015736815 = weight(_text_:23 in 4884) [ClassicSimilarity], result of:
          0.015736815 = score(doc=4884,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.13859524 = fieldWeight in 4884, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.02734375 = fieldNorm(doc=4884)
        0.010677501 = weight(_text_:internet in 4884) [ClassicSimilarity], result of:
          0.010677501 = score(doc=4884,freq=2.0), product of:
            0.0935287 = queryWeight, product of:
              2.9522398 = idf(docFreq=6276, maxDocs=44218)
              0.03168059 = queryNorm
            0.11416282 = fieldWeight in 4884, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.9522398 = idf(docFreq=6276, maxDocs=44218)
              0.02734375 = fieldNorm(doc=4884)
      0.33333334 = coord(4/12)
    
    Abstract
    Die Süddeutsche Zeitung (SZ) verfügt seit ihrer Gründung 1945 über ein Pressearchiv, das die Texte der eigenen Redakteure und zahlreicher nationaler und internationaler Publikationen dokumentiert und auf Anfrage für Recherchezwecke bereitstellt. Die Einführung der EDV begann Anfang der 90er Jahre mit der digitalen Speicherung zunächst der SZ-Daten. Die technische Weiterentwicklung ab Mitte der 90er Jahre diente zwei Zielen: (1) dem vollständigen Wechsel von der Papierablage zur digitalen Speicherung und (2) dem Wandel von einer verlagsinternen Dokumentations- und Auskunftsstelle zu einem auch auf dem Markt vertretenen Informationsdienstleister. Um die dabei entstehenden Aufwände zu verteilen und gleichzeitig Synergieeffekte zwischen inhaltlich verwandten Archiven zu erschließen, gründeten der Süddeutsche Verlag und der Bayerische Rundfunk im Jahr 1998 die Dokumentations- und Informationszentrum (DIZ) München GmbH, in der die Pressearchive der beiden Gesellschafter und das Bildarchiv des Süddeutschen Verlags zusammengeführt wurden. Die gemeinsam entwickelte Pressedatenbank ermöglichte das standortübergreifende Lektorat, die browserbasierte Recherche für Redakteure und externe Kunden im Intraund Internet und die kundenspezifischen Content Feeds für Verlage, Rundfunkanstalten und Portale. Die DIZPressedatenbank enthält zur Zeit 6,9 Millionen Artikel, die jeweils als HTML oder PDF abrufbar sind. Täglich kommen ca. 3.500 Artikel hinzu, von denen ca. 1.000 lektoriert werden. Das Lektorat erfolgt im DIZ nicht durch die Vergabe von Schlagwörtern am Dokument, sondern durch die Verlinkung der Artikel mit "virtuellen Mappen", den Dossiers. Diese stellen die elektronische Repräsentation einer Papiermappe dar und sind das zentrale Erschließungsobjekt. Im Gegensatz zu statischen Klassifikationssystemen ist die Dossierstruktur dynamisch und aufkommensabhängig, d.h. neue Dossiers werden hauptsächlich anhand der aktuellen Berichterstattung erstellt. Insgesamt enthält die DIZ-Pressedatenbank ca. 90.000 Dossiers, davon sind 68.000 Sachthemen (Topics), Personen und Institutionen. Die Dossiers sind untereinander zum "DIZ-Wissensnetz" verlinkt.
    Date
    27. 1.2006 13:23:26
  18. Reiner, U.: VZG-Projekt Colibri : Bewertung von automatisch DDC-klassifizierten Titeldatensätzen der Deutschen Nationalbibliothek (DNB) (2009) 0.02
    0.016860874 = product of:
      0.0674435 = sum of:
        0.022481166 = weight(_text_:23 in 2675) [ClassicSimilarity], result of:
          0.022481166 = score(doc=2675,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.1979932 = fieldWeight in 2675, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2675)
        0.022481166 = weight(_text_:23 in 2675) [ClassicSimilarity], result of:
          0.022481166 = score(doc=2675,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.1979932 = fieldWeight in 2675, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2675)
        0.022481166 = weight(_text_:23 in 2675) [ClassicSimilarity], result of:
          0.022481166 = score(doc=2675,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.1979932 = fieldWeight in 2675, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2675)
      0.25 = coord(3/12)
    
    Date
    23. 2.2009 16:52:13
  19. HaCohen-Kerner, Y.; Beck, H.; Yehudai, E.; Rosenstein, M.; Mughaz, D.: Cuisine : classification using stylistic feature sets and/or name-based feature sets (2010) 0.02
    0.016860874 = product of:
      0.0674435 = sum of:
        0.022481166 = weight(_text_:23 in 3706) [ClassicSimilarity], result of:
          0.022481166 = score(doc=3706,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.1979932 = fieldWeight in 3706, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3706)
        0.022481166 = weight(_text_:23 in 3706) [ClassicSimilarity], result of:
          0.022481166 = score(doc=3706,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.1979932 = fieldWeight in 3706, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3706)
        0.022481166 = weight(_text_:23 in 3706) [ClassicSimilarity], result of:
          0.022481166 = score(doc=3706,freq=2.0), product of:
            0.113545135 = queryWeight, product of:
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.03168059 = queryNorm
            0.1979932 = fieldWeight in 3706, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5840597 = idf(docFreq=3336, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3706)
      0.25 = coord(3/12)
    
    Date
    23. 7.2010 13:16:35
  20. Hoffmann, R.: Entwicklung einer benutzerunterstützten automatisierten Klassifikation von Web - Dokumenten : Untersuchung gegenwärtiger Methoden zur automatisierten Dokumentklassifikation und Implementierung eines Prototyps zum verbesserten Information Retrieval für das xFIND System (2002) 0.01
    0.012972681 = product of:
      0.07783608 = sum of:
        0.056700107 = weight(_text_:systeme in 4197) [ClassicSimilarity], result of:
          0.056700107 = score(doc=4197,freq=4.0), product of:
            0.16953078 = queryWeight, product of:
              5.3512506 = idf(docFreq=569, maxDocs=44218)
              0.03168059 = queryNorm
            0.33445317 = fieldWeight in 4197, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              5.3512506 = idf(docFreq=569, maxDocs=44218)
              0.03125 = fieldNorm(doc=4197)
        0.021135971 = weight(_text_:internet in 4197) [ClassicSimilarity], result of:
          0.021135971 = score(doc=4197,freq=6.0), product of:
            0.0935287 = queryWeight, product of:
              2.9522398 = idf(docFreq=6276, maxDocs=44218)
              0.03168059 = queryNorm
            0.22598378 = fieldWeight in 4197, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.9522398 = idf(docFreq=6276, maxDocs=44218)
              0.03125 = fieldNorm(doc=4197)
      0.16666667 = coord(2/12)
    
    Abstract
    Das unüberschaubare und permanent wachsende Angebot von Informationen im Internet ermöglicht es den Menschen nicht mehr, dieses inhaltlich zu erfassen oder gezielt nach Informationen zu suchen. Einen Lösungsweg zur verbesserten Informationsauffindung stellt hierbei die Kategorisierung bzw. Klassifikation der Informationen auf Basis ihres thematischen Inhaltes dar. Diese thematische Klassifikation kann sowohl anhand manueller (intellektueller) Methoden als auch durch automatisierte Verfahren erfolgen. Doch beide Ansätze für sich konnten die an sie gestellten Erwartungen bis zum heutigen Tag nur unzureichend erfüllen. Im Rahmen dieser Arbeit soll daher der naheliegende Ansatz, die beiden Methoden sinnvoll zu verknüpfen, untersucht werden. Im ersten Teil dieser Arbeit, dem Untersuchungsbereich, wird einleitend das Problem des Informationsüberangebots in unserer Gesellschaft erläutert und gezeigt, dass die Kategorisierung bzw. Klassifikation dieser Informationen speziell im Internet sinnvoll erscheint. Die prinzipiellen Möglichkeiten der Themenzuordnung von Dokumenten zur Verbesserung der Wissensverwaltung und Wissensauffindung werden beschrieben. Dabei werden unter anderem verschiedene Klassifikationsschemata, Topic Maps und semantische Netze vorgestellt. Schwerpunkt des Untersuchungsbereiches ist die Beschreibung automatisierter Methoden zur Themenzuordnung. Neben einem Überblick über die gebräuchlichsten Klassifikations-Algorithmen werden sowohl am Markt existierende Systeme sowie Forschungsansätze und frei verfügbare Module zur automatischen Klassifikation vorgestellt. Berücksichtigt werden auch Systeme, die zumindest teilweise den erwähnten Ansatz der Kombination von manuellen und automatischen Methoden unterstützen. Auch die in Zusammenhang mit der Klassifikation von Dokumenten im Internet auftretenden Probleme werden aufgezeigt. Die im Untersuchungsbereich gewonnenen Erkenntnisse fließen in die Entwicklung eines Moduls zur benutzerunterstützten, automatischen Dokumentklassifikation im Rahmen des xFIND Systems (extended Framework for Information Discovery) ein. Dieses an der technischen Universität Graz konzipierte Framework stellt die Basis für eine Vielzahl neuer Ideen zur Verbesserung des Information Retrieval dar. Der im Gestaltungsbereich entwickelte Lösungsansatz sieht zunächst die Verwendung bereits im System vorhandener, manuell klassifizierter Dokumente, Server oder Serverbereiche als Grundlage für die automatische Klassifikation vor. Nach erfolgter automatischer Klassifikation können in einem nächsten Schritt dann Autoren und Administratoren die Ergebnisse im Rahmen einer Benutzerunterstützung anpassen. Dabei kann das kollektive Benutzerverhalten durch die Möglichkeit eines Votings - mittels Zustimmung bzw. Ablehnung der Klassifikationsergebnisse - Einfluss finden. Das Wissen von Fachexperten und Benutzern trägt somit letztendlich zur Verbesserung der automatischen Klassifikation bei. Im Gestaltungsbereich werden die grundlegenden Konzepte, der Aufbau und die Funktionsweise des entwickelten Moduls beschrieben, sowie eine Reihe von Vorschlägen und Ideen zur Weiterentwicklung der benutzerunterstützten automatischen Dokumentklassifikation präsentiert.

Years

Languages

  • e 46
  • d 20

Types

  • a 46
  • el 17
  • r 4
  • x 4
  • m 2
  • s 1
  • More… Less…