Search (84 results, page 1 of 5)

  • × theme_ss:"Automatisches Klassifizieren"
  1. Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.07
    0.066568166 = sum of:
      0.05423162 = product of:
        0.21692649 = sum of:
          0.21692649 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
            0.21692649 = score(doc=562,freq=2.0), product of:
              0.3859778 = queryWeight, product of:
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.04552693 = queryNorm
              0.56201804 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.046875 = fieldNorm(doc=562)
        0.25 = coord(1/4)
      0.012336541 = product of:
        0.037009623 = sum of:
          0.037009623 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
            0.037009623 = score(doc=562,freq=2.0), product of:
              0.15942755 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.04552693 = queryNorm
              0.23214069 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046875 = fieldNorm(doc=562)
        0.33333334 = coord(1/3)
    
    Content
    Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
    Date
    8. 1.2013 10:22:32
  2. Bock, H.-H.: Automatische Klassifikation : theoretische und praktische Methoden zur Gruppierung und Strukturierung von Daten (Cluster-Analyse) (1974) 0.04
    0.04003016 = product of:
      0.08006032 = sum of:
        0.08006032 = product of:
          0.12009047 = sum of:
            0.049836867 = weight(_text_:m in 7693) [ClassicSimilarity], result of:
              0.049836867 = score(doc=7693,freq=2.0), product of:
                0.11329143 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.04552693 = queryNorm
                0.4398997 = fieldWeight in 7693, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.125 = fieldNorm(doc=7693)
            0.0702536 = weight(_text_:h in 7693) [ClassicSimilarity], result of:
              0.0702536 = score(doc=7693,freq=4.0), product of:
                0.11310934 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.04552693 = queryNorm
                0.6211123 = fieldWeight in 7693, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.125 = fieldNorm(doc=7693)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Type
    m
  3. Bock, H.-H.: Datenanalyse zur Strukturierung und Ordnung von Information (1989) 0.02
    0.024637949 = product of:
      0.049275897 = sum of:
        0.049275897 = product of:
          0.07391384 = sum of:
            0.03073595 = weight(_text_:h in 141) [ClassicSimilarity], result of:
              0.03073595 = score(doc=141,freq=4.0), product of:
                0.11310934 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.04552693 = queryNorm
                0.27173662 = fieldWeight in 141, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=141)
            0.043177895 = weight(_text_:22 in 141) [ClassicSimilarity], result of:
              0.043177895 = score(doc=141,freq=2.0), product of:
                0.15942755 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04552693 = queryNorm
                0.2708308 = fieldWeight in 141, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=141)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Pages
    S.1-22
  4. Pfeffer, M.: Automatische Vergabe von RVK-Notationen mittels fallbasiertem Schließen (2009) 0.02
    0.02114654 = product of:
      0.04229308 = sum of:
        0.04229308 = product of:
          0.063439615 = sum of:
            0.02642999 = weight(_text_:m in 3051) [ClassicSimilarity], result of:
              0.02642999 = score(doc=3051,freq=4.0), product of:
                0.11329143 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.04552693 = queryNorm
                0.23329206 = fieldWeight in 3051, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3051)
            0.037009623 = weight(_text_:22 in 3051) [ClassicSimilarity], result of:
              0.037009623 = score(doc=3051,freq=2.0), product of:
                0.15942755 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04552693 = queryNorm
                0.23214069 = fieldWeight in 3051, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3051)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    22. 8.2009 19:51:28
    Imprint
    Frankfurt, M. : Klostermann
  5. Egbert, J.; Biber, D.; Davies, M.: Developing a bottom-up, user-based method of web register classification (2015) 0.02
    0.01856615 = product of:
      0.0371323 = sum of:
        0.0371323 = product of:
          0.055698447 = sum of:
            0.018688826 = weight(_text_:m in 2158) [ClassicSimilarity], result of:
              0.018688826 = score(doc=2158,freq=2.0), product of:
                0.11329143 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.04552693 = queryNorm
                0.1649624 = fieldWeight in 2158, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2158)
            0.037009623 = weight(_text_:22 in 2158) [ClassicSimilarity], result of:
              0.037009623 = score(doc=2158,freq=2.0), product of:
                0.15942755 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04552693 = queryNorm
                0.23214069 = fieldWeight in 2158, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2158)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    4. 8.2015 19:22:04
  6. Chung, Y.-M.; Noh, Y.-H.: Developing a specialized directory system by automatically classifying Web documents (2003) 0.01
    0.012439209 = product of:
      0.024878418 = sum of:
        0.024878418 = product of:
          0.037317626 = sum of:
            0.018688826 = weight(_text_:m in 1566) [ClassicSimilarity], result of:
              0.018688826 = score(doc=1566,freq=2.0), product of:
                0.11329143 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.04552693 = queryNorm
                0.1649624 = fieldWeight in 1566, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1566)
            0.018628798 = weight(_text_:h in 1566) [ClassicSimilarity], result of:
              0.018628798 = score(doc=1566,freq=2.0), product of:
                0.11310934 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.04552693 = queryNorm
                0.16469726 = fieldWeight in 1566, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1566)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  7. Prabowo, R.; Jackson, M.; Burden, P.; Knoell, H.-D.: Ontology-based automatic classification for the Web pages : design, implementation and evaluation (2002) 0.01
    0.012439209 = product of:
      0.024878418 = sum of:
        0.024878418 = product of:
          0.037317626 = sum of:
            0.018688826 = weight(_text_:m in 3383) [ClassicSimilarity], result of:
              0.018688826 = score(doc=3383,freq=2.0), product of:
                0.11329143 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.04552693 = queryNorm
                0.1649624 = fieldWeight in 3383, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3383)
            0.018628798 = weight(_text_:h in 3383) [ClassicSimilarity], result of:
              0.018628798 = score(doc=3383,freq=2.0), product of:
                0.11310934 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.04552693 = queryNorm
                0.16469726 = fieldWeight in 3383, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3383)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  8. Wu, M.; Liu, Y.-H.; Brownlee, R.; Zhang, X.: Evaluating utility and automatic classification of subject metadata from Research Data Australia (2021) 0.01
    0.012439209 = product of:
      0.024878418 = sum of:
        0.024878418 = product of:
          0.037317626 = sum of:
            0.018688826 = weight(_text_:m in 453) [ClassicSimilarity], result of:
              0.018688826 = score(doc=453,freq=2.0), product of:
                0.11329143 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.04552693 = queryNorm
                0.1649624 = fieldWeight in 453, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.046875 = fieldNorm(doc=453)
            0.018628798 = weight(_text_:h in 453) [ClassicSimilarity], result of:
              0.018628798 = score(doc=453,freq=2.0), product of:
                0.11310934 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.04552693 = queryNorm
                0.16469726 = fieldWeight in 453, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.046875 = fieldNorm(doc=453)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  9. Khoo, C.S.G.; Ng, K.; Ou, S.: ¬An exploratory study of human clustering of Web pages (2003) 0.01
    0.0123774335 = product of:
      0.024754867 = sum of:
        0.024754867 = product of:
          0.0371323 = sum of:
            0.012459217 = weight(_text_:m in 2741) [ClassicSimilarity], result of:
              0.012459217 = score(doc=2741,freq=2.0), product of:
                0.11329143 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.04552693 = queryNorm
                0.10997493 = fieldWeight in 2741, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.03125 = fieldNorm(doc=2741)
            0.024673082 = weight(_text_:22 in 2741) [ClassicSimilarity], result of:
              0.024673082 = score(doc=2741,freq=2.0), product of:
                0.15942755 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04552693 = queryNorm
                0.15476047 = fieldWeight in 2741, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=2741)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    12. 9.2004 9:56:22
    Source
    Challenges in knowledge representation and organization for the 21st century: Integration of knowledge across boundaries. Proceedings of the 7th ISKO International Conference Granada, Spain, July 10-13, 2002. Ed.: M. López-Huertas
  10. Subramanian, S.; Shafer, K.E.: Clustering (2001) 0.01
    0.012336541 = product of:
      0.024673082 = sum of:
        0.024673082 = product of:
          0.074019246 = sum of:
            0.074019246 = weight(_text_:22 in 1046) [ClassicSimilarity], result of:
              0.074019246 = score(doc=1046,freq=2.0), product of:
                0.15942755 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04552693 = queryNorm
                0.46428138 = fieldWeight in 1046, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=1046)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    5. 5.2003 14:17:22
  11. HaCohen-Kerner, Y.; Beck, H.; Yehudai, E.; Rosenstein, M.; Mughaz, D.: Cuisine : classification using stylistic feature sets and/or name-based feature sets (2010) 0.01
    0.010366008 = product of:
      0.020732015 = sum of:
        0.020732015 = product of:
          0.031098021 = sum of:
            0.015574021 = weight(_text_:m in 3706) [ClassicSimilarity], result of:
              0.015574021 = score(doc=3706,freq=2.0), product of:
                0.11329143 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.04552693 = queryNorm
                0.13746867 = fieldWeight in 3706, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3706)
            0.015524 = weight(_text_:h in 3706) [ClassicSimilarity], result of:
              0.015524 = score(doc=3706,freq=2.0), product of:
                0.11310934 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.04552693 = queryNorm
                0.13724773 = fieldWeight in 3706, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3706)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  12. Wang, H.; Hong, M.: Supervised Hebb rule based feature selection for text classification (2019) 0.01
    0.010366008 = product of:
      0.020732015 = sum of:
        0.020732015 = product of:
          0.031098021 = sum of:
            0.015574021 = weight(_text_:m in 5036) [ClassicSimilarity], result of:
              0.015574021 = score(doc=5036,freq=2.0), product of:
                0.11329143 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.04552693 = queryNorm
                0.13746867 = fieldWeight in 5036, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5036)
            0.015524 = weight(_text_:h in 5036) [ClassicSimilarity], result of:
              0.015524 = score(doc=5036,freq=2.0), product of:
                0.11310934 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.04552693 = queryNorm
                0.13724773 = fieldWeight in 5036, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5036)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  13. Reiner, U.: Automatische DDC-Klassifizierung von bibliografischen Titeldatensätzen (2009) 0.01
    0.010280452 = product of:
      0.020560903 = sum of:
        0.020560903 = product of:
          0.06168271 = sum of:
            0.06168271 = weight(_text_:22 in 611) [ClassicSimilarity], result of:
              0.06168271 = score(doc=611,freq=2.0), product of:
                0.15942755 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04552693 = queryNorm
                0.38690117 = fieldWeight in 611, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=611)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    22. 8.2009 12:54:24
  14. HaCohen-Kerner, Y. et al.: Classification using various machine learning methods and combinations of key-phrases and visual features (2016) 0.01
    0.010280452 = product of:
      0.020560903 = sum of:
        0.020560903 = product of:
          0.06168271 = sum of:
            0.06168271 = weight(_text_:22 in 2748) [ClassicSimilarity], result of:
              0.06168271 = score(doc=2748,freq=2.0), product of:
                0.15942755 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04552693 = queryNorm
                0.38690117 = fieldWeight in 2748, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2748)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    1. 2.2016 18:25:22
  15. Wu, M.; Fuller, M.; Wilkinson, R.: Using clustering and classification approaches in interactive retrieval (2001) 0.01
    0.010278329 = product of:
      0.020556659 = sum of:
        0.020556659 = product of:
          0.061669976 = sum of:
            0.061669976 = weight(_text_:m in 2666) [ClassicSimilarity], result of:
              0.061669976 = score(doc=2666,freq=4.0), product of:
                0.11329143 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.04552693 = queryNorm
                0.5443481 = fieldWeight in 2666, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.109375 = fieldNorm(doc=2666)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  16. Groß, T.; Faden, M.: Automatische Indexierung elektronischer Dokumente an der Deutschen Zentralbibliothek für Wirtschaftswissenschaften : Bericht über die Jahrestagung der Internationalen Buchwissenschaftlichen Gesellschaft (2010) 0.01
    0.008292805 = product of:
      0.01658561 = sum of:
        0.01658561 = product of:
          0.024878416 = sum of:
            0.012459217 = weight(_text_:m in 4051) [ClassicSimilarity], result of:
              0.012459217 = score(doc=4051,freq=2.0), product of:
                0.11329143 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.04552693 = queryNorm
                0.10997493 = fieldWeight in 4051, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.03125 = fieldNorm(doc=4051)
            0.0124192 = weight(_text_:h in 4051) [ClassicSimilarity], result of:
              0.0124192 = score(doc=4051,freq=2.0), product of:
                0.11310934 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.04552693 = queryNorm
                0.10979818 = fieldWeight in 4051, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.03125 = fieldNorm(doc=4051)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Source
    Bibliotheksdienst. 44(2010) H.12, S.1120-1135
  17. Borko, H.: Research in computer based classification systems (1985) 0.01
    0.007256205 = product of:
      0.01451241 = sum of:
        0.01451241 = product of:
          0.021768615 = sum of:
            0.010901814 = weight(_text_:m in 3647) [ClassicSimilarity], result of:
              0.010901814 = score(doc=3647,freq=2.0), product of:
                0.11329143 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.04552693 = queryNorm
                0.09622806 = fieldWeight in 3647, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.02734375 = fieldNorm(doc=3647)
            0.010866799 = weight(_text_:h in 3647) [ClassicSimilarity], result of:
              0.010866799 = score(doc=3647,freq=2.0), product of:
                0.11310934 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.04552693 = queryNorm
                0.096073404 = fieldWeight in 3647, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.02734375 = fieldNorm(doc=3647)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Abstract
    The selection in this reader by R. M. Needham and K. Sparck Jones reports an early approach to automatic classification that was taken in England. The following selection reviews various approaches that were being pursued in the United States at about the same time. It then discusses a particular approach initiated in the early 1960s by Harold Borko, at that time Head of the Language Processing and Retrieval Research Staff at the System Development Corporation, Santa Monica, California and, since 1966, a member of the faculty at the Graduate School of Library and Information Science, University of California, Los Angeles. As was described earlier, there are two steps in automatic classification, the first being to identify pairs of terms that are similar by virtue of co-occurring as index terms in the same documents, and the second being to form equivalence classes of intersubstitutable terms. To compute similarities, Borko and his associates used a standard correlation formula; to derive classification categories, where Needham and Sparck Jones used clumping, the Borko team used the statistical technique of factor analysis. The fact that documents can be classified automatically, and in any number of ways, is worthy of passing notice. Worthy of serious attention would be a demonstra tion that a computer-based classification system was effective in the organization and retrieval of documents. One reason for the inclusion of the following selection in the reader is that it addresses the question of evaluation. To evaluate the effectiveness of their automatically derived classification, Borko and his team asked three questions. The first was Is the classification reliable? in other words, could the categories derived from one sample of texts be used to classify other texts? Reliability was assessed by a case-study comparison of the classes derived from three different samples of abstracts. The notso-surprising conclusion reached was that automatically derived classes were reliable only to the extent that the sample from which they were derived was representative of the total document collection. The second evaluation question asked whether the classification was reasonable, in the sense of adequately describing the content of the document collection. The answer was sought by comparing the automatically derived categories with categories in a related classification system that was manually constructed. Here the conclusion was that the automatic method yielded categories that fairly accurately reflected the major area of interest in the sample collection of texts; however, since there were only eleven such categories and they were quite broad, they could not be regarded as suitable for use in a university or any large general library. The third evaluation question asked whether automatic classification was accurate, in the sense of producing results similar to those obtainabie by human cIassifiers. When using human classification as a criterion, automatic classification was found to be 50 percent accurate.
  18. Schek, M.: Automatische Klassifizierung und Visualisierung im Archiv der Süddeutschen Zeitung (2005) 0.01
    0.007256205 = product of:
      0.01451241 = sum of:
        0.01451241 = product of:
          0.021768615 = sum of:
            0.010901814 = weight(_text_:m in 4884) [ClassicSimilarity], result of:
              0.010901814 = score(doc=4884,freq=2.0), product of:
                0.11329143 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.04552693 = queryNorm
                0.09622806 = fieldWeight in 4884, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.02734375 = fieldNorm(doc=4884)
            0.010866799 = weight(_text_:h in 4884) [ClassicSimilarity], result of:
              0.010866799 = score(doc=4884,freq=2.0), product of:
                0.11310934 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.04552693 = queryNorm
                0.096073404 = fieldWeight in 4884, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.02734375 = fieldNorm(doc=4884)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Source
    Medienwirtschaft. 2(2005) H.1, S.20-24
  19. Kleinoeder, H.H.; Puzicha, J.: Automatische Katalogisierung am Beispiel einer Pilotanwendung (2002) 0.01
    0.007244533 = product of:
      0.014489066 = sum of:
        0.014489066 = product of:
          0.043467198 = sum of:
            0.043467198 = weight(_text_:h in 1154) [ClassicSimilarity], result of:
              0.043467198 = score(doc=1154,freq=2.0), product of:
                0.11310934 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.04552693 = queryNorm
                0.38429362 = fieldWeight in 1154, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.109375 = fieldNorm(doc=1154)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Source
    Info 7. 17(2002) H.1, S.19-21
  20. Dubin, D.: Dimensions and discriminability (1998) 0.01
    0.007196316 = product of:
      0.014392632 = sum of:
        0.014392632 = product of:
          0.043177895 = sum of:
            0.043177895 = weight(_text_:22 in 2338) [ClassicSimilarity], result of:
              0.043177895 = score(doc=2338,freq=2.0), product of:
                0.15942755 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04552693 = queryNorm
                0.2708308 = fieldWeight in 2338, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2338)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    22. 9.1997 19:16:05

Years

Languages

  • e 57
  • d 26

Types

  • a 67
  • el 11
  • m 5
  • r 2
  • s 2
  • x 2
  • d 1
  • More… Less…