Document (#34561)

Author
Yi, K.
Title
Automatic text classification using library classification schemes : trends, issues and challenges
Source
International cataloguing and bibliographic control. 36(2007) no.4, S.78-82
Year
2007
Abstract
The proliferation of digital resources and their integration into a traditional library setting has created a pressing need for an automated tool that organizes textual information based on library classification schemes. Automated text classification is a research field of developing tools, methods, and models to automate text classification. This article describes the current popular approach for text classification and major text classification projects and applications that are based on library classification schemes. Related issues and challenges are discussed, and a number of considerations for the challenges are examined.
Theme
Automatisches Klassifizieren

Similar documents (content)

  1. Yi, K.: Challenges in automated classification using library classification schemes (2006) 0.43
    0.43484533 = sum of:
      0.43484533 = product of:
        1.3588917 = sum of:
          0.055833273 = weight(abstract_txt:tool in 5810) [ClassicSimilarity], result of:
            0.055833273 = score(doc=5810,freq=1.0), product of:
              0.090205505 = queryWeight, product of:
                1.0441568 = boost
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.01744686 = queryNorm
              0.6189564 = fieldWeight in 5810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.125 = fieldNorm(doc=5810)
          0.0682065 = weight(abstract_txt:projects in 5810) [ClassicSimilarity], result of:
            0.0682065 = score(doc=5810,freq=1.0), product of:
              0.10308326 = queryWeight, product of:
                1.1162032 = boost
                5.293313 = idf(docFreq=603, maxDocs=44218)
                0.01744686 = queryNorm
              0.6616641 = fieldWeight in 5810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.293313 = idf(docFreq=603, maxDocs=44218)
                0.125 = fieldNorm(doc=5810)
          0.0887103 = weight(abstract_txt:popular in 5810) [ClassicSimilarity], result of:
            0.0887103 = score(doc=5810,freq=1.0), product of:
              0.122825064 = queryWeight, product of:
                1.2184079 = boost
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.01744686 = queryNorm
              0.72224915 = fieldWeight in 5810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.125 = fieldNorm(doc=5810)
          0.10310857 = weight(abstract_txt:library in 5810) [ClassicSimilarity], result of:
            0.10310857 = score(doc=5810,freq=3.0), product of:
              0.14944467 = queryWeight, product of:
                2.687939 = boost
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.01744686 = queryNorm
              0.6899448 = fieldWeight in 5810, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.125 = fieldNorm(doc=5810)
          0.18908925 = weight(abstract_txt:challenges in 5810) [ClassicSimilarity], result of:
            0.18908925 = score(doc=5810,freq=1.0), product of:
              0.2933972 = queryWeight, product of:
                3.2616534 = boost
                5.155857 = idf(docFreq=692, maxDocs=44218)
                0.01744686 = queryNorm
              0.64448214 = fieldWeight in 5810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.155857 = idf(docFreq=692, maxDocs=44218)
                0.125 = fieldNorm(doc=5810)
          0.23376578 = weight(abstract_txt:schemes in 5810) [ClassicSimilarity], result of:
            0.23376578 = score(doc=5810,freq=1.0), product of:
              0.33796003 = queryWeight, product of:
                3.5006 = boost
                5.533572 = idf(docFreq=474, maxDocs=44218)
                0.01744686 = queryNorm
              0.6916965 = fieldWeight in 5810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.533572 = idf(docFreq=474, maxDocs=44218)
                0.125 = fieldNorm(doc=5810)
          0.15205596 = weight(abstract_txt:text in 5810) [ClassicSimilarity], result of:
            0.15205596 = score(doc=5810,freq=1.0), product of:
              0.30081302 = queryWeight, product of:
                4.2636595 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01744686 = queryNorm
              0.5054833 = fieldWeight in 5810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.125 = fieldNorm(doc=5810)
          0.46812207 = weight(abstract_txt:classification in 5810) [ClassicSimilarity], result of:
            0.46812207 = score(doc=5810,freq=4.0), product of:
              0.46905136 = queryWeight, product of:
                6.734485 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01744686 = queryNorm
              0.9980188 = fieldWeight in 5810, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.125 = fieldNorm(doc=5810)
        0.32 = coord(8/25)
    
  2. Kumbhar, R.: Library classification trends in the 21st century (2012) 0.24
    0.23552786 = sum of:
      0.23552786 = product of:
        0.8411709 = sum of:
          0.030653208 = weight(abstract_txt:applications in 736) [ClassicSimilarity], result of:
            0.030653208 = score(doc=736,freq=1.0), product of:
              0.08273735 = queryWeight, product of:
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.01744686 = queryNorm
              0.37048817 = fieldWeight in 736, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.078125 = fieldNorm(doc=736)
          0.034895796 = weight(abstract_txt:tool in 736) [ClassicSimilarity], result of:
            0.034895796 = score(doc=736,freq=1.0), product of:
              0.090205505 = queryWeight, product of:
                1.0441568 = boost
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.01744686 = queryNorm
              0.38684773 = fieldWeight in 736, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.078125 = fieldNorm(doc=736)
          0.04031156 = weight(abstract_txt:automatic in 736) [ClassicSimilarity], result of:
            0.04031156 = score(doc=736,freq=1.0), product of:
              0.09931253 = queryWeight, product of:
                1.095598 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.01744686 = queryNorm
              0.40590608 = fieldWeight in 736, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.078125 = fieldNorm(doc=736)
          0.05032304 = weight(abstract_txt:trends in 736) [ClassicSimilarity], result of:
            0.05032304 = score(doc=736,freq=1.0), product of:
              0.11514071 = queryWeight, product of:
                1.1796784 = boost
                5.5943284 = idf(docFreq=446, maxDocs=44218)
                0.01744686 = queryNorm
              0.4370569 = fieldWeight in 736, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5943284 = idf(docFreq=446, maxDocs=44218)
                0.078125 = fieldNorm(doc=736)
          0.08319537 = weight(abstract_txt:library in 736) [ClassicSimilarity], result of:
            0.08319537 = score(doc=736,freq=5.0), product of:
              0.14944467 = queryWeight, product of:
                2.687939 = boost
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.01744686 = queryNorm
              0.55669683 = fieldWeight in 736, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.078125 = fieldNorm(doc=736)
          0.09503498 = weight(abstract_txt:text in 736) [ClassicSimilarity], result of:
            0.09503498 = score(doc=736,freq=1.0), product of:
              0.30081302 = queryWeight, product of:
                4.2636595 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01744686 = queryNorm
              0.3159271 = fieldWeight in 736, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=736)
          0.50675696 = weight(abstract_txt:classification in 736) [ClassicSimilarity], result of:
            0.50675696 = score(doc=736,freq=12.0), product of:
              0.46905136 = queryWeight, product of:
                6.734485 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01744686 = queryNorm
              1.080387 = fieldWeight in 736, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=736)
        0.28 = coord(7/25)
    
  3. Wang, J.: ¬An extensive study on automated Dewey Decimal Classification (2009) 0.23
    0.23208117 = sum of:
      0.23208117 = product of:
        0.7252537 = sum of:
          0.024522567 = weight(abstract_txt:applications in 3172) [ClassicSimilarity], result of:
            0.024522567 = score(doc=3172,freq=1.0), product of:
              0.08273735 = queryWeight, product of:
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.01744686 = queryNorm
              0.29639053 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.0625 = fieldNorm(doc=3172)
          0.032249246 = weight(abstract_txt:automatic in 3172) [ClassicSimilarity], result of:
            0.032249246 = score(doc=3172,freq=1.0), product of:
              0.09931253 = queryWeight, product of:
                1.095598 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.01744686 = queryNorm
              0.32472485 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=3172)
          0.03347748 = weight(abstract_txt:created in 3172) [ClassicSimilarity], result of:
            0.03347748 = score(doc=3172,freq=1.0), product of:
              0.101818375 = queryWeight, product of:
                1.1093339 = boost
                5.260737 = idf(docFreq=623, maxDocs=44218)
                0.01744686 = queryNorm
              0.32879606 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.260737 = idf(docFreq=623, maxDocs=44218)
                0.0625 = fieldNorm(doc=3172)
          0.08090562 = weight(abstract_txt:automated in 3172) [ClassicSimilarity], result of:
            0.08090562 = score(doc=3172,freq=1.0), product of:
              0.23102205 = queryWeight, product of:
                2.363148 = boost
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.01744686 = queryNorm
              0.35020733 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.0625 = fieldNorm(doc=3172)
          0.051554285 = weight(abstract_txt:library in 3172) [ClassicSimilarity], result of:
            0.051554285 = score(doc=3172,freq=3.0), product of:
              0.14944467 = queryWeight, product of:
                2.687939 = boost
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.01744686 = queryNorm
              0.3449724 = fieldWeight in 3172, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.0625 = fieldNorm(doc=3172)
          0.11688289 = weight(abstract_txt:schemes in 3172) [ClassicSimilarity], result of:
            0.11688289 = score(doc=3172,freq=1.0), product of:
              0.33796003 = queryWeight, product of:
                3.5006 = boost
                5.533572 = idf(docFreq=474, maxDocs=44218)
                0.01744686 = queryNorm
              0.34584826 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.533572 = idf(docFreq=474, maxDocs=44218)
                0.0625 = fieldNorm(doc=3172)
          0.07602798 = weight(abstract_txt:text in 3172) [ClassicSimilarity], result of:
            0.07602798 = score(doc=3172,freq=1.0), product of:
              0.30081302 = queryWeight, product of:
                4.2636595 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01744686 = queryNorm
              0.25274166 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=3172)
          0.30963364 = weight(abstract_txt:classification in 3172) [ClassicSimilarity], result of:
            0.30963364 = score(doc=3172,freq=7.0), product of:
              0.46905136 = queryWeight, product of:
                6.734485 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01744686 = queryNorm
              0.66012734 = fieldWeight in 3172, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=3172)
        0.32 = coord(8/25)
    
  4. Batley, S.: Classification in theory and practice (2005) 0.22
    0.22153048 = sum of:
      0.22153048 = product of:
        0.7911803 = sum of:
          0.03488059 = weight(abstract_txt:examined in 1170) [ClassicSimilarity], result of:
            0.03488059 = score(doc=1170,freq=3.0), product of:
              0.09925516 = queryWeight, product of:
                1.0952815 = boost
                5.194097 = idf(docFreq=666, maxDocs=44218)
                0.01744686 = queryNorm
              0.35142344 = fieldWeight in 1170, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.194097 = idf(docFreq=666, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1170)
          0.027721968 = weight(abstract_txt:popular in 1170) [ClassicSimilarity], result of:
            0.027721968 = score(doc=1170,freq=1.0), product of:
              0.122825064 = queryWeight, product of:
                1.2184079 = boost
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.01744686 = queryNorm
              0.22570285 = fieldWeight in 1170, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1170)
          0.022880606 = weight(abstract_txt:issues in 1170) [ClassicSimilarity], result of:
            0.022880606 = score(doc=1170,freq=1.0), product of:
              0.13616306 = queryWeight, product of:
                1.8142363 = boost
                4.3017797 = idf(docFreq=1627, maxDocs=44218)
                0.01744686 = queryNorm
              0.16803828 = fieldWeight in 1170, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3017797 = idf(docFreq=1627, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1170)
          0.041597687 = weight(abstract_txt:library in 1170) [ClassicSimilarity], result of:
            0.041597687 = score(doc=1170,freq=5.0), product of:
              0.14944467 = queryWeight, product of:
                2.687939 = boost
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.01744686 = queryNorm
              0.27834842 = fieldWeight in 1170, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1170)
          0.23101008 = weight(abstract_txt:schemes in 1170) [ClassicSimilarity], result of:
            0.23101008 = score(doc=1170,freq=10.0), product of:
              0.33796003 = queryWeight, product of:
                3.5006 = boost
                5.533572 = idf(docFreq=474, maxDocs=44218)
                0.01744686 = queryNorm
              0.6835426 = fieldWeight in 1170, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.533572 = idf(docFreq=474, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1170)
          0.082302704 = weight(abstract_txt:text in 1170) [ClassicSimilarity], result of:
            0.082302704 = score(doc=1170,freq=3.0), product of:
              0.30081302 = queryWeight, product of:
                4.2636595 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01744686 = queryNorm
              0.27360088 = fieldWeight in 1170, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1170)
          0.3507867 = weight(abstract_txt:classification in 1170) [ClassicSimilarity], result of:
            0.3507867 = score(doc=1170,freq=23.0), product of:
              0.46905136 = queryWeight, product of:
                6.734485 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01744686 = queryNorm
              0.7478641 = fieldWeight in 1170, product of:
                4.7958317 = tf(freq=23.0), with freq of:
                  23.0 = termFreq=23.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1170)
        0.28 = coord(7/25)
    
  5. Hurt, C.D.: Classification and subject analysis : looking to the future at a distance (1997) 0.20
    0.20231035 = sum of:
      0.20231035 = product of:
        1.0115517 = sum of:
          0.1500098 = weight(abstract_txt:proliferation in 6929) [ClassicSimilarity], result of:
            0.1500098 = score(doc=6929,freq=1.0), product of:
              0.19056597 = queryWeight, product of:
                1.5176508 = boost
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.01744686 = queryNorm
              0.78718036 = fieldWeight in 6929, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.109375 = fieldNorm(doc=6929)
          0.052088547 = weight(abstract_txt:library in 6929) [ClassicSimilarity], result of:
            0.052088547 = score(doc=6929,freq=1.0), product of:
              0.14944467 = queryWeight, product of:
                2.687939 = boost
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.01744686 = queryNorm
              0.34854737 = fieldWeight in 6929, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.109375 = fieldNorm(doc=6929)
          0.16545309 = weight(abstract_txt:challenges in 6929) [ClassicSimilarity], result of:
            0.16545309 = score(doc=6929,freq=1.0), product of:
              0.2933972 = queryWeight, product of:
                3.2616534 = boost
                5.155857 = idf(docFreq=692, maxDocs=44218)
                0.01744686 = queryNorm
              0.56392187 = fieldWeight in 6929, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.155857 = idf(docFreq=692, maxDocs=44218)
                0.109375 = fieldNorm(doc=6929)
          0.28927037 = weight(abstract_txt:schemes in 6929) [ClassicSimilarity], result of:
            0.28927037 = score(doc=6929,freq=2.0), product of:
              0.33796003 = queryWeight, product of:
                3.5006 = boost
                5.533572 = idf(docFreq=474, maxDocs=44218)
                0.01744686 = queryNorm
              0.85593075 = fieldWeight in 6929, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.533572 = idf(docFreq=474, maxDocs=44218)
                0.109375 = fieldNorm(doc=6929)
          0.3547299 = weight(abstract_txt:classification in 6929) [ClassicSimilarity], result of:
            0.3547299 = score(doc=6929,freq=3.0), product of:
              0.46905136 = queryWeight, product of:
                6.734485 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01744686 = queryNorm
              0.7562709 = fieldWeight in 6929, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.109375 = fieldNorm(doc=6929)
        0.2 = coord(5/25)