Document (#42052)

Author
Altinel, B.
Ganiz, M.C.
Title
Semantic text classification : a survey of past and recent advances
Source
Information processing and management. 54(2018) no.6, S.1129-1153
Year
2018
Abstract
Automatic text classification is the task of organizing documents into pre-determined classes, generally using machine learning algorithms. Generally speaking, it is one of the most important methods to organize and make use of the gigantic amounts of information that exist in unstructured textual format. Text classification is a widely studied research area of language processing and text mining. In traditional text classification, a document is represented as a bag of words where the words in other words terms are cut from their finer context i.e. their location in a sentence or in a document. Only the broader context of document is used with some type of term frequency information in the vector space. Consequently, semantics of words that can be inferred from the finer context of its location in a sentence and its relations with neighboring words are usually ignored. However, meaning of words, semantic connections between words, documents and even classes are obviously important since methods that capture semantics generally reach better classification performances. Several surveys have been published to analyze diverse approaches for the traditional text classification methods. Most of these surveys cover application of different semantic term relatedness methods in text classification up to a certain degree. However, they do not specifically target semantic text classification algorithms and their advantages over the traditional text classification. In order to fill this gap, we undertake a comprehensive discussion of semantic text classification vs. traditional text classification. This survey explores the past and recent advancements in semantic text classification and attempts to organize existing approaches under five fundamental categories; domain knowledge-based approaches, corpus-based approaches, deep learning based approaches, word/character sequence enhanced approaches and linguistic enriched approaches. Furthermore, this survey highlights the advantages of semantic text classification algorithms over the traditional text classification algorithms.
Content
Vgl.: https://doi.org/10.1016/j.ipm.2018.08.001.
Theme
Automatisches Klassifizieren

Similar documents (content)

  1. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: Beyond bag-of-words : bigram-enhanced context-dependent term weights (2014) 0.25
    0.25397265 = sum of:
      0.25397265 = product of:
        0.57721055 = sum of:
          0.016119441 = weight(abstract_txt:based in 1283) [ClassicSimilarity], result of:
            0.016119441 = score(doc=1283,freq=3.0), product of:
              0.053381752 = queryWeight, product of:
                1.0065291 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.01663635 = queryNorm
              0.3019654 = fieldWeight in 1283, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
          0.05994712 = weight(abstract_txt:term in 1283) [ClassicSimilarity], result of:
            0.05994712 = score(doc=1283,freq=8.0), product of:
              0.08072072 = queryWeight, product of:
                1.0105941 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.01663635 = queryNorm
              0.7426484 = fieldWeight in 1283, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
          0.021991247 = weight(abstract_txt:recent in 1283) [ClassicSimilarity], result of:
            0.021991247 = score(doc=1283,freq=1.0), product of:
              0.08273121 = queryWeight, product of:
                1.023102 = boost
                4.860628 = idf(docFreq=930, maxDocs=44218)
                0.01663635 = queryNorm
              0.26581562 = fieldWeight in 1283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.860628 = idf(docFreq=930, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
          0.03301588 = weight(abstract_txt:past in 1283) [ClassicSimilarity], result of:
            0.03301588 = score(doc=1283,freq=1.0), product of:
              0.10847211 = queryWeight, product of:
                1.1715027 = boost
                5.565661 = idf(docFreq=459, maxDocs=44218)
                0.01663635 = queryNorm
              0.30437207 = fieldWeight in 1283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.565661 = idf(docFreq=459, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
          0.03935395 = weight(abstract_txt:document in 1283) [ClassicSimilarity], result of:
            0.03935395 = score(doc=1283,freq=3.0), product of:
              0.0967873 = queryWeight, product of:
                1.3553114 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.01663635 = queryNorm
              0.4066024 = fieldWeight in 1283, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
          0.046962783 = weight(abstract_txt:context in 1283) [ClassicSimilarity], result of:
            0.046962783 = score(doc=1283,freq=4.0), product of:
              0.09893481 = queryWeight, product of:
                1.3702646 = boost
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.01663635 = queryNorm
              0.47468412 = fieldWeight in 1283, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
          0.027310323 = weight(abstract_txt:methods in 1283) [ClassicSimilarity], result of:
            0.027310323 = score(doc=1283,freq=1.0), product of:
              0.1204289 = queryWeight, product of:
                1.74568 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.01663635 = queryNorm
              0.2267755 = fieldWeight in 1283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
          0.0482792 = weight(abstract_txt:traditional in 1283) [ClassicSimilarity], result of:
            0.0482792 = score(doc=1283,freq=1.0), product of:
              0.18966602 = queryWeight, product of:
                2.4493399 = boost
                4.654601 = idf(docFreq=1143, maxDocs=44218)
                0.01663635 = queryNorm
              0.2545485 = fieldWeight in 1283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.654601 = idf(docFreq=1143, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
          0.09277445 = weight(abstract_txt:approaches in 1283) [ClassicSimilarity], result of:
            0.09277445 = score(doc=1283,freq=2.0), product of:
              0.26029617 = queryWeight, product of:
                3.395097 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.01663635 = queryNorm
              0.35641882 = fieldWeight in 1283, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
          0.1028097 = weight(abstract_txt:words in 1283) [ClassicSimilarity], result of:
            0.1028097 = score(doc=1283,freq=1.0), product of:
              0.35119492 = queryWeight, product of:
                3.943596 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.01663635 = queryNorm
              0.29274255 = fieldWeight in 1283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
          0.08864642 = weight(abstract_txt:text in 1283) [ClassicSimilarity], result of:
            0.08864642 = score(doc=1283,freq=1.0), product of:
              0.40084484 = queryWeight, product of:
                5.9582872 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01663635 = queryNorm
              0.22114895 = fieldWeight in 1283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1283)
        0.44 = coord(11/25)
    
  2. Golub, K.: Automated subject classification of textual documents in the context of Web-based hierarchical browsing (2011) 0.25
    0.24959143 = sum of:
      0.24959143 = product of:
        0.62397856 = sum of:
          0.03318944 = weight(abstract_txt:learning in 4558) [ClassicSimilarity], result of:
            0.03318944 = score(doc=4558,freq=2.0), product of:
              0.07903718 = queryWeight, product of:
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.01663635 = queryNorm
              0.41992182 = fieldWeight in 4558, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
          0.010636073 = weight(abstract_txt:based in 4558) [ClassicSimilarity], result of:
            0.010636073 = score(doc=4558,freq=1.0), product of:
              0.053381752 = queryWeight, product of:
                1.0065291 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.01663635 = queryNorm
              0.19924548 = fieldWeight in 4558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
          0.043251466 = weight(abstract_txt:classes in 4558) [ClassicSimilarity], result of:
            0.043251466 = score(doc=4558,freq=1.0), product of:
              0.11880701 = queryWeight, product of:
                1.2260419 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.01663635 = queryNorm
              0.3640481 = fieldWeight in 4558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
          0.054225907 = weight(abstract_txt:organize in 4558) [ClassicSimilarity], result of:
            0.054225907 = score(doc=4558,freq=1.0), product of:
              0.13813786 = queryWeight, product of:
                1.3220279 = boost
                6.280787 = idf(docFreq=224, maxDocs=44218)
                0.01663635 = queryNorm
              0.3925492 = fieldWeight in 4558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.280787 = idf(docFreq=224, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
          0.036722705 = weight(abstract_txt:document in 4558) [ClassicSimilarity], result of:
            0.036722705 = score(doc=4558,freq=2.0), product of:
              0.0967873 = queryWeight, product of:
                1.3553114 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.01663635 = queryNorm
              0.37941656 = fieldWeight in 4558, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
          0.031211797 = weight(abstract_txt:methods in 4558) [ClassicSimilarity], result of:
            0.031211797 = score(doc=4558,freq=1.0), product of:
              0.1204289 = queryWeight, product of:
                1.74568 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.01663635 = queryNorm
              0.259172 = fieldWeight in 4558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
          0.1409906 = weight(abstract_txt:algorithms in 4558) [ClassicSimilarity], result of:
            0.1409906 = score(doc=4558,freq=3.0), product of:
              0.22817665 = queryWeight, product of:
                2.4028955 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.01663635 = queryNorm
              0.6179011 = fieldWeight in 4558, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
          0.074973084 = weight(abstract_txt:approaches in 4558) [ClassicSimilarity], result of:
            0.074973084 = score(doc=4558,freq=1.0), product of:
              0.26029617 = queryWeight, product of:
                3.395097 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.01663635 = queryNorm
              0.2880299 = fieldWeight in 4558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
          0.09746727 = weight(abstract_txt:classification in 4558) [ClassicSimilarity], result of:
            0.09746727 = score(doc=4558,freq=1.0), product of:
              0.39064303 = queryWeight, product of:
                5.881977 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01663635 = queryNorm
              0.2495047 = fieldWeight in 4558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
          0.10131019 = weight(abstract_txt:text in 4558) [ClassicSimilarity], result of:
            0.10131019 = score(doc=4558,freq=1.0), product of:
              0.40084484 = queryWeight, product of:
                5.9582872 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01663635 = queryNorm
              0.25274166 = fieldWeight in 4558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
        0.4 = coord(10/25)
    
  3. Golub, K.: Automatic subject indexing of text (2019) 0.25
    0.24838345 = sum of:
      0.24838345 = product of:
        0.689954 = sum of:
          0.023468476 = weight(abstract_txt:learning in 5268) [ClassicSimilarity], result of:
            0.023468476 = score(doc=5268,freq=1.0), product of:
              0.07903718 = queryWeight, product of:
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.01663635 = queryNorm
              0.29692957 = fieldWeight in 5268, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
          0.038880404 = weight(abstract_txt:advantages in 5268) [ClassicSimilarity], result of:
            0.038880404 = score(doc=5268,freq=1.0), product of:
              0.1106612 = queryWeight, product of:
                1.1832649 = boost
                5.621541 = idf(docFreq=434, maxDocs=44218)
                0.01663635 = queryNorm
              0.3513463 = fieldWeight in 5268, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.621541 = idf(docFreq=434, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
          0.043251466 = weight(abstract_txt:classes in 5268) [ClassicSimilarity], result of:
            0.043251466 = score(doc=5268,freq=1.0), product of:
              0.11880701 = queryWeight, product of:
                1.2260419 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.01663635 = queryNorm
              0.3640481 = fieldWeight in 5268, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
          0.051933747 = weight(abstract_txt:document in 5268) [ClassicSimilarity], result of:
            0.051933747 = score(doc=5268,freq=4.0), product of:
              0.0967873 = queryWeight, product of:
                1.3553114 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.01663635 = queryNorm
              0.53657603 = fieldWeight in 5268, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
          0.06395338 = weight(abstract_txt:generally in 5268) [ClassicSimilarity], result of:
            0.06395338 = score(doc=5268,freq=1.0), product of:
              0.17651471 = queryWeight, product of:
                1.830292 = boost
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.01663635 = queryNorm
              0.36231187 = fieldWeight in 5268, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
          0.08140096 = weight(abstract_txt:algorithms in 5268) [ClassicSimilarity], result of:
            0.08140096 = score(doc=5268,freq=1.0), product of:
              0.22817665 = queryWeight, product of:
                2.4028955 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.01663635 = queryNorm
              0.35674536 = fieldWeight in 5268, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
          0.074973084 = weight(abstract_txt:approaches in 5268) [ClassicSimilarity], result of:
            0.074973084 = score(doc=5268,freq=1.0), product of:
              0.26029617 = queryWeight, product of:
                3.395097 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.01663635 = queryNorm
              0.2880299 = fieldWeight in 5268, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
          0.16881827 = weight(abstract_txt:classification in 5268) [ClassicSimilarity], result of:
            0.16881827 = score(doc=5268,freq=3.0), product of:
              0.39064303 = queryWeight, product of:
                5.881977 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01663635 = queryNorm
              0.4321548 = fieldWeight in 5268, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
          0.14327425 = weight(abstract_txt:text in 5268) [ClassicSimilarity], result of:
            0.14327425 = score(doc=5268,freq=2.0), product of:
              0.40084484 = queryWeight, product of:
                5.9582872 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01663635 = queryNorm
              0.3574307 = fieldWeight in 5268, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
        0.36 = coord(9/25)
    
  4. Goller, C.; Löning, J.; Will, T.; Wolff, W.: Automatic document classification : a thourough evaluation of various methods (2000) 0.24
    0.23711549 = sum of:
      0.23711549 = product of:
        0.74098593 = sum of:
          0.041486796 = weight(abstract_txt:learning in 5480) [ClassicSimilarity], result of:
            0.041486796 = score(doc=5480,freq=2.0), product of:
              0.07903718 = queryWeight, product of:
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.01663635 = queryNorm
              0.5249023 = fieldWeight in 5480, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.078125 = fieldNorm(doc=5480)
          0.013295091 = weight(abstract_txt:based in 5480) [ClassicSimilarity], result of:
            0.013295091 = score(doc=5480,freq=1.0), product of:
              0.053381752 = queryWeight, product of:
                1.0065291 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.01663635 = queryNorm
              0.24905685 = fieldWeight in 5480, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.078125 = fieldNorm(doc=5480)
          0.04590338 = weight(abstract_txt:document in 5480) [ClassicSimilarity], result of:
            0.04590338 = score(doc=5480,freq=2.0), product of:
              0.0967873 = queryWeight, product of:
                1.3553114 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.01663635 = queryNorm
              0.4742707 = fieldWeight in 5480, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=5480)
          0.06757553 = weight(abstract_txt:methods in 5480) [ClassicSimilarity], result of:
            0.06757553 = score(doc=5480,freq=3.0), product of:
              0.1204289 = queryWeight, product of:
                1.74568 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.01663635 = queryNorm
              0.56112385 = fieldWeight in 5480, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.078125 = fieldNorm(doc=5480)
          0.07994172 = weight(abstract_txt:generally in 5480) [ClassicSimilarity], result of:
            0.07994172 = score(doc=5480,freq=1.0), product of:
              0.17651471 = queryWeight, product of:
                1.830292 = boost
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.01663635 = queryNorm
              0.45288983 = fieldWeight in 5480, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.078125 = fieldNorm(doc=5480)
          0.09371635 = weight(abstract_txt:approaches in 5480) [ClassicSimilarity], result of:
            0.09371635 = score(doc=5480,freq=1.0), product of:
              0.26029617 = queryWeight, product of:
                3.395097 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.01663635 = queryNorm
              0.3600374 = fieldWeight in 5480, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.078125 = fieldNorm(doc=5480)
          0.27242932 = weight(abstract_txt:classification in 5480) [ClassicSimilarity], result of:
            0.27242932 = score(doc=5480,freq=5.0), product of:
              0.39064303 = queryWeight, product of:
                5.881977 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01663635 = queryNorm
              0.69738686 = fieldWeight in 5480, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=5480)
          0.12663774 = weight(abstract_txt:text in 5480) [ClassicSimilarity], result of:
            0.12663774 = score(doc=5480,freq=1.0), product of:
              0.40084484 = queryWeight, product of:
                5.9582872 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01663635 = queryNorm
              0.3159271 = fieldWeight in 5480, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=5480)
        0.32 = coord(8/25)
    
  5. Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.24
    0.23656599 = sum of:
      0.23656599 = product of:
        0.98569167 = sum of:
          0.018613128 = weight(abstract_txt:based in 562) [ClassicSimilarity], result of:
            0.018613128 = score(doc=562,freq=1.0), product of:
              0.053381752 = queryWeight, product of:
                1.0065291 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.01663635 = queryNorm
              0.3486796 = fieldWeight in 562, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.109375 = fieldNorm(doc=562)
          0.06426474 = weight(abstract_txt:document in 562) [ClassicSimilarity], result of:
            0.06426474 = score(doc=562,freq=2.0), product of:
              0.0967873 = queryWeight, product of:
                1.3553114 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.01663635 = queryNorm
              0.663979 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.109375 = fieldNorm(doc=562)
          0.120074905 = weight(abstract_txt:semantic in 562) [ClassicSimilarity], result of:
            0.120074905 = score(doc=562,freq=1.0), product of:
              0.24536183 = queryWeight, product of:
                3.2962625 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.01663635 = queryNorm
              0.4893789 = fieldWeight in 562, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.109375 = fieldNorm(doc=562)
          0.29078975 = weight(abstract_txt:words in 562) [ClassicSimilarity], result of:
            0.29078975 = score(doc=562,freq=2.0), product of:
              0.35119492 = queryWeight, product of:
                3.943596 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.01663635 = queryNorm
              0.828001 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.109375 = fieldNorm(doc=562)
          0.2412192 = weight(abstract_txt:classification in 562) [ClassicSimilarity], result of:
            0.2412192 = score(doc=562,freq=2.0), product of:
              0.39064303 = queryWeight, product of:
                5.881977 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01663635 = queryNorm
              0.6174926 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.109375 = fieldNorm(doc=562)
          0.25072995 = weight(abstract_txt:text in 562) [ClassicSimilarity], result of:
            0.25072995 = score(doc=562,freq=2.0), product of:
              0.40084484 = queryWeight, product of:
                5.9582872 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01663635 = queryNorm
              0.6255037 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.109375 = fieldNorm(doc=562)
        0.24 = coord(6/25)