Document (#41097)

Author
Billal, B.
Fonseca, A.
Sadat, F.
Lounis, H.
Title
Semi-supervised learning and social media text analysis towards multi-labeling categorization
Source
IEEE International Conference on Big Data (Big Data) (2017)
Year
2017
Pages
S.1907-1916
Abstract
In traditional text classification, classes are mutually exclusive, i.e. it is not possible to have one text or text fragment classified into more than one class. On the other hand, in multi-label classification an individual text may belong to several classes simultaneously. This type of classification is required by a large number of current applications such as big data classification, images and video annotation. Supervised learning is the most used type of machine learning in the classification task. It requires large quantities of labeled data and the intervention of a human tagger in the creation of the training sets. When the data sets become very large or heavily noisy, this operation can be tedious, prone to error and time consuming. In this case, semi-supervised learning, which requires only few labels, is a better choice. In this paper, we study and evaluate several methods to address the problem of multi-label classification using semi-supervised learning and data from social networks. First, we propose a linguistic pre-processing involving tokeni-sation, recognition of named entities and hashtag segmentation in order to decrease the noise in this type of massive and unstructured real data and then we perform a word sense disambiguation using WordNet. Second, several experiments related to multi-label classification and semi-supervised learning are carried out on these data sets and compared to each other. These evaluations compare the results of the approaches considered. This paper proposes a method for combining semi-supervised methods with a graph method for the extraction of subjects in social networks using a multi-label classification approach. Experiments show that the performance of the proposed model increases in 4 p.p. the precision of the classification when compared to a baseline.
Footnote
Vgl.: doi:10.1109/BigData.2017.8258136
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Fonseca, F.: ¬The double role of ontologies in information science research (2007) 5.74
    5.7437115 = sum of:
      5.7437115 = weight(author_txt:fonseca in 2278) [ClassicSimilarity], result of:
        5.7437115 = fieldWeight in 2278, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.189939 = idf(docFreq=11, maxDocs=43254)
          0.625 = fieldNorm(doc=2278)
    
  2. Scott, M.; Fonseca, F.: Methodology for functional appraisal of records and creation of a functional thesaurus (1992) 4.59
    4.5949693 = sum of:
      4.5949693 = weight(author_txt:fonseca in 2096) [ClassicSimilarity], result of:
        4.5949693 = fieldWeight in 2096, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.189939 = idf(docFreq=11, maxDocs=43254)
          0.5 = fieldNorm(doc=2096)
    
  3. Fonseca, F.T.; Martin, J.E.: Toward an alternative notion of information systems ontologies : information engineering as a hermeneutic enterprise (2005) 4.59
    4.5949693 = sum of:
      4.5949693 = weight(author_txt:fonseca in 5267) [ClassicSimilarity], result of:
        4.5949693 = fieldWeight in 5267, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.189939 = idf(docFreq=11, maxDocs=43254)
          0.5 = fieldNorm(doc=5267)
    
  4. Câmara, G.; Fonseca, F.: Information policies and open source software in developing countries (2007) 4.59
    4.5949693 = sum of:
      4.5949693 = weight(author_txt:fonseca in 2091) [ClassicSimilarity], result of:
        4.5949693 = fieldWeight in 2091, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.189939 = idf(docFreq=11, maxDocs=43254)
          0.5 = fieldNorm(doc=2091)
    
  5. Marcinkowski, M.; Fonseca, F.: ¬The conditions of peak empiricism in big data and interaction design (2016) 4.59
    4.5949693 = sum of:
      4.5949693 = weight(author_txt:fonseca in 4389) [ClassicSimilarity], result of:
        4.5949693 = fieldWeight in 4389, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.189939 = idf(docFreq=11, maxDocs=43254)
          0.5 = fieldNorm(doc=4389)
    

Similar documents (content)

  1. Ko, Y.; Seo, J.: Text classification from unlabeled documents with bootstrapping and feature projection techniques (2009) 0.55
    0.5513864 = sum of:
      0.5513864 = product of:
        1.2531509 = sum of:
          0.042426668 = weight(abstract_txt:method in 4453) [ClassicSimilarity], result of:
            0.042426668 = score(doc=4453,freq=5.0), product of:
              0.06727625 = queryWeight, product of:
                1.0337315 = boost
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.014422543 = queryNorm
              0.6306337 = fieldWeight in 4453, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.0625 = fieldNorm(doc=4453)
          0.026114037 = weight(abstract_txt:compared in 4453) [ClassicSimilarity], result of:
            0.026114037 = score(doc=4453,freq=1.0), product of:
              0.083241865 = queryWeight, product of:
                1.1498674 = boost
                5.0194044 = idf(docFreq=776, maxDocs=43254)
                0.014422543 = queryNorm
              0.31371278 = fieldWeight in 4453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0194044 = idf(docFreq=776, maxDocs=43254)
                0.0625 = fieldNorm(doc=4453)
          0.0129775405 = weight(abstract_txt:using in 4453) [ClassicSimilarity], result of:
            0.0129775405 = score(doc=4453,freq=1.0), product of:
              0.05978393 = queryWeight, product of:
                1.1934788 = boost
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.014422543 = queryNorm
              0.21707405 = fieldWeight in 4453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.0625 = fieldNorm(doc=4453)
          0.031534854 = weight(abstract_txt:experiments in 4453) [ClassicSimilarity], result of:
            0.031534854 = score(doc=4453,freq=1.0), product of:
              0.094395876 = queryWeight, product of:
                1.2244847 = boost
                5.3451242 = idf(docFreq=560, maxDocs=43254)
                0.014422543 = queryNorm
              0.33407027 = fieldWeight in 4453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3451242 = idf(docFreq=560, maxDocs=43254)
                0.0625 = fieldNorm(doc=4453)
          0.027609099 = weight(abstract_txt:large in 4453) [ClassicSimilarity], result of:
            0.027609099 = score(doc=4453,freq=1.0), product of:
              0.09889121 = queryWeight, product of:
                1.534975 = boost
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.014422543 = queryNorm
              0.27918658 = fieldWeight in 4453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.0625 = fieldNorm(doc=4453)
          0.008905091 = weight(abstract_txt:this in 4453) [ClassicSimilarity], result of:
            0.008905091 = score(doc=4453,freq=1.0), product of:
              0.058599215 = queryWeight, product of:
                1.6710267 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.014422543 = queryNorm
              0.15196605 = fieldWeight in 4453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0625 = fieldNorm(doc=4453)
          0.09071669 = weight(abstract_txt:text in 4453) [ClassicSimilarity], result of:
            0.09071669 = score(doc=4453,freq=7.0), product of:
              0.13546629 = queryWeight, product of:
                2.3193297 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.014422543 = queryNorm
              0.6696625 = fieldWeight in 4453, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.0625 = fieldNorm(doc=4453)
          0.19323352 = weight(abstract_txt:learning in 4453) [ClassicSimilarity], result of:
            0.19323352 = score(doc=4453,freq=8.0), product of:
              0.2279421 = queryWeight, product of:
                3.2957158 = boost
                4.7954893 = idf(docFreq=971, maxDocs=43254)
                0.014422543 = queryNorm
              0.84773076 = fieldWeight in 4453, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.7954893 = idf(docFreq=971, maxDocs=43254)
                0.0625 = fieldNorm(doc=4453)
          0.14714715 = weight(abstract_txt:semi in 4453) [ClassicSimilarity], result of:
            0.14714715 = score(doc=4453,freq=1.0), product of:
              0.3577451 = queryWeight, product of:
                3.7690656 = boost
                6.5810947 = idf(docFreq=162, maxDocs=43254)
                0.014422543 = queryNorm
              0.41131842 = fieldWeight in 4453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5810947 = idf(docFreq=162, maxDocs=43254)
                0.0625 = fieldNorm(doc=4453)
          0.13277407 = weight(abstract_txt:classification in 4453) [ClassicSimilarity], result of:
            0.13277407 = score(doc=4453,freq=5.0), product of:
              0.23763776 = queryWeight, product of:
                4.1213627 = boost
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.014422543 = queryNorm
              0.55872464 = fieldWeight in 4453, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.0625 = fieldNorm(doc=4453)
          0.5397123 = weight(abstract_txt:supervised in 4453) [ClassicSimilarity], result of:
            0.5397123 = score(doc=4453,freq=4.0), product of:
              0.5695796 = queryWeight, product of:
                5.2097235 = boost
                7.5805006 = idf(docFreq=59, maxDocs=43254)
                0.014422543 = queryNorm
              0.9475626 = fieldWeight in 4453, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.5805006 = idf(docFreq=59, maxDocs=43254)
                0.0625 = fieldNorm(doc=4453)
        0.44 = coord(11/25)
    
  2. Rodríguez-Vidal, J.; Gonzalo, J.; Plaza, L.; Anaya Sánchez, H.: Automatic detection of influencers in social networks : authority versus domain signals (2019) 0.26
    0.25985822 = sum of:
      0.25985822 = product of:
        0.72182834 = sum of:
          0.0129775405 = weight(abstract_txt:using in 302) [ClassicSimilarity], result of:
            0.0129775405 = score(doc=302,freq=1.0), product of:
              0.05978393 = queryWeight, product of:
                1.1934788 = boost
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.014422543 = queryNorm
              0.21707405 = fieldWeight in 302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.031534854 = weight(abstract_txt:experiments in 302) [ClassicSimilarity], result of:
            0.031534854 = score(doc=302,freq=1.0), product of:
              0.094395876 = queryWeight, product of:
                1.2244847 = boost
                5.3451242 = idf(docFreq=560, maxDocs=43254)
                0.014422543 = queryNorm
              0.33407027 = fieldWeight in 302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3451242 = idf(docFreq=560, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.023879934 = weight(abstract_txt:social in 302) [ClassicSimilarity], result of:
            0.023879934 = score(doc=302,freq=1.0), product of:
              0.08977284 = queryWeight, product of:
                1.462497 = boost
                4.256064 = idf(docFreq=1666, maxDocs=43254)
                0.014422543 = queryNorm
              0.266004 = fieldWeight in 302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.256064 = idf(docFreq=1666, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.027609099 = weight(abstract_txt:large in 302) [ClassicSimilarity], result of:
            0.027609099 = score(doc=302,freq=1.0), product of:
              0.09889121 = queryWeight, product of:
                1.534975 = boost
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.014422543 = queryNorm
              0.27918658 = fieldWeight in 302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.043004356 = weight(abstract_txt:sets in 302) [ClassicSimilarity], result of:
            0.043004356 = score(doc=302,freq=1.0), product of:
              0.13288149 = queryWeight, product of:
                1.7793229 = boost
                5.17807 = idf(docFreq=662, maxDocs=43254)
                0.014422543 = queryNorm
              0.32362938 = fieldWeight in 302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.17807 = idf(docFreq=662, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.023479119 = weight(abstract_txt:data in 302) [ClassicSimilarity], result of:
            0.023479119 = score(doc=302,freq=1.0), product of:
              0.11183749 = queryWeight, product of:
                2.308507 = boost
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.014422543 = queryNorm
              0.20993961 = fieldWeight in 302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.11833088 = weight(abstract_txt:learning in 302) [ClassicSimilarity], result of:
            0.11833088 = score(doc=302,freq=3.0), product of:
              0.2279421 = queryWeight, product of:
                3.2957158 = boost
                4.7954893 = idf(docFreq=971, maxDocs=43254)
                0.014422543 = queryNorm
              0.51912695 = fieldWeight in 302, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7954893 = idf(docFreq=971, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.059378367 = weight(abstract_txt:classification in 302) [ClassicSimilarity], result of:
            0.059378367 = score(doc=302,freq=1.0), product of:
              0.23763776 = queryWeight, product of:
                4.1213627 = boost
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.014422543 = queryNorm
              0.24986924 = fieldWeight in 302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
          0.38163424 = weight(abstract_txt:supervised in 302) [ClassicSimilarity], result of:
            0.38163424 = score(doc=302,freq=2.0), product of:
              0.5695796 = queryWeight, product of:
                5.2097235 = boost
                7.5805006 = idf(docFreq=59, maxDocs=43254)
                0.014422543 = queryNorm
              0.6700279 = fieldWeight in 302, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5805006 = idf(docFreq=59, maxDocs=43254)
                0.0625 = fieldNorm(doc=302)
        0.36 = coord(9/25)
    
  3. Stamatatos, E.: Author identification : using text sampling to handle the class imbalance problem (2008) 0.26
    0.25668165 = sum of:
      0.25668165 = product of:
        0.7130045 = sum of:
          0.031534854 = weight(abstract_txt:experiments in 4064) [ClassicSimilarity], result of:
            0.031534854 = score(doc=4064,freq=1.0), product of:
              0.094395876 = queryWeight, product of:
                1.2244847 = boost
                5.3451242 = idf(docFreq=560, maxDocs=43254)
                0.014422543 = queryNorm
              0.33407027 = fieldWeight in 4064, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3451242 = idf(docFreq=560, maxDocs=43254)
                0.0625 = fieldNorm(doc=4064)
          0.08201477 = weight(abstract_txt:classes in 4064) [ClassicSimilarity], result of:
            0.08201477 = score(doc=4064,freq=4.0), product of:
              0.11246063 = queryWeight, product of:
                1.3365251 = boost
                5.8342032 = idf(docFreq=343, maxDocs=43254)
                0.014422543 = queryNorm
              0.7292754 = fieldWeight in 4064, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8342032 = idf(docFreq=343, maxDocs=43254)
                0.0625 = fieldNorm(doc=4064)
          0.012593701 = weight(abstract_txt:this in 4064) [ClassicSimilarity], result of:
            0.012593701 = score(doc=4064,freq=2.0), product of:
              0.058599215 = queryWeight, product of:
                1.6710267 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.014422543 = queryNorm
              0.21491244 = fieldWeight in 4064, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0625 = fieldNorm(doc=4064)
          0.023479119 = weight(abstract_txt:data in 4064) [ClassicSimilarity], result of:
            0.023479119 = score(doc=4064,freq=1.0), product of:
              0.11183749 = queryWeight, product of:
                2.308507 = boost
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.014422543 = queryNorm
              0.20993961 = fieldWeight in 4064, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.0625 = fieldNorm(doc=4064)
          0.08398734 = weight(abstract_txt:text in 4064) [ClassicSimilarity], result of:
            0.08398734 = score(doc=4064,freq=6.0), product of:
              0.13546629 = queryWeight, product of:
                2.3193297 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.014422543 = queryNorm
              0.619987 = fieldWeight in 4064, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.0625 = fieldNorm(doc=4064)
          0.06831837 = weight(abstract_txt:learning in 4064) [ClassicSimilarity], result of:
            0.06831837 = score(doc=4064,freq=1.0), product of:
              0.2279421 = queryWeight, product of:
                3.2957158 = boost
                4.7954893 = idf(docFreq=971, maxDocs=43254)
                0.014422543 = queryNorm
              0.29971808 = fieldWeight in 4064, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7954893 = idf(docFreq=971, maxDocs=43254)
                0.0625 = fieldNorm(doc=4064)
          0.16101967 = weight(abstract_txt:label in 4064) [ClassicSimilarity], result of:
            0.16101967 = score(doc=4064,freq=1.0), product of:
              0.35265908 = queryWeight, product of:
                3.3471053 = boost
                7.305397 = idf(docFreq=78, maxDocs=43254)
                0.014422543 = queryNorm
              0.4565873 = fieldWeight in 4064, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.305397 = idf(docFreq=78, maxDocs=43254)
                0.0625 = fieldNorm(doc=4064)
          0.1906783 = weight(abstract_txt:multi in 4064) [ClassicSimilarity], result of:
            0.1906783 = score(doc=4064,freq=3.0), product of:
              0.2948263 = queryWeight, product of:
                3.4216058 = boost
                5.9744015 = idf(docFreq=298, maxDocs=43254)
                0.014422543 = queryNorm
              0.64674795 = fieldWeight in 4064, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9744015 = idf(docFreq=298, maxDocs=43254)
                0.0625 = fieldNorm(doc=4064)
          0.059378367 = weight(abstract_txt:classification in 4064) [ClassicSimilarity], result of:
            0.059378367 = score(doc=4064,freq=1.0), product of:
              0.23763776 = queryWeight, product of:
                4.1213627 = boost
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.014422543 = queryNorm
              0.24986924 = fieldWeight in 4064, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.0625 = fieldNorm(doc=4064)
        0.36 = coord(9/25)
    
  4. Xu, L.; Qiu, J.: Unsupervised multi-class sentiment classification approach (2019) 0.25
    0.24982257 = sum of:
      0.24982257 = product of:
        0.8922235 = sum of:
          0.032863554 = weight(abstract_txt:method in 4) [ClassicSimilarity], result of:
            0.032863554 = score(doc=4,freq=3.0), product of:
              0.06727625 = queryWeight, product of:
                1.0337315 = boost
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.014422543 = queryNorm
              0.48848674 = fieldWeight in 4, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.0625 = fieldNorm(doc=4)
          0.026114037 = weight(abstract_txt:compared in 4) [ClassicSimilarity], result of:
            0.026114037 = score(doc=4,freq=1.0), product of:
              0.083241865 = queryWeight, product of:
                1.1498674 = boost
                5.0194044 = idf(docFreq=776, maxDocs=43254)
                0.014422543 = queryNorm
              0.31371278 = fieldWeight in 4, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0194044 = idf(docFreq=776, maxDocs=43254)
                0.0625 = fieldNorm(doc=4)
          0.0129775405 = weight(abstract_txt:using in 4) [ClassicSimilarity], result of:
            0.0129775405 = score(doc=4,freq=1.0), product of:
              0.05978393 = queryWeight, product of:
                1.1934788 = boost
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.014422543 = queryNorm
              0.21707405 = fieldWeight in 4, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.0625 = fieldNorm(doc=4)
          0.24616463 = weight(abstract_txt:multi in 4) [ClassicSimilarity], result of:
            0.24616463 = score(doc=4,freq=5.0), product of:
              0.2948263 = queryWeight, product of:
                3.4216058 = boost
                5.9744015 = idf(docFreq=298, maxDocs=43254)
                0.014422543 = queryNorm
              0.834948 = fieldWeight in 4, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.9744015 = idf(docFreq=298, maxDocs=43254)
                0.0625 = fieldNorm(doc=4)
          0.14714715 = weight(abstract_txt:semi in 4) [ClassicSimilarity], result of:
            0.14714715 = score(doc=4,freq=1.0), product of:
              0.3577451 = queryWeight, product of:
                3.7690656 = boost
                6.5810947 = idf(docFreq=162, maxDocs=43254)
                0.014422543 = queryNorm
              0.41131842 = fieldWeight in 4, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5810947 = idf(docFreq=162, maxDocs=43254)
                0.0625 = fieldNorm(doc=4)
          0.1571004 = weight(abstract_txt:classification in 4) [ClassicSimilarity], result of:
            0.1571004 = score(doc=4,freq=7.0), product of:
              0.23763776 = queryWeight, product of:
                4.1213627 = boost
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.014422543 = queryNorm
              0.66109186 = fieldWeight in 4, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.0625 = fieldNorm(doc=4)
          0.26985615 = weight(abstract_txt:supervised in 4) [ClassicSimilarity], result of:
            0.26985615 = score(doc=4,freq=1.0), product of:
              0.5695796 = queryWeight, product of:
                5.2097235 = boost
                7.5805006 = idf(docFreq=59, maxDocs=43254)
                0.014422543 = queryNorm
              0.4737813 = fieldWeight in 4, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5805006 = idf(docFreq=59, maxDocs=43254)
                0.0625 = fieldNorm(doc=4)
        0.28 = coord(7/25)
    
  5. Wang, J.: ¬An extensive study on automated Dewey Decimal Classification (2009) 0.25
    0.24655886 = sum of:
      0.24655886 = product of:
        0.6848857 = sum of:
          0.04459702 = weight(abstract_txt:experiments in 173) [ClassicSimilarity], result of:
            0.04459702 = score(doc=173,freq=2.0), product of:
              0.094395876 = queryWeight, product of:
                1.2244847 = boost
                5.3451242 = idf(docFreq=560, maxDocs=43254)
                0.014422543 = queryNorm
              0.47244668 = fieldWeight in 173, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3451242 = idf(docFreq=560, maxDocs=43254)
                0.0625 = fieldNorm(doc=173)
          0.041007385 = weight(abstract_txt:classes in 173) [ClassicSimilarity], result of:
            0.041007385 = score(doc=173,freq=1.0), product of:
              0.11246063 = queryWeight, product of:
                1.3365251 = boost
                5.8342032 = idf(docFreq=343, maxDocs=43254)
                0.014422543 = queryNorm
              0.3646377 = fieldWeight in 173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8342032 = idf(docFreq=343, maxDocs=43254)
                0.0625 = fieldNorm(doc=173)
          0.027609099 = weight(abstract_txt:large in 173) [ClassicSimilarity], result of:
            0.027609099 = score(doc=173,freq=1.0), product of:
              0.09889121 = queryWeight, product of:
                1.534975 = boost
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.014422543 = queryNorm
              0.27918658 = fieldWeight in 173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.0625 = fieldNorm(doc=173)
          0.008905091 = weight(abstract_txt:this in 173) [ClassicSimilarity], result of:
            0.008905091 = score(doc=173,freq=1.0), product of:
              0.058599215 = queryWeight, product of:
                1.6710267 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.014422543 = queryNorm
              0.15196605 = fieldWeight in 173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0625 = fieldNorm(doc=173)
          0.03320449 = weight(abstract_txt:data in 173) [ClassicSimilarity], result of:
            0.03320449 = score(doc=173,freq=2.0), product of:
              0.11183749 = queryWeight, product of:
                2.308507 = boost
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.014422543 = queryNorm
              0.29689944 = fieldWeight in 173, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.0625 = fieldNorm(doc=173)
          0.034287687 = weight(abstract_txt:text in 173) [ClassicSimilarity], result of:
            0.034287687 = score(doc=173,freq=1.0), product of:
              0.13546629 = queryWeight, product of:
                2.3193297 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.014422543 = queryNorm
              0.25310862 = fieldWeight in 173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.0625 = fieldNorm(doc=173)
          0.06831837 = weight(abstract_txt:learning in 173) [ClassicSimilarity], result of:
            0.06831837 = score(doc=173,freq=1.0), product of:
              0.2279421 = queryWeight, product of:
                3.2957158 = boost
                4.7954893 = idf(docFreq=971, maxDocs=43254)
                0.014422543 = queryNorm
              0.29971808 = fieldWeight in 173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7954893 = idf(docFreq=971, maxDocs=43254)
                0.0625 = fieldNorm(doc=173)
          0.1571004 = weight(abstract_txt:classification in 173) [ClassicSimilarity], result of:
            0.1571004 = score(doc=173,freq=7.0), product of:
              0.23763776 = queryWeight, product of:
                4.1213627 = boost
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.014422543 = queryNorm
              0.66109186 = fieldWeight in 173, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.0625 = fieldNorm(doc=173)
          0.26985615 = weight(abstract_txt:supervised in 173) [ClassicSimilarity], result of:
            0.26985615 = score(doc=173,freq=1.0), product of:
              0.5695796 = queryWeight, product of:
                5.2097235 = boost
                7.5805006 = idf(docFreq=59, maxDocs=43254)
                0.014422543 = queryNorm
              0.4737813 = fieldWeight in 173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5805006 = idf(docFreq=59, maxDocs=43254)
                0.0625 = fieldNorm(doc=173)
        0.36 = coord(9/25)