Document (#41096)

Author
Billal, B.
Fonseca, A.
Sadat, F.
Lounis, H.
Title
Semi-supervised learning and social media text analysis towards multi-labeling categorization
Source
IEEE International Conference on Big Data (Big Data) (2017)
Year
2017
Pages
S.1907-1916
Abstract
In traditional text classification, classes are mutually exclusive, i.e. it is not possible to have one text or text fragment classified into more than one class. On the other hand, in multi-label classification an individual text may belong to several classes simultaneously. This type of classification is required by a large number of current applications such as big data classification, images and video annotation. Supervised learning is the most used type of machine learning in the classification task. It requires large quantities of labeled data and the intervention of a human tagger in the creation of the training sets. When the data sets become very large or heavily noisy, this operation can be tedious, prone to error and time consuming. In this case, semi-supervised learning, which requires only few labels, is a better choice. In this paper, we study and evaluate several methods to address the problem of multi-label classification using semi-supervised learning and data from social networks. First, we propose a linguistic pre-processing involving tokeni-sation, recognition of named entities and hashtag segmentation in order to decrease the noise in this type of massive and unstructured real data and then we perform a word sense disambiguation using WordNet. Second, several experiments related to multi-label classification and semi-supervised learning are carried out on these data sets and compared to each other. These evaluations compare the results of the approaches considered. This paper proposes a method for combining semi-supervised methods with a graph method for the extraction of subjects in social networks using a multi-label classification approach. Experiments show that the performance of the proposed model increases in 4 p.p. the precision of the classification when compared to a baseline.
Footnote
Vgl.: doi:10.1109/BigData.2017.8258136
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Fonseca, F.: ¬The double role of ontologies in information science research (2007) 5.66
    5.661144 = sum of:
      5.661144 = weight(author_txt:fonseca in 277) [ClassicSimilarity], result of:
        5.661144 = fieldWeight in 277, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.05783 = idf(docFreq=13, maxDocs=44218)
          0.625 = fieldNorm(doc=277)
    
  2. Fonseca, F.: Whether or when : the question on the use of theories in data science (2021) 5.66
    5.661144 = sum of:
      5.661144 = weight(author_txt:fonseca in 409) [ClassicSimilarity], result of:
        5.661144 = fieldWeight in 409, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.05783 = idf(docFreq=13, maxDocs=44218)
          0.625 = fieldNorm(doc=409)
    
  3. Scott, M.; Fonseca, F.: Methodology for functional appraisal of records and creation of a functional thesaurus (1992) 4.53
    4.528915 = sum of:
      4.528915 = weight(author_txt:fonseca in 2096) [ClassicSimilarity], result of:
        4.528915 = fieldWeight in 2096, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.05783 = idf(docFreq=13, maxDocs=44218)
          0.5 = fieldNorm(doc=2096)
    
  4. Fonseca, F.T.; Martin, J.E.: Toward an alternative notion of information systems ontologies : information engineering as a hermeneutic enterprise (2005) 4.53
    4.528915 = sum of:
      4.528915 = weight(author_txt:fonseca in 3266) [ClassicSimilarity], result of:
        4.528915 = fieldWeight in 3266, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.05783 = idf(docFreq=13, maxDocs=44218)
          0.5 = fieldNorm(doc=3266)
    
  5. Câmara, G.; Fonseca, F.: Information policies and open source software in developing countries (2007) 4.53
    4.528915 = sum of:
      4.528915 = weight(author_txt:fonseca in 90) [ClassicSimilarity], result of:
        4.528915 = fieldWeight in 90, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.05783 = idf(docFreq=13, maxDocs=44218)
          0.5 = fieldNorm(doc=90)
    

Similar documents (content)

  1. Ko, Y.; Seo, J.: Text classification from unlabeled documents with bootstrapping and feature projection techniques (2009) 0.54
    0.5449648 = sum of:
      0.5449648 = product of:
        1.2385564 = sum of:
          0.042826846 = weight(abstract_txt:method in 2452) [ClassicSimilarity], result of:
            0.042826846 = score(doc=2452,freq=5.0), product of:
              0.068084285 = queryWeight, product of:
                1.0345638 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.01462128 = queryNorm
              0.6290269 = fieldWeight in 2452, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.026509382 = weight(abstract_txt:compared in 2452) [ClassicSimilarity], result of:
            0.026509382 = score(doc=2452,freq=1.0), product of:
              0.08455888 = queryWeight, product of:
                1.1529579 = boost
                5.0160327 = idf(docFreq=796, maxDocs=44218)
                0.01462128 = queryNorm
              0.31350204 = fieldWeight in 2452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0160327 = idf(docFreq=796, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.0130861495 = weight(abstract_txt:using in 2452) [ClassicSimilarity], result of:
            0.0130861495 = score(doc=2452,freq=1.0), product of:
              0.06045949 = queryWeight, product of:
                1.1940204 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.01462128 = queryNorm
              0.21644491 = fieldWeight in 2452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.03181281 = weight(abstract_txt:experiments in 2452) [ClassicSimilarity], result of:
            0.03181281 = score(doc=2452,freq=1.0), product of:
              0.09549065 = queryWeight, product of:
                1.2252206 = boost
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.01462128 = queryNorm
              0.33315104 = fieldWeight in 2452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.027841106 = weight(abstract_txt:large in 2452) [ClassicSimilarity], result of:
            0.027841106 = score(doc=2452,freq=1.0), product of:
              0.100010954 = queryWeight, product of:
                1.535689 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.01462128 = queryNorm
              0.27838057 = fieldWeight in 2452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.008853627 = weight(abstract_txt:this in 2452) [ClassicSimilarity], result of:
            0.008853627 = score(doc=2452,freq=1.0), product of:
              0.058705762 = queryWeight, product of:
                1.6639292 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.01462128 = queryNorm
              0.1508136 = fieldWeight in 2452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.09187514 = weight(abstract_txt:text in 2452) [ClassicSimilarity], result of:
            0.09187514 = score(doc=2452,freq=7.0), product of:
              0.1373954 = queryWeight, product of:
                2.3237529 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01462128 = queryNorm
              0.6686916 = fieldWeight in 2452, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.19111952 = weight(abstract_txt:learning in 2452) [ClassicSimilarity], result of:
            0.19111952 = score(doc=2452,freq=8.0), product of:
              0.2275656 = queryWeight, product of:
                3.2760293 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.01462128 = queryNorm
              0.83984363 = fieldWeight in 2452, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.14635828 = weight(abstract_txt:semi in 2452) [ClassicSimilarity], result of:
            0.14635828 = score(doc=2452,freq=1.0), product of:
              0.35849604 = queryWeight, product of:
                3.7535832 = boost
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.01462128 = queryNorm
              0.40825632 = fieldWeight in 2452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.1344659 = weight(abstract_txt:classification in 2452) [ClassicSimilarity], result of:
            0.1344659 = score(doc=2452,freq=5.0), product of:
              0.24101742 = queryWeight, product of:
                4.1291847 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01462128 = queryNorm
              0.5579095 = fieldWeight in 2452, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.5238077 = weight(abstract_txt:supervised in 2452) [ClassicSimilarity], result of:
            0.5238077 = score(doc=2452,freq=4.0), product of:
              0.56151474 = queryWeight, product of:
                5.146063 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.01462128 = queryNorm
              0.9328476 = fieldWeight in 2452, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
        0.44 = coord(11/25)
    
  2. Stamatatos, E.: Author identification : using text sampling to handle the class imbalance problem (2008) 0.26
    0.25682208 = sum of:
      0.25682208 = product of:
        0.71339464 = sum of:
          0.03181281 = weight(abstract_txt:experiments in 2063) [ClassicSimilarity], result of:
            0.03181281 = score(doc=2063,freq=1.0), product of:
              0.09549065 = queryWeight, product of:
                1.2252206 = boost
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.01462128 = queryNorm
              0.33315104 = fieldWeight in 2063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.0625 = fieldNorm(doc=2063)
          0.08302039 = weight(abstract_txt:classes in 2063) [ClassicSimilarity], result of:
            0.08302039 = score(doc=2063,freq=4.0), product of:
              0.11402393 = queryWeight, product of:
                1.3388498 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.01462128 = queryNorm
              0.7280962 = fieldWeight in 2063, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0625 = fieldNorm(doc=2063)
          0.01252092 = weight(abstract_txt:this in 2063) [ClassicSimilarity], result of:
            0.01252092 = score(doc=2063,freq=2.0), product of:
              0.058705762 = queryWeight, product of:
                1.6639292 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.01462128 = queryNorm
              0.21328263 = fieldWeight in 2063, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=2063)
          0.023402039 = weight(abstract_txt:data in 2063) [ClassicSimilarity], result of:
            0.023402039 = score(doc=2063,freq=1.0), product of:
              0.1122283 = queryWeight, product of:
                2.3006241 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.01462128 = queryNorm
              0.20852174 = fieldWeight in 2063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=2063)
          0.08505986 = weight(abstract_txt:text in 2063) [ClassicSimilarity], result of:
            0.08505986 = score(doc=2063,freq=6.0), product of:
              0.1373954 = queryWeight, product of:
                2.3237529 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01462128 = queryNorm
              0.6190881 = fieldWeight in 2063, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2063)
          0.067570955 = weight(abstract_txt:learning in 2063) [ClassicSimilarity], result of:
            0.067570955 = score(doc=2063,freq=1.0), product of:
              0.2275656 = queryWeight, product of:
                3.2760293 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.01462128 = queryNorm
              0.29692957 = fieldWeight in 2063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=2063)
          0.15883265 = weight(abstract_txt:label in 2063) [ClassicSimilarity], result of:
            0.15883265 = score(doc=2063,freq=1.0), product of:
              0.35144928 = queryWeight, product of:
                3.3241467 = boost
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.01462128 = queryNorm
              0.4519362 = fieldWeight in 2063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.0625 = fieldNorm(doc=2063)
          0.19104008 = weight(abstract_txt:multi in 2063) [ClassicSimilarity], result of:
            0.19104008 = score(doc=2063,freq=3.0), product of:
              0.29688078 = queryWeight, product of:
                3.4158194 = boost
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.01462128 = queryNorm
              0.6434909 = fieldWeight in 2063, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.0625 = fieldNorm(doc=2063)
          0.060134977 = weight(abstract_txt:classification in 2063) [ClassicSimilarity], result of:
            0.060134977 = score(doc=2063,freq=1.0), product of:
              0.24101742 = queryWeight, product of:
                4.1291847 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01462128 = queryNorm
              0.2495047 = fieldWeight in 2063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=2063)
        0.36 = coord(9/25)
    
  3. Rodríguez-Vidal, J.; Gonzalo, J.; Plaza, L.; Anaya Sánchez, H.: Automatic detection of influencers in social networks : authority versus domain signals (2019) 0.26
    0.25605428 = sum of:
      0.25605428 = product of:
        0.71126187 = sum of:
          0.0130861495 = weight(abstract_txt:using in 5301) [ClassicSimilarity], result of:
            0.0130861495 = score(doc=5301,freq=1.0), product of:
              0.06045949 = queryWeight, product of:
                1.1940204 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.01462128 = queryNorm
              0.21644491 = fieldWeight in 5301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=5301)
          0.03181281 = weight(abstract_txt:experiments in 5301) [ClassicSimilarity], result of:
            0.03181281 = score(doc=5301,freq=1.0), product of:
              0.09549065 = queryWeight, product of:
                1.2252206 = boost
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.01462128 = queryNorm
              0.33315104 = fieldWeight in 5301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.0625 = fieldNorm(doc=5301)
          0.023637528 = weight(abstract_txt:social in 5301) [ClassicSimilarity], result of:
            0.023637528 = score(doc=5301,freq=1.0), product of:
              0.08967222 = queryWeight, product of:
                1.4541475 = boost
                4.2175875 = idf(docFreq=1770, maxDocs=44218)
                0.01462128 = queryNorm
              0.26359922 = fieldWeight in 5301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2175875 = idf(docFreq=1770, maxDocs=44218)
                0.0625 = fieldNorm(doc=5301)
          0.027841106 = weight(abstract_txt:large in 5301) [ClassicSimilarity], result of:
            0.027841106 = score(doc=5301,freq=1.0), product of:
              0.100010954 = queryWeight, product of:
                1.535689 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.01462128 = queryNorm
              0.27838057 = fieldWeight in 5301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0625 = fieldNorm(doc=5301)
          0.043922976 = weight(abstract_txt:sets in 5301) [ClassicSimilarity], result of:
            0.043922976 = score(doc=5301,freq=1.0), product of:
              0.13553488 = queryWeight, product of:
                1.7877427 = boost
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.01462128 = queryNorm
              0.32407138 = fieldWeight in 5301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.0625 = fieldNorm(doc=5301)
          0.023402039 = weight(abstract_txt:data in 5301) [ClassicSimilarity], result of:
            0.023402039 = score(doc=5301,freq=1.0), product of:
              0.1122283 = queryWeight, product of:
                2.3006241 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.01462128 = queryNorm
              0.20852174 = fieldWeight in 5301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=5301)
          0.11703632 = weight(abstract_txt:learning in 5301) [ClassicSimilarity], result of:
            0.11703632 = score(doc=5301,freq=3.0), product of:
              0.2275656 = queryWeight, product of:
                3.2760293 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.01462128 = queryNorm
              0.51429707 = fieldWeight in 5301, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=5301)
          0.060134977 = weight(abstract_txt:classification in 5301) [ClassicSimilarity], result of:
            0.060134977 = score(doc=5301,freq=1.0), product of:
              0.24101742 = queryWeight, product of:
                4.1291847 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01462128 = queryNorm
              0.2495047 = fieldWeight in 5301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=5301)
          0.37038794 = weight(abstract_txt:supervised in 5301) [ClassicSimilarity], result of:
            0.37038794 = score(doc=5301,freq=2.0), product of:
              0.56151474 = queryWeight, product of:
                5.146063 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.01462128 = queryNorm
              0.65962285 = fieldWeight in 5301, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.0625 = fieldNorm(doc=5301)
        0.36 = coord(9/25)
    
  4. Xu, L.; Qiu, J.: Unsupervised multi-class sentiment classification approach (2019) 0.25
    0.2482942 = sum of:
      0.2482942 = product of:
        0.886765 = sum of:
          0.03317353 = weight(abstract_txt:method in 5003) [ClassicSimilarity], result of:
            0.03317353 = score(doc=5003,freq=3.0), product of:
              0.068084285 = queryWeight, product of:
                1.0345638 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.01462128 = queryNorm
              0.4872421 = fieldWeight in 5003, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=5003)
          0.026509382 = weight(abstract_txt:compared in 5003) [ClassicSimilarity], result of:
            0.026509382 = score(doc=5003,freq=1.0), product of:
              0.08455888 = queryWeight, product of:
                1.1529579 = boost
                5.0160327 = idf(docFreq=796, maxDocs=44218)
                0.01462128 = queryNorm
              0.31350204 = fieldWeight in 5003, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0160327 = idf(docFreq=796, maxDocs=44218)
                0.0625 = fieldNorm(doc=5003)
          0.0130861495 = weight(abstract_txt:using in 5003) [ClassicSimilarity], result of:
            0.0130861495 = score(doc=5003,freq=1.0), product of:
              0.06045949 = queryWeight, product of:
                1.1940204 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.01462128 = queryNorm
              0.21644491 = fieldWeight in 5003, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=5003)
          0.24663168 = weight(abstract_txt:multi in 5003) [ClassicSimilarity], result of:
            0.24663168 = score(doc=5003,freq=5.0), product of:
              0.29688078 = queryWeight, product of:
                3.4158194 = boost
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.01462128 = queryNorm
              0.8307432 = fieldWeight in 5003, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.0625 = fieldNorm(doc=5003)
          0.14635828 = weight(abstract_txt:semi in 5003) [ClassicSimilarity], result of:
            0.14635828 = score(doc=5003,freq=1.0), product of:
              0.35849604 = queryWeight, product of:
                3.7535832 = boost
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.01462128 = queryNorm
              0.40825632 = fieldWeight in 5003, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.0625 = fieldNorm(doc=5003)
          0.15910219 = weight(abstract_txt:classification in 5003) [ClassicSimilarity], result of:
            0.15910219 = score(doc=5003,freq=7.0), product of:
              0.24101742 = queryWeight, product of:
                4.1291847 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01462128 = queryNorm
              0.66012734 = fieldWeight in 5003, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=5003)
          0.26190385 = weight(abstract_txt:supervised in 5003) [ClassicSimilarity], result of:
            0.26190385 = score(doc=5003,freq=1.0), product of:
              0.56151474 = queryWeight, product of:
                5.146063 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.01462128 = queryNorm
              0.4664238 = fieldWeight in 5003, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.0625 = fieldNorm(doc=5003)
        0.28 = coord(7/25)
    
  5. Wang, J.: ¬An extensive study on automated Dewey Decimal Classification (2009) 0.24
    0.24465352 = sum of:
      0.24465352 = product of:
        0.6795931 = sum of:
          0.044990104 = weight(abstract_txt:experiments in 3172) [ClassicSimilarity], result of:
            0.044990104 = score(doc=3172,freq=2.0), product of:
              0.09549065 = queryWeight, product of:
                1.2252206 = boost
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.01462128 = queryNorm
              0.4711467 = fieldWeight in 3172, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.0625 = fieldNorm(doc=3172)
          0.041510195 = weight(abstract_txt:classes in 3172) [ClassicSimilarity], result of:
            0.041510195 = score(doc=3172,freq=1.0), product of:
              0.11402393 = queryWeight, product of:
                1.3388498 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.01462128 = queryNorm
              0.3640481 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0625 = fieldNorm(doc=3172)
          0.027841106 = weight(abstract_txt:large in 3172) [ClassicSimilarity], result of:
            0.027841106 = score(doc=3172,freq=1.0), product of:
              0.100010954 = queryWeight, product of:
                1.535689 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.01462128 = queryNorm
              0.27838057 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0625 = fieldNorm(doc=3172)
          0.008853627 = weight(abstract_txt:this in 3172) [ClassicSimilarity], result of:
            0.008853627 = score(doc=3172,freq=1.0), product of:
              0.058705762 = queryWeight, product of:
                1.6639292 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.01462128 = queryNorm
              0.1508136 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=3172)
          0.033095483 = weight(abstract_txt:data in 3172) [ClassicSimilarity], result of:
            0.033095483 = score(doc=3172,freq=2.0), product of:
              0.1122283 = queryWeight, product of:
                2.3006241 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.01462128 = queryNorm
              0.29489428 = fieldWeight in 3172, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=3172)
          0.034725543 = weight(abstract_txt:text in 3172) [ClassicSimilarity], result of:
            0.034725543 = score(doc=3172,freq=1.0), product of:
              0.1373954 = queryWeight, product of:
                2.3237529 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01462128 = queryNorm
              0.25274166 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=3172)
          0.067570955 = weight(abstract_txt:learning in 3172) [ClassicSimilarity], result of:
            0.067570955 = score(doc=3172,freq=1.0), product of:
              0.2275656 = queryWeight, product of:
                3.2760293 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.01462128 = queryNorm
              0.29692957 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=3172)
          0.15910219 = weight(abstract_txt:classification in 3172) [ClassicSimilarity], result of:
            0.15910219 = score(doc=3172,freq=7.0), product of:
              0.24101742 = queryWeight, product of:
                4.1291847 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01462128 = queryNorm
              0.66012734 = fieldWeight in 3172, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=3172)
          0.26190385 = weight(abstract_txt:supervised in 3172) [ClassicSimilarity], result of:
            0.26190385 = score(doc=3172,freq=1.0), product of:
              0.56151474 = queryWeight, product of:
                5.146063 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.01462128 = queryNorm
              0.4664238 = fieldWeight in 3172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.0625 = fieldNorm(doc=3172)
        0.36 = coord(9/25)