Document (#38185)

Author
Silva, R.M.
Gonçalves, M.A.
Veloso, A.
Title
¬A Two-stage active learning method for learning to rank
Source
Journal of the Association for Information Science and Technology. 65(2014) no.1, S.109-128
Year
2014
Abstract
Learning to rank (L2R) algorithms use a labeled training set to generate a ranking model that can later be used to rank new query results. These training sets are costly and laborious to produce, requiring human annotators to assess the relevance or order of the documents in relation to a query. Active learning algorithms are able to reduce the labeling effort by selectively sampling an unlabeled set and choosing data instances that maximize a learning function's effectiveness. In this article, we propose a novel two-stage active learning method for L2R that combines and exploits interesting properties of its constituent parts, thus being effective and practical. In the first stage, an association rule active sampling algorithm is used to select a very small but effective initial training set. In the second stage, a query-by-committee strategy trained with the first-stage set is used to iteratively select more examples until a preset labeling budget is met or a target effectiveness is achieved. We test our method with various LETOR benchmarking data sets and compare it with several baselines to show that it achieves good results using only a small portion of the original training sets.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.22958/abstract.
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Cortez, E.; Silva, A.S. da; Gonçalves, M.A.; Mesquita, F.; Moura, E.S. de: ¬A flexible approach for extracting metadata from bibliographic citations (2009) 2.84
    2.8407636 = sum of:
      2.8407636 = sum of:
        1.1440027 = weight(author_txt:silva in 2848) [ClassicSimilarity], result of:
          1.1440027 = score(doc=2848,freq=1.0), product of:
            0.60954696 = queryWeight, product of:
              7.5072327 = idf(docFreq=65, maxDocs=44218)
              0.081194624 = queryNorm
            1.8768082 = fieldWeight in 2848, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.5072327 = idf(docFreq=65, maxDocs=44218)
              0.25 = fieldNorm(doc=2848)
        1.696761 = weight(author_txt:gonçalves in 2848) [ClassicSimilarity], result of:
          1.696761 = score(doc=2848,freq=1.0), product of:
            0.79275 = queryWeight, product of:
              1.1404192 = boost
              8.561393 = idf(docFreq=22, maxDocs=44218)
              0.081194624 = queryNorm
            2.1403482 = fieldWeight in 2848, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.561393 = idf(docFreq=22, maxDocs=44218)
              0.25 = fieldNorm(doc=2848)
    
  2. Moura, E.S. de; Fernandes, D.; Ribeiro-Neto, B.; Silva, A.S. da; Gonçalves, M.A.: Using structural information to improve search in Web collections (2010) 2.84
    2.8407636 = sum of:
      2.8407636 = sum of:
        1.1440027 = weight(author_txt:silva in 4119) [ClassicSimilarity], result of:
          1.1440027 = score(doc=4119,freq=1.0), product of:
            0.60954696 = queryWeight, product of:
              7.5072327 = idf(docFreq=65, maxDocs=44218)
              0.081194624 = queryNorm
            1.8768082 = fieldWeight in 4119, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.5072327 = idf(docFreq=65, maxDocs=44218)
              0.25 = fieldNorm(doc=4119)
        1.696761 = weight(author_txt:gonçalves in 4119) [ClassicSimilarity], result of:
          1.696761 = score(doc=4119,freq=1.0), product of:
            0.79275 = queryWeight, product of:
              1.1404192 = boost
              8.561393 = idf(docFreq=22, maxDocs=44218)
              0.081194624 = queryNorm
            2.1403482 = fieldWeight in 4119, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.561393 = idf(docFreq=22, maxDocs=44218)
              0.25 = fieldNorm(doc=4119)
    
  3. Silva, A.J.C.; Gonçalves, M.A.; Laender, A.H.F.; Modesto, M.A.B.; Cristo, M.; Ziviani, N.: Finding what is missing from a digital library : a case study in the computer science field (2009) 2.84
    2.8407636 = sum of:
      2.8407636 = sum of:
        1.1440027 = weight(author_txt:silva in 4219) [ClassicSimilarity], result of:
          1.1440027 = score(doc=4219,freq=1.0), product of:
            0.60954696 = queryWeight, product of:
              7.5072327 = idf(docFreq=65, maxDocs=44218)
              0.081194624 = queryNorm
            1.8768082 = fieldWeight in 4219, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.5072327 = idf(docFreq=65, maxDocs=44218)
              0.25 = fieldNorm(doc=4219)
        1.696761 = weight(author_txt:gonçalves in 4219) [ClassicSimilarity], result of:
          1.696761 = score(doc=4219,freq=1.0), product of:
            0.79275 = queryWeight, product of:
              1.1404192 = boost
              8.561393 = idf(docFreq=22, maxDocs=44218)
              0.081194624 = queryNorm
            2.1403482 = fieldWeight in 4219, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.561393 = idf(docFreq=22, maxDocs=44218)
              0.25 = fieldNorm(doc=4219)
    
  4. Cavalcante Dourado, Í.; Galante, R.; Gonçalves, M.A.; Silva Torres, R. de: Bag of textual graphs (BoTG) : a general graph-based text representation model (2019) 2.84
    2.8407636 = sum of:
      2.8407636 = sum of:
        1.1440027 = weight(author_txt:silva in 5291) [ClassicSimilarity], result of:
          1.1440027 = score(doc=5291,freq=1.0), product of:
            0.60954696 = queryWeight, product of:
              7.5072327 = idf(docFreq=65, maxDocs=44218)
              0.081194624 = queryNorm
            1.8768082 = fieldWeight in 5291, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.5072327 = idf(docFreq=65, maxDocs=44218)
              0.25 = fieldNorm(doc=5291)
        1.696761 = weight(author_txt:gonçalves in 5291) [ClassicSimilarity], result of:
          1.696761 = score(doc=5291,freq=1.0), product of:
            0.79275 = queryWeight, product of:
              1.1404192 = boost
              8.561393 = idf(docFreq=22, maxDocs=44218)
              0.081194624 = queryNorm
            2.1403482 = fieldWeight in 5291, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.561393 = idf(docFreq=22, maxDocs=44218)
              0.25 = fieldNorm(doc=5291)
    
  5. Sant'Ana, R.C. Gonçalves => Gonçalves Sant'Ana, R.C.: 1.80
    1.7996869 = sum of:
      1.7996869 = product of:
        3.5993738 = sum of:
          3.5993738 = weight(author_txt:gonçalves in 4732) [ClassicSimilarity], result of:
            3.5993738 = score(doc=4732,freq=2.0), product of:
              0.79275 = queryWeight, product of:
                1.1404192 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.081194624 = queryNorm
              4.5403643 = fieldWeight in 4732, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.375 = fieldNorm(doc=4732)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Ko, Y.; Seo, J.: Text classification from unlabeled documents with bootstrapping and feature projection techniques (2009) 0.24
    0.2431578 = sum of:
      0.2431578 = product of:
        0.75986814 = sum of:
          0.15747201 = weight(abstract_txt:unlabeled in 2452) [ClassicSimilarity], result of:
            0.15747201 = score(doc=2452,freq=2.0), product of:
              0.18964608 = queryWeight, product of:
                1.1413108 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.017687865 = queryNorm
              0.8303468 = fieldWeight in 2452, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.010107093 = weight(abstract_txt:that in 2452) [ClassicSimilarity], result of:
            0.010107093 = score(doc=2452,freq=2.0), product of:
              0.048259083 = queryWeight, product of:
                1.1514671 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017687865 = queryNorm
              0.20943399 = fieldWeight in 2452, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.015274329 = weight(abstract_txt:used in 2452) [ClassicSimilarity], result of:
            0.015274329 = score(doc=2452,freq=1.0), product of:
              0.07275008 = queryWeight, product of:
                1.2243606 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.017687865 = queryNorm
              0.2099562 = fieldWeight in 2452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.049952794 = weight(abstract_txt:algorithms in 2452) [ClassicSimilarity], result of:
            0.049952794 = score(doc=2452,freq=1.0), product of:
              0.14002366 = queryWeight, product of:
                1.3869082 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.017687865 = queryNorm
              0.35674536 = fieldWeight in 2452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.08215115 = weight(abstract_txt:method in 2452) [ClassicSimilarity], result of:
            0.08215115 = score(doc=2452,freq=5.0), product of:
              0.13060038 = queryWeight, product of:
                1.6404569 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.017687865 = queryNorm
              0.6290269 = fieldWeight in 2452, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.12873365 = weight(abstract_txt:labeling in 2452) [ClassicSimilarity], result of:
            0.12873365 = score(doc=2452,freq=1.0), product of:
              0.26320228 = queryWeight, product of:
                1.9014802 = boost
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.017687865 = queryNorm
              0.48910537 = fieldWeight in 2452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.071771465 = weight(abstract_txt:training in 2452) [ClassicSimilarity], result of:
            0.071771465 = score(doc=2452,freq=1.0), product of:
              0.2246326 = queryWeight, product of:
                2.4842677 = boost
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.017687865 = queryNorm
              0.319506 = fieldWeight in 2452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.24440572 = weight(abstract_txt:learning in 2452) [ClassicSimilarity], result of:
            0.24440572 = score(doc=2452,freq=8.0), product of:
              0.29101336 = queryWeight, product of:
                3.4630923 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.017687865 = queryNorm
              0.83984363 = fieldWeight in 2452, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
        0.32 = coord(8/25)
    
  2. Xu, B.; Lin, H.; Lin, Y.: Assessment of learning to rank methods for query expansion (2016) 0.23
    0.22852248 = sum of:
      0.22852248 = product of:
        0.7141328 = sum of:
          0.0071467934 = weight(abstract_txt:that in 2929) [ClassicSimilarity], result of:
            0.0071467934 = score(doc=2929,freq=1.0), product of:
              0.048259083 = queryWeight, product of:
                1.1514671 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017687865 = queryNorm
              0.1480922 = fieldWeight in 2929, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=2929)
          0.042969365 = weight(abstract_txt:effective in 2929) [ClassicSimilarity], result of:
            0.042969365 = score(doc=2929,freq=2.0), product of:
              0.10052118 = queryWeight, product of:
                1.1751026 = boost
                4.8362236 = idf(docFreq=953, maxDocs=44218)
                0.017687865 = queryNorm
              0.4274658 = fieldWeight in 2929, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8362236 = idf(docFreq=953, maxDocs=44218)
                0.0625 = fieldNorm(doc=2929)
          0.015274329 = weight(abstract_txt:used in 2929) [ClassicSimilarity], result of:
            0.015274329 = score(doc=2929,freq=1.0), product of:
              0.07275008 = queryWeight, product of:
                1.2243606 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.017687865 = queryNorm
              0.2099562 = fieldWeight in 2929, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=2929)
          0.051956944 = weight(abstract_txt:method in 2929) [ClassicSimilarity], result of:
            0.051956944 = score(doc=2929,freq=2.0), product of:
              0.13060038 = queryWeight, product of:
                1.6404569 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.017687865 = queryNorm
              0.3978315 = fieldWeight in 2929, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=2929)
          0.061213065 = weight(abstract_txt:query in 2929) [ClassicSimilarity], result of:
            0.061213065 = score(doc=2929,freq=2.0), product of:
              0.14568385 = queryWeight, product of:
                1.7326 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.017687865 = queryNorm
              0.4201774 = fieldWeight in 2929, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=2929)
          0.12873365 = weight(abstract_txt:labeling in 2929) [ClassicSimilarity], result of:
            0.12873365 = score(doc=2929,freq=1.0), product of:
              0.26320228 = queryWeight, product of:
                1.9014802 = boost
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.017687865 = queryNorm
              0.48910537 = fieldWeight in 2929, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.0625 = fieldNorm(doc=2929)
          0.21361901 = weight(abstract_txt:rank in 2929) [ClassicSimilarity], result of:
            0.21361901 = score(doc=2929,freq=4.0), product of:
              0.26603082 = queryWeight, product of:
                2.3413084 = boost
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.017687865 = queryNorm
              0.80298597 = fieldWeight in 2929, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.0625 = fieldNorm(doc=2929)
          0.19321969 = weight(abstract_txt:learning in 2929) [ClassicSimilarity], result of:
            0.19321969 = score(doc=2929,freq=5.0), product of:
              0.29101336 = queryWeight, product of:
                3.4630923 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.017687865 = queryNorm
              0.66395473 = fieldWeight in 2929, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=2929)
        0.32 = coord(8/25)
    
  3. Kuo, J.-S.; Li, H.; Yang, Y.-K.: Active learning for constructing transliteration lexicons from the Web (2008) 0.22
    0.21832986 = sum of:
      0.21832986 = product of:
        0.9097078 = sum of:
          0.130494 = weight(abstract_txt:iteratively in 1345) [ClassicSimilarity], result of:
            0.130494 = score(doc=1345,freq=1.0), product of:
              0.16087347 = queryWeight, product of:
                1.0511731 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.017687865 = queryNorm
              0.8111592 = fieldWeight in 1345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.09375 = fieldNorm(doc=1345)
          0.018567912 = weight(abstract_txt:that in 1345) [ClassicSimilarity], result of:
            0.018567912 = score(doc=1345,freq=3.0), product of:
              0.048259083 = queryWeight, product of:
                1.1514671 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017687865 = queryNorm
              0.38475478 = fieldWeight in 1345, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=1345)
          0.045575894 = weight(abstract_txt:effective in 1345) [ClassicSimilarity], result of:
            0.045575894 = score(doc=1345,freq=1.0), product of:
              0.10052118 = queryWeight, product of:
                1.1751026 = boost
                4.8362236 = idf(docFreq=953, maxDocs=44218)
                0.017687865 = queryNorm
              0.45339596 = fieldWeight in 1345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8362236 = idf(docFreq=953, maxDocs=44218)
                0.09375 = fieldNorm(doc=1345)
          0.19310048 = weight(abstract_txt:labeling in 1345) [ClassicSimilarity], result of:
            0.19310048 = score(doc=1345,freq=1.0), product of:
              0.26320228 = queryWeight, product of:
                1.9014802 = boost
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.017687865 = queryNorm
              0.7336581 = fieldWeight in 1345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.09375 = fieldNorm(doc=1345)
          0.20447712 = weight(abstract_txt:active in 1345) [ClassicSimilarity], result of:
            0.20447712 = score(doc=1345,freq=1.0), product of:
              0.34451428 = queryWeight, product of:
                3.0765617 = boost
                6.330911 = idf(docFreq=213, maxDocs=44218)
                0.017687865 = queryNorm
              0.5935229 = fieldWeight in 1345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.330911 = idf(docFreq=213, maxDocs=44218)
                0.09375 = fieldNorm(doc=1345)
          0.31749237 = weight(abstract_txt:learning in 1345) [ClassicSimilarity], result of:
            0.31749237 = score(doc=1345,freq=6.0), product of:
              0.29101336 = queryWeight, product of:
                3.4630923 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.017687865 = queryNorm
              1.090989 = fieldWeight in 1345, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.09375 = fieldNorm(doc=1345)
        0.24 = coord(6/25)
    
  4. Lin, Y.; Lin, H.; Xu, K.; Sun, X.: Learning to rank using smoothing methods for language modeling (2013) 0.21
    0.205844 = sum of:
      0.205844 = product of:
        0.6432625 = sum of:
          0.0071467934 = weight(abstract_txt:that in 687) [ClassicSimilarity], result of:
            0.0071467934 = score(doc=687,freq=1.0), product of:
              0.048259083 = queryWeight, product of:
                1.1514671 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017687865 = queryNorm
              0.1480922 = fieldWeight in 687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=687)
          0.042969365 = weight(abstract_txt:effective in 687) [ClassicSimilarity], result of:
            0.042969365 = score(doc=687,freq=2.0), product of:
              0.10052118 = queryWeight, product of:
                1.1751026 = boost
                4.8362236 = idf(docFreq=953, maxDocs=44218)
                0.017687865 = queryNorm
              0.4274658 = fieldWeight in 687, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8362236 = idf(docFreq=953, maxDocs=44218)
                0.0625 = fieldNorm(doc=687)
          0.050342634 = weight(abstract_txt:effectiveness in 687) [ClassicSimilarity], result of:
            0.050342634 = score(doc=687,freq=2.0), product of:
              0.111714326 = queryWeight, product of:
                1.2388006 = boost
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.017687865 = queryNorm
              0.45063722 = fieldWeight in 687, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.0625 = fieldNorm(doc=687)
          0.06518011 = weight(abstract_txt:select in 687) [ClassicSimilarity], result of:
            0.06518011 = score(doc=687,freq=1.0), product of:
              0.16720079 = queryWeight, product of:
                1.5155356 = boost
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.017687865 = queryNorm
              0.38983136 = fieldWeight in 687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.0625 = fieldNorm(doc=687)
          0.063634 = weight(abstract_txt:method in 687) [ClassicSimilarity], result of:
            0.063634 = score(doc=687,freq=3.0), product of:
              0.13060038 = queryWeight, product of:
                1.6404569 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.017687865 = queryNorm
              0.4872421 = fieldWeight in 687, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=687)
          0.056169175 = weight(abstract_txt:sets in 687) [ClassicSimilarity], result of:
            0.056169175 = score(doc=687,freq=1.0), product of:
              0.17332347 = queryWeight, product of:
                1.8898238 = boost
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.017687865 = queryNorm
              0.32407138 = fieldWeight in 687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.0625 = fieldNorm(doc=687)
          0.1849995 = weight(abstract_txt:rank in 687) [ClassicSimilarity], result of:
            0.1849995 = score(doc=687,freq=3.0), product of:
              0.26603082 = queryWeight, product of:
                2.3413084 = boost
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.017687865 = queryNorm
              0.69540626 = fieldWeight in 687, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.0625 = fieldNorm(doc=687)
          0.17282094 = weight(abstract_txt:learning in 687) [ClassicSimilarity], result of:
            0.17282094 = score(doc=687,freq=4.0), product of:
              0.29101336 = queryWeight, product of:
                3.4630923 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.017687865 = queryNorm
              0.59385914 = fieldWeight in 687, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=687)
        0.32 = coord(8/25)
    
  5. Wu, T.; Pottenger, W.M.: ¬A semi-supervised active learning algorithm for information extraction from textual data (2005) 0.19
    0.19288561 = sum of:
      0.19288561 = product of:
        0.8036901 = sum of:
          0.0071467934 = weight(abstract_txt:that in 3237) [ClassicSimilarity], result of:
            0.0071467934 = score(doc=3237,freq=1.0), product of:
              0.048259083 = queryWeight, product of:
                1.1514671 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017687865 = queryNorm
              0.1480922 = fieldWeight in 3237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=3237)
          0.026455915 = weight(abstract_txt:used in 3237) [ClassicSimilarity], result of:
            0.026455915 = score(doc=3237,freq=3.0), product of:
              0.07275008 = queryWeight, product of:
                1.2243606 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.017687865 = queryNorm
              0.3636548 = fieldWeight in 3237, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=3237)
          0.12873365 = weight(abstract_txt:labeling in 3237) [ClassicSimilarity], result of:
            0.12873365 = score(doc=3237,freq=1.0), product of:
              0.26320228 = queryWeight, product of:
                1.9014802 = boost
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.017687865 = queryNorm
              0.48910537 = fieldWeight in 3237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.0625 = fieldNorm(doc=3237)
          0.12431181 = weight(abstract_txt:training in 3237) [ClassicSimilarity], result of:
            0.12431181 = score(doc=3237,freq=3.0), product of:
              0.2246326 = queryWeight, product of:
                2.4842677 = boost
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.017687865 = queryNorm
              0.5534006 = fieldWeight in 3237, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.0625 = fieldNorm(doc=3237)
          0.27263618 = weight(abstract_txt:active in 3237) [ClassicSimilarity], result of:
            0.27263618 = score(doc=3237,freq=4.0), product of:
              0.34451428 = queryWeight, product of:
                3.0765617 = boost
                6.330911 = idf(docFreq=213, maxDocs=44218)
                0.017687865 = queryNorm
              0.7913639 = fieldWeight in 3237, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.330911 = idf(docFreq=213, maxDocs=44218)
                0.0625 = fieldNorm(doc=3237)
          0.24440572 = weight(abstract_txt:learning in 3237) [ClassicSimilarity], result of:
            0.24440572 = score(doc=3237,freq=8.0), product of:
              0.29101336 = queryWeight, product of:
                3.4630923 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.017687865 = queryNorm
              0.83984363 = fieldWeight in 3237, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=3237)
        0.24 = coord(6/25)