Document (#36121)

Author
Moura, E.S. de
Fernandes, D.
Ribeiro-Neto, B.
Silva, A.S. da
Gonçalves, M.A.
Title
Using structural information to improve search in Web collections
Source
Journal of the American Society for Information Science and Technology. 61(2010) no.12, S.2503-2513
Year
2010
Abstract
In this work, we investigate the problem of using the block structure of Web pages to improve ranking results. Starting with basic intuitions provided by the concepts of term frequency (TF) and inverse document frequency (IDF), we propose nine block-weight functions to distinguish the impact of term occurrences inside page blocks, instead of inside whole pages. These are then used to compute a modified BM25 ranking function. Using four distinct Web collections, we ran extensive experiments to compare our block-weight ranking formulas with two other baselines: (a) a BM25 ranking applied to full pages, and (b) a BM25 ranking that takes into account best blocks. Our methods suggest that our block-weighting ranking method is superior to all baselines across all collections we used and that average gain in precision figures from 5 to 20% are generated.
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Calado, P.; Cristo, M.; Gonçalves, M.A.; Moura, E.S. de; Ribeiro-Neto, B.; Ziviani, N.: Link-based similarity measures for the classification of Web documents (2006) 2.40
    2.3950434 = sum of:
      2.3950434 = product of:
        3.592565 = sum of:
          0.81040424 = weight(author_txt:gonçalves in 922) [ClassicSimilarity], result of:
            0.81040424 = score(doc=922,freq=1.0), product of:
              0.37862095 = queryWeight, product of:
                1.1302278 = boost
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.039127458 = queryNorm
              2.1404104 = fieldWeight in 922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.25 = fieldNorm(doc=922)
          0.8377715 = weight(author_txt:moura in 922) [ClassicSimilarity], result of:
            0.8377715 = score(doc=922,freq=1.0), product of:
              0.38709766 = queryWeight, product of:
                1.1428097 = boost
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.039127458 = queryNorm
              2.164238 = fieldWeight in 922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.25 = fieldNorm(doc=922)
          0.8527516 = weight(author_txt:ribeiro in 922) [ClassicSimilarity], result of:
            0.8527516 = score(doc=922,freq=1.0), product of:
              0.39169848 = queryWeight, product of:
                1.1495811 = boost
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.039127458 = queryNorm
              2.1770613 = fieldWeight in 922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.25 = fieldNorm(doc=922)
          1.0916376 = weight(author_txt:neto in 922) [ClassicSimilarity], result of:
            1.0916376 = score(doc=922,freq=1.0), product of:
              0.46180204 = queryWeight, product of:
                1.2482213 = boost
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.039127458 = queryNorm
              2.363865 = fieldWeight in 922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.25 = fieldNorm(doc=922)
        0.6666667 = coord(4/6)
    
  2. Couto, T.; Cristo, M.; Gonçalves, M.A.; Calado, P.; Ziviani, N.; Moura, E.; Ribeiro-Neto, B.: ¬A comparative study of citations and links in document classification (2006) 2.40
    2.3950434 = sum of:
      2.3950434 = product of:
        3.592565 = sum of:
          0.81040424 = weight(author_txt:gonçalves in 351) [ClassicSimilarity], result of:
            0.81040424 = score(doc=351,freq=1.0), product of:
              0.37862095 = queryWeight, product of:
                1.1302278 = boost
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.039127458 = queryNorm
              2.1404104 = fieldWeight in 351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.25 = fieldNorm(doc=351)
          0.8377715 = weight(author_txt:moura in 351) [ClassicSimilarity], result of:
            0.8377715 = score(doc=351,freq=1.0), product of:
              0.38709766 = queryWeight, product of:
                1.1428097 = boost
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.039127458 = queryNorm
              2.164238 = fieldWeight in 351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.25 = fieldNorm(doc=351)
          0.8527516 = weight(author_txt:ribeiro in 351) [ClassicSimilarity], result of:
            0.8527516 = score(doc=351,freq=1.0), product of:
              0.39169848 = queryWeight, product of:
                1.1495811 = boost
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.039127458 = queryNorm
              2.1770613 = fieldWeight in 351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.25 = fieldNorm(doc=351)
          1.0916376 = weight(author_txt:neto in 351) [ClassicSimilarity], result of:
            1.0916376 = score(doc=351,freq=1.0), product of:
              0.46180204 = queryWeight, product of:
                1.2482213 = boost
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.039127458 = queryNorm
              2.363865 = fieldWeight in 351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.25 = fieldNorm(doc=351)
        0.6666667 = coord(4/6)
    
  3. Pereira, D.A.; Ribeiro-Neto, B.; Ziviani, N.; Laender, A.H.F.; Gonçalves, M.A.: ¬A generic Web-based entity resolution framework (2011) 1.38
    1.3773967 = sum of:
      1.3773967 = product of:
        2.7547934 = sum of:
          0.81040424 = weight(author_txt:gonçalves in 1451) [ClassicSimilarity], result of:
            0.81040424 = score(doc=1451,freq=1.0), product of:
              0.37862095 = queryWeight, product of:
                1.1302278 = boost
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.039127458 = queryNorm
              2.1404104 = fieldWeight in 1451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.25 = fieldNorm(doc=1451)
          0.8527516 = weight(author_txt:ribeiro in 1451) [ClassicSimilarity], result of:
            0.8527516 = score(doc=1451,freq=1.0), product of:
              0.39169848 = queryWeight, product of:
                1.1495811 = boost
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.039127458 = queryNorm
              2.1770613 = fieldWeight in 1451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.25 = fieldNorm(doc=1451)
          1.0916376 = weight(author_txt:neto in 1451) [ClassicSimilarity], result of:
            1.0916376 = score(doc=1451,freq=1.0), product of:
              0.46180204 = queryWeight, product of:
                1.2482213 = boost
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.039127458 = queryNorm
              2.363865 = fieldWeight in 1451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.25 = fieldNorm(doc=1451)
        0.5 = coord(3/6)
    
  4. Costa Carvalho, A. da; Rossi, C.; Moura, E.S. de; Silva, A.S. da; Fernandes, D.: LePrEF: Learn to precompute evidence fusion for efficient query evaluation (2012) 1.32
    1.3186309 = sum of:
      1.3186309 = product of:
        2.6372619 = sum of:
          0.56131124 = weight(author_txt:silva in 2279) [ClassicSimilarity], result of:
            0.56131124 = score(doc=2279,freq=1.0), product of:
              0.29639623 = queryWeight, product of:
                7.5751467 = idf(docFreq=58, maxDocs=42306)
                0.039127458 = queryNorm
              1.8937867 = fieldWeight in 2279, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5751467 = idf(docFreq=58, maxDocs=42306)
                0.25 = fieldNorm(doc=2279)
          0.8377715 = weight(author_txt:moura in 2279) [ClassicSimilarity], result of:
            0.8377715 = score(doc=2279,freq=1.0), product of:
              0.38709766 = queryWeight, product of:
                1.1428097 = boost
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.039127458 = queryNorm
              2.164238 = fieldWeight in 2279, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.25 = fieldNorm(doc=2279)
          1.2381793 = weight(author_txt:fernandes in 2279) [ClassicSimilarity], result of:
            1.2381793 = score(doc=2279,freq=1.0), product of:
              0.5022569 = queryWeight, product of:
                1.3017471 = boost
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.039127458 = queryNorm
              2.4652312 = fieldWeight in 2279, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.25 = fieldNorm(doc=2279)
        0.5 = coord(3/6)
    
  5. Silveira, M.; Ribeiro-Neto, B.: Concept-based ranking : a case study in the juridical domain (2004) 1.13
    1.134227 = sum of:
      1.134227 = product of:
        3.402681 = sum of:
          1.4923153 = weight(author_txt:ribeiro in 3340) [ClassicSimilarity], result of:
            1.4923153 = score(doc=3340,freq=1.0), product of:
              0.39169848 = queryWeight, product of:
                1.1495811 = boost
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.039127458 = queryNorm
              3.8098574 = fieldWeight in 3340, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.4375 = fieldNorm(doc=3340)
          1.9103658 = weight(author_txt:neto in 3340) [ClassicSimilarity], result of:
            1.9103658 = score(doc=3340,freq=1.0), product of:
              0.46180204 = queryWeight, product of:
                1.2482213 = boost
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.039127458 = queryNorm
              4.1367636 = fieldWeight in 3340, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.4375 = fieldNorm(doc=3340)
        0.33333334 = coord(2/6)
    

Similar documents (content)

  1. Fersini, E.; Messina, E.; Archetti, F.: Enhancing web page classification through image-block importance analysis (2008) 0.41
    0.4119069 = sum of:
      0.4119069 = product of:
        1.1441858 = sum of:
          0.046753604 = weight(abstract_txt:modified in 4103) [ClassicSimilarity], result of:
            0.046753604 = score(doc=4103,freq=1.0), product of:
              0.0876925 = queryWeight, product of:
                6.8243704 = idf(docFreq=124, maxDocs=42306)
                0.012849903 = queryNorm
              0.53315395 = fieldWeight in 4103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8243704 = idf(docFreq=124, maxDocs=42306)
                0.078125 = fieldNorm(doc=4103)
          0.0498229 = weight(abstract_txt:weighting in 4103) [ClassicSimilarity], result of:
            0.0498229 = score(doc=4103,freq=1.0), product of:
              0.091489606 = queryWeight, product of:
                1.0214207 = boost
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.012849903 = queryNorm
              0.54457444 = fieldWeight in 4103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.078125 = fieldNorm(doc=4103)
          0.008680214 = weight(abstract_txt:that in 4103) [ClassicSimilarity], result of:
            0.008680214 = score(doc=4103,freq=2.0), product of:
              0.032669045 = queryWeight, product of:
                1.0571768 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.012849903 = queryNorm
              0.2657015 = fieldWeight in 4103, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.078125 = fieldNorm(doc=4103)
          0.065738454 = weight(abstract_txt:inverse in 4103) [ClassicSimilarity], result of:
            0.065738454 = score(doc=4103,freq=1.0), product of:
              0.11006065 = queryWeight, product of:
                1.1203012 = boost
                7.645351 = idf(docFreq=54, maxDocs=42306)
                0.012849903 = queryNorm
              0.597293 = fieldWeight in 4103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.645351 = idf(docFreq=54, maxDocs=42306)
                0.078125 = fieldNorm(doc=4103)
          0.06654758 = weight(abstract_txt:term in 4103) [ClassicSimilarity], result of:
            0.06654758 = score(doc=4103,freq=4.0), product of:
              0.08807052 = queryWeight, product of:
                1.4172585 = boost
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.012849903 = queryNorm
              0.75561696 = fieldWeight in 4103, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.078125 = fieldNorm(doc=4103)
          0.018707328 = weight(abstract_txt:using in 4103) [ClassicSimilarity], result of:
            0.018707328 = score(doc=4103,freq=1.0), product of:
              0.06867532 = queryWeight, product of:
                1.5327797 = boost
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.012849903 = queryNorm
              0.2724025 = fieldWeight in 4103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.078125 = fieldNorm(doc=4103)
          0.12155553 = weight(abstract_txt:weight in 4103) [ClassicSimilarity], result of:
            0.12155553 = score(doc=4103,freq=1.0), product of:
              0.20890342 = queryWeight, product of:
                2.1827629 = boost
                7.4479914 = idf(docFreq=66, maxDocs=42306)
                0.012849903 = queryNorm
              0.5818743 = fieldWeight in 4103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4479914 = idf(docFreq=66, maxDocs=42306)
                0.078125 = fieldNorm(doc=4103)
          0.23634805 = weight(abstract_txt:blocks in 4103) [ClassicSimilarity], result of:
            0.23634805 = score(doc=4103,freq=3.0), product of:
              0.22564377 = queryWeight, product of:
                2.268535 = boost
                7.740661 = idf(docFreq=49, maxDocs=42306)
                0.012849903 = queryNorm
              1.0474389 = fieldWeight in 4103, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.740661 = idf(docFreq=49, maxDocs=42306)
                0.078125 = fieldNorm(doc=4103)
          0.53003216 = weight(abstract_txt:block in 4103) [ClassicSimilarity], result of:
            0.53003216 = score(doc=4103,freq=3.0), product of:
              0.4870798 = queryWeight, product of:
                4.7135577 = boost
                8.041766 = idf(docFreq=36, maxDocs=42306)
                0.012849903 = queryNorm
              1.0881834 = fieldWeight in 4103, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.041766 = idf(docFreq=36, maxDocs=42306)
                0.078125 = fieldNorm(doc=4103)
        0.36 = coord(9/25)
    
  2. Wan, X.; Yang, J.; Xiao, J.: Towards a unified approach to document similarity search using manifold-ranking of blocks (2008) 0.36
    0.36336467 = sum of:
      0.36336467 = product of:
        1.1355146 = sum of:
          0.0049102707 = weight(abstract_txt:that in 4082) [ClassicSimilarity], result of:
            0.0049102707 = score(doc=4082,freq=1.0), product of:
              0.032669045 = queryWeight, product of:
                1.0571768 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.012849903 = queryNorm
              0.15030347 = fieldWeight in 4082, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0625 = fieldNorm(doc=4082)
          0.05259077 = weight(abstract_txt:compute in 4082) [ClassicSimilarity], result of:
            0.05259077 = score(doc=4082,freq=1.0), product of:
              0.11006065 = queryWeight, product of:
                1.1203012 = boost
                7.645351 = idf(docFreq=54, maxDocs=42306)
                0.012849903 = queryNorm
              0.47783443 = fieldWeight in 4082, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.645351 = idf(docFreq=54, maxDocs=42306)
                0.0625 = fieldNorm(doc=4082)
          0.029601363 = weight(abstract_txt:improve in 4082) [ClassicSimilarity], result of:
            0.029601363 = score(doc=4082,freq=1.0), product of:
              0.09453157 = queryWeight, product of:
                1.4683251 = boost
                5.010197 = idf(docFreq=766, maxDocs=42306)
                0.012849903 = queryNorm
              0.31313732 = fieldWeight in 4082, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.010197 = idf(docFreq=766, maxDocs=42306)
                0.0625 = fieldNorm(doc=4082)
          0.014965863 = weight(abstract_txt:using in 4082) [ClassicSimilarity], result of:
            0.014965863 = score(doc=4082,freq=1.0), product of:
              0.06867532 = queryWeight, product of:
                1.5327797 = boost
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.012849903 = queryNorm
              0.217922 = fieldWeight in 4082, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.0625 = fieldNorm(doc=4082)
          0.24409924 = weight(abstract_txt:blocks in 4082) [ClassicSimilarity], result of:
            0.24409924 = score(doc=4082,freq=5.0), product of:
              0.22564377 = queryWeight, product of:
                2.268535 = boost
                7.740661 = idf(docFreq=49, maxDocs=42306)
                0.012849903 = queryNorm
              1.0817903 = fieldWeight in 4082, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.740661 = idf(docFreq=49, maxDocs=42306)
                0.0625 = fieldNorm(doc=4082)
          0.06148017 = weight(abstract_txt:pages in 4082) [ClassicSimilarity], result of:
            0.06148017 = score(doc=4082,freq=1.0), product of:
              0.17615278 = queryWeight, product of:
                2.4548454 = boost
                5.5842586 = idf(docFreq=431, maxDocs=42306)
                0.012849903 = queryNorm
              0.34901616 = fieldWeight in 4082, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5842586 = idf(docFreq=431, maxDocs=42306)
                0.0625 = fieldNorm(doc=4082)
          0.4240257 = weight(abstract_txt:block in 4082) [ClassicSimilarity], result of:
            0.4240257 = score(doc=4082,freq=3.0), product of:
              0.4870798 = queryWeight, product of:
                4.7135577 = boost
                8.041766 = idf(docFreq=36, maxDocs=42306)
                0.012849903 = queryNorm
              0.8705467 = fieldWeight in 4082, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.041766 = idf(docFreq=36, maxDocs=42306)
                0.0625 = fieldNorm(doc=4082)
          0.3038412 = weight(abstract_txt:ranking in 4082) [ClassicSimilarity], result of:
            0.3038412 = score(doc=4082,freq=6.0), product of:
              0.3543699 = queryWeight, product of:
                4.924054 = boost
                5.600595 = idf(docFreq=424, maxDocs=42306)
                0.012849903 = queryNorm
              0.8574125 = fieldWeight in 4082, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.600595 = idf(docFreq=424, maxDocs=42306)
                0.0625 = fieldNorm(doc=4082)
        0.32 = coord(8/25)
    
  3. Dang, E.K.F.; Luk, R.W.P.; Allan, J.; Ho, K.S.; Chung, K.F.L.; Lee, D.L.: ¬A new context-dependent term weight computed by boost and discount using relevance information (2010) 0.23
    0.23451126 = sum of:
      0.23451126 = product of:
        0.7328477 = sum of:
          0.039858323 = weight(abstract_txt:weighting in 1121) [ClassicSimilarity], result of:
            0.039858323 = score(doc=1121,freq=1.0), product of:
              0.091489606 = queryWeight, product of:
                1.0214207 = boost
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.012849903 = queryNorm
              0.43565956 = fieldWeight in 1121, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.0625 = fieldNorm(doc=1121)
          0.05259077 = weight(abstract_txt:inverse in 1121) [ClassicSimilarity], result of:
            0.05259077 = score(doc=1121,freq=1.0), product of:
              0.11006065 = queryWeight, product of:
                1.1203012 = boost
                7.645351 = idf(docFreq=54, maxDocs=42306)
                0.012849903 = queryNorm
              0.47783443 = fieldWeight in 1121, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.645351 = idf(docFreq=54, maxDocs=42306)
                0.0625 = fieldNorm(doc=1121)
          0.05259077 = weight(abstract_txt:compute in 1121) [ClassicSimilarity], result of:
            0.05259077 = score(doc=1121,freq=1.0), product of:
              0.11006065 = queryWeight, product of:
                1.1203012 = boost
                7.645351 = idf(docFreq=54, maxDocs=42306)
                0.012849903 = queryNorm
              0.47783443 = fieldWeight in 1121, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.645351 = idf(docFreq=54, maxDocs=42306)
                0.0625 = fieldNorm(doc=1121)
          0.070427336 = weight(abstract_txt:term in 1121) [ClassicSimilarity], result of:
            0.070427336 = score(doc=1121,freq=7.0), product of:
              0.08807052 = queryWeight, product of:
                1.4172585 = boost
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.012849903 = queryNorm
              0.7996698 = fieldWeight in 1121, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.0625 = fieldNorm(doc=1121)
          0.025921635 = weight(abstract_txt:using in 1121) [ClassicSimilarity], result of:
            0.025921635 = score(doc=1121,freq=3.0), product of:
              0.06867532 = queryWeight, product of:
                1.5327797 = boost
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.012849903 = queryNorm
              0.377452 = fieldWeight in 1121, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.0625 = fieldNorm(doc=1121)
          0.07043231 = weight(abstract_txt:frequency in 1121) [ClassicSimilarity], result of:
            0.07043231 = score(doc=1121,freq=2.0), product of:
              0.13372329 = queryWeight, product of:
                1.7463741 = boost
                5.958952 = idf(docFreq=296, maxDocs=42306)
                0.012849903 = queryNorm
              0.5267019 = fieldWeight in 1121, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.958952 = idf(docFreq=296, maxDocs=42306)
                0.0625 = fieldNorm(doc=1121)
          0.06371976 = weight(abstract_txt:collections in 1121) [ClassicSimilarity], result of:
            0.06371976 = score(doc=1121,freq=3.0), product of:
              0.12508593 = queryWeight, product of:
                2.068634 = boost
                4.705708 = idf(docFreq=1039, maxDocs=42306)
                0.012849903 = queryNorm
              0.5094078 = fieldWeight in 1121, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.705708 = idf(docFreq=1039, maxDocs=42306)
                0.0625 = fieldNorm(doc=1121)
          0.35730684 = weight(abstract_txt:bm25 in 1121) [ClassicSimilarity], result of:
            0.35730684 = score(doc=1121,freq=2.0), product of:
              0.45194307 = queryWeight, product of:
                3.9320703 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.012849903 = queryNorm
              0.79060143 = fieldWeight in 1121, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.0625 = fieldNorm(doc=1121)
        0.32 = coord(8/25)
    
  4. Trotman, A.: Choosing document structure weights (2005) 0.23
    0.22956993 = sum of:
      0.22956993 = product of:
        0.95654136 = sum of:
          0.070460215 = weight(abstract_txt:weighting in 3017) [ClassicSimilarity], result of:
            0.070460215 = score(doc=3017,freq=2.0), product of:
              0.091489606 = queryWeight, product of:
                1.0214207 = boost
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.012849903 = queryNorm
              0.7701445 = fieldWeight in 3017, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.078125 = fieldNorm(doc=3017)
          0.060060542 = weight(abstract_txt:occurrences in 3017) [ClassicSimilarity], result of:
            0.060060542 = score(doc=3017,freq=1.0), product of:
              0.103628345 = queryWeight, product of:
                1.0870714 = boost
                7.4185777 = idf(docFreq=68, maxDocs=42306)
                0.012849903 = queryNorm
              0.5795764 = fieldWeight in 3017, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4185777 = idf(docFreq=68, maxDocs=42306)
                0.078125 = fieldNorm(doc=3017)
          0.03327379 = weight(abstract_txt:term in 3017) [ClassicSimilarity], result of:
            0.03327379 = score(doc=3017,freq=1.0), product of:
              0.08807052 = queryWeight, product of:
                1.4172585 = boost
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.012849903 = queryNorm
              0.37780848 = fieldWeight in 3017, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.078125 = fieldNorm(doc=3017)
          0.026456157 = weight(abstract_txt:using in 3017) [ClassicSimilarity], result of:
            0.026456157 = score(doc=3017,freq=2.0), product of:
              0.06867532 = queryWeight, product of:
                1.5327797 = boost
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.012849903 = queryNorm
              0.3852353 = fieldWeight in 3017, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.078125 = fieldNorm(doc=3017)
          0.54701215 = weight(abstract_txt:bm25 in 3017) [ClassicSimilarity], result of:
            0.54701215 = score(doc=3017,freq=3.0), product of:
              0.45194307 = queryWeight, product of:
                3.9320703 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.012849903 = queryNorm
              1.2103564 = fieldWeight in 3017, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.078125 = fieldNorm(doc=3017)
          0.2192785 = weight(abstract_txt:ranking in 3017) [ClassicSimilarity], result of:
            0.2192785 = score(doc=3017,freq=2.0), product of:
              0.3543699 = queryWeight, product of:
                4.924054 = boost
                5.600595 = idf(docFreq=424, maxDocs=42306)
                0.012849903 = queryNorm
              0.6187842 = fieldWeight in 3017, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.600595 = idf(docFreq=424, maxDocs=42306)
                0.078125 = fieldNorm(doc=3017)
        0.24 = coord(6/25)
    
  5. Alzahrani, S.; Palade, V.; Salim, N.; Abraham, A.: Using structural information and citation evidence to detect significant plagiarism cases in scientific publications (2012) 0.22
    0.22260986 = sum of:
      0.22260986 = product of:
        0.6183607 = sum of:
          0.049322154 = weight(abstract_txt:weighting in 1983) [ClassicSimilarity], result of:
            0.049322154 = score(doc=1983,freq=2.0), product of:
              0.091489606 = queryWeight, product of:
                1.0214207 = boost
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.012849903 = queryNorm
              0.5391012 = fieldWeight in 1983, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1983)
          0.00607615 = weight(abstract_txt:that in 1983) [ClassicSimilarity], result of:
            0.00607615 = score(doc=1983,freq=2.0), product of:
              0.032669045 = queryWeight, product of:
                1.0571768 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.012849903 = queryNorm
              0.18599105 = fieldWeight in 1983, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1983)
          0.04601692 = weight(abstract_txt:inverse in 1983) [ClassicSimilarity], result of:
            0.04601692 = score(doc=1983,freq=1.0), product of:
              0.11006065 = queryWeight, product of:
                1.1203012 = boost
                7.645351 = idf(docFreq=54, maxDocs=42306)
                0.012849903 = queryNorm
              0.41810513 = fieldWeight in 1983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.645351 = idf(docFreq=54, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1983)
          0.023291651 = weight(abstract_txt:term in 1983) [ClassicSimilarity], result of:
            0.023291651 = score(doc=1983,freq=1.0), product of:
              0.08807052 = queryWeight, product of:
                1.4172585 = boost
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.012849903 = queryNorm
              0.26446593 = fieldWeight in 1983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1983)
          0.03662982 = weight(abstract_txt:improve in 1983) [ClassicSimilarity], result of:
            0.03662982 = score(doc=1983,freq=2.0), product of:
              0.09453157 = queryWeight, product of:
                1.4683251 = boost
                5.010197 = idf(docFreq=766, maxDocs=42306)
                0.012849903 = queryNorm
              0.38748768 = fieldWeight in 1983, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.010197 = idf(docFreq=766, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1983)
          0.0292816 = weight(abstract_txt:using in 1983) [ClassicSimilarity], result of:
            0.0292816 = score(doc=1983,freq=5.0), product of:
              0.06867532 = queryWeight, product of:
                1.5327797 = boost
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.012849903 = queryNorm
              0.42637736 = fieldWeight in 1983, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1983)
          0.043577768 = weight(abstract_txt:frequency in 1983) [ClassicSimilarity], result of:
            0.043577768 = score(doc=1983,freq=1.0), product of:
              0.13372329 = queryWeight, product of:
                1.7463741 = boost
                5.958952 = idf(docFreq=296, maxDocs=42306)
                0.012849903 = queryNorm
              0.32588017 = fieldWeight in 1983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.958952 = idf(docFreq=296, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1983)
          0.17017776 = weight(abstract_txt:weight in 1983) [ClassicSimilarity], result of:
            0.17017776 = score(doc=1983,freq=4.0), product of:
              0.20890342 = queryWeight, product of:
                2.1827629 = boost
                7.4479914 = idf(docFreq=66, maxDocs=42306)
                0.012849903 = queryNorm
              0.8146241 = fieldWeight in 1983, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.4479914 = idf(docFreq=66, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1983)
          0.21398687 = weight(abstract_txt:baselines in 1983) [ClassicSimilarity], result of:
            0.21398687 = score(doc=1983,freq=3.0), product of:
              0.26786423 = queryWeight, product of:
                2.4716737 = boost
                8.433808 = idf(docFreq=24, maxDocs=42306)
                0.012849903 = queryNorm
              0.7988632 = fieldWeight in 1983, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.433808 = idf(docFreq=24, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1983)
        0.36 = coord(9/25)