Document (#34082)

Author
Wan, X.
Yang, J.
Xiao, J.
Title
Towards a unified approach to document similarity search using manifold-ranking of blocks
Source
Information processing and management. 44(2008) no.3, S.1032-1048
Year
2008
Abstract
Document similarity search (i.e. query by example) aims to retrieve a ranked list of documents similar to a query document in a text corpus or on the Web. Most existing approaches to similarity search first compute the pairwise similarity score between each document and the query using a retrieval function or similarity measure (e.g. Cosine), and then rank the documents by the similarity scores. In this paper, we propose a novel retrieval approach based on manifold-ranking of document blocks (i.e. a block of coherent text about a subtopic) to re-rank a small set of documents initially retrieved by some existing retrieval function. The proposed approach can make full use of the intrinsic global manifold structure of the document blocks by propagating the ranking scores between the blocks on a weighted graph. First, the TextTiling algorithm and the VIPS algorithm are respectively employed to segment text documents and web pages into blocks. Then, each block is assigned with a ranking score by the manifold-ranking algorithm. Lastly, a document gets its final ranking score by fusing the scores of its blocks. Experimental results on the TDT data and the ODP data demonstrate that the proposed approach can significantly improve the retrieval performances over baseline approaches. Document block is validated to be a better unit than the whole document in the manifold-ranking process.
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Wan, X.; Yang, J.; Xiao, J.: Incorporating cross-document relationships between sentences for single document summarizations (2006) 4.26
    4.260605 = sum of:
      4.260605 = sum of:
        1.4855756 = weight(author_txt:yang in 2421) [ClassicSimilarity], result of:
          1.4855756 = score(doc=2421,freq=1.0), product of:
            0.5504366 = queryWeight, product of:
              7.1970778 = idf(docFreq=89, maxDocs=44218)
              0.076480575 = queryNorm
            2.698904 = fieldWeight in 2421, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.1970778 = idf(docFreq=89, maxDocs=44218)
              0.375 = fieldNorm(doc=2421)
        2.7750292 = weight(author_txt:xiao in 2421) [ClassicSimilarity], result of:
          2.7750292 = score(doc=2421,freq=1.0), product of:
            0.834877 = queryWeight, product of:
              1.2315657 = boost
              8.863674 = idf(docFreq=16, maxDocs=44218)
              0.076480575 = queryNorm
            3.3238778 = fieldWeight in 2421, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.863674 = idf(docFreq=16, maxDocs=44218)
              0.375 = fieldNorm(doc=2421)
    
  2. Xiao, Y.: Modern development of classification : research and practice in the People's Republic of China (1992) 2.31
    2.3125243 = sum of:
      2.3125243 = product of:
        4.6250486 = sum of:
          4.6250486 = weight(author_txt:xiao in 1909) [ClassicSimilarity], result of:
            4.6250486 = score(doc=1909,freq=1.0), product of:
              0.834877 = queryWeight, product of:
                1.2315657 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.076480575 = queryNorm
              5.5397964 = fieldWeight in 1909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.625 = fieldNorm(doc=1909)
        0.5 = coord(1/2)
    
  3. Xiao, Y.: Faceted classification : a consideration of its features as a paradigm of knowledge organization (1994) 2.31
    2.3125243 = sum of:
      2.3125243 = product of:
        4.6250486 = sum of:
          4.6250486 = weight(author_txt:xiao in 7547) [ClassicSimilarity], result of:
            4.6250486 = score(doc=7547,freq=1.0), product of:
              0.834877 = queryWeight, product of:
                1.2315657 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.076480575 = queryNorm
              5.5397964 = fieldWeight in 7547, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.625 = fieldNorm(doc=7547)
        0.5 = coord(1/2)
    
  4. Xiao, G.: ¬A knowledge classification model based on the relationship between science and human needs (2013) 2.31
    2.3125243 = sum of:
      2.3125243 = product of:
        4.6250486 = sum of:
          4.6250486 = weight(author_txt:xiao in 138) [ClassicSimilarity], result of:
            4.6250486 = score(doc=138,freq=1.0), product of:
              0.834877 = queryWeight, product of:
                1.2315657 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.076480575 = queryNorm
              5.5397964 = fieldWeight in 138, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.625 = fieldNorm(doc=138)
        0.5 = coord(1/2)
    
  5. Xiao, L.: Effects of rationale awareness in online ideation crowdsourcing tasks (2014) 2.31
    2.3125243 = sum of:
      2.3125243 = product of:
        4.6250486 = sum of:
          4.6250486 = weight(author_txt:xiao in 1329) [ClassicSimilarity], result of:
            4.6250486 = score(doc=1329,freq=1.0), product of:
              0.834877 = queryWeight, product of:
                1.2315657 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.076480575 = queryNorm
              5.5397964 = fieldWeight in 1329, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.625 = fieldNorm(doc=1329)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Qin, T.; Zhang, X.-D.; Tsai, M.-F.; Wang, D.-S.; Liu, T.-Y.; Li, H.: Query-level loss functions for information retrieval (2008) 0.32
    0.3164573 = sum of:
      0.3164573 = product of:
        0.7192211 = sum of:
          0.022696862 = weight(abstract_txt:proposed in 2066) [ClassicSimilarity], result of:
            0.022696862 = score(doc=2066,freq=2.0), product of:
              0.055710178 = queryWeight, product of:
                1.0924109 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.011064002 = queryNorm
              0.4074096 = fieldWeight in 2066, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.0625 = fieldNorm(doc=2066)
          0.016489673 = weight(abstract_txt:existing in 2066) [ClassicSimilarity], result of:
            0.016489673 = score(doc=2066,freq=1.0), product of:
              0.05672511 = queryWeight, product of:
                1.1023169 = boost
                4.6511106 = idf(docFreq=1147, maxDocs=44218)
                0.011064002 = queryNorm
              0.29069442 = fieldWeight in 2066, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6511106 = idf(docFreq=1147, maxDocs=44218)
                0.0625 = fieldNorm(doc=2066)
          0.012033255 = weight(abstract_txt:search in 2066) [ClassicSimilarity], result of:
            0.012033255 = score(doc=2066,freq=1.0), product of:
              0.0526324 = queryWeight, product of:
                1.300442 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.011064002 = queryNorm
              0.22862828 = fieldWeight in 2066, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=2066)
          0.056843147 = weight(abstract_txt:function in 2066) [ClassicSimilarity], result of:
            0.056843147 = score(doc=2066,freq=4.0), product of:
              0.08154539 = queryWeight, product of:
                1.3216561 = boost
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.011064002 = queryNorm
              0.6970737 = fieldWeight in 2066, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.0625 = fieldNorm(doc=2066)
          0.013755892 = weight(abstract_txt:retrieval in 2066) [ClassicSimilarity], result of:
            0.013755892 = score(doc=2066,freq=1.0), product of:
              0.06333394 = queryWeight, product of:
                1.6472216 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.011064002 = queryNorm
              0.21719621 = fieldWeight in 2066, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=2066)
          0.064687744 = weight(abstract_txt:query in 2066) [ClassicSimilarity], result of:
            0.064687744 = score(doc=2066,freq=6.0), product of:
              0.08888503 = queryWeight, product of:
                1.689969 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.011064002 = queryNorm
              0.72776866 = fieldWeight in 2066, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=2066)
          0.022944339 = weight(abstract_txt:documents in 2066) [ClassicSimilarity], result of:
            0.022944339 = score(doc=2066,freq=1.0), product of:
              0.089076065 = queryWeight, product of:
                1.953504 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.011064002 = queryNorm
              0.2575814 = fieldWeight in 2066, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=2066)
          0.045655895 = weight(abstract_txt:algorithm in 2066) [ClassicSimilarity], result of:
            0.045655895 = score(doc=2066,freq=1.0), product of:
              0.12803508 = queryWeight, product of:
                2.0282845 = boost
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.011064002 = queryNorm
              0.35658893 = fieldWeight in 2066, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.0625 = fieldNorm(doc=2066)
          0.09688183 = weight(abstract_txt:similarity in 2066) [ClassicSimilarity], result of:
            0.09688183 = score(doc=2066,freq=1.0), product of:
              0.26638064 = queryWeight, product of:
                4.1374307 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.011064002 = queryNorm
              0.36369696 = fieldWeight in 2066, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0625 = fieldNorm(doc=2066)
          0.082496084 = weight(abstract_txt:document in 2066) [ClassicSimilarity], result of:
            0.082496084 = score(doc=2066,freq=2.0), product of:
              0.2174288 = queryWeight, product of:
                4.578082 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.011064002 = queryNorm
              0.37941656 = fieldWeight in 2066, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=2066)
          0.28473642 = weight(abstract_txt:ranking in 2066) [ClassicSimilarity], result of:
            0.28473642 = score(doc=2066,freq=8.0), product of:
              0.2876882 = queryWeight, product of:
                4.6442313 = boost
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.011064002 = queryNorm
              0.98973966 = fieldWeight in 2066, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.0625 = fieldNorm(doc=2066)
        0.44 = coord(11/25)
    
  2. Luo, Z.; Yu, Y.; Osborne, M.; Wang, T.: Structuring tweets for improving Twitter search (2015) 0.29
    0.29460242 = sum of:
      0.29460242 = product of:
        0.81834 = sum of:
          0.012033255 = weight(abstract_txt:search in 2335) [ClassicSimilarity], result of:
            0.012033255 = score(doc=2335,freq=1.0), product of:
              0.0526324 = queryWeight, product of:
                1.300442 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.011064002 = queryNorm
              0.22862828 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.02298999 = weight(abstract_txt:text in 2335) [ClassicSimilarity], result of:
            0.02298999 = score(doc=2335,freq=2.0), product of:
              0.06432013 = queryWeight, product of:
                1.4375993 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.011064002 = queryNorm
              0.3574307 = fieldWeight in 2335, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.04344461 = weight(abstract_txt:rank in 2335) [ClassicSimilarity], result of:
            0.04344461 = score(doc=2335,freq=1.0), product of:
              0.10820765 = queryWeight, product of:
                1.5224665 = boost
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.011064002 = queryNorm
              0.40149298 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.03639467 = weight(abstract_txt:retrieval in 2335) [ClassicSimilarity], result of:
            0.03639467 = score(doc=2335,freq=7.0), product of:
              0.06333394 = queryWeight, product of:
                1.6472216 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.011064002 = queryNorm
              0.5746471 = fieldWeight in 2335, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.026408657 = weight(abstract_txt:query in 2335) [ClassicSimilarity], result of:
            0.026408657 = score(doc=2335,freq=1.0), product of:
              0.08888503 = queryWeight, product of:
                1.689969 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.011064002 = queryNorm
              0.2971103 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.017220337 = weight(abstract_txt:approach in 2335) [ClassicSimilarity], result of:
            0.017220337 = score(doc=2335,freq=1.0), product of:
              0.07356509 = queryWeight, product of:
                1.7752914 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.011064002 = queryNorm
              0.234083 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.032448195 = weight(abstract_txt:documents in 2335) [ClassicSimilarity], result of:
            0.032448195 = score(doc=2335,freq=2.0), product of:
              0.089076065 = queryWeight, product of:
                1.953504 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.011064002 = queryNorm
              0.36427513 = fieldWeight in 2335, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.12394941 = weight(abstract_txt:block in 2335) [ClassicSimilarity], result of:
            0.12394941 = score(doc=2335,freq=1.0), product of:
              0.24916904 = queryWeight, product of:
                2.8295114 = boost
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.011064002 = queryNorm
              0.4974511 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.5034509 = weight(abstract_txt:blocks in 2335) [ClassicSimilarity], result of:
            0.5034509 = score(doc=2335,freq=5.0), product of:
              0.4673646 = queryWeight, product of:
                5.4803376 = boost
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.011064002 = queryNorm
              1.0772122 = fieldWeight in 2335, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
        0.36 = coord(9/25)
    
  3. Moura, E.S. de; Fernandes, D.; Ribeiro-Neto, B.; Silva, A.S. da; Gonçalves, M.A.: Using structural information to improve search in Web collections (2010) 0.27
    0.27473438 = sum of:
      0.27473438 = product of:
        1.1447266 = sum of:
          0.020160088 = weight(abstract_txt:then in 4119) [ClassicSimilarity], result of:
            0.020160088 = score(doc=4119,freq=1.0), product of:
              0.055892766 = queryWeight, product of:
                1.0941997 = boost
                4.616861 = idf(docFreq=1187, maxDocs=44218)
                0.011064002 = queryNorm
              0.36069226 = fieldWeight in 4119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.616861 = idf(docFreq=1187, maxDocs=44218)
                0.078125 = fieldNorm(doc=4119)
          0.03552697 = weight(abstract_txt:function in 4119) [ClassicSimilarity], result of:
            0.03552697 = score(doc=4119,freq=1.0), product of:
              0.08154539 = queryWeight, product of:
                1.3216561 = boost
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.011064002 = queryNorm
              0.43567106 = fieldWeight in 4119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.078125 = fieldNorm(doc=4119)
          0.30987355 = weight(abstract_txt:block in 4119) [ClassicSimilarity], result of:
            0.30987355 = score(doc=4119,freq=4.0), product of:
              0.24916904 = queryWeight, product of:
                2.8295114 = boost
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.011064002 = queryNorm
              1.2436278 = fieldWeight in 4119, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.078125 = fieldNorm(doc=4119)
          0.072916925 = weight(abstract_txt:document in 4119) [ClassicSimilarity], result of:
            0.072916925 = score(doc=4119,freq=1.0), product of:
              0.2174288 = queryWeight, product of:
                4.578082 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.011064002 = queryNorm
              0.33536002 = fieldWeight in 4119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=4119)
          0.3082362 = weight(abstract_txt:ranking in 4119) [ClassicSimilarity], result of:
            0.3082362 = score(doc=4119,freq=6.0), product of:
              0.2876882 = queryWeight, product of:
                4.6442313 = boost
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.011064002 = queryNorm
              1.0714246 = fieldWeight in 4119, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.078125 = fieldNorm(doc=4119)
          0.39801285 = weight(abstract_txt:blocks in 4119) [ClassicSimilarity], result of:
            0.39801285 = score(doc=4119,freq=2.0), product of:
              0.4673646 = queryWeight, product of:
                5.4803376 = boost
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.011064002 = queryNorm
              0.851611 = fieldWeight in 4119, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.078125 = fieldNorm(doc=4119)
        0.24 = coord(6/25)
    
  4. Mengle, S.; Goharian, N.: Passage detection using text classification (2009) 0.27
    0.2668983 = sum of:
      0.2668983 = product of:
        0.60658705 = sum of:
          0.014035336 = weight(abstract_txt:approaches in 2765) [ClassicSimilarity], result of:
            0.014035336 = score(doc=2765,freq=1.0), product of:
              0.055689994 = queryWeight, product of:
                1.092213 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.011064002 = queryNorm
              0.25202617 = fieldWeight in 2765, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
          0.010529098 = weight(abstract_txt:search in 2765) [ClassicSimilarity], result of:
            0.010529098 = score(doc=2765,freq=1.0), product of:
              0.0526324 = queryWeight, product of:
                1.300442 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.011064002 = queryNorm
              0.20004974 = fieldWeight in 2765, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
          0.02844866 = weight(abstract_txt:text in 2765) [ClassicSimilarity], result of:
            0.02844866 = score(doc=2765,freq=4.0), product of:
              0.06432013 = queryWeight, product of:
                1.4375993 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.011064002 = queryNorm
              0.4422979 = fieldWeight in 2765, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
          0.026914222 = weight(abstract_txt:retrieval in 2765) [ClassicSimilarity], result of:
            0.026914222 = score(doc=2765,freq=5.0), product of:
              0.06333394 = queryWeight, product of:
                1.6472216 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.011064002 = queryNorm
              0.4249573 = fieldWeight in 2765, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
          0.032679047 = weight(abstract_txt:query in 2765) [ClassicSimilarity], result of:
            0.032679047 = score(doc=2765,freq=2.0), product of:
              0.08888503 = queryWeight, product of:
                1.689969 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.011064002 = queryNorm
              0.36765522 = fieldWeight in 2765, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
          0.015067794 = weight(abstract_txt:approach in 2765) [ClassicSimilarity], result of:
            0.015067794 = score(doc=2765,freq=1.0), product of:
              0.07356509 = queryWeight, product of:
                1.7752914 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.011064002 = queryNorm
              0.20482263 = fieldWeight in 2765, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
          0.034773163 = weight(abstract_txt:documents in 2765) [ClassicSimilarity], result of:
            0.034773163 = score(doc=2765,freq=3.0), product of:
              0.089076065 = queryWeight, product of:
                1.953504 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.011064002 = queryNorm
              0.39037606 = fieldWeight in 2765, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
          0.07395475 = weight(abstract_txt:score in 2765) [ClassicSimilarity], result of:
            0.07395475 = score(doc=2765,freq=1.0), product of:
              0.19303517 = queryWeight, product of:
                2.4904776 = boost
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.011064002 = queryNorm
              0.38311544 = fieldWeight in 2765, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
          0.084771596 = weight(abstract_txt:similarity in 2765) [ClassicSimilarity], result of:
            0.084771596 = score(doc=2765,freq=1.0), product of:
              0.26638064 = queryWeight, product of:
                4.1374307 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.011064002 = queryNorm
              0.31823483 = fieldWeight in 2765, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
          0.08840708 = weight(abstract_txt:document in 2765) [ClassicSimilarity], result of:
            0.08840708 = score(doc=2765,freq=3.0), product of:
              0.2174288 = queryWeight, product of:
                4.578082 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.011064002 = queryNorm
              0.4066024 = fieldWeight in 2765, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
          0.19700631 = weight(abstract_txt:blocks in 2765) [ClassicSimilarity], result of:
            0.19700631 = score(doc=2765,freq=1.0), product of:
              0.4673646 = queryWeight, product of:
                5.4803376 = boost
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.011064002 = queryNorm
              0.42152596 = fieldWeight in 2765, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
        0.44 = coord(11/25)
    
  5. Keikha, M.; Crestani, F.; Carman, M.J.: Employing document dependency in blog search (2012) 0.23
    0.23388882 = sum of:
      0.23388882 = product of:
        0.5315655 = sum of:
          0.03208077 = weight(abstract_txt:approaches in 4987) [ClassicSimilarity], result of:
            0.03208077 = score(doc=4987,freq=4.0), product of:
              0.055689994 = queryWeight, product of:
                1.092213 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.011064002 = queryNorm
              0.5760598 = fieldWeight in 4987, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.0625 = fieldNorm(doc=4987)
          0.022808535 = weight(abstract_txt:then in 4987) [ClassicSimilarity], result of:
            0.022808535 = score(doc=4987,freq=2.0), product of:
              0.055892766 = queryWeight, product of:
                1.0941997 = boost
                4.616861 = idf(docFreq=1187, maxDocs=44218)
                0.011064002 = queryNorm
              0.4080767 = fieldWeight in 4987, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.616861 = idf(docFreq=1187, maxDocs=44218)
                0.0625 = fieldNorm(doc=4987)
          0.02084221 = weight(abstract_txt:search in 4987) [ClassicSimilarity], result of:
            0.02084221 = score(doc=4987,freq=3.0), product of:
              0.0526324 = queryWeight, product of:
                1.300442 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.011064002 = queryNorm
              0.3959958 = fieldWeight in 4987, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=4987)
          0.04344461 = weight(abstract_txt:rank in 4987) [ClassicSimilarity], result of:
            0.04344461 = score(doc=4987,freq=1.0), product of:
              0.10820765 = queryWeight, product of:
                1.5224665 = boost
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.011064002 = queryNorm
              0.40149298 = fieldWeight in 4987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.0625 = fieldNorm(doc=4987)
          0.013755892 = weight(abstract_txt:retrieval in 4987) [ClassicSimilarity], result of:
            0.013755892 = score(doc=4987,freq=1.0), product of:
              0.06333394 = queryWeight, product of:
                1.6472216 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.011064002 = queryNorm
              0.21719621 = fieldWeight in 4987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=4987)
          0.026408657 = weight(abstract_txt:query in 4987) [ClassicSimilarity], result of:
            0.026408657 = score(doc=4987,freq=1.0), product of:
              0.08888503 = queryWeight, product of:
                1.689969 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.011064002 = queryNorm
              0.2971103 = fieldWeight in 4987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=4987)
          0.017220337 = weight(abstract_txt:approach in 4987) [ClassicSimilarity], result of:
            0.017220337 = score(doc=4987,freq=1.0), product of:
              0.07356509 = queryWeight, product of:
                1.7752914 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.011064002 = queryNorm
              0.234083 = fieldWeight in 4987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=4987)
          0.022944339 = weight(abstract_txt:documents in 4987) [ClassicSimilarity], result of:
            0.022944339 = score(doc=4987,freq=1.0), product of:
              0.089076065 = queryWeight, product of:
                1.953504 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.011064002 = queryNorm
              0.2575814 = fieldWeight in 4987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=4987)
          0.17684473 = weight(abstract_txt:scores in 4987) [ClassicSimilarity], result of:
            0.17684473 = score(doc=4987,freq=6.0), product of:
              0.17378359 = queryWeight, product of:
                2.3630276 = boost
                6.6470313 = idf(docFreq=155, maxDocs=44218)
                0.011064002 = queryNorm
              1.0176147 = fieldWeight in 4987, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.6470313 = idf(docFreq=155, maxDocs=44218)
                0.0625 = fieldNorm(doc=4987)
          0.09688183 = weight(abstract_txt:similarity in 4987) [ClassicSimilarity], result of:
            0.09688183 = score(doc=4987,freq=1.0), product of:
              0.26638064 = queryWeight, product of:
                4.1374307 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.011064002 = queryNorm
              0.36369696 = fieldWeight in 4987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0625 = fieldNorm(doc=4987)
          0.058333542 = weight(abstract_txt:document in 4987) [ClassicSimilarity], result of:
            0.058333542 = score(doc=4987,freq=1.0), product of:
              0.2174288 = queryWeight, product of:
                4.578082 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.011064002 = queryNorm
              0.26828802 = fieldWeight in 4987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=4987)
        0.44 = coord(11/25)