Document (#33036)

Author
Desai, M.
Spink, A.
Title
¬A algorithm to cluster documents based on relevance
Source
Information processing and management. 41(2005) no.5, S.1035-1050
Year
2005
Abstract
Search engines fail to make a clear distinction between items of varying relevance when presenting search results to users. Instead, they rely on the user of the system to estimate which items are relevant, partially relevant, or not relevant. The user of the system is given the task of distinguishing between documents that are relevant to different degrees. This process often hinders the accessibility of relevant or partially relevant documents, particularly when the results set is large and documents of varying relevance are scattered throughout the set. In this paper, we present a clustering scheme that groups documents within relevant, partially relevant, and not relevant regions for a given search. A clustering algorithm accomplishes the task of clustering documents based on relevance. The clusters were evaluated by end-users issuing categorical, interval, and descriptive relevance judgments for the documents returned from a search. The degree of overlap between users and the system for each of the clustered regions was measured to determine the overall effectiveness of the algorithm. This research showed that clustering documents on the Web by regions of relevance is highly necessary and quite feasible.
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Desai, B.C.: Supporting discovery in virtual libraries (1997) 2.51
    2.5056796 = sum of:
      2.5056796 = product of:
        5.011359 = sum of:
          5.011359 = weight(author_txt:desai in 543) [ClassicSimilarity], result of:
            5.011359 = score(doc=543,freq=1.0), product of:
              0.8535147 = queryWeight, product of:
                1.2798465 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.07098859 = queryNorm
              5.871439 = fieldWeight in 543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.625 = fieldNorm(doc=543)
        0.5 = coord(1/2)
    
  2. Desai, B.C.: CINDI: a virtual library indexing and discovery system (1999) 2.51
    2.5056796 = sum of:
      2.5056796 = product of:
        5.011359 = sum of:
          5.011359 = weight(author_txt:desai in 4578) [ClassicSimilarity], result of:
            5.011359 = score(doc=4578,freq=1.0), product of:
              0.8535147 = queryWeight, product of:
                1.2798465 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.07098859 = queryNorm
              5.871439 = fieldWeight in 4578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.625 = fieldNorm(doc=4578)
        0.5 = coord(1/2)
    
  3. Shah, G.A.; Desai, A.T.; Nagarkar, S.A.: Search strategies : their importance in IR process (1992) 1.50
    1.5034078 = sum of:
      1.5034078 = product of:
        3.0068157 = sum of:
          3.0068157 = weight(author_txt:desai in 3806) [ClassicSimilarity], result of:
            3.0068157 = score(doc=3806,freq=1.0), product of:
              0.8535147 = queryWeight, product of:
                1.2798465 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.07098859 = queryNorm
              3.5228634 = fieldWeight in 3806, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.375 = fieldNorm(doc=3806)
        0.5 = coord(1/2)
    
  4. Eschenfelder, K.R.; Howard, R.G.; Desai, A.C.: Who posts DeCSS and why? : a content analysis of Web sites posting DVD circumvention software (2005) 1.50
    1.5034078 = sum of:
      1.5034078 = product of:
        3.0068157 = sum of:
          3.0068157 = weight(author_txt:desai in 4576) [ClassicSimilarity], result of:
            3.0068157 = score(doc=4576,freq=1.0), product of:
              0.8535147 = queryWeight, product of:
                1.2798465 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.07098859 = queryNorm
              3.5228634 = fieldWeight in 4576, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.375 = fieldNorm(doc=4576)
        0.5 = coord(1/2)
    
  5. Bhansali, D.; Desai, H.; Deulkar, K.: ¬A study of different ranking approaches for semantic search (2015) 1.50
    1.5034078 = sum of:
      1.5034078 = product of:
        3.0068157 = sum of:
          3.0068157 = weight(author_txt:desai in 2696) [ClassicSimilarity], result of:
            3.0068157 = score(doc=2696,freq=1.0), product of:
              0.8535147 = queryWeight, product of:
                1.2798465 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.07098859 = queryNorm
              3.5228634 = fieldWeight in 2696, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.375 = fieldNorm(doc=2696)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Chen, L.-C.: Next generation search engine for the result clustering technology (2012) 0.31
    0.31463137 = sum of:
      0.31463137 = product of:
        0.983223 = sum of:
          0.09726787 = weight(abstract_txt:returned in 105) [ClassicSimilarity], result of:
            0.09726787 = score(doc=105,freq=2.0), product of:
              0.11750473 = queryWeight, product of:
                1.0169818 = boost
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.0154217305 = queryNorm
              0.82777834 = fieldWeight in 105, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.078125 = fieldNorm(doc=105)
          0.15893058 = weight(abstract_txt:accomplishes in 105) [ClassicSimilarity], result of:
            0.15893058 = score(doc=105,freq=1.0), product of:
              0.20537964 = queryWeight, product of:
                1.3445104 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0154217305 = queryNorm
              0.7738381 = fieldWeight in 105, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.078125 = fieldNorm(doc=105)
          0.026610034 = weight(abstract_txt:system in 105) [ClassicSimilarity], result of:
            0.026610034 = score(doc=105,freq=2.0), product of:
              0.07141889 = queryWeight, product of:
                1.3732597 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0154217305 = queryNorm
              0.372591 = fieldWeight in 105, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.078125 = fieldNorm(doc=105)
          0.022318864 = weight(abstract_txt:users in 105) [ClassicSimilarity], result of:
            0.022318864 = score(doc=105,freq=1.0), product of:
              0.0800278 = queryWeight, product of:
                1.4536724 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.0154217305 = queryNorm
              0.2788889 = fieldWeight in 105, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.078125 = fieldNorm(doc=105)
          0.05546228 = weight(abstract_txt:search in 105) [ClassicSimilarity], result of:
            0.05546228 = score(doc=105,freq=3.0), product of:
              0.11204619 = queryWeight, product of:
                1.9861591 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0154217305 = queryNorm
              0.49499476 = fieldWeight in 105, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.078125 = fieldNorm(doc=105)
          0.09111965 = weight(abstract_txt:algorithm in 105) [ClassicSimilarity], result of:
            0.09111965 = score(doc=105,freq=1.0), product of:
              0.20442507 = queryWeight, product of:
                2.3233423 = boost
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.0154217305 = queryNorm
              0.44573617 = fieldWeight in 105, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.078125 = fieldNorm(doc=105)
          0.3849001 = weight(abstract_txt:clustering in 105) [ClassicSimilarity], result of:
            0.3849001 = score(doc=105,freq=6.0), product of:
              0.32355937 = queryWeight, product of:
                3.3751454 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.0154217305 = queryNorm
              1.189581 = fieldWeight in 105, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.078125 = fieldNorm(doc=105)
          0.14661364 = weight(abstract_txt:relevant in 105) [ClassicSimilarity], result of:
            0.14661364 = score(doc=105,freq=1.0), product of:
              0.40483943 = queryWeight, product of:
                5.6630206 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.0154217305 = queryNorm
              0.36215258 = fieldWeight in 105, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.078125 = fieldNorm(doc=105)
        0.32 = coord(8/25)
    
  2. Spink, A.; Greisdorf, H.: Users' partial relevance judgements during online searching (1997) 0.28
    0.27575207 = sum of:
      0.27575207 = product of:
        0.8617252 = sum of:
          0.016347634 = weight(abstract_txt:user in 623) [ClassicSimilarity], result of:
            0.016347634 = score(doc=623,freq=1.0), product of:
              0.056806624 = queryWeight, product of:
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0154217305 = queryNorm
              0.2877769 = fieldWeight in 623, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.078125 = fieldNorm(doc=623)
          0.020382173 = weight(abstract_txt:between in 623) [ClassicSimilarity], result of:
            0.020382173 = score(doc=623,freq=1.0), product of:
              0.07532858 = queryWeight, product of:
                1.4103471 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.0154217305 = queryNorm
              0.2705769 = fieldWeight in 623, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.078125 = fieldNorm(doc=623)
          0.05466983 = weight(abstract_txt:users in 623) [ClassicSimilarity], result of:
            0.05466983 = score(doc=623,freq=6.0), product of:
              0.0800278 = queryWeight, product of:
                1.4536724 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.0154217305 = queryNorm
              0.6831355 = fieldWeight in 623, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.078125 = fieldNorm(doc=623)
          0.11358143 = weight(abstract_txt:items in 623) [ClassicSimilarity], result of:
            0.11358143 = score(doc=623,freq=4.0), product of:
              0.13030085 = queryWeight, product of:
                1.514517 = boost
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.0154217305 = queryNorm
              0.871686 = fieldWeight in 623, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.078125 = fieldNorm(doc=623)
          0.04528476 = weight(abstract_txt:search in 623) [ClassicSimilarity], result of:
            0.04528476 = score(doc=623,freq=2.0), product of:
              0.11204619 = queryWeight, product of:
                1.9861591 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0154217305 = queryNorm
              0.4041615 = fieldWeight in 623, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.078125 = fieldNorm(doc=623)
          0.19105227 = weight(abstract_txt:partially in 623) [ClassicSimilarity], result of:
            0.19105227 = score(doc=623,freq=1.0), product of:
              0.3348839 = queryWeight, product of:
                2.9736733 = boost
                7.3024383 = idf(docFreq=80, maxDocs=44218)
                0.0154217305 = queryNorm
              0.570503 = fieldWeight in 623, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3024383 = idf(docFreq=80, maxDocs=44218)
                0.078125 = fieldNorm(doc=623)
          0.16646485 = weight(abstract_txt:relevance in 623) [ClassicSimilarity], result of:
            0.16646485 = score(doc=623,freq=2.0), product of:
              0.30549762 = queryWeight, product of:
                4.0166597 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0154217305 = queryNorm
              0.5448974 = fieldWeight in 623, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.078125 = fieldNorm(doc=623)
          0.25394228 = weight(abstract_txt:relevant in 623) [ClassicSimilarity], result of:
            0.25394228 = score(doc=623,freq=3.0), product of:
              0.40483943 = queryWeight, product of:
                5.6630206 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.0154217305 = queryNorm
              0.62726665 = fieldWeight in 623, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.078125 = fieldNorm(doc=623)
        0.32 = coord(8/25)
    
  3. Cosijn, E.: Relevance judgments and measurements (2009) 0.27
    0.26973113 = sum of:
      0.26973113 = product of:
        0.8429098 = sum of:
          0.023119045 = weight(abstract_txt:user in 3855) [ClassicSimilarity], result of:
            0.023119045 = score(doc=3855,freq=2.0), product of:
              0.056806624 = queryWeight, product of:
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0154217305 = queryNorm
              0.40697798 = fieldWeight in 3855, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.078125 = fieldNorm(doc=3855)
          0.023349496 = weight(abstract_txt:when in 3855) [ClassicSimilarity], result of:
            0.023349496 = score(doc=3855,freq=1.0), product of:
              0.0720467 = queryWeight, product of:
                1.1261793 = boost
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.0154217305 = queryNorm
              0.32408836 = fieldWeight in 3855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.078125 = fieldNorm(doc=3855)
          0.018816134 = weight(abstract_txt:system in 3855) [ClassicSimilarity], result of:
            0.018816134 = score(doc=3855,freq=1.0), product of:
              0.07141889 = queryWeight, product of:
                1.3732597 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0154217305 = queryNorm
              0.2634616 = fieldWeight in 3855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.078125 = fieldNorm(doc=3855)
          0.03530296 = weight(abstract_txt:between in 3855) [ClassicSimilarity], result of:
            0.03530296 = score(doc=3855,freq=3.0), product of:
              0.07532858 = queryWeight, product of:
                1.4103471 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.0154217305 = queryNorm
              0.4686529 = fieldWeight in 3855, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.078125 = fieldNorm(doc=3855)
          0.022318864 = weight(abstract_txt:users in 3855) [ClassicSimilarity], result of:
            0.022318864 = score(doc=3855,freq=1.0), product of:
              0.0800278 = queryWeight, product of:
                1.4536724 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.0154217305 = queryNorm
              0.2788889 = fieldWeight in 3855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.078125 = fieldNorm(doc=3855)
          0.2883256 = weight(abstract_txt:relevance in 3855) [ClassicSimilarity], result of:
            0.2883256 = score(doc=3855,freq=6.0), product of:
              0.30549762 = queryWeight, product of:
                4.0166597 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0154217305 = queryNorm
              0.94379 = fieldWeight in 3855, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.078125 = fieldNorm(doc=3855)
          0.22433469 = weight(abstract_txt:documents in 3855) [ClassicSimilarity], result of:
            0.22433469 = score(doc=3855,freq=6.0), product of:
              0.28444365 = queryWeight, product of:
                4.4753666 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0154217305 = queryNorm
              0.7886788 = fieldWeight in 3855, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=3855)
          0.207343 = weight(abstract_txt:relevant in 3855) [ClassicSimilarity], result of:
            0.207343 = score(doc=3855,freq=2.0), product of:
              0.40483943 = queryWeight, product of:
                5.6630206 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.0154217305 = queryNorm
              0.5121611 = fieldWeight in 3855, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.078125 = fieldNorm(doc=3855)
        0.32 = coord(8/25)
    
  4. Smith, M.P.; Pollitt, A.S.: Ranking and relevance feedback extensions to a view-based searching system (1995) 0.26
    0.25621894 = sum of:
      0.25621894 = product of:
        0.8006842 = sum of:
          0.023119045 = weight(abstract_txt:user in 3855) [ClassicSimilarity], result of:
            0.023119045 = score(doc=3855,freq=2.0), product of:
              0.056806624 = queryWeight, product of:
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0154217305 = queryNorm
              0.40697798 = fieldWeight in 3855, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.078125 = fieldNorm(doc=3855)
          0.018816134 = weight(abstract_txt:system in 3855) [ClassicSimilarity], result of:
            0.018816134 = score(doc=3855,freq=1.0), product of:
              0.07141889 = queryWeight, product of:
                1.3732597 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0154217305 = queryNorm
              0.2634616 = fieldWeight in 3855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.078125 = fieldNorm(doc=3855)
          0.020382173 = weight(abstract_txt:between in 3855) [ClassicSimilarity], result of:
            0.020382173 = score(doc=3855,freq=1.0), product of:
              0.07532858 = queryWeight, product of:
                1.4103471 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.0154217305 = queryNorm
              0.2705769 = fieldWeight in 3855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.078125 = fieldNorm(doc=3855)
          0.022318864 = weight(abstract_txt:users in 3855) [ClassicSimilarity], result of:
            0.022318864 = score(doc=3855,freq=1.0), product of:
              0.0800278 = queryWeight, product of:
                1.4536724 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.0154217305 = queryNorm
              0.2788889 = fieldWeight in 3855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.078125 = fieldNorm(doc=3855)
          0.032021157 = weight(abstract_txt:search in 3855) [ClassicSimilarity], result of:
            0.032021157 = score(doc=3855,freq=1.0), product of:
              0.11204619 = queryWeight, product of:
                1.9861591 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0154217305 = queryNorm
              0.28578535 = fieldWeight in 3855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.078125 = fieldNorm(doc=3855)
          0.16646485 = weight(abstract_txt:relevance in 3855) [ClassicSimilarity], result of:
            0.16646485 = score(doc=3855,freq=2.0), product of:
              0.30549762 = queryWeight, product of:
                4.0166597 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0154217305 = queryNorm
              0.5448974 = fieldWeight in 3855, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.078125 = fieldNorm(doc=3855)
          0.22433469 = weight(abstract_txt:documents in 3855) [ClassicSimilarity], result of:
            0.22433469 = score(doc=3855,freq=6.0), product of:
              0.28444365 = queryWeight, product of:
                4.4753666 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0154217305 = queryNorm
              0.7886788 = fieldWeight in 3855, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=3855)
          0.2932273 = weight(abstract_txt:relevant in 3855) [ClassicSimilarity], result of:
            0.2932273 = score(doc=3855,freq=4.0), product of:
              0.40483943 = queryWeight, product of:
                5.6630206 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.0154217305 = queryNorm
              0.72430515 = fieldWeight in 3855, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.078125 = fieldNorm(doc=3855)
        0.32 = coord(8/25)
    
  5. Lihui, C.; Lian, C.W.: Using Web structure and summarisation techniques for Web content mining (2005) 0.25
    0.253813 = sum of:
      0.253813 = product of:
        0.70503604 = sum of:
          0.013078107 = weight(abstract_txt:user in 1046) [ClassicSimilarity], result of:
            0.013078107 = score(doc=1046,freq=1.0), product of:
              0.056806624 = queryWeight, product of:
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0154217305 = queryNorm
              0.23022151 = fieldWeight in 1046, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0625 = fieldNorm(doc=1046)
          0.027186386 = weight(abstract_txt:given in 1046) [ClassicSimilarity], result of:
            0.027186386 = score(doc=1046,freq=1.0), product of:
              0.09252734 = queryWeight, product of:
                1.2762494 = boost
                4.701121 = idf(docFreq=1091, maxDocs=44218)
                0.0154217305 = queryNorm
              0.29382005 = fieldWeight in 1046, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.701121 = idf(docFreq=1091, maxDocs=44218)
                0.0625 = fieldNorm(doc=1046)
          0.030925926 = weight(abstract_txt:users in 1046) [ClassicSimilarity], result of:
            0.030925926 = score(doc=1046,freq=3.0), product of:
              0.0800278 = queryWeight, product of:
                1.4536724 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.0154217305 = queryNorm
              0.3864398 = fieldWeight in 1046, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.0625 = fieldNorm(doc=1046)
          0.051233854 = weight(abstract_txt:search in 1046) [ClassicSimilarity], result of:
            0.051233854 = score(doc=1046,freq=4.0), product of:
              0.11204619 = queryWeight, product of:
                1.9861591 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0154217305 = queryNorm
              0.45725656 = fieldWeight in 1046, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=1046)
          0.07289571 = weight(abstract_txt:algorithm in 1046) [ClassicSimilarity], result of:
            0.07289571 = score(doc=1046,freq=1.0), product of:
              0.20442507 = queryWeight, product of:
                2.3233423 = boost
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.0154217305 = queryNorm
              0.35658893 = fieldWeight in 1046, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.0625 = fieldNorm(doc=1046)
          0.12570783 = weight(abstract_txt:clustering in 1046) [ClassicSimilarity], result of:
            0.12570783 = score(doc=1046,freq=1.0), product of:
              0.32355937 = queryWeight, product of:
                3.3751454 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.0154217305 = queryNorm
              0.38851553 = fieldWeight in 1046, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.0625 = fieldNorm(doc=1046)
          0.16310158 = weight(abstract_txt:relevance in 1046) [ClassicSimilarity], result of:
            0.16310158 = score(doc=1046,freq=3.0), product of:
              0.30549762 = queryWeight, product of:
                4.0166597 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0154217305 = queryNorm
              0.5338882 = fieldWeight in 1046, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0625 = fieldNorm(doc=1046)
          0.103615746 = weight(abstract_txt:documents in 1046) [ClassicSimilarity], result of:
            0.103615746 = score(doc=1046,freq=2.0), product of:
              0.28444365 = queryWeight, product of:
                4.4753666 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0154217305 = queryNorm
              0.36427513 = fieldWeight in 1046, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=1046)
          0.117290914 = weight(abstract_txt:relevant in 1046) [ClassicSimilarity], result of:
            0.117290914 = score(doc=1046,freq=1.0), product of:
              0.40483943 = queryWeight, product of:
                5.6630206 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.0154217305 = queryNorm
              0.28972206 = fieldWeight in 1046, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.0625 = fieldNorm(doc=1046)
        0.36 = coord(9/25)