Document (#30210)

Author
Ruthven, I.
Lalmas, M.
Rijsbergen, K. van
Title
Combining and selecting characteristics of information use
Source
Journal of the American Society for Information Science and technology. 53(2002) no.5, S.378-396
Year
2002
Abstract
Ruthven, Lalmas, and van Rijsbergen use traditional term importance measures like inverse document frequency, noise, based upon in-document frequency, and term frequency supplemented by theme value which is calculated from differences of expected positions of words in a text from their actual positions, on the assumption that even distribution indicates term association with a main topic, and context, which is based on a query term's distance from the nearest other query term relative to the average expected distribution of all query terms in the document. They then define document characteristics like specificity, the sum of all idf values in a document over the total terms in the document, or document complexity, measured by the documents average idf value; and information to noise ratio, info-noise, tokens after stopping and stemming over tokens before these processes, measuring the ratio of useful and non-useful information in a document. Retrieval tests are then carried out using each characteristic, combinations of the characteristics, and relevance feedback to determine the correct combination of characteristics. A file ranks independently of query terms by both specificity and info-noise, but if presence of a query term is required unique rankings are generated. Tested on five standard collections the traditional characteristics out preformed the new characteristics, which did, however, out preform random retrieval. All possible combinations of characteristics were also tested both with and without a set of scaling weights applied. All characteristics can benefit by combination with another characteristic or set of characteristics and performance as a single characteristic is a good indicator of performance in combination. Larger combinations tended to be more effective than smaller ones and weighting increased precision measures of middle ranking combinations but decreased the ranking of poorer combinations. The best combinations vary for each collection, and in some collections with the addition of weighting. Finally, with all documents ranked by the all characteristics combination, they take the top 30 documents and calculate the characteristic scores for each term in both the relevant and the non-relevant sets. Then taking for each query term the characteristics whose average was higher for relevant than non-relevant documents the documents are re-ranked. The relevance feedback method of selecting characteristics can select a good set of characteristics for query terms.
Theme
Retrievalstudien

Similar documents (author)

  1. Ruthven, T.; Lalmas, M.; Rijsbergen, K.van: Incorporating user research behavior into relevance feedback (2003) 5.44
    5.4367194 = sum of:
      5.4367194 = sum of:
        1.7798518 = weight(author_txt:ruthven in 170) [ClassicSimilarity], result of:
          1.7798518 = score(doc=170,freq=1.0), product of:
            0.5704325 = queryWeight, product of:
              8.320479 = idf(docFreq=27, maxDocs=42306)
              0.06855765 = queryNorm
            3.1201797 = fieldWeight in 170, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.320479 = idf(docFreq=27, maxDocs=42306)
              0.375 = fieldNorm(doc=170)
        1.8032931 = weight(author_txt:rijsbergen in 170) [ClassicSimilarity], result of:
          1.8032931 = score(doc=170,freq=1.0), product of:
            0.5754301 = queryWeight, product of:
              1.0043709 = boost
              8.356848 = idf(docFreq=26, maxDocs=42306)
              0.06855765 = queryNorm
            3.133818 = fieldWeight in 170, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.356848 = idf(docFreq=26, maxDocs=42306)
              0.375 = fieldNorm(doc=170)
        1.8535746 = weight(author_txt:lalmas in 170) [ClassicSimilarity], result of:
          1.8535746 = score(doc=170,freq=1.0), product of:
            0.5860775 = queryWeight, product of:
              1.0136205 = boost
              8.433808 = idf(docFreq=24, maxDocs=42306)
              0.06855765 = queryNorm
            3.1626782 = fieldWeight in 170, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.433808 = idf(docFreq=24, maxDocs=42306)
              0.375 = fieldNorm(doc=170)
    
  2. Lalmas, M.; Ruthven, I.: ¬A model for structured document retrieval : empirical investigations (1997) 3.23
    3.2297122 = sum of:
      3.2297122 = product of:
        4.8445683 = sum of:
          2.3731358 = weight(author_txt:ruthven in 1728) [ClassicSimilarity], result of:
            2.3731358 = score(doc=1728,freq=1.0), product of:
              0.5704325 = queryWeight, product of:
                8.320479 = idf(docFreq=27, maxDocs=42306)
                0.06855765 = queryNorm
              4.1602397 = fieldWeight in 1728, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.320479 = idf(docFreq=27, maxDocs=42306)
                0.5 = fieldNorm(doc=1728)
          2.4714327 = weight(author_txt:lalmas in 1728) [ClassicSimilarity], result of:
            2.4714327 = score(doc=1728,freq=1.0), product of:
              0.5860775 = queryWeight, product of:
                1.0136205 = boost
                8.433808 = idf(docFreq=24, maxDocs=42306)
                0.06855765 = queryNorm
              4.216904 = fieldWeight in 1728, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.433808 = idf(docFreq=24, maxDocs=42306)
                0.5 = fieldNorm(doc=1728)
        0.6666667 = coord(2/3)
    
  3. Lalmas, M.; Ruthven, I.: Representing and retrieving structured documents using the Dempster-Shafer theory of evidence : modelling and evaluation (1998) 3.23
    3.2297122 = sum of:
      3.2297122 = product of:
        4.8445683 = sum of:
          2.3731358 = weight(author_txt:ruthven in 2077) [ClassicSimilarity], result of:
            2.3731358 = score(doc=2077,freq=1.0), product of:
              0.5704325 = queryWeight, product of:
                8.320479 = idf(docFreq=27, maxDocs=42306)
                0.06855765 = queryNorm
              4.1602397 = fieldWeight in 2077, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.320479 = idf(docFreq=27, maxDocs=42306)
                0.5 = fieldNorm(doc=2077)
          2.4714327 = weight(author_txt:lalmas in 2077) [ClassicSimilarity], result of:
            2.4714327 = score(doc=2077,freq=1.0), product of:
              0.5860775 = queryWeight, product of:
                1.0136205 = boost
                8.433808 = idf(docFreq=24, maxDocs=42306)
                0.06855765 = queryNorm
              4.216904 = fieldWeight in 2077, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.433808 = idf(docFreq=24, maxDocs=42306)
                0.5 = fieldNorm(doc=2077)
        0.6666667 = coord(2/3)
    
  4. Ruthven, I.; Lalmas, M.: Selective relevance feedback using term characteristics (1999) 3.23
    3.2297122 = sum of:
      3.2297122 = product of:
        4.8445683 = sum of:
          2.3731358 = weight(author_txt:ruthven in 4825) [ClassicSimilarity], result of:
            2.3731358 = score(doc=4825,freq=1.0), product of:
              0.5704325 = queryWeight, product of:
                8.320479 = idf(docFreq=27, maxDocs=42306)
                0.06855765 = queryNorm
              4.1602397 = fieldWeight in 4825, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.320479 = idf(docFreq=27, maxDocs=42306)
                0.5 = fieldNorm(doc=4825)
          2.4714327 = weight(author_txt:lalmas in 4825) [ClassicSimilarity], result of:
            2.4714327 = score(doc=4825,freq=1.0), product of:
              0.5860775 = queryWeight, product of:
                1.0136205 = boost
                8.433808 = idf(docFreq=24, maxDocs=42306)
                0.06855765 = queryNorm
              4.216904 = fieldWeight in 4825, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.433808 = idf(docFreq=24, maxDocs=42306)
                0.5 = fieldNorm(doc=4825)
        0.6666667 = coord(2/3)
    
  5. Rijsbergen, C.J. van; Lalmas, M.: Information calculus for information retrieval (1996) 2.84
    2.8442307 = sum of:
      2.8442307 = product of:
        4.266346 = sum of:
          2.103842 = weight(author_txt:rijsbergen in 4270) [ClassicSimilarity], result of:
            2.103842 = score(doc=4270,freq=1.0), product of:
              0.5754301 = queryWeight, product of:
                1.0043709 = boost
                8.356848 = idf(docFreq=26, maxDocs=42306)
                0.06855765 = queryNorm
              3.6561208 = fieldWeight in 4270, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.356848 = idf(docFreq=26, maxDocs=42306)
                0.4375 = fieldNorm(doc=4270)
          2.1625037 = weight(author_txt:lalmas in 4270) [ClassicSimilarity], result of:
            2.1625037 = score(doc=4270,freq=1.0), product of:
              0.5860775 = queryWeight, product of:
                1.0136205 = boost
                8.433808 = idf(docFreq=24, maxDocs=42306)
                0.06855765 = queryNorm
              3.6897912 = fieldWeight in 4270, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.433808 = idf(docFreq=24, maxDocs=42306)
                0.4375 = fieldNorm(doc=4270)
        0.6666667 = coord(2/3)
    

Similar documents (content)

  1. Smith, M.P.; Pollitt, S.A.: ¬A comparison of ranking formulae and their ranks (1995) 0.46
    0.4604067 = sum of:
      0.4604067 = product of:
        0.95918065 = sum of:
          0.049856026 = weight(abstract_txt:expected in 5871) [ClassicSimilarity], result of:
            0.049856026 = score(doc=5871,freq=1.0), product of:
              0.10346446 = queryWeight, product of:
                6.167887 = idf(docFreq=240, maxDocs=42306)
                0.016774701 = queryNorm
              0.48186618 = fieldWeight in 5871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.167887 = idf(docFreq=240, maxDocs=42306)
                0.078125 = fieldNorm(doc=5871)
          0.008567527 = weight(abstract_txt:with in 5871) [ClassicSimilarity], result of:
            0.008567527 = score(doc=5871,freq=1.0), product of:
              0.043404 = queryWeight, product of:
                1.0240927 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.016774701 = queryNorm
              0.19739026 = fieldWeight in 5871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.078125 = fieldNorm(doc=5871)
          0.07859294 = weight(abstract_txt:ranked in 5871) [ClassicSimilarity], result of:
            0.07859294 = score(doc=5871,freq=2.0), product of:
              0.11123082 = queryWeight, product of:
                1.0368525 = boost
                6.395189 = idf(docFreq=191, maxDocs=42306)
                0.016774701 = queryNorm
              0.7065752 = fieldWeight in 5871, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.395189 = idf(docFreq=191, maxDocs=42306)
                0.078125 = fieldNorm(doc=5871)
          0.07196313 = weight(abstract_txt:weighting in 5871) [ClassicSimilarity], result of:
            0.07196313 = score(doc=5871,freq=1.0), product of:
              0.13214563 = queryWeight, product of:
                1.1301363 = boost
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.016774701 = queryNorm
              0.54457444 = fieldWeight in 5871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.078125 = fieldNorm(doc=5871)
          0.030250857 = weight(abstract_txt:each in 5871) [ClassicSimilarity], result of:
            0.030250857 = score(doc=5871,freq=1.0), product of:
              0.093428895 = queryWeight, product of:
                1.3438785 = boost
                4.1444454 = idf(docFreq=1822, maxDocs=42306)
                0.016774701 = queryNorm
              0.3237848 = fieldWeight in 5871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1444454 = idf(docFreq=1822, maxDocs=42306)
                0.078125 = fieldNorm(doc=5871)
          0.06743872 = weight(abstract_txt:frequency in 5871) [ClassicSimilarity], result of:
            0.06743872 = score(doc=5871,freq=1.0), product of:
              0.14486031 = queryWeight, product of:
                1.449188 = boost
                5.958952 = idf(docFreq=296, maxDocs=42306)
                0.016774701 = queryNorm
              0.46554312 = fieldWeight in 5871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.958952 = idf(docFreq=296, maxDocs=42306)
                0.078125 = fieldNorm(doc=5871)
          0.07459941 = weight(abstract_txt:relevant in 5871) [ClassicSimilarity], result of:
            0.07459941 = score(doc=5871,freq=3.0), product of:
              0.118242234 = queryWeight, product of:
                1.5118395 = boost
                4.662428 = idf(docFreq=1085, maxDocs=42306)
                0.016774701 = queryNorm
              0.63090324 = fieldWeight in 5871, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.662428 = idf(docFreq=1085, maxDocs=42306)
                0.078125 = fieldNorm(doc=5871)
          0.09071576 = weight(abstract_txt:documents in 5871) [ClassicSimilarity], result of:
            0.09071576 = score(doc=5871,freq=6.0), product of:
              0.11517658 = queryWeight, product of:
                1.6682322 = boost
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.016774701 = queryNorm
              0.7876233 = fieldWeight in 5871, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.078125 = fieldNorm(doc=5871)
          0.111576594 = weight(abstract_txt:query in 5871) [ClassicSimilarity], result of:
            0.111576594 = score(doc=5871,freq=2.0), product of:
              0.21332456 = queryWeight, product of:
                2.6863267 = boost
                4.733989 = idf(docFreq=1010, maxDocs=42306)
                0.016774701 = queryNorm
              0.5230368 = fieldWeight in 5871, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.733989 = idf(docFreq=1010, maxDocs=42306)
                0.078125 = fieldNorm(doc=5871)
          0.14567399 = weight(abstract_txt:term in 5871) [ClassicSimilarity], result of:
            0.14567399 = score(doc=5871,freq=3.0), product of:
              0.22261259 = queryWeight, product of:
                2.7441843 = boost
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.016774701 = queryNorm
              0.6543834 = fieldWeight in 5871, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.078125 = fieldNorm(doc=5871)
          0.06660235 = weight(abstract_txt:document in 5871) [ClassicSimilarity], result of:
            0.06660235 = score(doc=5871,freq=1.0), product of:
              0.19921672 = queryWeight, product of:
                2.7752192 = boost
                4.2793097 = idf(docFreq=1592, maxDocs=42306)
                0.016774701 = queryNorm
              0.33432108 = fieldWeight in 5871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2793097 = idf(docFreq=1592, maxDocs=42306)
                0.078125 = fieldNorm(doc=5871)
          0.1633433 = weight(abstract_txt:characteristics in 5871) [ClassicSimilarity], result of:
            0.1633433 = score(doc=5871,freq=1.0), product of:
              0.42594296 = queryWeight, product of:
                5.172932 = boost
                4.908625 = idf(docFreq=848, maxDocs=42306)
                0.016774701 = queryNorm
              0.38348633 = fieldWeight in 5871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.908625 = idf(docFreq=848, maxDocs=42306)
                0.078125 = fieldNorm(doc=5871)
        0.48 = coord(12/25)
    
  2. Wong, S.K.M.; Yao, Y.Y.: ¬An information-theoretic measure of term specifics (1992) 0.32
    0.31944576 = sum of:
      0.31944576 = product of:
        0.998268 = sum of:
          0.122125484 = weight(abstract_txt:weighting in 4807) [ClassicSimilarity], result of:
            0.122125484 = score(doc=4807,freq=2.0), product of:
              0.13214563 = queryWeight, product of:
                1.1301363 = boost
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.016774701 = queryNorm
              0.9241735 = fieldWeight in 4807, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.09375 = fieldNorm(doc=4807)
          0.11158342 = weight(abstract_txt:ratio in 4807) [ClassicSimilarity], result of:
            0.11158342 = score(doc=4807,freq=1.0), product of:
              0.15676835 = queryWeight, product of:
                1.2309307 = boost
                7.5922413 = idf(docFreq=57, maxDocs=42306)
                0.016774701 = queryNorm
              0.7117726 = fieldWeight in 4807, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5922413 = idf(docFreq=57, maxDocs=42306)
                0.09375 = fieldNorm(doc=4807)
          0.036301028 = weight(abstract_txt:each in 4807) [ClassicSimilarity], result of:
            0.036301028 = score(doc=4807,freq=1.0), product of:
              0.093428895 = queryWeight, product of:
                1.3438785 = boost
                4.1444454 = idf(docFreq=1822, maxDocs=42306)
                0.016774701 = queryNorm
              0.38854176 = fieldWeight in 4807, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1444454 = idf(docFreq=1822, maxDocs=42306)
                0.09375 = fieldNorm(doc=4807)
          0.11444731 = weight(abstract_txt:frequency in 4807) [ClassicSimilarity], result of:
            0.11444731 = score(doc=4807,freq=2.0), product of:
              0.14486031 = queryWeight, product of:
                1.449188 = boost
                5.958952 = idf(docFreq=296, maxDocs=42306)
                0.016774701 = queryNorm
              0.7900529 = fieldWeight in 4807, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.958952 = idf(docFreq=296, maxDocs=42306)
                0.09375 = fieldNorm(doc=4807)
          0.04444146 = weight(abstract_txt:documents in 4807) [ClassicSimilarity], result of:
            0.04444146 = score(doc=4807,freq=1.0), product of:
              0.11517658 = queryWeight, product of:
                1.6682322 = boost
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.016774701 = queryNorm
              0.38585502 = fieldWeight in 4807, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.09375 = fieldNorm(doc=4807)
          0.24222952 = weight(abstract_txt:noise in 4807) [ClassicSimilarity], result of:
            0.24222952 = score(doc=4807,freq=1.0), product of:
              0.33114636 = queryWeight, product of:
                2.5300517 = boost
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.016774701 = queryNorm
              0.7314878 = fieldWeight in 4807, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.09375 = fieldNorm(doc=4807)
          0.24721698 = weight(abstract_txt:term in 4807) [ClassicSimilarity], result of:
            0.24721698 = score(doc=4807,freq=6.0), product of:
              0.22261259 = queryWeight, product of:
                2.7441843 = boost
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.016774701 = queryNorm
              1.1105256 = fieldWeight in 4807, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.09375 = fieldNorm(doc=4807)
          0.07992282 = weight(abstract_txt:document in 4807) [ClassicSimilarity], result of:
            0.07992282 = score(doc=4807,freq=1.0), product of:
              0.19921672 = queryWeight, product of:
                2.7752192 = boost
                4.2793097 = idf(docFreq=1592, maxDocs=42306)
                0.016774701 = queryNorm
              0.40118527 = fieldWeight in 4807, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2793097 = idf(docFreq=1592, maxDocs=42306)
                0.09375 = fieldNorm(doc=4807)
        0.32 = coord(8/25)
    
  3. Seo, H.-C.; Kim, S.-B.; Rim, H.-C.; Myaeng, S.-H.: lmproving query translation in English-Korean Cross-language information retrieval (2005) 0.29
    0.2895421 = sum of:
      0.2895421 = product of:
        0.90481913 = sum of:
          0.008567527 = weight(abstract_txt:with in 3024) [ClassicSimilarity], result of:
            0.008567527 = score(doc=3024,freq=1.0), product of:
              0.043404 = queryWeight, product of:
                1.0240927 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.016774701 = queryNorm
              0.19739026 = fieldWeight in 3024, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.078125 = fieldNorm(doc=3024)
          0.08431375 = weight(abstract_txt:selecting in 3024) [ClassicSimilarity], result of:
            0.08431375 = score(doc=3024,freq=2.0), product of:
              0.11656505 = queryWeight, product of:
                1.0614232 = boost
                6.5467386 = idf(docFreq=164, maxDocs=42306)
                0.016774701 = queryNorm
              0.7233193 = fieldWeight in 3024, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5467386 = idf(docFreq=164, maxDocs=42306)
                0.078125 = fieldNorm(doc=3024)
          0.063583106 = weight(abstract_txt:terms in 3024) [ClassicSimilarity], result of:
            0.063583106 = score(doc=3024,freq=5.0), product of:
              0.08965213 = queryWeight, product of:
                1.3164358 = boost
                4.059814 = idf(docFreq=1983, maxDocs=42306)
                0.016774701 = queryNorm
              0.7092203 = fieldWeight in 3024, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.059814 = idf(docFreq=1983, maxDocs=42306)
                0.078125 = fieldNorm(doc=3024)
          0.042781167 = weight(abstract_txt:each in 3024) [ClassicSimilarity], result of:
            0.042781167 = score(doc=3024,freq=2.0), product of:
              0.093428895 = queryWeight, product of:
                1.3438785 = boost
                4.1444454 = idf(docFreq=1822, maxDocs=42306)
                0.016774701 = queryNorm
              0.45790082 = fieldWeight in 3024, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1444454 = idf(docFreq=1822, maxDocs=42306)
                0.078125 = fieldNorm(doc=3024)
          0.11642261 = weight(abstract_txt:combination in 3024) [ClassicSimilarity], result of:
            0.11642261 = score(doc=3024,freq=2.0), product of:
              0.18211162 = queryWeight, product of:
                1.8762393 = boost
                5.7862163 = idf(docFreq=352, maxDocs=42306)
                0.016774701 = queryNorm
              0.6392926 = fieldWeight in 3024, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7862163 = idf(docFreq=352, maxDocs=42306)
                0.078125 = fieldNorm(doc=3024)
          0.22315319 = weight(abstract_txt:query in 3024) [ClassicSimilarity], result of:
            0.22315319 = score(doc=3024,freq=8.0), product of:
              0.21332456 = queryWeight, product of:
                2.6863267 = boost
                4.733989 = idf(docFreq=1010, maxDocs=42306)
                0.016774701 = queryNorm
              1.0460736 = fieldWeight in 3024, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.733989 = idf(docFreq=1010, maxDocs=42306)
                0.078125 = fieldNorm(doc=3024)
          0.14567399 = weight(abstract_txt:term in 3024) [ClassicSimilarity], result of:
            0.14567399 = score(doc=3024,freq=3.0), product of:
              0.22261259 = queryWeight, product of:
                2.7441843 = boost
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.016774701 = queryNorm
              0.6543834 = fieldWeight in 3024, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.078125 = fieldNorm(doc=3024)
          0.2203238 = weight(abstract_txt:combinations in 3024) [ClassicSimilarity], result of:
            0.2203238 = score(doc=3024,freq=1.0), product of:
              0.40184706 = queryWeight, product of:
                3.4134648 = boost
                7.0179553 = idf(docFreq=102, maxDocs=42306)
                0.016774701 = queryNorm
              0.54827774 = fieldWeight in 3024, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0179553 = idf(docFreq=102, maxDocs=42306)
                0.078125 = fieldNorm(doc=3024)
        0.32 = coord(8/25)
    
  4. Kim, W.; Wilbur, W.J.: Corpus-based statistical screening for content-bearing terms (2001) 0.27
    0.26921052 = sum of:
      0.26921052 = product of:
        0.6118421 = sum of:
          0.008903636 = weight(abstract_txt:with in 189) [ClassicSimilarity], result of:
            0.008903636 = score(doc=189,freq=3.0), product of:
              0.043404 = queryWeight, product of:
                1.0240927 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.016774701 = queryNorm
              0.20513397 = fieldWeight in 189, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.046875 = fieldNorm(doc=189)
          0.057753783 = weight(abstract_txt:ranked in 189) [ClassicSimilarity], result of:
            0.057753783 = score(doc=189,freq=3.0), product of:
              0.11123082 = queryWeight, product of:
                1.0368525 = boost
                6.395189 = idf(docFreq=191, maxDocs=42306)
                0.016774701 = queryNorm
              0.51922464 = fieldWeight in 189, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.395189 = idf(docFreq=191, maxDocs=42306)
                0.046875 = fieldNorm(doc=189)
          0.019416012 = weight(abstract_txt:then in 189) [ClassicSimilarity], result of:
            0.019416012 = score(doc=189,freq=1.0), product of:
              0.08878693 = queryWeight, product of:
                1.1345524 = boost
                4.665194 = idf(docFreq=1082, maxDocs=42306)
                0.016774701 = queryNorm
              0.21868098 = fieldWeight in 189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.665194 = idf(docFreq=1082, maxDocs=42306)
                0.046875 = fieldNorm(doc=189)
          0.07540198 = weight(abstract_txt:specificity in 189) [ClassicSimilarity], result of:
            0.07540198 = score(doc=189,freq=2.0), product of:
              0.15209809 = queryWeight, product of:
                1.2124568 = boost
                7.4782968 = idf(docFreq=64, maxDocs=42306)
                0.016774701 = queryNorm
              0.49574572 = fieldWeight in 189, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4782968 = idf(docFreq=64, maxDocs=42306)
                0.046875 = fieldNorm(doc=189)
          0.034122277 = weight(abstract_txt:terms in 189) [ClassicSimilarity], result of:
            0.034122277 = score(doc=189,freq=4.0), product of:
              0.08965213 = queryWeight, product of:
                1.3164358 = boost
                4.059814 = idf(docFreq=1983, maxDocs=42306)
                0.016774701 = queryNorm
              0.38060755 = fieldWeight in 189, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.059814 = idf(docFreq=1983, maxDocs=42306)
                0.046875 = fieldNorm(doc=189)
          0.0444595 = weight(abstract_txt:each in 189) [ClassicSimilarity], result of:
            0.0444595 = score(doc=189,freq=6.0), product of:
              0.093428895 = queryWeight, product of:
                1.3438785 = boost
                4.1444454 = idf(docFreq=1822, maxDocs=42306)
                0.016774701 = queryNorm
              0.47586456 = fieldWeight in 189, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.1444454 = idf(docFreq=1822, maxDocs=42306)
                0.046875 = fieldNorm(doc=189)
          0.07958476 = weight(abstract_txt:average in 189) [ClassicSimilarity], result of:
            0.07958476 = score(doc=189,freq=4.0), product of:
              0.14325473 = queryWeight, product of:
                1.4411345 = boost
                5.9258366 = idf(docFreq=306, maxDocs=42306)
                0.016774701 = queryNorm
              0.5555472 = fieldWeight in 189, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.9258366 = idf(docFreq=306, maxDocs=42306)
                0.046875 = fieldNorm(doc=189)
          0.057223655 = weight(abstract_txt:frequency in 189) [ClassicSimilarity], result of:
            0.057223655 = score(doc=189,freq=2.0), product of:
              0.14486031 = queryWeight, product of:
                1.449188 = boost
                5.958952 = idf(docFreq=296, maxDocs=42306)
                0.016774701 = queryNorm
              0.39502645 = fieldWeight in 189, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.958952 = idf(docFreq=296, maxDocs=42306)
                0.046875 = fieldNorm(doc=189)
          0.04968707 = weight(abstract_txt:documents in 189) [ClassicSimilarity], result of:
            0.04968707 = score(doc=189,freq=5.0), product of:
              0.11517658 = queryWeight, product of:
                1.6682322 = boost
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.016774701 = queryNorm
              0.43139905 = fieldWeight in 189, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.046875 = fieldNorm(doc=189)
          0.0874044 = weight(abstract_txt:term in 189) [ClassicSimilarity], result of:
            0.0874044 = score(doc=189,freq=3.0), product of:
              0.22261259 = queryWeight, product of:
                2.7441843 = boost
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.016774701 = queryNorm
              0.39263007 = fieldWeight in 189, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.046875 = fieldNorm(doc=189)
          0.097885065 = weight(abstract_txt:document in 189) [ClassicSimilarity], result of:
            0.097885065 = score(doc=189,freq=6.0), product of:
              0.19921672 = queryWeight, product of:
                2.7752192 = boost
                4.2793097 = idf(docFreq=1592, maxDocs=42306)
                0.016774701 = queryNorm
              0.49134964 = fieldWeight in 189, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.2793097 = idf(docFreq=1592, maxDocs=42306)
                0.046875 = fieldNorm(doc=189)
        0.44 = coord(11/25)
    
  5. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: Beyond bag-of-words : bigram-enhanced context-dependent term weights (2014) 0.27
    0.26512343 = sum of:
      0.26512343 = product of:
        0.66280854 = sum of:
          0.011994538 = weight(abstract_txt:with in 3284) [ClassicSimilarity], result of:
            0.011994538 = score(doc=3284,freq=4.0), product of:
              0.043404 = queryWeight, product of:
                1.0240927 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.016774701 = queryNorm
              0.27634636 = fieldWeight in 3284, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3284)
          0.05037419 = weight(abstract_txt:weighting in 3284) [ClassicSimilarity], result of:
            0.05037419 = score(doc=3284,freq=1.0), product of:
              0.13214563 = queryWeight, product of:
                1.1301363 = boost
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.016774701 = queryNorm
              0.3812021 = fieldWeight in 3284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3284)
          0.019904662 = weight(abstract_txt:terms in 3284) [ClassicSimilarity], result of:
            0.019904662 = score(doc=3284,freq=1.0), product of:
              0.08965213 = queryWeight, product of:
                1.3164358 = boost
                4.059814 = idf(docFreq=1983, maxDocs=42306)
                0.016774701 = queryNorm
              0.22202107 = fieldWeight in 3284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.059814 = idf(docFreq=1983, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3284)
          0.021175599 = weight(abstract_txt:each in 3284) [ClassicSimilarity], result of:
            0.021175599 = score(doc=3284,freq=1.0), product of:
              0.093428895 = queryWeight, product of:
                1.3438785 = boost
                4.1444454 = idf(docFreq=1822, maxDocs=42306)
                0.016774701 = queryNorm
              0.22664936 = fieldWeight in 3284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1444454 = idf(docFreq=1822, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3284)
          0.06676093 = weight(abstract_txt:frequency in 3284) [ClassicSimilarity], result of:
            0.06676093 = score(doc=3284,freq=2.0), product of:
              0.14486031 = queryWeight, product of:
                1.449188 = boost
                5.958952 = idf(docFreq=296, maxDocs=42306)
                0.016774701 = queryNorm
              0.4608642 = fieldWeight in 3284, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.958952 = idf(docFreq=296, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3284)
          0.025924187 = weight(abstract_txt:documents in 3284) [ClassicSimilarity], result of:
            0.025924187 = score(doc=3284,freq=1.0), product of:
              0.11517658 = queryWeight, product of:
                1.6682322 = boost
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.016774701 = queryNorm
              0.2250821 = fieldWeight in 3284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3284)
          0.14130056 = weight(abstract_txt:noise in 3284) [ClassicSimilarity], result of:
            0.14130056 = score(doc=3284,freq=1.0), product of:
              0.33114636 = queryWeight, product of:
                2.5300517 = boost
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.016774701 = queryNorm
              0.42670122 = fieldWeight in 3284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3284)
          0.07810362 = weight(abstract_txt:query in 3284) [ClassicSimilarity], result of:
            0.07810362 = score(doc=3284,freq=2.0), product of:
              0.21332456 = queryWeight, product of:
                2.6863267 = boost
                4.733989 = idf(docFreq=1010, maxDocs=42306)
                0.016774701 = queryNorm
              0.36612576 = fieldWeight in 3284, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.733989 = idf(docFreq=1010, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3284)
          0.16651924 = weight(abstract_txt:term in 3284) [ClassicSimilarity], result of:
            0.16651924 = score(doc=3284,freq=8.0), product of:
              0.22261259 = queryWeight, product of:
                2.7441843 = boost
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.016774701 = queryNorm
              0.74802256 = fieldWeight in 3284, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3284)
          0.080751054 = weight(abstract_txt:document in 3284) [ClassicSimilarity], result of:
            0.080751054 = score(doc=3284,freq=3.0), product of:
              0.19921672 = queryWeight, product of:
                2.7752192 = boost
                4.2793097 = idf(docFreq=1592, maxDocs=42306)
                0.016774701 = queryNorm
              0.40534276 = fieldWeight in 3284, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2793097 = idf(docFreq=1592, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3284)
        0.4 = coord(10/25)