Document (#32978)

Author
Lam-Adesina, A.M.
Jones, G.J.F.
Title
Examining and improving the effectiveness of relevance feedback for retrieval of scanned text documents
Source
Information processing and management. 42(2006) no.3, S.633-649
Year
2006
Abstract
Important legacy paper documents are digitized and collected in online accessible archives. This enables the preservation, sharing, and significantly the searching of these documents. The text contents of these document images can be transcribed automatically using OCR systems and then stored in an information retrieval system. However, OCR systems make errors in character recognition which have previously been shown to impact on document retrieval behaviour. In particular relevance feedback query-expansion methods, which are often effective for improving electronic text retrieval, are observed to be less reliable for retrieval of scanned document images. Our experimental examination of the effects of character recognition errors on an ad hoc OCR retrieval task demonstrates that, while baseline information retrieval can remain relatively unaffected by transcription errors, relevance feedback via query expansion becomes highly unstable. This paper examines the reason for this behaviour, and introduces novel modifications to standard relevance feedback methods. These methods are shown experimentally to improve the effectiveness of relevance feedback for errorful OCR transcriptions. The new methods combine similar recognised character strings based on term collection frequency and a string edit-distance measure. The techniques are domain independent and make no use of external resources such as dictionaries or training data.
Theme
Dokumentenmanagement

Similar documents (author)

  1. Jones, M.H.: Year's work in cataloging and classification : 1978 (1979) 4.33
    4.334196 = sum of:
      4.334196 = weight(author_txt:jones in 308) [ClassicSimilarity], result of:
        4.334196 = fieldWeight in 308, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.9347134 = idf(docFreq=116, maxDocs=44218)
          0.625 = fieldNorm(doc=308)
    
  2. Jones, K.P.: Natural-language processing and automatic indexing : a reply (1990) 4.33
    4.334196 = sum of:
      4.334196 = weight(author_txt:jones in 394) [ClassicSimilarity], result of:
        4.334196 = fieldWeight in 394, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.9347134 = idf(docFreq=116, maxDocs=44218)
          0.625 = fieldNorm(doc=394)
    
  3. Jones, R.M.: Online catalogue research in Europe (1989) 4.33
    4.334196 = sum of:
      4.334196 = weight(author_txt:jones in 796) [ClassicSimilarity], result of:
        4.334196 = fieldWeight in 796, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.9347134 = idf(docFreq=116, maxDocs=44218)
          0.625 = fieldNorm(doc=796)
    
  4. Jones, R.L.: Automatic document content analysis : the AIDA project (1992) 4.33
    4.334196 = sum of:
      4.334196 = weight(author_txt:jones in 2607) [ClassicSimilarity], result of:
        4.334196 = fieldWeight in 2607, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.9347134 = idf(docFreq=116, maxDocs=44218)
          0.625 = fieldNorm(doc=2607)
    
  5. Jones, K.P.: How do we index? : a report of some Aslib Information Group activity (1983) 4.33
    4.334196 = sum of:
      4.334196 = weight(author_txt:jones in 2736) [ClassicSimilarity], result of:
        4.334196 = fieldWeight in 2736, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.9347134 = idf(docFreq=116, maxDocs=44218)
          0.625 = fieldNorm(doc=2736)
    

Similar documents (content)

  1. Tagheva, K.; Borsack, J.; Condit, A.: Effects of OCR errors on ranking and feedback using the vector space model (1996) 0.36
    0.36300078 = sum of:
      0.36300078 = product of:
        1.1343775 = sum of:
          0.050347336 = weight(abstract_txt:text in 4951) [ClassicSimilarity], result of:
            0.050347336 = score(doc=4951,freq=1.0), product of:
              0.11383128 = queryWeight, product of:
                1.3686874 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020566508 = queryNorm
              0.4422979 = fieldWeight in 4951, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.109375 = fieldNorm(doc=4951)
          0.11639889 = weight(abstract_txt:recognition in 4951) [ClassicSimilarity], result of:
            0.11639889 = score(doc=4951,freq=1.0), product of:
              0.17386524 = queryWeight, product of:
                1.3811289 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.020566508 = queryNorm
              0.66947764 = fieldWeight in 4951, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.109375 = fieldNorm(doc=4951)
          0.053295385 = weight(abstract_txt:documents in 4951) [ClassicSimilarity], result of:
            0.053295385 = score(doc=4951,freq=1.0), product of:
              0.11823255 = queryWeight, product of:
                1.3948965 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.020566508 = queryNorm
              0.45076746 = fieldWeight in 4951, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.109375 = fieldNorm(doc=4951)
          0.06022126 = weight(abstract_txt:document in 4951) [ClassicSimilarity], result of:
            0.06022126 = score(doc=4951,freq=1.0), product of:
              0.1282657 = queryWeight, product of:
                1.4528766 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.020566508 = queryNorm
              0.46950403 = fieldWeight in 4951, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.109375 = fieldNorm(doc=4951)
          0.21054752 = weight(abstract_txt:character in 4951) [ClassicSimilarity], result of:
            0.21054752 = score(doc=4951,freq=1.0), product of:
              0.29546818 = queryWeight, product of:
                2.2051027 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.020566508 = queryNorm
              0.7125895 = fieldWeight in 4951, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.109375 = fieldNorm(doc=4951)
          0.30248523 = weight(abstract_txt:errors in 4951) [ClassicSimilarity], result of:
            0.30248523 = score(doc=4951,freq=2.0), product of:
              0.29858646 = queryWeight, product of:
                2.2167082 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.020566508 = queryNorm
              1.0130575 = fieldWeight in 4951, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.109375 = fieldNorm(doc=4951)
          0.07455548 = weight(abstract_txt:retrieval in 4951) [ClassicSimilarity], result of:
            0.07455548 = score(doc=4951,freq=1.0), product of:
              0.19615044 = queryWeight, product of:
                2.7444572 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.020566508 = queryNorm
              0.38009337 = fieldWeight in 4951, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.109375 = fieldNorm(doc=4951)
          0.2665264 = weight(abstract_txt:feedback in 4951) [ClassicSimilarity], result of:
            0.2665264 = score(doc=4951,freq=1.0), product of:
              0.40994006 = queryWeight, product of:
                3.3531888 = boost
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.020566508 = queryNorm
              0.6501594 = fieldWeight in 4951, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.109375 = fieldNorm(doc=4951)
        0.32 = coord(8/25)
    
  2. Salton, G.; Buckley, C.: Improving retrieval performance by relevance feedback (1990) 0.35
    0.35367352 = sum of:
      0.35367352 = product of:
        1.2631197 = sum of:
          0.06231603 = weight(abstract_txt:query in 5442) [ClassicSimilarity], result of:
            0.06231603 = score(doc=5442,freq=1.0), product of:
              0.1048702 = queryWeight, product of:
                1.0726397 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.020566508 = queryNorm
              0.5942206 = fieldWeight in 5442, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.125 = fieldNorm(doc=5442)
          0.076874614 = weight(abstract_txt:effectiveness in 5442) [ClassicSimilarity], result of:
            0.076874614 = score(doc=5442,freq=1.0), product of:
              0.12062599 = queryWeight, product of:
                1.1503984 = boost
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.020566508 = queryNorm
              0.6372973 = fieldWeight in 5442, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.125 = fieldNorm(doc=5442)
          0.057539817 = weight(abstract_txt:text in 5442) [ClassicSimilarity], result of:
            0.057539817 = score(doc=5442,freq=1.0), product of:
              0.11383128 = queryWeight, product of:
                1.3686874 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020566508 = queryNorm
              0.5054833 = fieldWeight in 5442, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.125 = fieldNorm(doc=5442)
          0.11699194 = weight(abstract_txt:methods in 5442) [ClassicSimilarity], result of:
            0.11699194 = score(doc=5442,freq=2.0), product of:
              0.15959632 = queryWeight, product of:
                1.8713467 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.020566508 = queryNorm
              0.7330491 = fieldWeight in 5442, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.125 = fieldNorm(doc=5442)
          0.12049985 = weight(abstract_txt:retrieval in 5442) [ClassicSimilarity], result of:
            0.12049985 = score(doc=5442,freq=2.0), product of:
              0.19615044 = queryWeight, product of:
                2.7444572 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.020566508 = queryNorm
              0.6143236 = fieldWeight in 5442, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.125 = fieldNorm(doc=5442)
          0.30131203 = weight(abstract_txt:relevance in 5442) [ClassicSimilarity], result of:
            0.30131203 = score(doc=5442,freq=3.0), product of:
              0.28218645 = queryWeight, product of:
                2.7820563 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.020566508 = queryNorm
              1.0677764 = fieldWeight in 5442, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.125 = fieldNorm(doc=5442)
          0.5275854 = weight(abstract_txt:feedback in 5442) [ClassicSimilarity], result of:
            0.5275854 = score(doc=5442,freq=3.0), product of:
              0.40994006 = queryWeight, product of:
                3.3531888 = boost
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.020566508 = queryNorm
              1.2869818 = fieldWeight in 5442, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.125 = fieldNorm(doc=5442)
        0.28 = coord(7/25)
    
  3. Ye, Z.; Huang, J.X.: ¬A learning to rank approach for quality-aware pseudo-relevance feedback (2016) 0.35
    0.35131818 = sum of:
      0.35131818 = product of:
        0.8782954 = sum of:
          0.029803801 = weight(abstract_txt:make in 2855) [ClassicSimilarity], result of:
            0.029803801 = score(doc=2855,freq=1.0), product of:
              0.10180912 = queryWeight, product of:
                1.056869 = boost
                4.6838713 = idf(docFreq=1110, maxDocs=44218)
                0.020566508 = queryNorm
              0.29274195 = fieldWeight in 2855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6838713 = idf(docFreq=1110, maxDocs=44218)
                0.0625 = fieldNorm(doc=2855)
          0.014097948 = weight(abstract_txt:these in 2855) [ClassicSimilarity], result of:
            0.014097948 = score(doc=2855,freq=1.0), product of:
              0.070752196 = queryWeight, product of:
                1.0790546 = boost
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.020566508 = queryNorm
              0.19925809 = fieldWeight in 2855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.0625 = fieldNorm(doc=2855)
          0.05059901 = weight(abstract_txt:shown in 2855) [ClassicSimilarity], result of:
            0.05059901 = score(doc=2855,freq=1.0), product of:
              0.14488839 = queryWeight, product of:
                1.2607954 = boost
                5.58764 = idf(docFreq=449, maxDocs=44218)
                0.020566508 = queryNorm
              0.3492275 = fieldWeight in 2855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.58764 = idf(docFreq=449, maxDocs=44218)
                0.0625 = fieldNorm(doc=2855)
          0.028769908 = weight(abstract_txt:text in 2855) [ClassicSimilarity], result of:
            0.028769908 = score(doc=2855,freq=1.0), product of:
              0.11383128 = queryWeight, product of:
                1.3686874 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020566508 = queryNorm
              0.25274166 = fieldWeight in 2855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2855)
          0.05274875 = weight(abstract_txt:documents in 2855) [ClassicSimilarity], result of:
            0.05274875 = score(doc=2855,freq=3.0), product of:
              0.11823255 = queryWeight, product of:
                1.3948965 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.020566508 = queryNorm
              0.44614407 = fieldWeight in 2855, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=2855)
          0.076947905 = weight(abstract_txt:document in 2855) [ClassicSimilarity], result of:
            0.076947905 = score(doc=2855,freq=5.0), product of:
              0.1282657 = queryWeight, product of:
                1.4528766 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.020566508 = queryNorm
              0.59991026 = fieldWeight in 2855, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=2855)
          0.041362897 = weight(abstract_txt:methods in 2855) [ClassicSimilarity], result of:
            0.041362897 = score(doc=2855,freq=1.0), product of:
              0.15959632 = queryWeight, product of:
                1.8713467 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.020566508 = queryNorm
              0.259172 = fieldWeight in 2855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=2855)
          0.060249925 = weight(abstract_txt:retrieval in 2855) [ClassicSimilarity], result of:
            0.060249925 = score(doc=2855,freq=2.0), product of:
              0.19615044 = queryWeight, product of:
                2.7444572 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.020566508 = queryNorm
              0.3071618 = fieldWeight in 2855, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=2855)
          0.15065601 = weight(abstract_txt:relevance in 2855) [ClassicSimilarity], result of:
            0.15065601 = score(doc=2855,freq=3.0), product of:
              0.28218645 = queryWeight, product of:
                2.7820563 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.020566508 = queryNorm
              0.5338882 = fieldWeight in 2855, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0625 = fieldNorm(doc=2855)
          0.37305924 = weight(abstract_txt:feedback in 2855) [ClassicSimilarity], result of:
            0.37305924 = score(doc=2855,freq=6.0), product of:
              0.40994006 = queryWeight, product of:
                3.3531888 = boost
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.020566508 = queryNorm
              0.91003364 = fieldWeight in 2855, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.0625 = fieldNorm(doc=2855)
        0.4 = coord(10/25)
    
  4. He, D.; Wu, D.: Enhancing query translation with relevance feedback in translingual information retrieval : a study of the medication process (2011) 0.34
    0.34413132 = sum of:
      0.34413132 = product of:
        0.95592034 = sum of:
          0.076321244 = weight(abstract_txt:query in 4244) [ClassicSimilarity], result of:
            0.076321244 = score(doc=4244,freq=6.0), product of:
              0.1048702 = queryWeight, product of:
                1.0726397 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.020566508 = queryNorm
              0.72776866 = fieldWeight in 4244, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=4244)
          0.038437307 = weight(abstract_txt:effectiveness in 4244) [ClassicSimilarity], result of:
            0.038437307 = score(doc=4244,freq=1.0), product of:
              0.12062599 = queryWeight, product of:
                1.1503984 = boost
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.020566508 = queryNorm
              0.31864864 = fieldWeight in 4244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.0625 = fieldNorm(doc=4244)
          0.059685376 = weight(abstract_txt:improving in 4244) [ClassicSimilarity], result of:
            0.059685376 = score(doc=4244,freq=1.0), product of:
              0.16175245 = queryWeight, product of:
                1.3321503 = boost
                5.9038734 = idf(docFreq=327, maxDocs=44218)
                0.020566508 = queryNorm
              0.3689921 = fieldWeight in 4244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9038734 = idf(docFreq=327, maxDocs=44218)
                0.0625 = fieldNorm(doc=4244)
          0.114358 = weight(abstract_txt:expansion in 4244) [ClassicSimilarity], result of:
            0.114358 = score(doc=4244,freq=3.0), product of:
              0.17301199 = queryWeight, product of:
                1.3777357 = boost
                6.1059003 = idf(docFreq=267, maxDocs=44218)
                0.020566508 = queryNorm
              0.6609831 = fieldWeight in 4244, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1059003 = idf(docFreq=267, maxDocs=44218)
                0.0625 = fieldNorm(doc=4244)
          0.030454507 = weight(abstract_txt:documents in 4244) [ClassicSimilarity], result of:
            0.030454507 = score(doc=4244,freq=1.0), product of:
              0.11823255 = queryWeight, product of:
                1.3948965 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.020566508 = queryNorm
              0.2575814 = fieldWeight in 4244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=4244)
          0.041362897 = weight(abstract_txt:methods in 4244) [ClassicSimilarity], result of:
            0.041362897 = score(doc=4244,freq=1.0), product of:
              0.15959632 = queryWeight, product of:
                1.8713467 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.020566508 = queryNorm
              0.259172 = fieldWeight in 4244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=4244)
          0.060249925 = weight(abstract_txt:retrieval in 4244) [ClassicSimilarity], result of:
            0.060249925 = score(doc=4244,freq=2.0), product of:
              0.19615044 = queryWeight, product of:
                2.7444572 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.020566508 = queryNorm
              0.3071618 = fieldWeight in 4244, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=4244)
          0.19449608 = weight(abstract_txt:relevance in 4244) [ClassicSimilarity], result of:
            0.19449608 = score(doc=4244,freq=5.0), product of:
              0.28218645 = queryWeight, product of:
                2.7820563 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.020566508 = queryNorm
              0.6892467 = fieldWeight in 4244, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0625 = fieldNorm(doc=4244)
          0.34055492 = weight(abstract_txt:feedback in 4244) [ClassicSimilarity], result of:
            0.34055492 = score(doc=4244,freq=5.0), product of:
              0.40994006 = queryWeight, product of:
                3.3531888 = boost
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.020566508 = queryNorm
              0.8307432 = fieldWeight in 4244, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.0625 = fieldNorm(doc=4244)
        0.36 = coord(9/25)
    
  5. Colace, F.; Santo, M. De; Greco, L.; Napoletano, P.: Weighted word pairs for query expansion (2015) 0.31
    0.3057165 = sum of:
      0.3057165 = product of:
        0.9553641 = sum of:
          0.0809509 = weight(abstract_txt:query in 2687) [ClassicSimilarity], result of:
            0.0809509 = score(doc=2687,freq=3.0), product of:
              0.1048702 = queryWeight, product of:
                1.0726397 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.020566508 = queryNorm
              0.7719152 = fieldWeight in 2687, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.09375 = fieldNorm(doc=2687)
          0.05765596 = weight(abstract_txt:effectiveness in 2687) [ClassicSimilarity], result of:
            0.05765596 = score(doc=2687,freq=1.0), product of:
              0.12062599 = queryWeight, product of:
                1.1503984 = boost
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.020566508 = queryNorm
              0.47797295 = fieldWeight in 2687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.09375 = fieldNorm(doc=2687)
          0.043154858 = weight(abstract_txt:text in 2687) [ClassicSimilarity], result of:
            0.043154858 = score(doc=2687,freq=1.0), product of:
              0.11383128 = queryWeight, product of:
                1.3686874 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020566508 = queryNorm
              0.37911248 = fieldWeight in 2687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=2687)
          0.14005937 = weight(abstract_txt:expansion in 2687) [ClassicSimilarity], result of:
            0.14005937 = score(doc=2687,freq=2.0), product of:
              0.17301199 = queryWeight, product of:
                1.3777357 = boost
                6.1059003 = idf(docFreq=267, maxDocs=44218)
                0.020566508 = queryNorm
              0.8095356 = fieldWeight in 2687, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1059003 = idf(docFreq=267, maxDocs=44218)
                0.09375 = fieldNorm(doc=2687)
          0.06204435 = weight(abstract_txt:methods in 2687) [ClassicSimilarity], result of:
            0.06204435 = score(doc=2687,freq=1.0), product of:
              0.15959632 = queryWeight, product of:
                1.8713467 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.020566508 = queryNorm
              0.388758 = fieldWeight in 2687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.09375 = fieldNorm(doc=2687)
          0.063904695 = weight(abstract_txt:retrieval in 2687) [ClassicSimilarity], result of:
            0.063904695 = score(doc=2687,freq=1.0), product of:
              0.19615044 = queryWeight, product of:
                2.7444572 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.020566508 = queryNorm
              0.3257943 = fieldWeight in 2687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=2687)
          0.18451518 = weight(abstract_txt:relevance in 2687) [ClassicSimilarity], result of:
            0.18451518 = score(doc=2687,freq=2.0), product of:
              0.28218645 = queryWeight, product of:
                2.7820563 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.020566508 = queryNorm
              0.65387684 = fieldWeight in 2687, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.09375 = fieldNorm(doc=2687)
          0.32307878 = weight(abstract_txt:feedback in 2687) [ClassicSimilarity], result of:
            0.32307878 = score(doc=2687,freq=2.0), product of:
              0.40994006 = queryWeight, product of:
                3.3531888 = boost
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.020566508 = queryNorm
              0.7881122 = fieldWeight in 2687, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9443145 = idf(docFreq=314, maxDocs=44218)
                0.09375 = fieldNorm(doc=2687)
        0.32 = coord(8/25)