Document (#39690)

Author
Gencosman, B.C.
Ozmutlu, H.C.
Ozmutlu, S.
Title
Character n-gram application for automatic new topic identification
Source
Information processing and management. 50(2014) no.6, S.821-856
Year
2014
Abstract
The widespread availability of the Internet and the variety of Internet-based applications have resulted in a significant increase in the amount of web pages. Determining the behaviors of search engine users has become a critical step in enhancing search engine performance. Search engine user behaviors can be determined by content-based or content-ignorant algorithms. Although many content-ignorant studies have been performed to automatically identify new topics, previous results have demonstrated that spelling errors can cause significant errors in topic shift estimates. In this study, we focused on minimizing the number of wrong estimates that were based on spelling errors. We developed a new hybrid algorithm combining character n-gram and neural network methodologies, and compared the experimental results with results from previous studies. For the FAST and Excite datasets, the proposed algorithm improved topic shift estimates by 6.987% and 2.639%, respectively. Moreover, we analyzed the performance of the character n-gram method in different aspects including the comparison with Levenshtein edit-distance method. The experimental results demonstrated that the character n-gram method outperformed to the Levensthein edit distance method in terms of topic identification.
Content
Vgl.: doi: 10.1016/j.ipm.2014.06.005.
Theme
Computerlinguistik
Suchmaschinen
Object
n-grams

Similar documents (author)

  1. Spink, A.; Ozmutlu, H.C.; Ozmutlu, S.: Multitasking information seeking and searching processes (2002) 5.17
    5.1696143 = sum of:
      5.1696143 = weight(author_txt:ozmutlu in 1601) [ClassicSimilarity], result of:
        5.1696143 = score(doc=1601,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            9.747919 = idf(docFreq=6, maxDocs=44083)
            0.10258599 = queryNorm
          5.169615 = fieldWeight in 1601, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            9.747919 = idf(docFreq=6, maxDocs=44083)
            0.375 = fieldNorm(doc=1601)
    
  2. Ozmutlu, S.; Spink, A.; Ozmutlu, H.C.: ¬A day in the life of Web searching : an exploratory study (2004) 5.17
    5.1696143 = sum of:
      5.1696143 = weight(author_txt:ozmutlu in 3531) [ClassicSimilarity], result of:
        5.1696143 = score(doc=3531,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            9.747919 = idf(docFreq=6, maxDocs=44083)
            0.10258599 = queryNorm
          5.169615 = fieldWeight in 3531, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            9.747919 = idf(docFreq=6, maxDocs=44083)
            0.375 = fieldNorm(doc=3531)
    
  3. Ozmutlu, S.; Spink, A.; Ozmutlu, H.C.: Multimedia Web searching trends : 1997-2001 (2003) 5.17
    5.1696143 = sum of:
      5.1696143 = weight(author_txt:ozmutlu in 2073) [ClassicSimilarity], result of:
        5.1696143 = score(doc=2073,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            9.747919 = idf(docFreq=6, maxDocs=44083)
            0.10258599 = queryNorm
          5.169615 = fieldWeight in 2073, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            9.747919 = idf(docFreq=6, maxDocs=44083)
            0.375 = fieldNorm(doc=2073)
    
  4. Ozmutlu, H.C.; Cavdur, F.; Ozmutlu, S.: Cross-validation of neural network applications for automatic new topic identification (2008) 5.17
    5.1696143 = sum of:
      5.1696143 = weight(author_txt:ozmutlu in 2365) [ClassicSimilarity], result of:
        5.1696143 = score(doc=2365,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            9.747919 = idf(docFreq=6, maxDocs=44083)
            0.10258599 = queryNorm
          5.169615 = fieldWeight in 2365, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            9.747919 = idf(docFreq=6, maxDocs=44083)
            0.375 = fieldNorm(doc=2365)
    
  5. Ozmutlu, S.; Cosar, G.C.: Analyzing the results of automatic new topic identification (2008) 4.87
    4.873959 = sum of:
      4.873959 = weight(author_txt:ozmutlu in 3605) [ClassicSimilarity], result of:
        4.873959 = score(doc=3605,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            9.747919 = idf(docFreq=6, maxDocs=44083)
            0.10258599 = queryNorm
          4.8739595 = fieldWeight in 3605, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            9.747919 = idf(docFreq=6, maxDocs=44083)
            0.5 = fieldNorm(doc=3605)
    

Similar documents (content)

  1. Ozmutlu, S.; Cosar, G.C.: Analyzing the results of automatic new topic identification (2008) 0.23
    0.23155738 = sum of:
      0.23155738 = product of:
        0.72361684 = sum of:
          0.034599133 = weight(abstract_txt:performance in 3605) [ClassicSimilarity], result of:
            0.034599133 = score(doc=3605,freq=2.0), product of:
              0.0845466 = queryWeight, product of:
                1.0645835 = boost
                4.6299257 = idf(docFreq=1168, maxDocs=44083)
                0.017153092 = queryNorm
              0.40923148 = fieldWeight in 3605, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6299257 = idf(docFreq=1168, maxDocs=44083)
                0.0625 = fieldNorm(doc=3605)
          0.01729593 = weight(abstract_txt:have in 3605) [ClassicSimilarity], result of:
            0.01729593 = score(doc=3605,freq=2.0), product of:
              0.06096011 = queryWeight, product of:
                1.107134 = boost
                3.2099862 = idf(docFreq=4835, maxDocs=44083)
                0.017153092 = queryNorm
              0.28372538 = fieldWeight in 3605, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2099862 = idf(docFreq=4835, maxDocs=44083)
                0.0625 = fieldNorm(doc=3605)
          0.040456265 = weight(abstract_txt:search in 3605) [ClassicSimilarity], result of:
            0.040456265 = score(doc=3605,freq=5.0), product of:
              0.07914564 = queryWeight, product of:
                1.2615103 = boost
                3.6575797 = idf(docFreq=3090, maxDocs=44083)
                0.017153092 = queryNorm
              0.5111623 = fieldWeight in 3605, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.6575797 = idf(docFreq=3090, maxDocs=44083)
                0.0625 = fieldNorm(doc=3605)
          0.13981397 = weight(abstract_txt:identification in 3605) [ClassicSimilarity], result of:
            0.13981397 = score(doc=3605,freq=8.0), product of:
              0.13512419 = queryWeight, product of:
                1.3458549 = boost
                5.853188 = idf(docFreq=343, maxDocs=44083)
                0.017153092 = queryNorm
              1.0347072 = fieldWeight in 3605, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.853188 = idf(docFreq=343, maxDocs=44083)
                0.0625 = fieldNorm(doc=3605)
          0.027113756 = weight(abstract_txt:content in 3605) [ClassicSimilarity], result of:
            0.027113756 = score(doc=3605,freq=1.0), product of:
              0.103646085 = queryWeight, product of:
                1.4436228 = boost
                4.1855907 = idf(docFreq=1822, maxDocs=44083)
                0.017153092 = queryNorm
              0.26159942 = fieldWeight in 3605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1855907 = idf(docFreq=1822, maxDocs=44083)
                0.0625 = fieldNorm(doc=3605)
          0.12467 = weight(abstract_txt:engine in 3605) [ClassicSimilarity], result of:
            0.12467 = score(doc=3605,freq=4.0), product of:
              0.18054318 = queryWeight, product of:
                1.9053196 = boost
                5.5242186 = idf(docFreq=477, maxDocs=44083)
                0.017153092 = queryNorm
              0.6905273 = fieldWeight in 3605, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5242186 = idf(docFreq=477, maxDocs=44083)
                0.0625 = fieldNorm(doc=3605)
          0.14709203 = weight(abstract_txt:errors in 3605) [ClassicSimilarity], result of:
            0.14709203 = score(doc=3605,freq=2.0), product of:
              0.25398567 = queryWeight, product of:
                2.259862 = boost
                6.552166 = idf(docFreq=170, maxDocs=44083)
                0.017153092 = queryNorm
              0.5791351 = fieldWeight in 3605, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.552166 = idf(docFreq=170, maxDocs=44083)
                0.0625 = fieldNorm(doc=3605)
          0.19257572 = weight(abstract_txt:topic in 3605) [ClassicSimilarity], result of:
            0.19257572 = score(doc=3605,freq=9.0), product of:
              0.20264049 = queryWeight, product of:
                2.3308256 = boost
                5.068437 = idf(docFreq=753, maxDocs=44083)
                0.017153092 = queryNorm
              0.9503319 = fieldWeight in 3605, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                5.068437 = idf(docFreq=753, maxDocs=44083)
                0.0625 = fieldNorm(doc=3605)
        0.32 = coord(8/25)
    
  2. Willson, R.; Given, L.M.: ¬The effect of spelling and retrieval system familiarity on search behavior in online public access catalogs : a mixed methods study (2010) 0.23
    0.2304435 = sum of:
      0.2304435 = product of:
        0.720136 = sum of:
          0.012230069 = weight(abstract_txt:have in 43) [ClassicSimilarity], result of:
            0.012230069 = score(doc=43,freq=1.0), product of:
              0.06096011 = queryWeight, product of:
                1.107134 = boost
                3.2099862 = idf(docFreq=4835, maxDocs=44083)
                0.017153092 = queryNorm
              0.20062414 = fieldWeight in 43, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2099862 = idf(docFreq=4835, maxDocs=44083)
                0.0625 = fieldNorm(doc=43)
          0.06769629 = weight(abstract_txt:search in 43) [ClassicSimilarity], result of:
            0.06769629 = score(doc=43,freq=14.0), product of:
              0.07914564 = queryWeight, product of:
                1.2615103 = boost
                3.6575797 = idf(docFreq=3090, maxDocs=44083)
                0.017153092 = queryNorm
              0.85533816 = fieldWeight in 43, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                3.6575797 = idf(docFreq=3090, maxDocs=44083)
                0.0625 = fieldNorm(doc=43)
          0.09831582 = weight(abstract_txt:behaviors in 43) [ClassicSimilarity], result of:
            0.09831582 = score(doc=43,freq=3.0), product of:
              0.14817373 = queryWeight, product of:
                1.4093448 = boost
                6.129309 = idf(docFreq=260, maxDocs=44083)
                0.017153092 = queryNorm
              0.6635172 = fieldWeight in 43, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.129309 = idf(docFreq=260, maxDocs=44083)
                0.0625 = fieldNorm(doc=43)
          0.020897869 = weight(abstract_txt:results in 43) [ClassicSimilarity], result of:
            0.020897869 = score(doc=43,freq=1.0), product of:
              0.09589752 = queryWeight, product of:
                1.6034312 = boost
                3.4867003 = idf(docFreq=3666, maxDocs=44083)
                0.017153092 = queryNorm
              0.21791877 = fieldWeight in 43, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4867003 = idf(docFreq=3666, maxDocs=44083)
                0.0625 = fieldNorm(doc=43)
          0.22078778 = weight(abstract_txt:spelling in 43) [ClassicSimilarity], result of:
            0.22078778 = score(doc=43,freq=4.0), product of:
              0.23086569 = queryWeight, product of:
                1.7591844 = boost
                7.6507783 = idf(docFreq=56, maxDocs=44083)
                0.017153092 = queryNorm
              0.9563473 = fieldWeight in 43, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.6507783 = idf(docFreq=56, maxDocs=44083)
                0.0625 = fieldNorm(doc=43)
          0.062335 = weight(abstract_txt:engine in 43) [ClassicSimilarity], result of:
            0.062335 = score(doc=43,freq=1.0), product of:
              0.18054318 = queryWeight, product of:
                1.9053196 = boost
                5.5242186 = idf(docFreq=477, maxDocs=44083)
                0.017153092 = queryNorm
              0.34526366 = fieldWeight in 43, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5242186 = idf(docFreq=477, maxDocs=44083)
                0.0625 = fieldNorm(doc=43)
          0.14709203 = weight(abstract_txt:errors in 43) [ClassicSimilarity], result of:
            0.14709203 = score(doc=43,freq=2.0), product of:
              0.25398567 = queryWeight, product of:
                2.259862 = boost
                6.552166 = idf(docFreq=170, maxDocs=44083)
                0.017153092 = queryNorm
              0.5791351 = fieldWeight in 43, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.552166 = idf(docFreq=170, maxDocs=44083)
                0.0625 = fieldNorm(doc=43)
          0.09078107 = weight(abstract_txt:topic in 43) [ClassicSimilarity], result of:
            0.09078107 = score(doc=43,freq=2.0), product of:
              0.20264049 = queryWeight, product of:
                2.3308256 = boost
                5.068437 = idf(docFreq=753, maxDocs=44083)
                0.017153092 = queryNorm
              0.44799078 = fieldWeight in 43, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.068437 = idf(docFreq=753, maxDocs=44083)
                0.0625 = fieldNorm(doc=43)
        0.32 = coord(8/25)
    
  3. Bellaachia, A.; Amor-Tijani, G.: Proper nouns in English-Arabic cross language information retrieval (2008) 0.20
    0.19867885 = sum of:
      0.19867885 = product of:
        0.7095673 = sum of:
          0.034599133 = weight(abstract_txt:performance in 3373) [ClassicSimilarity], result of:
            0.034599133 = score(doc=3373,freq=2.0), product of:
              0.0845466 = queryWeight, product of:
                1.0645835 = boost
                4.6299257 = idf(docFreq=1168, maxDocs=44083)
                0.017153092 = queryNorm
              0.40923148 = fieldWeight in 3373, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6299257 = idf(docFreq=1168, maxDocs=44083)
                0.0625 = fieldNorm(doc=3373)
          0.012022812 = weight(abstract_txt:based in 3373) [ClassicSimilarity], result of:
            0.012022812 = score(doc=3373,freq=1.0), product of:
              0.06026944 = queryWeight, product of:
                1.1008443 = boost
                3.19175 = idf(docFreq=4924, maxDocs=44083)
                0.017153092 = queryNorm
              0.19948438 = fieldWeight in 3373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.19175 = idf(docFreq=4924, maxDocs=44083)
                0.0625 = fieldNorm(doc=3373)
          0.012230069 = weight(abstract_txt:have in 3373) [ClassicSimilarity], result of:
            0.012230069 = score(doc=3373,freq=1.0), product of:
              0.06096011 = queryWeight, product of:
                1.107134 = boost
                3.2099862 = idf(docFreq=4835, maxDocs=44083)
                0.017153092 = queryNorm
              0.20062414 = fieldWeight in 3373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2099862 = idf(docFreq=4835, maxDocs=44083)
                0.0625 = fieldNorm(doc=3373)
          0.038784906 = weight(abstract_txt:experimental in 3373) [ClassicSimilarity], result of:
            0.038784906 = score(doc=3373,freq=1.0), product of:
              0.114948824 = queryWeight, product of:
                1.2413205 = boost
                5.3985634 = idf(docFreq=541, maxDocs=44083)
                0.017153092 = queryNorm
              0.3374102 = fieldWeight in 3373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3985634 = idf(docFreq=541, maxDocs=44083)
                0.0625 = fieldNorm(doc=3373)
          0.029554049 = weight(abstract_txt:results in 3373) [ClassicSimilarity], result of:
            0.029554049 = score(doc=3373,freq=2.0), product of:
              0.09589752 = queryWeight, product of:
                1.6034312 = boost
                3.4867003 = idf(docFreq=3666, maxDocs=44083)
                0.017153092 = queryNorm
              0.30818367 = fieldWeight in 3373, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4867003 = idf(docFreq=3666, maxDocs=44083)
                0.0625 = fieldNorm(doc=3373)
          0.15612054 = weight(abstract_txt:spelling in 3373) [ClassicSimilarity], result of:
            0.15612054 = score(doc=3373,freq=2.0), product of:
              0.23086569 = queryWeight, product of:
                1.7591844 = boost
                7.6507783 = idf(docFreq=56, maxDocs=44083)
                0.017153092 = queryNorm
              0.67623967 = fieldWeight in 3373, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6507783 = idf(docFreq=56, maxDocs=44083)
                0.0625 = fieldNorm(doc=3373)
          0.42625582 = weight(abstract_txt:gram in 3373) [ClassicSimilarity], result of:
            0.42625582 = score(doc=3373,freq=3.0), product of:
              0.49637797 = queryWeight, product of:
                3.6479838 = boost
                7.9326296 = idf(docFreq=42, maxDocs=44083)
                0.017153092 = queryNorm
              0.85873234 = fieldWeight in 3373, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.9326296 = idf(docFreq=42, maxDocs=44083)
                0.0625 = fieldNorm(doc=3373)
        0.28 = coord(7/25)
    
  4. Doval, Y.; Gómez-Rodríguez, C.: Comparing neural- and N-gram-based language models for word segmentation (2019) 0.20
    0.19690575 = sum of:
      0.19690575 = product of:
        0.61533046 = sum of:
          0.024465282 = weight(abstract_txt:performance in 5676) [ClassicSimilarity], result of:
            0.024465282 = score(doc=5676,freq=1.0), product of:
              0.0845466 = queryWeight, product of:
                1.0645835 = boost
                4.6299257 = idf(docFreq=1168, maxDocs=44083)
                0.017153092 = queryNorm
              0.28937036 = fieldWeight in 5676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6299257 = idf(docFreq=1168, maxDocs=44083)
                0.0625 = fieldNorm(doc=5676)
          0.012022812 = weight(abstract_txt:based in 5676) [ClassicSimilarity], result of:
            0.012022812 = score(doc=5676,freq=1.0), product of:
              0.06026944 = queryWeight, product of:
                1.1008443 = boost
                3.19175 = idf(docFreq=4924, maxDocs=44083)
                0.017153092 = queryNorm
              0.19948438 = fieldWeight in 5676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.19175 = idf(docFreq=4924, maxDocs=44083)
                0.0625 = fieldNorm(doc=5676)
          0.012230069 = weight(abstract_txt:have in 5676) [ClassicSimilarity], result of:
            0.012230069 = score(doc=5676,freq=1.0), product of:
              0.06096011 = queryWeight, product of:
                1.107134 = boost
                3.2099862 = idf(docFreq=4835, maxDocs=44083)
                0.017153092 = queryNorm
              0.20062414 = fieldWeight in 5676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2099862 = idf(docFreq=4835, maxDocs=44083)
                0.0625 = fieldNorm(doc=5676)
          0.018092593 = weight(abstract_txt:search in 5676) [ClassicSimilarity], result of:
            0.018092593 = score(doc=5676,freq=1.0), product of:
              0.07914564 = queryWeight, product of:
                1.2615103 = boost
                3.6575797 = idf(docFreq=3090, maxDocs=44083)
                0.017153092 = queryNorm
              0.22859873 = fieldWeight in 5676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6575797 = idf(docFreq=3090, maxDocs=44083)
                0.0625 = fieldNorm(doc=5676)
          0.0457081 = weight(abstract_txt:algorithm in 5676) [ClassicSimilarity], result of:
            0.0457081 = score(doc=5676,freq=1.0), product of:
              0.12825023 = queryWeight, product of:
                1.3111752 = boost
                5.702365 = idf(docFreq=399, maxDocs=44083)
                0.017153092 = queryNorm
              0.3563978 = fieldWeight in 5676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.702365 = idf(docFreq=399, maxDocs=44083)
                0.0625 = fieldNorm(doc=5676)
          0.020897869 = weight(abstract_txt:results in 5676) [ClassicSimilarity], result of:
            0.020897869 = score(doc=5676,freq=1.0), product of:
              0.09589752 = queryWeight, product of:
                1.6034312 = boost
                3.4867003 = idf(docFreq=3666, maxDocs=44083)
                0.017153092 = queryNorm
              0.21791877 = fieldWeight in 5676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4867003 = idf(docFreq=3666, maxDocs=44083)
                0.0625 = fieldNorm(doc=5676)
          0.23581484 = weight(abstract_txt:character in 5676) [ClassicSimilarity], result of:
            0.23581484 = score(doc=5676,freq=3.0), product of:
              0.33451304 = queryWeight, product of:
                2.994699 = boost
                6.512046 = idf(docFreq=177, maxDocs=44083)
                0.017153092 = queryNorm
              0.7049496 = fieldWeight in 5676, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.512046 = idf(docFreq=177, maxDocs=44083)
                0.0625 = fieldNorm(doc=5676)
          0.2460989 = weight(abstract_txt:gram in 5676) [ClassicSimilarity], result of:
            0.2460989 = score(doc=5676,freq=1.0), product of:
              0.49637797 = queryWeight, product of:
                3.6479838 = boost
                7.9326296 = idf(docFreq=42, maxDocs=44083)
                0.017153092 = queryNorm
              0.49578935 = fieldWeight in 5676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9326296 = idf(docFreq=42, maxDocs=44083)
                0.0625 = fieldNorm(doc=5676)
        0.32 = coord(8/25)
    
  5. Lam-Adesina, A.M.; Jones, G.J.F.: Examining and improving the effectiveness of relevance feedback for retrieval of scanned text documents (2006) 0.19
    0.19115444 = sum of:
      0.19115444 = product of:
        0.68269444 = sum of:
          0.012022812 = weight(abstract_txt:based in 1978) [ClassicSimilarity], result of:
            0.012022812 = score(doc=1978,freq=1.0), product of:
              0.06026944 = queryWeight, product of:
                1.1008443 = boost
                3.19175 = idf(docFreq=4924, maxDocs=44083)
                0.017153092 = queryNorm
              0.19948438 = fieldWeight in 1978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.19175 = idf(docFreq=4924, maxDocs=44083)
                0.0625 = fieldNorm(doc=1978)
          0.012230069 = weight(abstract_txt:have in 1978) [ClassicSimilarity], result of:
            0.012230069 = score(doc=1978,freq=1.0), product of:
              0.06096011 = queryWeight, product of:
                1.107134 = boost
                3.2099862 = idf(docFreq=4835, maxDocs=44083)
                0.017153092 = queryNorm
              0.20062414 = fieldWeight in 1978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2099862 = idf(docFreq=4835, maxDocs=44083)
                0.0625 = fieldNorm(doc=1978)
          0.038784906 = weight(abstract_txt:experimental in 1978) [ClassicSimilarity], result of:
            0.038784906 = score(doc=1978,freq=1.0), product of:
              0.114948824 = queryWeight, product of:
                1.2413205 = boost
                5.3985634 = idf(docFreq=541, maxDocs=44083)
                0.017153092 = queryNorm
              0.3374102 = fieldWeight in 1978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3985634 = idf(docFreq=541, maxDocs=44083)
                0.0625 = fieldNorm(doc=1978)
          0.066374116 = weight(abstract_txt:distance in 1978) [ClassicSimilarity], result of:
            0.066374116 = score(doc=1978,freq=1.0), product of:
              0.1644606 = queryWeight, product of:
                1.4847816 = boost
                6.4573874 = idf(docFreq=187, maxDocs=44083)
                0.017153092 = queryNorm
              0.40358672 = fieldWeight in 1978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4573874 = idf(docFreq=187, maxDocs=44083)
                0.0625 = fieldNorm(doc=1978)
          0.13731746 = weight(abstract_txt:edit in 1978) [ClassicSimilarity], result of:
            0.13731746 = score(doc=1978,freq=1.0), product of:
              0.2670217 = queryWeight, product of:
                1.8919295 = boost
                8.228093 = idf(docFreq=31, maxDocs=44083)
                0.017153092 = queryNorm
              0.5142558 = fieldWeight in 1978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.228093 = idf(docFreq=31, maxDocs=44083)
                0.0625 = fieldNorm(doc=1978)
          0.1801502 = weight(abstract_txt:errors in 1978) [ClassicSimilarity], result of:
            0.1801502 = score(doc=1978,freq=3.0), product of:
              0.25398567 = queryWeight, product of:
                2.259862 = boost
                6.552166 = idf(docFreq=170, maxDocs=44083)
                0.017153092 = queryNorm
              0.70929277 = fieldWeight in 1978, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.552166 = idf(docFreq=170, maxDocs=44083)
                0.0625 = fieldNorm(doc=1978)
          0.23581484 = weight(abstract_txt:character in 1978) [ClassicSimilarity], result of:
            0.23581484 = score(doc=1978,freq=3.0), product of:
              0.33451304 = queryWeight, product of:
                2.994699 = boost
                6.512046 = idf(docFreq=177, maxDocs=44083)
                0.017153092 = queryNorm
              0.7049496 = fieldWeight in 1978, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.512046 = idf(docFreq=177, maxDocs=44083)
                0.0625 = fieldNorm(doc=1978)
        0.28 = coord(7/25)