Document (#36956)

Author
Stamatatos, E.
Title
Plagiarism detection using stopword n-grams
Source
Journal of the American Society for Information Science and Technology. 62(2011) no.12, S.2512-2527
Year
2011
Abstract
In this paper a novel method for detecting plagiarized passages in document collections is presented. In contrast to previous work in this field that uses content terms to represent documents, the proposed method is based on a small list of stopwords (i.e., very frequent words). We show that stopword n-grams reveal important information for plagiarism detection since they are able to capture syntactic similarities between suspicious and original documents and they can be used to detect the exact plagiarized passage boundaries. Experimental results on a publicly available corpus demonstrate that the performance of the proposed approach is competitive when compared with the best reported results. More importantly, it achieves significantly better results when dealing with difficult plagiarism cases where the plagiarized passages are highly modified and most of the words or phrases have been replaced with synonyms.
Object
n-grams

Similar documents (author)

  1. Stamatatos, E.: Author identification : using text sampling to handle the class imbalance problem (2008) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:stamatatos in 2063) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 2063, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=2063)
    
  2. Stamatatos, E.: ¬A survey of modern authorship attribution methods (2009) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:stamatatos in 2741) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 2741, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=2741)
    
  3. Stamatatos, E.: Masking topic-related information to enhance authorship attribution (2018) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:stamatatos in 4124) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 4124, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=4124)
    
  4. Potha, N.; Stamatatos, E.: Improving author verification based on topic modeling (2019) 4.95
    4.952564 = sum of:
      4.952564 = weight(author_txt:stamatatos in 5385) [ClassicSimilarity], result of:
        4.952564 = fieldWeight in 5385, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.5 = fieldNorm(doc=5385)
    

Similar documents (content)

  1. Agarwal, B.; Ramampiaro, H.; Langseth, H.; Ruocco, M.: ¬A deep network model for paraphrase detection in short text messages (2018) 0.43
    0.42871553 = sum of:
      0.42871553 = product of:
        0.89315736 = sum of:
          0.051628377 = weight(abstract_txt:detect in 5043) [ClassicSimilarity], result of:
            0.051628377 = score(doc=5043,freq=1.0), product of:
              0.11631668 = queryWeight, product of:
                1.0228236 = boost
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.016013078 = queryNorm
              0.44386047 = fieldWeight in 5043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.0625 = fieldNorm(doc=5043)
          0.011505269 = weight(abstract_txt:that in 5043) [ClassicSimilarity], result of:
            0.011505269 = score(doc=5043,freq=4.0), product of:
              0.038844954 = queryWeight, product of:
                1.0237824 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016013078 = queryNorm
              0.2961844 = fieldWeight in 5043, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=5043)
          0.060261138 = weight(abstract_txt:achieves in 5043) [ClassicSimilarity], result of:
            0.060261138 = score(doc=5043,freq=1.0), product of:
              0.128946 = queryWeight, product of:
                1.0769206 = boost
                7.4773793 = idf(docFreq=67, maxDocs=44218)
                0.016013078 = queryNorm
              0.4673362 = fieldWeight in 5043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4773793 = idf(docFreq=67, maxDocs=44218)
                0.0625 = fieldNorm(doc=5043)
          0.01169909 = weight(abstract_txt:with in 5043) [ClassicSimilarity], result of:
            0.01169909 = score(doc=5043,freq=3.0), product of:
              0.043233234 = queryWeight, product of:
                1.0800633 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.016013078 = queryNorm
              0.27060407 = fieldWeight in 5043, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=5043)
          0.015227258 = weight(abstract_txt:they in 5043) [ClassicSimilarity], result of:
            0.015227258 = score(doc=5043,freq=1.0), product of:
              0.0649343 = queryWeight, product of:
                1.0807663 = boost
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.016013078 = queryNorm
              0.23450254 = fieldWeight in 5043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.0625 = fieldNorm(doc=5043)
          0.065537915 = weight(abstract_txt:detecting in 5043) [ClassicSimilarity], result of:
            0.065537915 = score(doc=5043,freq=1.0), product of:
              0.13636768 = queryWeight, product of:
                1.1074789 = boost
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.016013078 = queryNorm
              0.48059714 = fieldWeight in 5043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.0625 = fieldNorm(doc=5043)
          0.020579716 = weight(abstract_txt:when in 5043) [ClassicSimilarity], result of:
            0.020579716 = score(doc=5043,freq=1.0), product of:
              0.0793754 = queryWeight, product of:
                1.1949168 = boost
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.016013078 = queryNorm
              0.2592707 = fieldWeight in 5043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.0625 = fieldNorm(doc=5043)
          0.028231105 = weight(abstract_txt:proposed in 5043) [ClassicSimilarity], result of:
            0.028231105 = score(doc=5043,freq=1.0), product of:
              0.097996734 = queryWeight, product of:
                1.3277017 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.016013078 = queryNorm
              0.2880821 = fieldWeight in 5043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.0625 = fieldNorm(doc=5043)
          0.018262263 = weight(abstract_txt:results in 5043) [ClassicSimilarity], result of:
            0.018262263 = score(doc=5043,freq=1.0), product of:
              0.08390603 = queryWeight, product of:
                1.504655 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.016013078 = queryNorm
              0.21765138 = fieldWeight in 5043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=5043)
          0.15591282 = weight(abstract_txt:detection in 5043) [ClassicSimilarity], result of:
            0.15591282 = score(doc=5043,freq=3.0), product of:
              0.21229535 = queryWeight, product of:
                1.9541818 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.016013078 = queryNorm
              0.73441464 = fieldWeight in 5043, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.0625 = fieldNorm(doc=5043)
          0.15897211 = weight(abstract_txt:grams in 5043) [ClassicSimilarity], result of:
            0.15897211 = score(doc=5043,freq=1.0), product of:
              0.31017515 = queryWeight, product of:
                2.3620996 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.016013078 = queryNorm
              0.5125237 = fieldWeight in 5043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.0625 = fieldNorm(doc=5043)
          0.29534033 = weight(abstract_txt:plagiarism in 5043) [ClassicSimilarity], result of:
            0.29534033 = score(doc=5043,freq=1.0), product of:
              0.5365851 = queryWeight, product of:
                3.8050437 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.016013078 = queryNorm
              0.55040723 = fieldWeight in 5043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.0625 = fieldNorm(doc=5043)
        0.48 = coord(12/25)
    
  2. Vani, K.; Gupta, D.: Integrating syntax-semantic-based text analysis with structural and citation information for scientific plagiarism detection (2018) 0.37
    0.37140837 = sum of:
      0.37140837 = product of:
        1.1606512 = sum of:
          0.008135454 = weight(abstract_txt:that in 4543) [ClassicSimilarity], result of:
            0.008135454 = score(doc=4543,freq=2.0), product of:
              0.038844954 = queryWeight, product of:
                1.0237824 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016013078 = queryNorm
              0.20943399 = fieldWeight in 4543, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=4543)
          0.013508946 = weight(abstract_txt:with in 4543) [ClassicSimilarity], result of:
            0.013508946 = score(doc=4543,freq=4.0), product of:
              0.043233234 = queryWeight, product of:
                1.0800633 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.016013078 = queryNorm
              0.31246668 = fieldWeight in 4543, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=4543)
          0.021534594 = weight(abstract_txt:they in 4543) [ClassicSimilarity], result of:
            0.021534594 = score(doc=4543,freq=2.0), product of:
              0.0649343 = queryWeight, product of:
                1.0807663 = boost
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.016013078 = queryNorm
              0.33163667 = fieldWeight in 4543, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.0625 = fieldNorm(doc=4543)
          0.08131799 = weight(abstract_txt:passage in 4543) [ClassicSimilarity], result of:
            0.08131799 = score(doc=4543,freq=1.0), product of:
              0.1574614 = queryWeight, product of:
                1.1900543 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.016013078 = queryNorm
              0.5164313 = fieldWeight in 4543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.0625 = fieldNorm(doc=4543)
          0.05646221 = weight(abstract_txt:proposed in 4543) [ClassicSimilarity], result of:
            0.05646221 = score(doc=4543,freq=4.0), product of:
              0.097996734 = queryWeight, product of:
                1.3277017 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.016013078 = queryNorm
              0.5761642 = fieldWeight in 4543, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.0625 = fieldNorm(doc=4543)
          0.018262263 = weight(abstract_txt:results in 4543) [ClassicSimilarity], result of:
            0.018262263 = score(doc=4543,freq=1.0), product of:
              0.08390603 = queryWeight, product of:
                1.504655 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.016013078 = queryNorm
              0.21765138 = fieldWeight in 4543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=4543)
          0.18003263 = weight(abstract_txt:detection in 4543) [ClassicSimilarity], result of:
            0.18003263 = score(doc=4543,freq=4.0), product of:
              0.21229535 = queryWeight, product of:
                1.9541818 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.016013078 = queryNorm
              0.848029 = fieldWeight in 4543, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.0625 = fieldNorm(doc=4543)
          0.78139704 = weight(abstract_txt:plagiarism in 4543) [ClassicSimilarity], result of:
            0.78139704 = score(doc=4543,freq=7.0), product of:
              0.5365851 = queryWeight, product of:
                3.8050437 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.016013078 = queryNorm
              1.4562407 = fieldWeight in 4543, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.0625 = fieldNorm(doc=4543)
        0.32 = coord(8/25)
    
  3. Gipp, B.; Meuschke, N.; Breitinger, C.: Citation-based plagiarism detection : practicability on a large-scale scientific corpus (2014) 0.36
    0.3621847 = sum of:
      0.3621847 = product of:
        1.2935168 = sum of:
          0.0057526347 = weight(abstract_txt:that in 3332) [ClassicSimilarity], result of:
            0.0057526347 = score(doc=3332,freq=1.0), product of:
              0.038844954 = queryWeight, product of:
                1.0237824 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016013078 = queryNorm
              0.1480922 = fieldWeight in 3332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=3332)
          0.060261138 = weight(abstract_txt:achieves in 3332) [ClassicSimilarity], result of:
            0.060261138 = score(doc=3332,freq=1.0), product of:
              0.128946 = queryWeight, product of:
                1.0769206 = boost
                7.4773793 = idf(docFreq=67, maxDocs=44218)
                0.016013078 = queryNorm
              0.4673362 = fieldWeight in 3332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4773793 = idf(docFreq=67, maxDocs=44218)
                0.0625 = fieldNorm(doc=3332)
          0.009552266 = weight(abstract_txt:with in 3332) [ClassicSimilarity], result of:
            0.009552266 = score(doc=3332,freq=2.0), product of:
              0.043233234 = queryWeight, product of:
                1.0800633 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.016013078 = queryNorm
              0.22094731 = fieldWeight in 3332, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=3332)
          0.065537915 = weight(abstract_txt:detecting in 3332) [ClassicSimilarity], result of:
            0.065537915 = score(doc=3332,freq=1.0), product of:
              0.13636768 = queryWeight, product of:
                1.1074789 = boost
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.016013078 = queryNorm
              0.48059714 = fieldWeight in 3332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.0625 = fieldNorm(doc=3332)
          0.028231105 = weight(abstract_txt:proposed in 3332) [ClassicSimilarity], result of:
            0.028231105 = score(doc=3332,freq=1.0), product of:
              0.097996734 = queryWeight, product of:
                1.3277017 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.016013078 = queryNorm
              0.2880821 = fieldWeight in 3332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.0625 = fieldNorm(doc=3332)
          0.23816076 = weight(abstract_txt:detection in 3332) [ClassicSimilarity], result of:
            0.23816076 = score(doc=3332,freq=7.0), product of:
              0.21229535 = queryWeight, product of:
                1.9541818 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.016013078 = queryNorm
              1.1218369 = fieldWeight in 3332, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.0625 = fieldNorm(doc=3332)
          0.88602096 = weight(abstract_txt:plagiarism in 3332) [ClassicSimilarity], result of:
            0.88602096 = score(doc=3332,freq=9.0), product of:
              0.5365851 = queryWeight, product of:
                3.8050437 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.016013078 = queryNorm
              1.6512218 = fieldWeight in 3332, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.0625 = fieldNorm(doc=3332)
        0.28 = coord(7/25)
    
  4. Alzahrani, S.; Palade, V.; Salim, N.; Abraham, A.: Using structural information and citation evidence to detect significant plagiarism cases in scientific publications (2012) 0.33
    0.33236733 = sum of:
      0.33236733 = product of:
        1.0386479 = sum of:
          0.0071185217 = weight(abstract_txt:that in 4982) [ClassicSimilarity], result of:
            0.0071185217 = score(doc=4982,freq=2.0), product of:
              0.038844954 = queryWeight, product of:
                1.0237824 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016013078 = queryNorm
              0.18325473 = fieldWeight in 4982, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4982)
          0.0118203275 = weight(abstract_txt:with in 4982) [ClassicSimilarity], result of:
            0.0118203275 = score(doc=4982,freq=4.0), product of:
              0.043233234 = queryWeight, product of:
                1.0800633 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.016013078 = queryNorm
              0.27340835 = fieldWeight in 4982, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4982)
          0.01884277 = weight(abstract_txt:they in 4982) [ClassicSimilarity], result of:
            0.01884277 = score(doc=4982,freq=2.0), product of:
              0.0649343 = queryWeight, product of:
                1.0807663 = boost
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.016013078 = queryNorm
              0.29018208 = fieldWeight in 4982, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4982)
          0.024971558 = weight(abstract_txt:documents in 4982) [ClassicSimilarity], result of:
            0.024971558 = score(doc=4982,freq=2.0), product of:
              0.07834442 = queryWeight, product of:
                1.1871313 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.016013078 = queryNorm
              0.31874073 = fieldWeight in 4982, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4982)
          0.024702216 = weight(abstract_txt:proposed in 4982) [ClassicSimilarity], result of:
            0.024702216 = score(doc=4982,freq=1.0), product of:
              0.097996734 = queryWeight, product of:
                1.3277017 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.016013078 = queryNorm
              0.25207183 = fieldWeight in 4982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4982)
          0.022598399 = weight(abstract_txt:results in 4982) [ClassicSimilarity], result of:
            0.022598399 = score(doc=4982,freq=2.0), product of:
              0.08390603 = queryWeight, product of:
                1.504655 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.016013078 = queryNorm
              0.26932985 = fieldWeight in 4982, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4982)
          0.111389495 = weight(abstract_txt:detection in 4982) [ClassicSimilarity], result of:
            0.111389495 = score(doc=4982,freq=2.0), product of:
              0.21229535 = queryWeight, product of:
                1.9541818 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.016013078 = queryNorm
              0.52469116 = fieldWeight in 4982, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4982)
          0.8172046 = weight(abstract_txt:plagiarism in 4982) [ClassicSimilarity], result of:
            0.8172046 = score(doc=4982,freq=10.0), product of:
              0.5365851 = queryWeight, product of:
                3.8050437 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.016013078 = queryNorm
              1.522973 = fieldWeight in 4982, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4982)
        0.32 = coord(8/25)
    
  5. Mengle, S.; Goharian, N.: Passage detection using text classification (2009) 0.31
    0.31110632 = sum of:
      0.31110632 = product of:
        1.111094 = sum of:
          0.04517483 = weight(abstract_txt:detect in 2765) [ClassicSimilarity], result of:
            0.04517483 = score(doc=2765,freq=1.0), product of:
              0.11631668 = queryWeight, product of:
                1.0228236 = boost
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.016013078 = queryNorm
              0.3883779 = fieldWeight in 2765, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
          0.010067111 = weight(abstract_txt:that in 2765) [ClassicSimilarity], result of:
            0.010067111 = score(doc=2765,freq=4.0), product of:
              0.038844954 = queryWeight, product of:
                1.0237824 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016013078 = queryNorm
              0.25916135 = fieldWeight in 2765, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
          0.0118203275 = weight(abstract_txt:with in 2765) [ClassicSimilarity], result of:
            0.0118203275 = score(doc=2765,freq=4.0), product of:
              0.043233234 = queryWeight, product of:
                1.0800633 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.016013078 = queryNorm
              0.27340835 = fieldWeight in 2765, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
          0.030583786 = weight(abstract_txt:documents in 2765) [ClassicSimilarity], result of:
            0.030583786 = score(doc=2765,freq=3.0), product of:
              0.07834442 = queryWeight, product of:
                1.1871313 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.016013078 = queryNorm
              0.39037606 = fieldWeight in 2765, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
          0.26623106 = weight(abstract_txt:passage in 2765) [ClassicSimilarity], result of:
            0.26623106 = score(doc=2765,freq=14.0), product of:
              0.1574614 = queryWeight, product of:
                1.1900543 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.016013078 = queryNorm
              1.6907703 = fieldWeight in 2765, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
          0.20839067 = weight(abstract_txt:detection in 2765) [ClassicSimilarity], result of:
            0.20839067 = score(doc=2765,freq=7.0), product of:
              0.21229535 = queryWeight, product of:
                1.9541818 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.016013078 = queryNorm
              0.9816073 = fieldWeight in 2765, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
          0.53882617 = weight(abstract_txt:passages in 2765) [ClassicSimilarity], result of:
            0.53882617 = score(doc=2765,freq=14.0), product of:
              0.31742716 = queryWeight, product of:
                2.3895535 = boost
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.016013078 = queryNorm
              1.6974797 = fieldWeight in 2765, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2765)
        0.28 = coord(7/25)