Document (#41544)

Author
Vani, K.
Gupta, D.
Title
Integrating syntax-semantic-based text analysis with structural and citation information for scientific plagiarism detection
Source
Journal of the Association for Information Science and Technology. 69(2018) no.11, S.1330-1345
Year
2018
Abstract
The objective of the work is to explore the potency of integrating structural and citation information with effective syntax-semantic text-based analysis for scientific plagiarism detection. One of the major limitations in today's plagiarism checkers is their sole dependence on text-based detection, where they ignore the citation and structural information. Further, the text-based detection approaches that they employ usually fail to trace out intelligent manipulations. In the proposed work, a plagiarism detection system is presented that employs the effective coupling of various modules, namely, logical structure classifications and citation parsing, two-stage candidate document selections, syntax-semantic-based exhaustive passage level analysis with plagiarism analysis using structural and citation information. Further, a new plagiarism score, namely, weighted overall similarity index is proposed, opposed to the general plagiarism scores. The proposed approach is evaluated on the data set created by Alzahrani et al. (2011),1 which contains scientific publications imposed with various plagiarism complexities. Comparison of the final system results is done against a potential baseline approach. The proposed approach exhibits considerable improvement over the comparative baseline, and hence reflects the potency of syntax-semantic text-based analysis with structural and citation information.
Content
Vgl.: https://onlinelibrary.wiley.com/doi/10.1002/asi.24027.

Similar documents (author)

  1. Gupta, S.: Decimal Classification System : a bibliography for the period 1876-1994 (1997) 5.76
    5.7574883 = sum of:
      5.7574883 = weight(author_txt:gupta in 3935) [ClassicSimilarity], result of:
        5.7574883 = fieldWeight in 3935, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.625 = fieldNorm(doc=3935)
    
  2. Gupta, S.: Cataloging Ethiopian personal names (1991) 5.76
    5.7574883 = sum of:
      5.7574883 = weight(author_txt:gupta in 527) [ClassicSimilarity], result of:
        5.7574883 = fieldWeight in 527, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.625 = fieldNorm(doc=527)
    
  3. Gupta, S.: Communication clothing design (2009) 5.76
    5.7574883 = sum of:
      5.7574883 = weight(author_txt:gupta in 3105) [ClassicSimilarity], result of:
        5.7574883 = fieldWeight in 3105, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.625 = fieldNorm(doc=3105)
    
  4. Gupta, U.; Salisbury, L.: Is FirstSearch really attractive? (1992) 4.61
    4.6059904 = sum of:
      4.6059904 = weight(author_txt:gupta in 3863) [ClassicSimilarity], result of:
        4.6059904 = fieldWeight in 3863, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.5 = fieldNorm(doc=3863)
    
  5. Berkley, B.J.; Gupta, A.: Improving service quality with information technology (1994) 4.61
    4.6059904 = sum of:
      4.6059904 = weight(author_txt:gupta in 8021) [ClassicSimilarity], result of:
        4.6059904 = fieldWeight in 8021, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.5 = fieldNorm(doc=8021)
    

Similar documents (content)

  1. Gipp, B.; Meuschke, N.; Breitinger, C.: Citation-based plagiarism detection : practicability on a large-scale scientific corpus (2014) 0.95
    0.9461453 = sum of:
      0.9461453 = product of:
        1.9711361 = sum of:
          0.013235076 = weight(abstract_txt:various in 3332) [ClassicSimilarity], result of:
            0.013235076 = score(doc=3332,freq=1.0), product of:
              0.048196368 = queryWeight, product of:
                1.0634806 = boost
                4.3937173 = idf(docFreq=1484, maxDocs=44218)
                0.010314605 = queryNorm
              0.27460733 = fieldWeight in 3332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3937173 = idf(docFreq=1484, maxDocs=44218)
                0.0625 = fieldNorm(doc=3332)
          0.017390255 = weight(abstract_txt:approach in 3332) [ClassicSimilarity], result of:
            0.017390255 = score(doc=3332,freq=2.0), product of:
              0.052531656 = queryWeight, product of:
                1.3598112 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.010314605 = queryNorm
              0.33104333 = fieldWeight in 3332, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=3332)
          0.007827816 = weight(abstract_txt:information in 3332) [ClassicSimilarity], result of:
            0.007827816 = score(doc=3332,freq=2.0), product of:
              0.03658141 = queryWeight, product of:
                1.4649495 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.010314605 = queryNorm
              0.21398345 = fieldWeight in 3332, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=3332)
          0.008617201 = weight(abstract_txt:with in 3332) [ClassicSimilarity], result of:
            0.008617201 = score(doc=3332,freq=2.0), product of:
              0.039001156 = queryWeight, product of:
                1.5126246 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.010314605 = queryNorm
              0.22094731 = fieldWeight in 3332, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=3332)
          0.027953852 = weight(abstract_txt:semantic in 3332) [ClassicSimilarity], result of:
            0.027953852 = score(doc=3332,freq=1.0), product of:
              0.09996189 = queryWeight, product of:
                2.165981 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.010314605 = queryNorm
              0.2796451 = fieldWeight in 3332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=3332)
          0.030561091 = weight(abstract_txt:proposed in 3332) [ClassicSimilarity], result of:
            0.030561091 = score(doc=3332,freq=1.0), product of:
              0.10608466 = queryWeight, product of:
                2.2313297 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.010314605 = queryNorm
              0.2880821 = fieldWeight in 3332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.0625 = fieldNorm(doc=3332)
          0.040125992 = weight(abstract_txt:based in 3332) [ClassicSimilarity], result of:
            0.040125992 = score(doc=3332,freq=7.0), product of:
              0.076118164 = queryWeight, product of:
                2.3148732 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.010314605 = queryNorm
              0.52715397 = fieldWeight in 3332, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=3332)
          0.03648174 = weight(abstract_txt:text in 3332) [ClassicSimilarity], result of:
            0.03648174 = score(doc=3332,freq=2.0), product of:
              0.10206662 = queryWeight, product of:
                2.447002 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.010314605 = queryNorm
              0.3574307 = fieldWeight in 3332, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=3332)
          0.07788492 = weight(abstract_txt:structural in 3332) [ClassicSimilarity], result of:
            0.07788492 = score(doc=3332,freq=1.0), product of:
              0.21321231 = queryWeight, product of:
                3.5367029 = boost
                5.8446846 = idf(docFreq=347, maxDocs=44218)
                0.010314605 = queryNorm
              0.3652928 = fieldWeight in 3332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8446846 = idf(docFreq=347, maxDocs=44218)
                0.0625 = fieldNorm(doc=3332)
          0.10992498 = weight(abstract_txt:citation in 3332) [ClassicSimilarity], result of:
            0.10992498 = score(doc=3332,freq=4.0), product of:
              0.17958967 = queryWeight, product of:
                3.5556889 = boost
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.010314605 = queryNorm
              0.61208963 = fieldWeight in 3332, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.0625 = fieldNorm(doc=3332)
          0.322271 = weight(abstract_txt:detection in 3332) [ClassicSimilarity], result of:
            0.322271 = score(doc=3332,freq=7.0), product of:
              0.2872708 = queryWeight, product of:
                4.1052365 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.010314605 = queryNorm
              1.1218369 = fieldWeight in 3332, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.0625 = fieldNorm(doc=3332)
          1.2788621 = weight(abstract_txt:plagiarism in 3332) [ClassicSimilarity], result of:
            1.2788621 = score(doc=3332,freq=9.0), product of:
              0.77449447 = queryWeight, product of:
                8.5263195 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.010314605 = queryNorm
              1.6512218 = fieldWeight in 3332, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.0625 = fieldNorm(doc=3332)
        0.48 = coord(12/25)
    
  2. K., Vani; Gupta, D.: Unmasking text plagiarism using syntactic-semantic based natural language processing techniques : comparisons, analysis and challenges (2018) 0.76
    0.76195055 = sum of:
      0.76195055 = product of:
        1.4652896 = sum of:
          0.018717224 = weight(abstract_txt:various in 5084) [ClassicSimilarity], result of:
            0.018717224 = score(doc=5084,freq=2.0), product of:
              0.048196368 = queryWeight, product of:
                1.0634806 = boost
                4.3937173 = idf(docFreq=1484, maxDocs=44218)
                0.010314605 = queryNorm
              0.3883534 = fieldWeight in 5084, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3937173 = idf(docFreq=1484, maxDocs=44218)
                0.0625 = fieldNorm(doc=5084)
          0.015905855 = weight(abstract_txt:further in 5084) [ClassicSimilarity], result of:
            0.015905855 = score(doc=5084,freq=1.0), product of:
              0.0544797 = queryWeight, product of:
                1.1306802 = boost
                4.671349 = idf(docFreq=1124, maxDocs=44218)
                0.010314605 = queryNorm
              0.29195932 = fieldWeight in 5084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.671349 = idf(docFreq=1124, maxDocs=44218)
                0.0625 = fieldNorm(doc=5084)
          0.017650183 = weight(abstract_txt:effective in 5084) [ClassicSimilarity], result of:
            0.017650183 = score(doc=5084,freq=1.0), product of:
              0.058393274 = queryWeight, product of:
                1.1705874 = boost
                4.8362236 = idf(docFreq=953, maxDocs=44218)
                0.010314605 = queryNorm
              0.30226398 = fieldWeight in 5084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8362236 = idf(docFreq=953, maxDocs=44218)
                0.0625 = fieldNorm(doc=5084)
          0.021298626 = weight(abstract_txt:approach in 5084) [ClassicSimilarity], result of:
            0.021298626 = score(doc=5084,freq=3.0), product of:
              0.052531656 = queryWeight, product of:
                1.3598112 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.010314605 = queryNorm
              0.40544364 = fieldWeight in 5084, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=5084)
          0.0055351015 = weight(abstract_txt:information in 5084) [ClassicSimilarity], result of:
            0.0055351015 = score(doc=5084,freq=1.0), product of:
              0.03658141 = queryWeight, product of:
                1.4649495 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.010314605 = queryNorm
              0.15130915 = fieldWeight in 5084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=5084)
          0.008617201 = weight(abstract_txt:with in 5084) [ClassicSimilarity], result of:
            0.008617201 = score(doc=5084,freq=2.0), product of:
              0.039001156 = queryWeight, product of:
                1.5126246 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.010314605 = queryNorm
              0.22094731 = fieldWeight in 5084, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=5084)
          0.055907704 = weight(abstract_txt:semantic in 5084) [ClassicSimilarity], result of:
            0.055907704 = score(doc=5084,freq=4.0), product of:
              0.09996189 = queryWeight, product of:
                2.165981 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.010314605 = queryNorm
              0.5592902 = fieldWeight in 5084, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=5084)
          0.026904726 = weight(abstract_txt:analysis in 5084) [ClassicSimilarity], result of:
            0.026904726 = score(doc=5084,freq=2.0), product of:
              0.08331421 = queryWeight, product of:
                2.2108128 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.010314605 = queryNorm
              0.3229308 = fieldWeight in 5084, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=5084)
          0.05293336 = weight(abstract_txt:proposed in 5084) [ClassicSimilarity], result of:
            0.05293336 = score(doc=5084,freq=3.0), product of:
              0.10608466 = queryWeight, product of:
                2.2313297 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.010314605 = queryNorm
              0.4989728 = fieldWeight in 5084, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.0625 = fieldNorm(doc=5084)
          0.015166201 = weight(abstract_txt:based in 5084) [ClassicSimilarity], result of:
            0.015166201 = score(doc=5084,freq=1.0), product of:
              0.076118164 = queryWeight, product of:
                2.3148732 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.010314605 = queryNorm
              0.19924548 = fieldWeight in 5084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=5084)
          0.15163879 = weight(abstract_txt:potency in 5084) [ClassicSimilarity], result of:
            0.15163879 = score(doc=5084,freq=1.0), product of:
              0.24494593 = queryWeight, product of:
                2.397494 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.010314605 = queryNorm
              0.6190705 = fieldWeight in 5084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0625 = fieldNorm(doc=5084)
          0.121806994 = weight(abstract_txt:detection in 5084) [ClassicSimilarity], result of:
            0.121806994 = score(doc=5084,freq=1.0), product of:
              0.2872708 = queryWeight, product of:
                4.1052365 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.010314605 = queryNorm
              0.4240145 = fieldWeight in 5084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.0625 = fieldNorm(doc=5084)
          0.95320755 = weight(abstract_txt:plagiarism in 5084) [ClassicSimilarity], result of:
            0.95320755 = score(doc=5084,freq=5.0), product of:
              0.77449447 = queryWeight, product of:
                8.5263195 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.010314605 = queryNorm
              1.230748 = fieldWeight in 5084, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.0625 = fieldNorm(doc=5084)
        0.52 = coord(13/25)
    
  3. Alzahrani, S.; Palade, V.; Salim, N.; Abraham, A.: Using structural information and citation evidence to detect significant plagiarism cases in scientific publications (2012) 0.74
    0.74414426 = sum of:
      0.74414426 = product of:
        1.691237 = sum of:
          0.010759672 = weight(abstract_txt:approach in 4982) [ClassicSimilarity], result of:
            0.010759672 = score(doc=4982,freq=1.0), product of:
              0.052531656 = queryWeight, product of:
                1.3598112 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.010314605 = queryNorm
              0.20482263 = fieldWeight in 4982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4982)
          0.0068493388 = weight(abstract_txt:information in 4982) [ClassicSimilarity], result of:
            0.0068493388 = score(doc=4982,freq=2.0), product of:
              0.03658141 = queryWeight, product of:
                1.4649495 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.010314605 = queryNorm
              0.18723552 = fieldWeight in 4982, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4982)
          0.010663242 = weight(abstract_txt:with in 4982) [ClassicSimilarity], result of:
            0.010663242 = score(doc=4982,freq=4.0), product of:
              0.039001156 = queryWeight, product of:
                1.5126246 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.010314605 = queryNorm
              0.27340835 = fieldWeight in 4982, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4982)
          0.03441666 = weight(abstract_txt:namely in 4982) [ClassicSimilarity], result of:
            0.03441666 = score(doc=4982,freq=1.0), product of:
              0.09962549 = queryWeight, product of:
                1.5290006 = boost
                6.31699 = idf(docFreq=216, maxDocs=44218)
                0.010314605 = queryNorm
              0.34546039 = fieldWeight in 4982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.31699 = idf(docFreq=216, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4982)
          0.028962787 = weight(abstract_txt:scientific in 4982) [ClassicSimilarity], result of:
            0.028962787 = score(doc=4982,freq=2.0), product of:
              0.08068113 = queryWeight, product of:
                1.68521 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.010314605 = queryNorm
              0.35897845 = fieldWeight in 4982, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4982)
          0.026740953 = weight(abstract_txt:proposed in 4982) [ClassicSimilarity], result of:
            0.026740953 = score(doc=4982,freq=1.0), product of:
              0.10608466 = queryWeight, product of:
                2.2313297 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.010314605 = queryNorm
              0.25207183 = fieldWeight in 4982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4982)
          0.02298505 = weight(abstract_txt:based in 4982) [ClassicSimilarity], result of:
            0.02298505 = score(doc=4982,freq=3.0), product of:
              0.076118164 = queryWeight, product of:
                2.3148732 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.010314605 = queryNorm
              0.3019654 = fieldWeight in 4982, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4982)
          0.13629861 = weight(abstract_txt:structural in 4982) [ClassicSimilarity], result of:
            0.13629861 = score(doc=4982,freq=4.0), product of:
              0.21321231 = queryWeight, product of:
                3.5367029 = boost
                5.8446846 = idf(docFreq=347, maxDocs=44218)
                0.010314605 = queryNorm
              0.6392624 = fieldWeight in 4982, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8446846 = idf(docFreq=347, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4982)
          0.083298095 = weight(abstract_txt:citation in 4982) [ClassicSimilarity], result of:
            0.083298095 = score(doc=4982,freq=3.0), product of:
              0.17958967 = queryWeight, product of:
                3.5556889 = boost
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.010314605 = queryNorm
              0.4638245 = fieldWeight in 4982, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4982)
          0.15072846 = weight(abstract_txt:detection in 4982) [ClassicSimilarity], result of:
            0.15072846 = score(doc=4982,freq=2.0), product of:
              0.2872708 = queryWeight, product of:
                4.1052365 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.010314605 = queryNorm
              0.52469116 = fieldWeight in 4982, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4982)
          1.1795341 = weight(abstract_txt:plagiarism in 4982) [ClassicSimilarity], result of:
            1.1795341 = score(doc=4982,freq=10.0), product of:
              0.77449447 = queryWeight, product of:
                8.5263195 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.010314605 = queryNorm
              1.522973 = fieldWeight in 4982, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4982)
        0.44 = coord(11/25)
    
  4. Stamatatos, E.: Plagiarism detection using stopword n-grams (2011) 0.34
    0.34218192 = sum of:
      0.34218192 = product of:
        1.0693185 = sum of:
          0.055018365 = weight(abstract_txt:passage in 4955) [ClassicSimilarity], result of:
            0.055018365 = score(doc=4955,freq=1.0), product of:
              0.085228555 = queryWeight, product of:
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.010314605 = queryNorm
              0.6455391 = fieldWeight in 4955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.078125 = fieldNorm(doc=4955)
          0.0153709585 = weight(abstract_txt:approach in 4955) [ClassicSimilarity], result of:
            0.0153709585 = score(doc=4955,freq=1.0), product of:
              0.052531656 = queryWeight, product of:
                1.3598112 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.010314605 = queryNorm
              0.29260373 = fieldWeight in 4955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.078125 = fieldNorm(doc=4955)
          0.006918877 = weight(abstract_txt:information in 4955) [ClassicSimilarity], result of:
            0.006918877 = score(doc=4955,freq=1.0), product of:
              0.03658141 = queryWeight, product of:
                1.4649495 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.010314605 = queryNorm
              0.18913643 = fieldWeight in 4955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=4955)
          0.01319234 = weight(abstract_txt:with in 4955) [ClassicSimilarity], result of:
            0.01319234 = score(doc=4955,freq=3.0), product of:
              0.039001156 = queryWeight, product of:
                1.5126246 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.010314605 = queryNorm
              0.3382551 = fieldWeight in 4955, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.078125 = fieldNorm(doc=4955)
          0.05402489 = weight(abstract_txt:proposed in 4955) [ClassicSimilarity], result of:
            0.05402489 = score(doc=4955,freq=2.0), product of:
              0.10608466 = queryWeight, product of:
                2.2313297 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.010314605 = queryNorm
              0.509262 = fieldWeight in 4955, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.078125 = fieldNorm(doc=4955)
          0.018957749 = weight(abstract_txt:based in 4955) [ClassicSimilarity], result of:
            0.018957749 = score(doc=4955,freq=1.0), product of:
              0.076118164 = queryWeight, product of:
                2.3148732 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.010314605 = queryNorm
              0.24905685 = fieldWeight in 4955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.078125 = fieldNorm(doc=4955)
          0.15225874 = weight(abstract_txt:detection in 4955) [ClassicSimilarity], result of:
            0.15225874 = score(doc=4955,freq=1.0), product of:
              0.2872708 = queryWeight, product of:
                4.1052365 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.010314605 = queryNorm
              0.53001815 = fieldWeight in 4955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.078125 = fieldNorm(doc=4955)
          0.75357664 = weight(abstract_txt:plagiarism in 4955) [ClassicSimilarity], result of:
            0.75357664 = score(doc=4955,freq=2.0), product of:
              0.77449447 = queryWeight, product of:
                8.5263195 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.010314605 = queryNorm
              0.97299165 = fieldWeight in 4955, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.078125 = fieldNorm(doc=4955)
        0.32 = coord(8/25)
    
  5. Pertile, S. de L.; Moreira, V.P.: Comparing and combining content- and citation-based approaches for plagiarism detection (2016) 0.34
    0.33904648 = sum of:
      0.33904648 = product of:
        1.2108803 = sum of:
          0.008617201 = weight(abstract_txt:with in 3123) [ClassicSimilarity], result of:
            0.008617201 = score(doc=3123,freq=2.0), product of:
              0.039001156 = queryWeight, product of:
                1.5126246 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.010314605 = queryNorm
              0.22094731 = fieldWeight in 3123, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=3123)
          0.040539455 = weight(abstract_txt:scientific in 3123) [ClassicSimilarity], result of:
            0.040539455 = score(doc=3123,freq=3.0), product of:
              0.08068113 = queryWeight, product of:
                1.68521 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.010314605 = queryNorm
              0.5024651 = fieldWeight in 3123, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=3123)
          0.030332401 = weight(abstract_txt:based in 3123) [ClassicSimilarity], result of:
            0.030332401 = score(doc=3123,freq=4.0), product of:
              0.076118164 = queryWeight, product of:
                2.3148732 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.010314605 = queryNorm
              0.39849097 = fieldWeight in 3123, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=3123)
          0.051592976 = weight(abstract_txt:text in 3123) [ClassicSimilarity], result of:
            0.051592976 = score(doc=3123,freq=4.0), product of:
              0.10206662 = queryWeight, product of:
                2.447002 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.010314605 = queryNorm
              0.5054833 = fieldWeight in 3123, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=3123)
          0.05496249 = weight(abstract_txt:citation in 3123) [ClassicSimilarity], result of:
            0.05496249 = score(doc=3123,freq=1.0), product of:
              0.17958967 = queryWeight, product of:
                3.5556889 = boost
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.010314605 = queryNorm
              0.30604482 = fieldWeight in 3123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.0625 = fieldNorm(doc=3123)
          0.17226109 = weight(abstract_txt:detection in 3123) [ClassicSimilarity], result of:
            0.17226109 = score(doc=3123,freq=2.0), product of:
              0.2872708 = queryWeight, product of:
                4.1052365 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.010314605 = queryNorm
              0.59964705 = fieldWeight in 3123, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.0625 = fieldNorm(doc=3123)
          0.8525747 = weight(abstract_txt:plagiarism in 3123) [ClassicSimilarity], result of:
            0.8525747 = score(doc=3123,freq=4.0), product of:
              0.77449447 = queryWeight, product of:
                8.5263195 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.010314605 = queryNorm
              1.1008145 = fieldWeight in 3123, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.0625 = fieldNorm(doc=3123)
        0.28 = coord(7/25)