Document (#39939)

Author
Savoy, J.
Title
Estimating the probability of an authorship attribution
Source
Journal of the Association for Information Science and Technology. 67(2016) no.6, S.1462-1472
Year
2016
Abstract
In authorship attribution, various distance-based metrics have been proposed to determine the most probable author of a disputed text. In this paradigm, a distance is computed between each author profile and the query text. These values are then employed only to rank the possible authors. In this article, we analyze their distribution and show that we can model it as a mixture of 2 Beta distributions. Based on this finding, we demonstrate how we can derive a more accurate probability that the closest author is, in fact, the real author. To evaluate this approach, we have chosen 4 authorship attribution methods (Burrows' Delta, Kullback-Leibler divergence, Labbé's intertextual distance, and the naïve Bayes). As the first test collection, we have downloaded 224 State of the Union addresses (from 1790 to 2014) delivered by 41 U.S. presidents. The second test collection is formed by the Federalist Papers. The evaluations indicate that the accuracy rate of some authorship decisions can be improved. The suggested method can signal that the proposed assignment should be interpreted as possible, without strong certainty. Being able to quantify the certainty associated with an authorship decision can be a useful component when important decisions must be taken.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23455/abstract.
Theme
Formalerschließung

Similar documents (author)

  1. Savoy, J.: Stemming of French words based on grammatical categories (1993) 5.21
    5.2141504 = sum of:
      5.2141504 = weight(author_txt:savoy in 4650) [ClassicSimilarity], result of:
        5.2141504 = fieldWeight in 4650, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.342641 = idf(docFreq=27, maxDocs=43254)
          0.625 = fieldNorm(doc=4650)
    
  2. Savoy, J.: Effectiveness of information retrieval systems used in a hypertext environment (1993) 5.21
    5.2141504 = sum of:
      5.2141504 = weight(author_txt:savoy in 6511) [ClassicSimilarity], result of:
        5.2141504 = fieldWeight in 6511, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.342641 = idf(docFreq=27, maxDocs=43254)
          0.625 = fieldNorm(doc=6511)
    
  3. Savoy, J.: ¬A learning scheme for information retrieval in hypertext (1994) 5.21
    5.2141504 = sum of:
      5.2141504 = weight(author_txt:savoy in 292) [ClassicSimilarity], result of:
        5.2141504 = fieldWeight in 292, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.342641 = idf(docFreq=27, maxDocs=43254)
          0.625 = fieldNorm(doc=292)
    
  4. Savoy, J.: Bayesian inference networks and spreading activation in hypertext systems (1992) 5.21
    5.2141504 = sum of:
      5.2141504 = weight(author_txt:savoy in 1261) [ClassicSimilarity], result of:
        5.2141504 = fieldWeight in 1261, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.342641 = idf(docFreq=27, maxDocs=43254)
          0.625 = fieldNorm(doc=1261)
    
  5. Savoy, J.: Searching information in legal hypertext systems (1993/94) 5.21
    5.2141504 = sum of:
      5.2141504 = weight(author_txt:savoy in 1826) [ClassicSimilarity], result of:
        5.2141504 = fieldWeight in 1826, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.342641 = idf(docFreq=27, maxDocs=43254)
          0.625 = fieldNorm(doc=1826)
    

Similar documents (content)

  1. Kocher, M.; Savoy, J.: ¬A simple and efficient algorithm for authorship verification (2017) 0.56
    0.56097263 = sum of:
      0.56097263 = product of:
        1.2749377 = sum of:
          0.030118773 = weight(abstract_txt:text in 4795) [ClassicSimilarity], result of:
            0.030118773 = score(doc=4795,freq=2.0), product of:
              0.06731399 = queryWeight, product of:
                1.0010073 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.016605088 = queryNorm
              0.44743705 = fieldWeight in 4795, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.078125 = fieldNorm(doc=4795)
          0.031753205 = weight(abstract_txt:proposed in 4795) [ClassicSimilarity], result of:
            0.031753205 = score(doc=4795,freq=1.0), product of:
              0.087851435 = queryWeight, product of:
                1.14356 = boost
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.016605088 = queryNorm
              0.3614421 = fieldWeight in 4795, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.078125 = fieldNorm(doc=4795)
          0.19302982 = weight(abstract_txt:disputed in 4795) [ClassicSimilarity], result of:
            0.19302982 = score(doc=4795,freq=2.0), product of:
              0.18434022 = queryWeight, product of:
                1.1713309 = boost
                9.47762 = idf(docFreq=8, maxDocs=43254)
                0.016605088 = queryNorm
              1.0471389 = fieldWeight in 4795, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.47762 = idf(docFreq=8, maxDocs=43254)
                0.078125 = fieldNorm(doc=4795)
          0.008704905 = weight(abstract_txt:that in 4795) [ClassicSimilarity], result of:
            0.008704905 = score(doc=4795,freq=1.0), product of:
              0.04671 = queryWeight, product of:
                1.1792462 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.016605088 = queryNorm
              0.18636064 = fieldWeight in 4795, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.078125 = fieldNorm(doc=4795)
          0.009218731 = weight(abstract_txt:this in 4795) [ClassicSimilarity], result of:
            0.009218731 = score(doc=4795,freq=1.0), product of:
              0.048530478 = queryWeight, product of:
                1.2020066 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.016605088 = queryNorm
              0.18995756 = fieldWeight in 4795, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.078125 = fieldNorm(doc=4795)
          0.041479576 = weight(abstract_txt:test in 4795) [ClassicSimilarity], result of:
            0.041479576 = score(doc=4795,freq=1.0), product of:
              0.104981646 = queryWeight, product of:
                1.2500899 = boost
                5.057442 = idf(docFreq=747, maxDocs=43254)
                0.016605088 = queryNorm
              0.39511266 = fieldWeight in 4795, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.057442 = idf(docFreq=747, maxDocs=43254)
                0.078125 = fieldNorm(doc=4795)
          0.23655905 = weight(abstract_txt:certainty in 4795) [ClassicSimilarity], result of:
            0.23655905 = score(doc=4795,freq=1.0), product of:
              0.335107 = queryWeight, product of:
                2.2334504 = boost
                9.035788 = idf(docFreq=13, maxDocs=43254)
                0.016605088 = queryNorm
              0.70592093 = fieldWeight in 4795, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.035788 = idf(docFreq=13, maxDocs=43254)
                0.078125 = fieldNorm(doc=4795)
          0.1299902 = weight(abstract_txt:distance in 4795) [ClassicSimilarity], result of:
            0.1299902 = score(doc=4795,freq=1.0), product of:
              0.25735226 = queryWeight, product of:
                2.3971443 = boost
                6.4653587 = idf(docFreq=182, maxDocs=43254)
                0.016605088 = queryNorm
              0.50510615 = fieldWeight in 4795, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4653587 = idf(docFreq=182, maxDocs=43254)
                0.078125 = fieldNorm(doc=4795)
          0.0793974 = weight(abstract_txt:author in 4795) [ClassicSimilarity], result of:
            0.0793974 = score(doc=4795,freq=1.0), product of:
              0.20390975 = queryWeight, product of:
                2.4638743 = boost
                4.9840026 = idf(docFreq=804, maxDocs=43254)
                0.016605088 = queryNorm
              0.3893752 = fieldWeight in 4795, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9840026 = idf(docFreq=804, maxDocs=43254)
                0.078125 = fieldNorm(doc=4795)
          0.23837595 = weight(abstract_txt:attribution in 4795) [ClassicSimilarity], result of:
            0.23837595 = score(doc=4795,freq=1.0), product of:
              0.38556346 = queryWeight, product of:
                2.9341216 = boost
                7.913645 = idf(docFreq=42, maxDocs=43254)
                0.016605088 = queryNorm
              0.61825347 = fieldWeight in 4795, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.913645 = idf(docFreq=42, maxDocs=43254)
                0.078125 = fieldNorm(doc=4795)
          0.27631018 = weight(abstract_txt:authorship in 4795) [ClassicSimilarity], result of:
            0.27631018 = score(doc=4795,freq=1.0), product of:
              0.50443095 = queryWeight, product of:
                4.3326683 = boost
                7.011406 = idf(docFreq=105, maxDocs=43254)
                0.016605088 = queryNorm
              0.5477661 = fieldWeight in 4795, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.011406 = idf(docFreq=105, maxDocs=43254)
                0.078125 = fieldNorm(doc=4795)
        0.44 = coord(11/25)
    
  2. Savoy, J.: Text clustering : an application with the 'State of the Union' addresses (2015) 0.37
    0.36861694 = sum of:
      0.36861694 = product of:
        1.0239359 = sum of:
          0.025402565 = weight(abstract_txt:proposed in 3593) [ClassicSimilarity], result of:
            0.025402565 = score(doc=3593,freq=1.0), product of:
              0.087851435 = queryWeight, product of:
                1.14356 = boost
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.016605088 = queryNorm
              0.28915367 = fieldWeight in 3593, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.0625 = fieldNorm(doc=3593)
          0.009848476 = weight(abstract_txt:that in 3593) [ClassicSimilarity], result of:
            0.009848476 = score(doc=3593,freq=2.0), product of:
              0.04671 = queryWeight, product of:
                1.1792462 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.016605088 = queryNorm
              0.210843 = fieldWeight in 3593, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.0625 = fieldNorm(doc=3593)
          0.012910066 = weight(abstract_txt:have in 3593) [ClassicSimilarity], result of:
            0.012910066 = score(doc=3593,freq=1.0), product of:
              0.064044215 = queryWeight, product of:
                1.1958319 = boost
                3.2252884 = idf(docFreq=4672, maxDocs=43254)
                0.016605088 = queryNorm
              0.20158052 = fieldWeight in 3593, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2252884 = idf(docFreq=4672, maxDocs=43254)
                0.0625 = fieldNorm(doc=3593)
          0.010429803 = weight(abstract_txt:this in 3593) [ClassicSimilarity], result of:
            0.010429803 = score(doc=3593,freq=2.0), product of:
              0.048530478 = queryWeight, product of:
                1.2020066 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.016605088 = queryNorm
              0.21491244 = fieldWeight in 3593, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0625 = fieldNorm(doc=3593)
          0.11811294 = weight(abstract_txt:1790 in 3593) [ClassicSimilarity], result of:
            0.11811294 = score(doc=3593,freq=1.0), product of:
              0.19424602 = queryWeight, product of:
                1.2023908 = boost
                9.728935 = idf(docFreq=6, maxDocs=43254)
                0.016605088 = queryNorm
              0.60805845 = fieldWeight in 3593, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.728935 = idf(docFreq=6, maxDocs=43254)
                0.0625 = fieldNorm(doc=3593)
          0.17510322 = weight(abstract_txt:presidents in 3593) [ClassicSimilarity], result of:
            0.17510322 = score(doc=3593,freq=2.0), product of:
              0.20045024 = queryWeight, product of:
                1.221442 = boost
                9.883085 = idf(docFreq=5, maxDocs=43254)
                0.016605088 = queryNorm
              0.8735496 = fieldWeight in 3593, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.883085 = idf(docFreq=5, maxDocs=43254)
                0.0625 = fieldNorm(doc=3593)
          0.0898279 = weight(abstract_txt:author in 3593) [ClassicSimilarity], result of:
            0.0898279 = score(doc=3593,freq=2.0), product of:
              0.20390975 = queryWeight, product of:
                2.4638743 = boost
                4.9840026 = idf(docFreq=804, maxDocs=43254)
                0.016605088 = queryNorm
              0.44052774 = fieldWeight in 3593, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9840026 = idf(docFreq=804, maxDocs=43254)
                0.0625 = fieldNorm(doc=3593)
          0.26969162 = weight(abstract_txt:attribution in 3593) [ClassicSimilarity], result of:
            0.26969162 = score(doc=3593,freq=2.0), product of:
              0.38556346 = queryWeight, product of:
                2.9341216 = boost
                7.913645 = idf(docFreq=42, maxDocs=43254)
                0.016605088 = queryNorm
              0.699474 = fieldWeight in 3593, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.913645 = idf(docFreq=42, maxDocs=43254)
                0.0625 = fieldNorm(doc=3593)
          0.3126093 = weight(abstract_txt:authorship in 3593) [ClassicSimilarity], result of:
            0.3126093 = score(doc=3593,freq=2.0), product of:
              0.50443095 = queryWeight, product of:
                4.3326683 = boost
                7.011406 = idf(docFreq=105, maxDocs=43254)
                0.016605088 = queryNorm
              0.6197266 = fieldWeight in 3593, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.011406 = idf(docFreq=105, maxDocs=43254)
                0.0625 = fieldNorm(doc=3593)
        0.36 = coord(9/25)
    
  3. Stamatatos, E.: Masking topic-related information to enhance authorship attribution (2018) 0.28
    0.27888432 = sum of:
      0.27888432 = product of:
        0.9960154 = sum of:
          0.02409502 = weight(abstract_txt:text in 125) [ClassicSimilarity], result of:
            0.02409502 = score(doc=125,freq=2.0), product of:
              0.06731399 = queryWeight, product of:
                1.0010073 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.016605088 = queryNorm
              0.35794964 = fieldWeight in 125, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.0625 = fieldNorm(doc=125)
          0.03592465 = weight(abstract_txt:proposed in 125) [ClassicSimilarity], result of:
            0.03592465 = score(doc=125,freq=2.0), product of:
              0.087851435 = queryWeight, product of:
                1.14356 = boost
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.016605088 = queryNorm
              0.40892503 = fieldWeight in 125, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.0625 = fieldNorm(doc=125)
          0.012061871 = weight(abstract_txt:that in 125) [ClassicSimilarity], result of:
            0.012061871 = score(doc=125,freq=3.0), product of:
              0.04671 = queryWeight, product of:
                1.1792462 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.016605088 = queryNorm
              0.25822887 = fieldWeight in 125, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.0625 = fieldNorm(doc=125)
          0.010429803 = weight(abstract_txt:this in 125) [ClassicSimilarity], result of:
            0.010429803 = score(doc=125,freq=2.0), product of:
              0.048530478 = queryWeight, product of:
                1.2020066 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.016605088 = queryNorm
              0.21491244 = fieldWeight in 125, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0625 = fieldNorm(doc=125)
          0.06351792 = weight(abstract_txt:author in 125) [ClassicSimilarity], result of:
            0.06351792 = score(doc=125,freq=1.0), product of:
              0.20390975 = queryWeight, product of:
                2.4638743 = boost
                4.9840026 = idf(docFreq=804, maxDocs=43254)
                0.016605088 = queryNorm
              0.31150016 = fieldWeight in 125, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9840026 = idf(docFreq=804, maxDocs=43254)
                0.0625 = fieldNorm(doc=125)
          0.46711957 = weight(abstract_txt:attribution in 125) [ClassicSimilarity], result of:
            0.46711957 = score(doc=125,freq=6.0), product of:
              0.38556346 = queryWeight, product of:
                2.9341216 = boost
                7.913645 = idf(docFreq=42, maxDocs=43254)
                0.016605088 = queryNorm
              1.2115245 = fieldWeight in 125, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.913645 = idf(docFreq=42, maxDocs=43254)
                0.0625 = fieldNorm(doc=125)
          0.3828666 = weight(abstract_txt:authorship in 125) [ClassicSimilarity], result of:
            0.3828666 = score(doc=125,freq=3.0), product of:
              0.50443095 = queryWeight, product of:
                4.3326683 = boost
                7.011406 = idf(docFreq=105, maxDocs=43254)
                0.016605088 = queryNorm
              0.7590069 = fieldWeight in 125, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.011406 = idf(docFreq=105, maxDocs=43254)
                0.0625 = fieldNorm(doc=125)
        0.28 = coord(7/25)
    
  4. Koppel, M.; Schler, J.; Argamon, S.: Computational methods in authorship attribution (2009) 0.26
    0.25917438 = sum of:
      0.25917438 = product of:
        0.80991995 = sum of:
          0.017037751 = weight(abstract_txt:text in 4684) [ClassicSimilarity], result of:
            0.017037751 = score(doc=4684,freq=1.0), product of:
              0.06731399 = queryWeight, product of:
                1.0010073 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.016605088 = queryNorm
              0.25310862 = fieldWeight in 4684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.0625 = fieldNorm(doc=4684)
          0.025517626 = weight(abstract_txt:possible in 4684) [ClassicSimilarity], result of:
            0.025517626 = score(doc=4684,freq=1.0), product of:
              0.08811652 = queryWeight, product of:
                1.145284 = boost
                4.6334333 = idf(docFreq=1142, maxDocs=43254)
                0.016605088 = queryNorm
              0.28958958 = fieldWeight in 4684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6334333 = idf(docFreq=1142, maxDocs=43254)
                0.0625 = fieldNorm(doc=4684)
          0.0069639245 = weight(abstract_txt:that in 4684) [ClassicSimilarity], result of:
            0.0069639245 = score(doc=4684,freq=1.0), product of:
              0.04671 = queryWeight, product of:
                1.1792462 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.016605088 = queryNorm
              0.14908852 = fieldWeight in 4684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.0625 = fieldNorm(doc=4684)
          0.012910066 = weight(abstract_txt:have in 4684) [ClassicSimilarity], result of:
            0.012910066 = score(doc=4684,freq=1.0), product of:
              0.064044215 = queryWeight, product of:
                1.1958319 = boost
                3.2252884 = idf(docFreq=4672, maxDocs=43254)
                0.016605088 = queryNorm
              0.20158052 = fieldWeight in 4684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2252884 = idf(docFreq=4672, maxDocs=43254)
                0.0625 = fieldNorm(doc=4684)
          0.01474997 = weight(abstract_txt:this in 4684) [ClassicSimilarity], result of:
            0.01474997 = score(doc=4684,freq=4.0), product of:
              0.048530478 = queryWeight, product of:
                1.2020066 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.016605088 = queryNorm
              0.3039321 = fieldWeight in 4684, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0625 = fieldNorm(doc=4684)
          0.0898279 = weight(abstract_txt:author in 4684) [ClassicSimilarity], result of:
            0.0898279 = score(doc=4684,freq=2.0), product of:
              0.20390975 = queryWeight, product of:
                2.4638743 = boost
                4.9840026 = idf(docFreq=804, maxDocs=43254)
                0.016605088 = queryNorm
              0.44052774 = fieldWeight in 4684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9840026 = idf(docFreq=804, maxDocs=43254)
                0.0625 = fieldNorm(doc=4684)
          0.33030343 = weight(abstract_txt:attribution in 4684) [ClassicSimilarity], result of:
            0.33030343 = score(doc=4684,freq=3.0), product of:
              0.38556346 = queryWeight, product of:
                2.9341216 = boost
                7.913645 = idf(docFreq=42, maxDocs=43254)
                0.016605088 = queryNorm
              0.8566772 = fieldWeight in 4684, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.913645 = idf(docFreq=42, maxDocs=43254)
                0.0625 = fieldNorm(doc=4684)
          0.3126093 = weight(abstract_txt:authorship in 4684) [ClassicSimilarity], result of:
            0.3126093 = score(doc=4684,freq=2.0), product of:
              0.50443095 = queryWeight, product of:
                4.3326683 = boost
                7.011406 = idf(docFreq=105, maxDocs=43254)
                0.016605088 = queryNorm
              0.6197266 = fieldWeight in 4684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.011406 = idf(docFreq=105, maxDocs=43254)
                0.0625 = fieldNorm(doc=4684)
        0.32 = coord(8/25)
    
  5. Stover, J.A.; Winter, Y.; Koppel, M.; Kestemont, M.: Computational authorship verification method attributes a new work to a major 2nd century African author (2016) 0.23
    0.22637707 = sum of:
      0.22637707 = product of:
        0.70742834 = sum of:
          0.034075502 = weight(abstract_txt:text in 3968) [ClassicSimilarity], result of:
            0.034075502 = score(doc=3968,freq=4.0), product of:
              0.06731399 = queryWeight, product of:
                1.0010073 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.016605088 = queryNorm
              0.50621724 = fieldWeight in 3968, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.0625 = fieldNorm(doc=3968)
          0.025402565 = weight(abstract_txt:proposed in 3968) [ClassicSimilarity], result of:
            0.025402565 = score(doc=3968,freq=1.0), product of:
              0.087851435 = queryWeight, product of:
                1.14356 = boost
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.016605088 = queryNorm
              0.28915367 = fieldWeight in 3968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.0625 = fieldNorm(doc=3968)
          0.0069639245 = weight(abstract_txt:that in 3968) [ClassicSimilarity], result of:
            0.0069639245 = score(doc=3968,freq=1.0), product of:
              0.04671 = queryWeight, product of:
                1.1792462 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.016605088 = queryNorm
              0.14908852 = fieldWeight in 3968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.0625 = fieldNorm(doc=3968)
          0.012910066 = weight(abstract_txt:have in 3968) [ClassicSimilarity], result of:
            0.012910066 = score(doc=3968,freq=1.0), product of:
              0.064044215 = queryWeight, product of:
                1.1958319 = boost
                3.2252884 = idf(docFreq=4672, maxDocs=43254)
                0.016605088 = queryNorm
              0.20158052 = fieldWeight in 3968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2252884 = idf(docFreq=4672, maxDocs=43254)
                0.0625 = fieldNorm(doc=3968)
          0.01474997 = weight(abstract_txt:this in 3968) [ClassicSimilarity], result of:
            0.01474997 = score(doc=3968,freq=4.0), product of:
              0.048530478 = queryWeight, product of:
                1.2020066 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.016605088 = queryNorm
              0.3039321 = fieldWeight in 3968, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0625 = fieldNorm(doc=3968)
          0.110016264 = weight(abstract_txt:author in 3968) [ClassicSimilarity], result of:
            0.110016264 = score(doc=3968,freq=3.0), product of:
              0.20390975 = queryWeight, product of:
                2.4638743 = boost
                4.9840026 = idf(docFreq=804, maxDocs=43254)
                0.016605088 = queryNorm
              0.5395341 = fieldWeight in 3968, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9840026 = idf(docFreq=804, maxDocs=43254)
                0.0625 = fieldNorm(doc=3968)
          0.19070077 = weight(abstract_txt:attribution in 3968) [ClassicSimilarity], result of:
            0.19070077 = score(doc=3968,freq=1.0), product of:
              0.38556346 = queryWeight, product of:
                2.9341216 = boost
                7.913645 = idf(docFreq=42, maxDocs=43254)
                0.016605088 = queryNorm
              0.4946028 = fieldWeight in 3968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.913645 = idf(docFreq=42, maxDocs=43254)
                0.0625 = fieldNorm(doc=3968)
          0.3126093 = weight(abstract_txt:authorship in 3968) [ClassicSimilarity], result of:
            0.3126093 = score(doc=3968,freq=2.0), product of:
              0.50443095 = queryWeight, product of:
                4.3326683 = boost
                7.011406 = idf(docFreq=105, maxDocs=43254)
                0.016605088 = queryNorm
              0.6197266 = fieldWeight in 3968, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.011406 = idf(docFreq=105, maxDocs=43254)
                0.0625 = fieldNorm(doc=3968)
        0.32 = coord(8/25)