Search (2 results, page 1 of 1)

  • × author_ss:"Savoy, J."
  • × theme_ss:"Formalerschließung"
  1. Savoy, J.: Estimating the probability of an authorship attribution (2016) 0.08
    0.079768166 = sum of:
      0.0150408745 = product of:
        0.060163498 = sum of:
          0.060163498 = weight(_text_:authors in 2937) [ClassicSimilarity], result of:
            0.060163498 = score(doc=2937,freq=2.0), product of:
              0.2388945 = queryWeight, product of:
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.052402776 = queryNorm
              0.25184128 = fieldWeight in 2937, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2937)
        0.25 = coord(1/4)
      0.06472729 = sum of:
        0.029228024 = weight(_text_:j in 2937) [ClassicSimilarity], result of:
          0.029228024 = score(doc=2937,freq=2.0), product of:
            0.16650963 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.052402776 = queryNorm
            0.17553353 = fieldWeight in 2937, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2937)
        0.035499264 = weight(_text_:22 in 2937) [ClassicSimilarity], result of:
          0.035499264 = score(doc=2937,freq=2.0), product of:
            0.1835056 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.052402776 = queryNorm
            0.19345059 = fieldWeight in 2937, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2937)
    
    Abstract
    In authorship attribution, various distance-based metrics have been proposed to determine the most probable author of a disputed text. In this paradigm, a distance is computed between each author profile and the query text. These values are then employed only to rank the possible authors. In this article, we analyze their distribution and show that we can model it as a mixture of 2 Beta distributions. Based on this finding, we demonstrate how we can derive a more accurate probability that the closest author is, in fact, the real author. To evaluate this approach, we have chosen 4 authorship attribution methods (Burrows' Delta, Kullback-Leibler divergence, Labbé's intertextual distance, and the naïve Bayes). As the first test collection, we have downloaded 224 State of the Union addresses (from 1790 to 2014) delivered by 41 U.S. presidents. The second test collection is formed by the Federalist Papers. The evaluations indicate that the accuracy rate of some authorship decisions can be improved. The suggested method can signal that the proposed assignment should be interpreted as possible, without strong certainty. Being able to quantify the certainty associated with an authorship decision can be a useful component when important decisions must be taken.
    Date
    7. 5.2016 21:22:27
  2. Kocher, M.; Savoy, J.: ¬A simple and efficient algorithm for authorship verification (2017) 0.01
    0.008768408 = product of:
      0.017536815 = sum of:
        0.017536815 = product of:
          0.03507363 = sum of:
            0.03507363 = weight(_text_:j in 3330) [ClassicSimilarity], result of:
              0.03507363 = score(doc=3330,freq=2.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.21064025 = fieldWeight in 3330, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3330)
          0.5 = coord(1/2)
      0.5 = coord(1/2)