Document (#43848)

Author
Corbara, S.
Moreo, A.
Sebastiani, F.
Title
Syllabic quantity patterns as rhythmic features for Latin authorship attribution
Source
Journal of the Association for Information Science and Technology. 74(2023) no.1, S.128-141
Year
2023
Abstract
It is well known that, within the Latin production of written text, peculiar metric schemes were followed not only in poetic compositions, but also in many prose works. Such metric patterns were based on so-called syllabic quantity, that is, on the length of the involved syllables, and there is substantial evidence suggesting that certain authors had a preference for certain metric patterns over others. In this research we investigate the possibility to employ syllabic quantity as a base for deriving rhythmic features for the task of computational authorship attribution of Latin prose texts. We test the impact of these features on the authorship attribution task when combined with other topic-agnostic features. Our experiments, carried out on three different datasets using support vector machines (SVMs) show that rhythmic features based on syllabic quantity are beneficial in discriminating among Latin prose authors.
Content
Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24660. https://doi.org/10.1002/asi.24660.
Theme
Computerlinguistik
Formalerschließung

Similar documents (author)

  1. Sebastiani, F.: On the role of logic in information retrieval (1998) 5.94
    5.937289 = sum of:
      5.937289 = weight(author_txt:sebastiani in 1140) [ClassicSimilarity], result of:
        5.937289 = fieldWeight in 1140, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.625 = fieldNorm(doc=1140)
    
  2. Sebastiani, F.: Machine learning in automated text categorization (2002) 5.94
    5.937289 = sum of:
      5.937289 = weight(author_txt:sebastiani in 3389) [ClassicSimilarity], result of:
        5.937289 = fieldWeight in 3389, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.625 = fieldNorm(doc=3389)
    
  3. Sebastiani, F.: ¬A tutorial an automated text categorisation (1999) 5.94
    5.937289 = sum of:
      5.937289 = weight(author_txt:sebastiani in 3390) [ClassicSimilarity], result of:
        5.937289 = fieldWeight in 3390, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.625 = fieldNorm(doc=3390)
    
  4. Sebastiani, F.: Classification of text, automatic (2006) 5.94
    5.937289 = sum of:
      5.937289 = weight(author_txt:sebastiani in 5003) [ClassicSimilarity], result of:
        5.937289 = fieldWeight in 5003, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.625 = fieldNorm(doc=5003)
    
  5. Debole, F.; Sebastiani, F.: ¬An analysis of the relative hardness of Reuters-21578 subsets (2005) 4.75
    4.749831 = sum of:
      4.749831 = weight(author_txt:sebastiani in 3456) [ClassicSimilarity], result of:
        4.749831 = fieldWeight in 3456, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.5 = fieldNorm(doc=3456)
    

Similar documents (content)

  1. Stover, J.A.; Winter, Y.; Koppel, M.; Kestemont, M.: Computational authorship verification method attributes a new work to a major 2nd century African author (2016) 0.14
    0.14449203 = sum of:
      0.14449203 = product of:
        0.6020501 = sum of:
          0.038400132 = weight(abstract_txt:authors in 2503) [ClassicSimilarity], result of:
            0.038400132 = score(doc=2503,freq=3.0), product of:
              0.0763096 = queryWeight, product of:
                1.4079674 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.011659331 = queryNorm
              0.50321496 = fieldWeight in 2503, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.0625 = fieldNorm(doc=2503)
          0.0058724564 = weight(abstract_txt:that in 2503) [ClassicSimilarity], result of:
            0.0058724564 = score(doc=2503,freq=1.0), product of:
              0.039654057 = queryWeight, product of:
                1.4353633 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.011659331 = queryNorm
              0.1480922 = fieldWeight in 2503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=2503)
          0.026147047 = weight(abstract_txt:task in 2503) [ClassicSimilarity], result of:
            0.026147047 = score(doc=2503,freq=1.0), product of:
              0.085181676 = queryWeight, product of:
                1.4875656 = boost
                4.9112997 = idf(docFreq=884, maxDocs=44218)
                0.011659331 = queryNorm
              0.30695623 = fieldWeight in 2503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9112997 = idf(docFreq=884, maxDocs=44218)
                0.0625 = fieldNorm(doc=2503)
          0.15911275 = weight(abstract_txt:authorship in 2503) [ClassicSimilarity], result of:
            0.15911275 = score(doc=2503,freq=2.0), product of:
              0.25796148 = queryWeight, product of:
                3.1704879 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.011659331 = queryNorm
              0.6168082 = fieldWeight in 2503, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.0625 = fieldNorm(doc=2503)
          0.16262703 = weight(abstract_txt:attribution in 2503) [ClassicSimilarity], result of:
            0.16262703 = score(doc=2503,freq=1.0), product of:
              0.32977924 = queryWeight, product of:
                3.584762 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.011659331 = queryNorm
              0.49313906 = fieldWeight in 2503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.0625 = fieldNorm(doc=2503)
          0.20989072 = weight(abstract_txt:latin in 2503) [ClassicSimilarity], result of:
            0.20989072 = score(doc=2503,freq=1.0), product of:
              0.43026558 = queryWeight, product of:
                4.7280965 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.011659331 = queryNorm
              0.4878167 = fieldWeight in 2503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.0625 = fieldNorm(doc=2503)
        0.24 = coord(6/25)
    
  2. Yuan, Q.; Xu, S.; Jian, L.: ¬A new method for retrieving batik shape patterns (2018) 0.12
    0.119216435 = sum of:
      0.119216435 = product of:
        0.49673516 = sum of:
          0.013501349 = weight(abstract_txt:were in 4186) [ClassicSimilarity], result of:
            0.013501349 = score(doc=4186,freq=2.0), product of:
              0.047566425 = queryWeight, product of:
                1.1116122 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.011659331 = queryNorm
              0.283842 = fieldWeight in 4186, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4186)
          0.07353603 = weight(abstract_txt:compositions in 4186) [ClassicSimilarity], result of:
            0.07353603 = score(doc=4186,freq=1.0), product of:
              0.14724793 = queryWeight, product of:
                1.3829696 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.011659331 = queryNorm
              0.49940285 = fieldWeight in 4186, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4186)
          0.010276799 = weight(abstract_txt:that in 4186) [ClassicSimilarity], result of:
            0.010276799 = score(doc=4186,freq=4.0), product of:
              0.039654057 = queryWeight, product of:
                1.4353633 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.011659331 = queryNorm
              0.25916135 = fieldWeight in 4186, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4186)
          0.11272462 = weight(abstract_txt:patterns in 4186) [ClassicSimilarity], result of:
            0.11272462 = score(doc=4186,freq=7.0), product of:
              0.14759421 = queryWeight, product of:
                2.3981886 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.011659331 = queryNorm
              0.76374686 = fieldWeight in 4186, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4186)
          0.19638643 = weight(abstract_txt:metric in 4186) [ClassicSimilarity], result of:
            0.19638643 = score(doc=4186,freq=3.0), product of:
              0.2834371 = queryWeight, product of:
                3.323357 = boost
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.011659331 = queryNorm
              0.6928748 = fieldWeight in 4186, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4186)
          0.09030992 = weight(abstract_txt:features in 4186) [ClassicSimilarity], result of:
            0.09030992 = score(doc=4186,freq=4.0), product of:
              0.18190418 = queryWeight, product of:
                3.4371176 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.011659331 = queryNorm
              0.4964697 = fieldWeight in 4186, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4186)
        0.24 = coord(6/25)
    
  3. Zheng, R.; Li, J.; Chen, H.; Huang, Z.: ¬A framework for authorship identification of online messages : writing-style features and classification techniques (2006) 0.11
    0.11431208 = sum of:
      0.11431208 = product of:
        0.47630036 = sum of:
          0.054745313 = weight(abstract_txt:machines in 5276) [ClassicSimilarity], result of:
            0.054745313 = score(doc=5276,freq=2.0), product of:
              0.08782316 = queryWeight, product of:
                1.0680524 = boost
                7.0524964 = idf(docFreq=103, maxDocs=44218)
                0.011659331 = queryNorm
              0.6233585 = fieldWeight in 5276, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0524964 = idf(docFreq=103, maxDocs=44218)
                0.0625 = fieldNorm(doc=5276)
          0.12200493 = weight(abstract_txt:discriminating in 5276) [ClassicSimilarity], result of:
            0.12200493 = score(doc=5276,freq=2.0), product of:
              0.14984055 = queryWeight, product of:
                1.3950915 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.011659331 = queryNorm
              0.81423175 = fieldWeight in 5276, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=5276)
          0.031353578 = weight(abstract_txt:authors in 5276) [ClassicSimilarity], result of:
            0.031353578 = score(doc=5276,freq=2.0), product of:
              0.0763096 = queryWeight, product of:
                1.4079674 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.011659331 = queryNorm
              0.4108733 = fieldWeight in 5276, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.0625 = fieldNorm(doc=5276)
          0.0058724564 = weight(abstract_txt:that in 5276) [ClassicSimilarity], result of:
            0.0058724564 = score(doc=5276,freq=1.0), product of:
              0.039654057 = queryWeight, product of:
                1.4353633 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.011659331 = queryNorm
              0.1480922 = fieldWeight in 5276, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=5276)
          0.15911275 = weight(abstract_txt:authorship in 5276) [ClassicSimilarity], result of:
            0.15911275 = score(doc=5276,freq=2.0), product of:
              0.25796148 = queryWeight, product of:
                3.1704879 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.011659331 = queryNorm
              0.6168082 = fieldWeight in 5276, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.0625 = fieldNorm(doc=5276)
          0.103211336 = weight(abstract_txt:features in 5276) [ClassicSimilarity], result of:
            0.103211336 = score(doc=5276,freq=4.0), product of:
              0.18190418 = queryWeight, product of:
                3.4371176 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.011659331 = queryNorm
              0.56739396 = fieldWeight in 5276, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.0625 = fieldNorm(doc=5276)
        0.24 = coord(6/25)
    
  4. Stamatatos, E.: Masking topic-related information to enhance authorship attribution (2018) 0.10
    0.101560116 = sum of:
      0.101560116 = product of:
        0.6347507 = sum of:
          0.031353578 = weight(abstract_txt:authors in 4124) [ClassicSimilarity], result of:
            0.031353578 = score(doc=4124,freq=2.0), product of:
              0.0763096 = queryWeight, product of:
                1.4079674 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.011659331 = queryNorm
              0.4108733 = fieldWeight in 4124, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.0625 = fieldNorm(doc=4124)
          0.010171392 = weight(abstract_txt:that in 4124) [ClassicSimilarity], result of:
            0.010171392 = score(doc=4124,freq=3.0), product of:
              0.039654057 = queryWeight, product of:
                1.4353633 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.011659331 = queryNorm
              0.2565032 = fieldWeight in 4124, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=4124)
          0.19487253 = weight(abstract_txt:authorship in 4124) [ClassicSimilarity], result of:
            0.19487253 = score(doc=4124,freq=3.0), product of:
              0.25796148 = queryWeight, product of:
                3.1704879 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.011659331 = queryNorm
              0.75543267 = fieldWeight in 4124, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.0625 = fieldNorm(doc=4124)
          0.39835325 = weight(abstract_txt:attribution in 4124) [ClassicSimilarity], result of:
            0.39835325 = score(doc=4124,freq=6.0), product of:
              0.32977924 = queryWeight, product of:
                3.584762 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.011659331 = queryNorm
              1.2079391 = fieldWeight in 4124, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.0625 = fieldNorm(doc=4124)
        0.16 = coord(4/25)
    
  5. Potha, N.; Stamatatos, E.: Improving author verification based on topic modeling (2019) 0.10
    0.09935103 = sum of:
      0.09935103 = product of:
        0.41396266 = sum of:
          0.022170328 = weight(abstract_txt:authors in 5385) [ClassicSimilarity], result of:
            0.022170328 = score(doc=5385,freq=1.0), product of:
              0.0763096 = queryWeight, product of:
                1.4079674 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.011659331 = queryNorm
              0.2905313 = fieldWeight in 5385, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.0625 = fieldNorm(doc=5385)
          0.011744913 = weight(abstract_txt:that in 5385) [ClassicSimilarity], result of:
            0.011744913 = score(doc=5385,freq=4.0), product of:
              0.039654057 = queryWeight, product of:
                1.4353633 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.011659331 = queryNorm
              0.2961844 = fieldWeight in 5385, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=5385)
          0.102317 = weight(abstract_txt:agnostic in 5385) [ClassicSimilarity], result of:
            0.102317 = score(doc=5385,freq=1.0), product of:
              0.16788799 = queryWeight, product of:
                1.4767189 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.011659331 = queryNorm
              0.6094361 = fieldWeight in 5385, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0625 = fieldNorm(doc=5385)
          0.04528801 = weight(abstract_txt:task in 5385) [ClassicSimilarity], result of:
            0.04528801 = score(doc=5385,freq=3.0), product of:
              0.085181676 = queryWeight, product of:
                1.4875656 = boost
                4.9112997 = idf(docFreq=884, maxDocs=44218)
                0.011659331 = queryNorm
              0.5316638 = fieldWeight in 5385, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9112997 = idf(docFreq=884, maxDocs=44218)
                0.0625 = fieldNorm(doc=5385)
          0.03756987 = weight(abstract_txt:certain in 5385) [ClassicSimilarity], result of:
            0.03756987 = score(doc=5385,freq=1.0), product of:
              0.108465314 = queryWeight, product of:
                1.6786048 = boost
                5.542029 = idf(docFreq=470, maxDocs=44218)
                0.011659331 = queryNorm
              0.3463768 = fieldWeight in 5385, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.542029 = idf(docFreq=470, maxDocs=44218)
                0.0625 = fieldNorm(doc=5385)
          0.19487253 = weight(abstract_txt:authorship in 5385) [ClassicSimilarity], result of:
            0.19487253 = score(doc=5385,freq=3.0), product of:
              0.25796148 = queryWeight, product of:
                3.1704879 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.011659331 = queryNorm
              0.75543267 = fieldWeight in 5385, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.0625 = fieldNorm(doc=5385)
        0.24 = coord(6/25)