Document (#38505)

Author
Koppel, M.
Schweitzer, N.
Title
Measuring direct and indirect authorial influence in historical corpora
Source
Journal of the Association for Information Science and Technology. 65(2014) no.10, S.2138-2144
Year
2014
Abstract
We show how automatically extracted citations in historical corpora can be used to measure the direct and indirect influence of authors on each other. These measures can in turn be used to determine an author's overall prominence in the corpus and to identify distinct schools of thought. We apply our methods to two major historical corpora. Using scholarly consensus as a gold standard, we demonstrate empirically the superiority of indirect influence over direct influence as a basis for various measures of authorial impact.

Similar documents (author)

  1. Koppel, T.P.: Public access catalogs through Internet (1990) 6.00
    6.0014763 = sum of:
      6.0014763 = weight(author_txt:koppel in 4070) [ClassicSimilarity], result of:
        6.0014763 = fieldWeight in 4070, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.602362 = idf(docFreq=7, maxDocs=43556)
          0.625 = fieldNorm(doc=4070)
    
  2. Akiva, N.; Koppel, M.: ¬A generic unsupervised method for decomposing multi-author documents (2013) 4.80
    4.801181 = sum of:
      4.801181 = weight(author_txt:koppel in 3096) [ClassicSimilarity], result of:
        4.801181 = fieldWeight in 3096, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.602362 = idf(docFreq=7, maxDocs=43556)
          0.5 = fieldNorm(doc=3096)
    
  3. Koppel, M.; Winter, Y.: Determining if two documents are written by the same author (2014) 4.80
    4.801181 = sum of:
      4.801181 = weight(author_txt:koppel in 3600) [ClassicSimilarity], result of:
        4.801181 = fieldWeight in 3600, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.602362 = idf(docFreq=7, maxDocs=43556)
          0.5 = fieldNorm(doc=3600)
    
  4. Koppel, M.; Akiva, N.; Dagan, I.: Feature instability as a criterion for selecting potential style markers (2006) 3.60
    3.6008856 = sum of:
      3.6008856 = weight(author_txt:koppel in 1090) [ClassicSimilarity], result of:
        3.6008856 = fieldWeight in 1090, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.602362 = idf(docFreq=7, maxDocs=43556)
          0.375 = fieldNorm(doc=1090)
    
  5. Koppel, M.; Schler, J.; Argamon, S.: Computational methods in authorship attribution (2009) 3.60
    3.6008856 = sum of:
      3.6008856 = weight(author_txt:koppel in 4681) [ClassicSimilarity], result of:
        3.6008856 = fieldWeight in 4681, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.602362 = idf(docFreq=7, maxDocs=43556)
          0.375 = fieldNorm(doc=4681)
    

Similar documents (content)

  1. Akter, S.; D'Ambra, J.; Ray, P.: Trustworthiness in mHealth information services : an assessment of a hierarchical model with mediating and moderating effects using partial least squares (PLS) (2011) 0.14
    0.14440109 = sum of:
      0.14440109 = product of:
        0.6016712 = sum of:
          0.033021074 = weight(abstract_txt:overall in 1134) [ClassicSimilarity], result of:
            0.033021074 = score(doc=1134,freq=1.0), product of:
              0.07708276 = queryWeight, product of:
                5.483324 = idf(docFreq=491, maxDocs=43556)
                0.014057671 = queryNorm
              0.4283847 = fieldWeight in 1134, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.483324 = idf(docFreq=491, maxDocs=43556)
                0.078125 = fieldNorm(doc=1134)
          0.059111997 = weight(abstract_txt:empirically in 1134) [ClassicSimilarity], result of:
            0.059111997 = score(doc=1134,freq=1.0), product of:
              0.11364409 = queryWeight, product of:
                1.2142128 = boost
                6.6579223 = idf(docFreq=151, maxDocs=43556)
                0.014057671 = queryNorm
              0.5201502 = fieldWeight in 1134, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6579223 = idf(docFreq=151, maxDocs=43556)
                0.078125 = fieldNorm(doc=1134)
          0.015276666 = weight(abstract_txt:used in 1134) [ClassicSimilarity], result of:
            0.015276666 = score(doc=1134,freq=1.0), product of:
              0.05809323 = queryWeight, product of:
                1.2277194 = boost
                3.3659916 = idf(docFreq=4087, maxDocs=43556)
                0.014057671 = queryNorm
              0.2629681 = fieldWeight in 1134, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3659916 = idf(docFreq=4087, maxDocs=43556)
                0.078125 = fieldNorm(doc=1134)
          0.12176072 = weight(abstract_txt:direct in 1134) [ClassicSimilarity], result of:
            0.12176072 = score(doc=1134,freq=1.0), product of:
              0.26534343 = queryWeight, product of:
                3.213558 = boost
                5.87366 = idf(docFreq=332, maxDocs=43556)
                0.014057671 = queryNorm
              0.4588797 = fieldWeight in 1134, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.87366 = idf(docFreq=332, maxDocs=43556)
                0.078125 = fieldNorm(doc=1134)
          0.11825599 = weight(abstract_txt:influence in 1134) [ClassicSimilarity], result of:
            0.11825599 = score(doc=1134,freq=1.0), product of:
              0.28641683 = queryWeight, product of:
                3.855233 = boost
                5.284873 = idf(docFreq=599, maxDocs=43556)
                0.014057671 = queryNorm
              0.41288072 = fieldWeight in 1134, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.284873 = idf(docFreq=599, maxDocs=43556)
                0.078125 = fieldNorm(doc=1134)
          0.25424474 = weight(abstract_txt:indirect in 1134) [ClassicSimilarity], result of:
            0.25424474 = score(doc=1134,freq=1.0), product of:
              0.43348247 = queryWeight, product of:
                4.107407 = boost
                7.5074153 = idf(docFreq=64, maxDocs=43556)
                0.014057671 = queryNorm
              0.5865168 = fieldWeight in 1134, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5074153 = idf(docFreq=64, maxDocs=43556)
                0.078125 = fieldNorm(doc=1134)
        0.24 = coord(6/25)
    
  2. Cui, H.; Heidorn, P.B.: ¬The reusability of induced knowledge for the automatic semantic markup of taxonomic descriptions (2007) 0.14
    0.1370893 = sum of:
      0.1370893 = product of:
        0.6854465 = sum of:
          0.046481825 = weight(abstract_txt:automatically in 2082) [ClassicSimilarity], result of:
            0.046481825 = score(doc=2082,freq=3.0), product of:
              0.07789654 = queryWeight, product of:
                1.0052648 = boost
                5.5121922 = idf(docFreq=477, maxDocs=43556)
                0.014057671 = queryNorm
              0.5967123 = fieldWeight in 2082, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5121922 = idf(docFreq=477, maxDocs=43556)
                0.0625 = fieldNorm(doc=2082)
          0.03667964 = weight(abstract_txt:corpus in 2082) [ClassicSimilarity], result of:
            0.03667964 = score(doc=2082,freq=1.0), product of:
              0.09593709 = queryWeight, product of:
                1.1156157 = boost
                6.1172824 = idf(docFreq=260, maxDocs=43556)
                0.014057671 = queryNorm
              0.38233015 = fieldWeight in 2082, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1172824 = idf(docFreq=260, maxDocs=43556)
                0.0625 = fieldNorm(doc=2082)
          0.012221333 = weight(abstract_txt:used in 2082) [ClassicSimilarity], result of:
            0.012221333 = score(doc=2082,freq=1.0), product of:
              0.05809323 = queryWeight, product of:
                1.2277194 = boost
                3.3659916 = idf(docFreq=4087, maxDocs=43556)
                0.014057671 = queryNorm
              0.21037447 = fieldWeight in 2082, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3659916 = idf(docFreq=4087, maxDocs=43556)
                0.0625 = fieldNorm(doc=2082)
          0.0513591 = weight(abstract_txt:measures in 2082) [ClassicSimilarity], result of:
            0.0513591 = score(doc=2082,freq=1.0), product of:
              0.15128344 = queryWeight, product of:
                1.981217 = boost
                5.4318275 = idf(docFreq=517, maxDocs=43556)
                0.014057671 = queryNorm
              0.33948922 = fieldWeight in 2082, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4318275 = idf(docFreq=517, maxDocs=43556)
                0.0625 = fieldNorm(doc=2082)
          0.5387046 = weight(abstract_txt:corpora in 2082) [ClassicSimilarity], result of:
            0.5387046 = score(doc=2082,freq=10.0), product of:
              0.3851625 = queryWeight, product of:
                3.8717203 = boost
                7.0766325 = idf(docFreq=99, maxDocs=43556)
                0.014057671 = queryNorm
              1.3986423 = fieldWeight in 2082, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                7.0766325 = idf(docFreq=99, maxDocs=43556)
                0.0625 = fieldNorm(doc=2082)
        0.2 = coord(5/25)
    
  3. Herdagdelen, A.; Baroni, M.: Stereotypical gender actions can be extracted from web text (2011) 0.10
    0.10109299 = sum of:
      0.10109299 = product of:
        0.42122078 = sum of:
          0.026836295 = weight(abstract_txt:automatically in 1750) [ClassicSimilarity], result of:
            0.026836295 = score(doc=1750,freq=1.0), product of:
              0.07789654 = queryWeight, product of:
                1.0052648 = boost
                5.5121922 = idf(docFreq=477, maxDocs=43556)
                0.014057671 = queryNorm
              0.34451202 = fieldWeight in 1750, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5121922 = idf(docFreq=477, maxDocs=43556)
                0.0625 = fieldNorm(doc=1750)
          0.063531004 = weight(abstract_txt:corpus in 1750) [ClassicSimilarity], result of:
            0.063531004 = score(doc=1750,freq=3.0), product of:
              0.09593709 = queryWeight, product of:
                1.1156157 = boost
                6.1172824 = idf(docFreq=260, maxDocs=43556)
                0.014057671 = queryNorm
              0.66221523 = fieldWeight in 1750, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1172824 = idf(docFreq=260, maxDocs=43556)
                0.0625 = fieldNorm(doc=1750)
          0.037606385 = weight(abstract_txt:extracted in 1750) [ClassicSimilarity], result of:
            0.037606385 = score(doc=1750,freq=1.0), product of:
              0.09754632 = queryWeight, product of:
                1.1249334 = boost
                6.168374 = idf(docFreq=247, maxDocs=43556)
                0.014057671 = queryNorm
              0.38552338 = fieldWeight in 1750, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.168374 = idf(docFreq=247, maxDocs=43556)
                0.0625 = fieldNorm(doc=1750)
          0.012221333 = weight(abstract_txt:used in 1750) [ClassicSimilarity], result of:
            0.012221333 = score(doc=1750,freq=1.0), product of:
              0.05809323 = queryWeight, product of:
                1.2277194 = boost
                3.3659916 = idf(docFreq=4087, maxDocs=43556)
                0.014057671 = queryNorm
              0.21037447 = fieldWeight in 1750, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3659916 = idf(docFreq=4087, maxDocs=43556)
                0.0625 = fieldNorm(doc=1750)
          0.1106724 = weight(abstract_txt:gold in 1750) [ClassicSimilarity], result of:
            0.1106724 = score(doc=1750,freq=2.0), product of:
              0.15899594 = queryWeight, product of:
                1.4361982 = boost
                7.87514 = idf(docFreq=44, maxDocs=43556)
                0.014057671 = queryNorm
              0.6960706 = fieldWeight in 1750, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.87514 = idf(docFreq=44, maxDocs=43556)
                0.0625 = fieldNorm(doc=1750)
          0.17035334 = weight(abstract_txt:corpora in 1750) [ClassicSimilarity], result of:
            0.17035334 = score(doc=1750,freq=1.0), product of:
              0.3851625 = queryWeight, product of:
                3.8717203 = boost
                7.0766325 = idf(docFreq=99, maxDocs=43556)
                0.014057671 = queryNorm
              0.44228953 = fieldWeight in 1750, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0766325 = idf(docFreq=99, maxDocs=43556)
                0.0625 = fieldNorm(doc=1750)
        0.24 = coord(6/25)
    
  4. Akiva, N.; Koppel, M.: ¬A generic unsupervised method for decomposing multi-author documents (2013) 0.10
    0.100025095 = sum of:
      0.100025095 = product of:
        0.83354247 = sum of:
          0.05367259 = weight(abstract_txt:automatically in 3096) [ClassicSimilarity], result of:
            0.05367259 = score(doc=3096,freq=1.0), product of:
              0.07789654 = queryWeight, product of:
                1.0052648 = boost
                5.5121922 = idf(docFreq=477, maxDocs=43556)
                0.014057671 = queryNorm
              0.68902403 = fieldWeight in 3096, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5121922 = idf(docFreq=477, maxDocs=43556)
                0.125 = fieldNorm(doc=3096)
          0.07477377 = weight(abstract_txt:distinct in 3096) [ClassicSimilarity], result of:
            0.07477377 = score(doc=3096,freq=1.0), product of:
              0.097166374 = queryWeight, product of:
                1.1227404 = boost
                6.1563497 = idf(docFreq=250, maxDocs=43556)
                0.014057671 = queryNorm
              0.7695437 = fieldWeight in 3096, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1563497 = idf(docFreq=250, maxDocs=43556)
                0.125 = fieldNorm(doc=3096)
          0.7050961 = weight(abstract_txt:authorial in 3096) [ClassicSimilarity], result of:
            0.7050961 = score(doc=3096,freq=2.0), product of:
              0.43369263 = queryWeight, product of:
                3.3544967 = boost
                9.196897 = idf(docFreq=11, maxDocs=43556)
                0.014057671 = queryNorm
              1.6257969 = fieldWeight in 3096, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.196897 = idf(docFreq=11, maxDocs=43556)
                0.125 = fieldNorm(doc=3096)
        0.12 = coord(3/25)
    
  5. Clavier, V.; Paganelli, C.: Including authorial stance in the indexing of scientific documents (2012) 0.09
    0.09332346 = sum of:
      0.09332346 = product of:
        0.77769554 = sum of:
          0.015276666 = weight(abstract_txt:used in 2318) [ClassicSimilarity], result of:
            0.015276666 = score(doc=2318,freq=1.0), product of:
              0.05809323 = queryWeight, product of:
                1.2277194 = boost
                3.3659916 = idf(docFreq=4087, maxDocs=43556)
                0.014057671 = queryNorm
              0.2629681 = fieldWeight in 2318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3659916 = idf(docFreq=4087, maxDocs=43556)
                0.078125 = fieldNorm(doc=2318)
          0.06563449 = weight(abstract_txt:author's in 2318) [ClassicSimilarity], result of:
            0.06563449 = score(doc=2318,freq=1.0), product of:
              0.1218572 = queryWeight, product of:
                1.2573233 = boost
                6.894311 = idf(docFreq=119, maxDocs=43556)
                0.014057671 = queryNorm
              0.538618 = fieldWeight in 2318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.894311 = idf(docFreq=119, maxDocs=43556)
                0.078125 = fieldNorm(doc=2318)
          0.6967844 = weight(abstract_txt:authorial in 2318) [ClassicSimilarity], result of:
            0.6967844 = score(doc=2318,freq=5.0), product of:
              0.43369263 = queryWeight, product of:
                3.3544967 = boost
                9.196897 = idf(docFreq=11, maxDocs=43556)
                0.014057671 = queryNorm
              1.6066318 = fieldWeight in 2318, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                9.196897 = idf(docFreq=11, maxDocs=43556)
                0.078125 = fieldNorm(doc=2318)
        0.12 = coord(3/25)