Search (27 results, page 1 of 2)

  • × author_ss:"Savoy, J."
  1. Savoy, J.: Estimating the probability of an authorship attribution (2016) 0.05
    0.05463498 = sum of:
      0.01522842 = product of:
        0.06091368 = sum of:
          0.06091368 = weight(_text_:authors in 2937) [ClassicSimilarity], result of:
            0.06091368 = score(doc=2937,freq=2.0), product of:
              0.2418733 = queryWeight, product of:
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.053056188 = queryNorm
              0.25184128 = fieldWeight in 2937, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2937)
        0.25 = coord(1/4)
      0.03940656 = sum of:
        0.003464655 = weight(_text_:s in 2937) [ClassicSimilarity], result of:
          0.003464655 = score(doc=2937,freq=2.0), product of:
            0.057684682 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.053056188 = queryNorm
            0.060061958 = fieldWeight in 2937, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2937)
        0.035941906 = weight(_text_:22 in 2937) [ClassicSimilarity], result of:
          0.035941906 = score(doc=2937,freq=2.0), product of:
            0.18579373 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.053056188 = queryNorm
            0.19345059 = fieldWeight in 2937, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2937)
    
    Abstract
    In authorship attribution, various distance-based metrics have been proposed to determine the most probable author of a disputed text. In this paradigm, a distance is computed between each author profile and the query text. These values are then employed only to rank the possible authors. In this article, we analyze their distribution and show that we can model it as a mixture of 2 Beta distributions. Based on this finding, we demonstrate how we can derive a more accurate probability that the closest author is, in fact, the real author. To evaluate this approach, we have chosen 4 authorship attribution methods (Burrows' Delta, Kullback-Leibler divergence, Labbé's intertextual distance, and the naïve Bayes). As the first test collection, we have downloaded 224 State of the Union addresses (from 1790 to 2014) delivered by 41 U.S. presidents. The second test collection is formed by the Federalist Papers. The evaluations indicate that the accuracy rate of some authorship decisions can be improved. The suggested method can signal that the proposed assignment should be interpreted as possible, without strong certainty. Being able to quantify the certainty associated with an authorship decision can be a useful component when important decisions must be taken.
    Date
    7. 5.2016 21:22:27
    Source
    Journal of the Association for Information Science and Technology. 67(2016) no.6, S.1462-1472
  2. Savoy, J.: Stemming of French words based on grammatical categories (1993) 0.00
    0.002771724 = product of:
      0.005543448 = sum of:
        0.005543448 = product of:
          0.011086896 = sum of:
            0.011086896 = weight(_text_:s in 4650) [ClassicSimilarity], result of:
              0.011086896 = score(doc=4650,freq=2.0), product of:
                0.057684682 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.053056188 = queryNorm
                0.19219826 = fieldWeight in 4650, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.125 = fieldNorm(doc=4650)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of the American Society for Information Science. 44(1993) no.1, S.1-9
  3. Savoy, J.: Bayesian inference networks and spreading activation in hypertext systems (1992) 0.00
    0.002771724 = product of:
      0.005543448 = sum of:
        0.005543448 = product of:
          0.011086896 = sum of:
            0.011086896 = weight(_text_:s in 192) [ClassicSimilarity], result of:
              0.011086896 = score(doc=192,freq=2.0), product of:
                0.057684682 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.053056188 = queryNorm
                0.19219826 = fieldWeight in 192, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.125 = fieldNorm(doc=192)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information processing and management. 28(1992), S.389-405
  4. Savoy, J.; Picard, J.: Retrieval effectiveness on the web (2001) 0.00
    0.0024252585 = product of:
      0.004850517 = sum of:
        0.004850517 = product of:
          0.009701034 = sum of:
            0.009701034 = weight(_text_:s in 775) [ClassicSimilarity], result of:
              0.009701034 = score(doc=775,freq=2.0), product of:
                0.057684682 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.053056188 = queryNorm
                0.16817348 = fieldWeight in 775, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.109375 = fieldNorm(doc=775)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information processing and management. 37(2001) no.4, S.543-569
  5. Savoy, J.; Ndarugendamwo, M.; Vrajitoru, D.: Report on the TREC-4 experiment : combining probabilistic and vector-space schemes (1996) 0.00
    0.0020787928 = product of:
      0.0041575856 = sum of:
        0.0041575856 = product of:
          0.008315171 = sum of:
            0.008315171 = weight(_text_:s in 7574) [ClassicSimilarity], result of:
              0.008315171 = score(doc=7574,freq=2.0), product of:
                0.057684682 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.053056188 = queryNorm
                0.14414869 = fieldWeight in 7574, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.09375 = fieldNorm(doc=7574)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Pages
    S.537-547
  6. Savoy, J.; Calvé, A. le; Vrajitoru, D.: Report on the TREC5 experiment : data fusion and collection fusion (1997) 0.00
    0.0020787928 = product of:
      0.0041575856 = sum of:
        0.0041575856 = product of:
          0.008315171 = sum of:
            0.008315171 = weight(_text_:s in 3108) [ClassicSimilarity], result of:
              0.008315171 = score(doc=3108,freq=2.0), product of:
                0.057684682 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.053056188 = queryNorm
                0.14414869 = fieldWeight in 3108, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.09375 = fieldNorm(doc=3108)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Pages
    S.489-502
  7. Savoy, J.: ¬A stemming procedure and stopword list for general French Corpora (1999) 0.00
    0.0020787928 = product of:
      0.0041575856 = sum of:
        0.0041575856 = product of:
          0.008315171 = sum of:
            0.008315171 = weight(_text_:s in 4314) [ClassicSimilarity], result of:
              0.008315171 = score(doc=4314,freq=2.0), product of:
                0.057684682 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.053056188 = queryNorm
                0.14414869 = fieldWeight in 4314, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.09375 = fieldNorm(doc=4314)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of the American Society for Information Science. 50(1999) no.10, S.944-954
  8. Abdou, S.; Savoy, J.: Searching in Medline : query expansion and manual indexing evaluation (2008) 0.00
    0.0014699287 = product of:
      0.0029398573 = sum of:
        0.0029398573 = product of:
          0.0058797146 = sum of:
            0.0058797146 = weight(_text_:s in 2062) [ClassicSimilarity], result of:
              0.0058797146 = score(doc=2062,freq=4.0), product of:
                0.057684682 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.053056188 = queryNorm
                0.101928525 = fieldWeight in 2062, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2062)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information processing and management. 44(2008) no.2, S.781-789
  9. Savoy, J.; Desbois, D.: Information retrieval in hypertext systems (1991) 0.00
    0.001385862 = product of:
      0.002771724 = sum of:
        0.002771724 = product of:
          0.005543448 = sum of:
            0.005543448 = weight(_text_:s in 4452) [ClassicSimilarity], result of:
              0.005543448 = score(doc=4452,freq=2.0), product of:
                0.057684682 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.053056188 = queryNorm
                0.09609913 = fieldWeight in 4452, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4452)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Electronic publishing. 4(1991) no.2, S.87-108
  10. Savoy, J.: Effectiveness of information retrieval systems used in a hypertext environment (1993) 0.00
    0.001385862 = product of:
      0.002771724 = sum of:
        0.002771724 = product of:
          0.005543448 = sum of:
            0.005543448 = weight(_text_:s in 6511) [ClassicSimilarity], result of:
              0.005543448 = score(doc=6511,freq=2.0), product of:
                0.057684682 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.053056188 = queryNorm
                0.09609913 = fieldWeight in 6511, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.0625 = fieldNorm(doc=6511)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Hypermedia. 5(1993) no.1, S.23-46
  11. Savoy, J.: ¬A learning scheme for information retrieval in hypertext (1994) 0.00
    0.001385862 = product of:
      0.002771724 = sum of:
        0.002771724 = product of:
          0.005543448 = sum of:
            0.005543448 = weight(_text_:s in 7292) [ClassicSimilarity], result of:
              0.005543448 = score(doc=7292,freq=2.0), product of:
                0.057684682 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.053056188 = queryNorm
                0.09609913 = fieldWeight in 7292, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.0625 = fieldNorm(doc=7292)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information processing and management. 30(1994) no.4, S.515-533
  12. Savoy, J.: Searching information in legal hypertext systems (1993/94) 0.00
    0.001385862 = product of:
      0.002771724 = sum of:
        0.002771724 = product of:
          0.005543448 = sum of:
            0.005543448 = weight(_text_:s in 757) [ClassicSimilarity], result of:
              0.005543448 = score(doc=757,freq=2.0), product of:
                0.057684682 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.053056188 = queryNorm
                0.09609913 = fieldWeight in 757, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.0625 = fieldNorm(doc=757)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Artificial intelligence and law. 2(1993/94) no.3, S.205-232
  13. Savoy, J.: ¬A new probabilistic scheme for information retrieval in hypertext (1995) 0.00
    0.0012126293 = product of:
      0.0024252585 = sum of:
        0.0024252585 = product of:
          0.004850517 = sum of:
            0.004850517 = weight(_text_:s in 7254) [ClassicSimilarity], result of:
              0.004850517 = score(doc=7254,freq=2.0), product of:
                0.057684682 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.053056188 = queryNorm
                0.08408674 = fieldWeight in 7254, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=7254)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    New review of hypermedia and multimedia. 1995, no.1, S.107-134
  14. Savoy, J.: Bibliographic database access using free-text and controlled vocabulary : an evaluation (2005) 0.00
    0.0012126293 = product of:
      0.0024252585 = sum of:
        0.0024252585 = product of:
          0.004850517 = sum of:
            0.004850517 = weight(_text_:s in 1053) [ClassicSimilarity], result of:
              0.004850517 = score(doc=1053,freq=2.0), product of:
                0.057684682 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.053056188 = queryNorm
                0.08408674 = fieldWeight in 1053, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1053)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information processing and management. 41(2005) no.4, S.873-890
  15. Dolamic, L.; Savoy, J.: When stopword lists make the difference (2009) 0.00
    0.0012126293 = product of:
      0.0024252585 = sum of:
        0.0024252585 = product of:
          0.004850517 = sum of:
            0.004850517 = weight(_text_:s in 3319) [ClassicSimilarity], result of:
              0.004850517 = score(doc=3319,freq=2.0), product of:
                0.057684682 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.053056188 = queryNorm
                0.08408674 = fieldWeight in 3319, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3319)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of the American Society for Information Science and Technology. 61(2010) no.1, S.200-203
  16. Savoy, J.: Ranking schemes in hybrid Boolean systems : a new approach (1997) 0.00
    0.0010393964 = product of:
      0.0020787928 = sum of:
        0.0020787928 = product of:
          0.0041575856 = sum of:
            0.0041575856 = weight(_text_:s in 393) [ClassicSimilarity], result of:
              0.0041575856 = score(doc=393,freq=2.0), product of:
                0.057684682 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.053056188 = queryNorm
                0.072074346 = fieldWeight in 393, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.046875 = fieldNorm(doc=393)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of the American Society for Information Science. 48(1997) no.3, S.235-253
  17. Savoy, J.: Searching strategies for the Hungarian language (2008) 0.00
    0.0010393964 = product of:
      0.0020787928 = sum of:
        0.0020787928 = product of:
          0.0041575856 = sum of:
            0.0041575856 = weight(_text_:s in 2037) [ClassicSimilarity], result of:
              0.0041575856 = score(doc=2037,freq=2.0), product of:
                0.057684682 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.053056188 = queryNorm
                0.072074346 = fieldWeight in 2037, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2037)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information processing and management. 44(2008) no.1, S.310-324
  18. Fautsch, C.; Savoy, J.: Algorithmic stemmers or morphological analysis? : an evaluation (2009) 0.00
    0.0010393964 = product of:
      0.0020787928 = sum of:
        0.0020787928 = product of:
          0.0041575856 = sum of:
            0.0041575856 = weight(_text_:s in 2950) [ClassicSimilarity], result of:
              0.0041575856 = score(doc=2950,freq=2.0), product of:
                0.057684682 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.053056188 = queryNorm
                0.072074346 = fieldWeight in 2950, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2950)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of the American Society for Information Science and Technology. 60(2009) no.8, S.1616-1624
  19. Dolamic, L.; Savoy, J.: Retrieval effectiveness of machine translated queries (2010) 0.00
    0.0010393964 = product of:
      0.0020787928 = sum of:
        0.0020787928 = product of:
          0.0041575856 = sum of:
            0.0041575856 = weight(_text_:s in 4102) [ClassicSimilarity], result of:
              0.0041575856 = score(doc=4102,freq=2.0), product of:
                0.057684682 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.053056188 = queryNorm
                0.072074346 = fieldWeight in 4102, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4102)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of the American Society for Information Science and Technology. 61(2010) no.11, S.2266-2273
  20. Kocher, M.; Savoy, J.: ¬A simple and efficient algorithm for authorship verification (2017) 0.00
    0.0010393964 = product of:
      0.0020787928 = sum of:
        0.0020787928 = product of:
          0.0041575856 = sum of:
            0.0041575856 = weight(_text_:s in 3330) [ClassicSimilarity], result of:
              0.0041575856 = score(doc=3330,freq=2.0), product of:
                0.057684682 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.053056188 = queryNorm
                0.072074346 = fieldWeight in 3330, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3330)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of the Association for Information Science and Technology. 68(2017) no.1, S.259-269