Search (27 results, page 1 of 2)

  • × author_ss:"Savoy, J."
  1. Savoy, J.: Estimating the probability of an authorship attribution (2016) 0.08
    0.079768166 = sum of:
      0.0150408745 = product of:
        0.060163498 = sum of:
          0.060163498 = weight(_text_:authors in 2937) [ClassicSimilarity], result of:
            0.060163498 = score(doc=2937,freq=2.0), product of:
              0.2388945 = queryWeight, product of:
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.052402776 = queryNorm
              0.25184128 = fieldWeight in 2937, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2937)
        0.25 = coord(1/4)
      0.06472729 = sum of:
        0.029228024 = weight(_text_:j in 2937) [ClassicSimilarity], result of:
          0.029228024 = score(doc=2937,freq=2.0), product of:
            0.16650963 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.052402776 = queryNorm
            0.17553353 = fieldWeight in 2937, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2937)
        0.035499264 = weight(_text_:22 in 2937) [ClassicSimilarity], result of:
          0.035499264 = score(doc=2937,freq=2.0), product of:
            0.1835056 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.052402776 = queryNorm
            0.19345059 = fieldWeight in 2937, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2937)
    
    Abstract
    In authorship attribution, various distance-based metrics have been proposed to determine the most probable author of a disputed text. In this paradigm, a distance is computed between each author profile and the query text. These values are then employed only to rank the possible authors. In this article, we analyze their distribution and show that we can model it as a mixture of 2 Beta distributions. Based on this finding, we demonstrate how we can derive a more accurate probability that the closest author is, in fact, the real author. To evaluate this approach, we have chosen 4 authorship attribution methods (Burrows' Delta, Kullback-Leibler divergence, Labbé's intertextual distance, and the naïve Bayes). As the first test collection, we have downloaded 224 State of the Union addresses (from 1790 to 2014) delivered by 41 U.S. presidents. The second test collection is formed by the Federalist Papers. The evaluations indicate that the accuracy rate of some authorship decisions can be improved. The suggested method can signal that the proposed assignment should be interpreted as possible, without strong certainty. Being able to quantify the certainty associated with an authorship decision can be a useful component when important decisions must be taken.
    Date
    7. 5.2016 21:22:27
  2. Savoy, J.; Picard, J.: Retrieval effectiveness on the web (2001) 0.03
    0.028934268 = product of:
      0.057868537 = sum of:
        0.057868537 = product of:
          0.11573707 = sum of:
            0.11573707 = weight(_text_:j in 775) [ClassicSimilarity], result of:
              0.11573707 = score(doc=775,freq=4.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.69507736 = fieldWeight in 775, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.109375 = fieldNorm(doc=775)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  3. Savoy, J.: Stemming of French words based on grammatical categories (1993) 0.02
    0.02338242 = product of:
      0.04676484 = sum of:
        0.04676484 = product of:
          0.09352968 = sum of:
            0.09352968 = weight(_text_:j in 4650) [ClassicSimilarity], result of:
              0.09352968 = score(doc=4650,freq=2.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.5617073 = fieldWeight in 4650, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.125 = fieldNorm(doc=4650)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  4. Savoy, J.: Bayesian inference networks and spreading activation in hypertext systems (1992) 0.02
    0.02338242 = product of:
      0.04676484 = sum of:
        0.04676484 = product of:
          0.09352968 = sum of:
            0.09352968 = weight(_text_:j in 192) [ClassicSimilarity], result of:
              0.09352968 = score(doc=192,freq=2.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.5617073 = fieldWeight in 192, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.125 = fieldNorm(doc=192)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  5. Savoy, J.; Ndarugendamwo, M.; Vrajitoru, D.: Report on the TREC-4 experiment : combining probabilistic and vector-space schemes (1996) 0.02
    0.017536815 = product of:
      0.03507363 = sum of:
        0.03507363 = product of:
          0.07014726 = sum of:
            0.07014726 = weight(_text_:j in 7574) [ClassicSimilarity], result of:
              0.07014726 = score(doc=7574,freq=2.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.4212805 = fieldWeight in 7574, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.09375 = fieldNorm(doc=7574)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  6. Savoy, J.; Calvé, A. le; Vrajitoru, D.: Report on the TREC5 experiment : data fusion and collection fusion (1997) 0.02
    0.017536815 = product of:
      0.03507363 = sum of:
        0.03507363 = product of:
          0.07014726 = sum of:
            0.07014726 = weight(_text_:j in 3108) [ClassicSimilarity], result of:
              0.07014726 = score(doc=3108,freq=2.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.4212805 = fieldWeight in 3108, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.09375 = fieldNorm(doc=3108)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  7. Savoy, J.: ¬A stemming procedure and stopword list for general French Corpora (1999) 0.02
    0.017536815 = product of:
      0.03507363 = sum of:
        0.03507363 = product of:
          0.07014726 = sum of:
            0.07014726 = weight(_text_:j in 4314) [ClassicSimilarity], result of:
              0.07014726 = score(doc=4314,freq=2.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.4212805 = fieldWeight in 4314, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.09375 = fieldNorm(doc=4314)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  8. Savoy, J.; Desbois, D.: Information retrieval in hypertext systems (1991) 0.01
    0.01169121 = product of:
      0.02338242 = sum of:
        0.02338242 = product of:
          0.04676484 = sum of:
            0.04676484 = weight(_text_:j in 4452) [ClassicSimilarity], result of:
              0.04676484 = score(doc=4452,freq=2.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.28085366 = fieldWeight in 4452, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4452)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  9. Savoy, J.: Effectiveness of information retrieval systems used in a hypertext environment (1993) 0.01
    0.01169121 = product of:
      0.02338242 = sum of:
        0.02338242 = product of:
          0.04676484 = sum of:
            0.04676484 = weight(_text_:j in 6511) [ClassicSimilarity], result of:
              0.04676484 = score(doc=6511,freq=2.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.28085366 = fieldWeight in 6511, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.0625 = fieldNorm(doc=6511)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  10. Savoy, J.: ¬A learning scheme for information retrieval in hypertext (1994) 0.01
    0.01169121 = product of:
      0.02338242 = sum of:
        0.02338242 = product of:
          0.04676484 = sum of:
            0.04676484 = weight(_text_:j in 7292) [ClassicSimilarity], result of:
              0.04676484 = score(doc=7292,freq=2.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.28085366 = fieldWeight in 7292, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.0625 = fieldNorm(doc=7292)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  11. Savoy, J.: Searching information in legal hypertext systems (1993/94) 0.01
    0.01169121 = product of:
      0.02338242 = sum of:
        0.02338242 = product of:
          0.04676484 = sum of:
            0.04676484 = weight(_text_:j in 757) [ClassicSimilarity], result of:
              0.04676484 = score(doc=757,freq=2.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.28085366 = fieldWeight in 757, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.0625 = fieldNorm(doc=757)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  12. Picard, J.; Savoy, J.: Enhancing retrieval with hyperlinks : a general model based on propositional argumentation systems (2003) 0.01
    0.010333667 = product of:
      0.020667333 = sum of:
        0.020667333 = product of:
          0.041334666 = sum of:
            0.041334666 = weight(_text_:j in 1427) [ClassicSimilarity], result of:
              0.041334666 = score(doc=1427,freq=4.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.2482419 = fieldWeight in 1427, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1427)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  13. Savoy, J.: ¬A new probabilistic scheme for information retrieval in hypertext (1995) 0.01
    0.010229808 = product of:
      0.020459617 = sum of:
        0.020459617 = product of:
          0.040919233 = sum of:
            0.040919233 = weight(_text_:j in 7254) [ClassicSimilarity], result of:
              0.040919233 = score(doc=7254,freq=2.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.24574696 = fieldWeight in 7254, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=7254)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  14. Savoy, J.: Bibliographic database access using free-text and controlled vocabulary : an evaluation (2005) 0.01
    0.010229808 = product of:
      0.020459617 = sum of:
        0.020459617 = product of:
          0.040919233 = sum of:
            0.040919233 = weight(_text_:j in 1053) [ClassicSimilarity], result of:
              0.040919233 = score(doc=1053,freq=2.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.24574696 = fieldWeight in 1053, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1053)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  15. Dolamic, L.; Savoy, J.: When stopword lists make the difference (2009) 0.01
    0.010229808 = product of:
      0.020459617 = sum of:
        0.020459617 = product of:
          0.040919233 = sum of:
            0.040919233 = weight(_text_:j in 3319) [ClassicSimilarity], result of:
              0.040919233 = score(doc=3319,freq=2.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.24574696 = fieldWeight in 3319, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3319)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  16. Savoy, J.: Ranking schemes in hybrid Boolean systems : a new approach (1997) 0.01
    0.008768408 = product of:
      0.017536815 = sum of:
        0.017536815 = product of:
          0.03507363 = sum of:
            0.03507363 = weight(_text_:j in 393) [ClassicSimilarity], result of:
              0.03507363 = score(doc=393,freq=2.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.21064025 = fieldWeight in 393, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.046875 = fieldNorm(doc=393)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  17. Savoy, J.: Searching strategies for the Hungarian language (2008) 0.01
    0.008768408 = product of:
      0.017536815 = sum of:
        0.017536815 = product of:
          0.03507363 = sum of:
            0.03507363 = weight(_text_:j in 2037) [ClassicSimilarity], result of:
              0.03507363 = score(doc=2037,freq=2.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.21064025 = fieldWeight in 2037, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2037)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  18. Abdou, S.; Savoy, J.: Searching in Medline : query expansion and manual indexing evaluation (2008) 0.01
    0.008768408 = product of:
      0.017536815 = sum of:
        0.017536815 = product of:
          0.03507363 = sum of:
            0.03507363 = weight(_text_:j in 2062) [ClassicSimilarity], result of:
              0.03507363 = score(doc=2062,freq=2.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.21064025 = fieldWeight in 2062, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2062)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  19. Fautsch, C.; Savoy, J.: Algorithmic stemmers or morphological analysis? : an evaluation (2009) 0.01
    0.008768408 = product of:
      0.017536815 = sum of:
        0.017536815 = product of:
          0.03507363 = sum of:
            0.03507363 = weight(_text_:j in 2950) [ClassicSimilarity], result of:
              0.03507363 = score(doc=2950,freq=2.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.21064025 = fieldWeight in 2950, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2950)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  20. Dolamic, L.; Savoy, J.: Retrieval effectiveness of machine translated queries (2010) 0.01
    0.008768408 = product of:
      0.017536815 = sum of:
        0.017536815 = product of:
          0.03507363 = sum of:
            0.03507363 = weight(_text_:j in 4102) [ClassicSimilarity], result of:
              0.03507363 = score(doc=4102,freq=2.0), product of:
                0.16650963 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.052402776 = queryNorm
                0.21064025 = fieldWeight in 4102, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4102)
          0.5 = coord(1/2)
      0.5 = coord(1/2)