Search (5 results, page 1 of 1)

  • × author_ss:"Savoy, J."
  • × language_ss:"e"
  • × year_i:[1990 TO 2000}
  1. Savoy, J.: ¬A new probabilistic scheme for information retrieval in hypertext (1995) 0.06
    0.060013108 = product of:
      0.120026216 = sum of:
        0.00823978 = product of:
          0.03295912 = sum of:
            0.03295912 = weight(_text_:based in 7254) [ClassicSimilarity], result of:
              0.03295912 = score(doc=7254,freq=2.0), product of:
                0.14144066 = queryWeight, product of:
                  3.0129938 = idf(docFreq=5906, maxDocs=44218)
                  0.04694356 = queryNorm
                0.23302436 = fieldWeight in 7254, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.0129938 = idf(docFreq=5906, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=7254)
          0.25 = coord(1/4)
        0.11178643 = weight(_text_:term in 7254) [ClassicSimilarity], result of:
          0.11178643 = score(doc=7254,freq=4.0), product of:
            0.21904005 = queryWeight, product of:
              4.66603 = idf(docFreq=1130, maxDocs=44218)
              0.04694356 = queryNorm
            0.510347 = fieldWeight in 7254, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.66603 = idf(docFreq=1130, maxDocs=44218)
              0.0546875 = fieldNorm(doc=7254)
      0.5 = coord(2/4)
    
    Abstract
    The aim of probabilistic models is to define a retrieval strategy within which documents can be optimally ranked according to their relevance probability with respect to a given request. Presents a study which suggests representing documents not only by index term vendors, as proposed by previous probabilistic models but also by considering relevance hypertext links. To enhance retrieval effectiveness, the learning retrieval scheme should modify the weight assigned to each indexing terms, the importance attached to each search term, and the relationships between documents. Evaluation of the proposed retrieval scheme with a hypertext based on the CACM test collection which includes 3.204 documents and the CISI corpus (1,460 documents), yields interesting results on the retrieval effectiveness of this approach
  2. Savoy, J.: Stemming of French words based on grammatical categories (1993) 0.00
    0.0047084456 = product of:
      0.018833783 = sum of:
        0.018833783 = product of:
          0.07533513 = sum of:
            0.07533513 = weight(_text_:based in 4650) [ClassicSimilarity], result of:
              0.07533513 = score(doc=4650,freq=2.0), product of:
                0.14144066 = queryWeight, product of:
                  3.0129938 = idf(docFreq=5906, maxDocs=44218)
                  0.04694356 = queryNorm
                0.5326271 = fieldWeight in 4650, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.0129938 = idf(docFreq=5906, maxDocs=44218)
                  0.125 = fieldNorm(doc=4650)
          0.25 = coord(1/4)
      0.25 = coord(1/4)
    
  3. Savoy, J.: ¬An extended vector-processing scheme for searching information in hypertext systems (1996) 0.00
    0.0032901266 = product of:
      0.013160506 = sum of:
        0.013160506 = product of:
          0.052642025 = sum of:
            0.052642025 = weight(_text_:based in 4036) [ClassicSimilarity], result of:
              0.052642025 = score(doc=4036,freq=10.0), product of:
                0.14144066 = queryWeight, product of:
                  3.0129938 = idf(docFreq=5906, maxDocs=44218)
                  0.04694356 = queryNorm
                0.37218451 = fieldWeight in 4036, product of:
                  3.1622777 = tf(freq=10.0), with freq of:
                    10.0 = termFreq=10.0
                  3.0129938 = idf(docFreq=5906, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4036)
          0.25 = coord(1/4)
      0.25 = coord(1/4)
    
    Abstract
    When searching information in a hypertext is limited to navigation, it is not an easy task, especially when the number of nodes and/or links becomes very large. A query based access mechanism must therefore be provided to complement the navigational tools inherent in hypertext systems. Most mechanisms currently proposed are based on conventional information retrieval models which consider documents as indepent entities, and ignore hypertext links. To promote the use of other information retrieval mechnaisms adapted to hypertext systems, responds to the following questions; how can we integrate information given by hypertext links into an information retrieval scheme; are these hypertext links (and link semantics) clues to the enhancement of retrieval effectiveness; if so, how can we use them. 2 solutions are: using a default weight function based on link tape or assigning the same strength to all link types; or using a specific weight for each particular link, i.e. the level of association or a similarity measure. Proposes an extended vector processing scheme which extracts additional information from hypertext links to enhance retrieval effectiveness. A hypertext based on 2 medium size collections, the CACM and the CISI collection has been built. The hypergraph is composed of explicit links (bibliographic references), computed links based on bibliographic information, or on hypertext links established according to document representatives (nearest neighbour)
  4. Savoy, J.: Effectiveness of information retrieval systems used in a hypertext environment (1993) 0.00
    0.0023542228 = product of:
      0.009416891 = sum of:
        0.009416891 = product of:
          0.037667565 = sum of:
            0.037667565 = weight(_text_:based in 6511) [ClassicSimilarity], result of:
              0.037667565 = score(doc=6511,freq=2.0), product of:
                0.14144066 = queryWeight, product of:
                  3.0129938 = idf(docFreq=5906, maxDocs=44218)
                  0.04694356 = queryNorm
                0.26631355 = fieldWeight in 6511, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.0129938 = idf(docFreq=5906, maxDocs=44218)
                  0.0625 = fieldNorm(doc=6511)
          0.25 = coord(1/4)
      0.25 = coord(1/4)
    
    Abstract
    In most hypertext systems, information retrieval techniques emphasize browsing or navigational methods which are not thorough enough to find all relevant material, especially when the number of nodes and/or links becomes very large. Reviews the main query-based search techniques currently used in hypertext environments. Explains the experimental methodology. Concentrates on the retrieval effectiveness of these retrieval strategies. Considers ways of improving search effectiveness
  5. Savoy, J.: Ranking schemes in hybrid Boolean systems : a new approach (1997) 0.00
    0.0017656671 = product of:
      0.0070626684 = sum of:
        0.0070626684 = product of:
          0.028250674 = sum of:
            0.028250674 = weight(_text_:based in 393) [ClassicSimilarity], result of:
              0.028250674 = score(doc=393,freq=2.0), product of:
                0.14144066 = queryWeight, product of:
                  3.0129938 = idf(docFreq=5906, maxDocs=44218)
                  0.04694356 = queryNorm
                0.19973516 = fieldWeight in 393, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.0129938 = idf(docFreq=5906, maxDocs=44218)
                  0.046875 = fieldNorm(doc=393)
          0.25 = coord(1/4)
      0.25 = coord(1/4)
    
    Abstract
    In most commercial online systems, the retrieval system is based on the Boolean model and its inverted file organization. Since the investment in these systems is so great and changing them could be economically unfeasible, this article suggests a new ranking scheme especially adapted for hypertext environments in order to produce more effective retrieval results and yet maintain the effectiveness of the investment made to date in the Boolean model. To select the retrieved documents, the suggested ranking strategy uses multiple sources of document content evidence. The proposed scheme integrates both the information provided by the index and query terms, and the inherent relationships between documents such as bibliographic references or hypertext links. We will demonstrate that our scheme represents an integration of both subject and citation indexing, and results in a significant imporvement over classical ranking schemes uses in hybrid Boolean systems, while preserving its efficiency. Moreover, through knowing the nearest neighbor and the hypertext links which constitute additional sources of evidence, our strategy will take them into account in order to further improve retrieval effectiveness and to provide 'good' starting points for browsing in a hypertext or hypermedia environement