Search (4 results, page 1 of 1)

  • × author_ss:"Crestani, F."
  • × year_i:[2000 TO 2010}
  1. Bache, R.; Baillie, M.; Crestani, F.: Measuring the likelihood property of scoring functions in general retrieval models (2009) 0.01
    0.014097675 = product of:
      0.042293023 = sum of:
        0.042293023 = product of:
          0.12687907 = sum of:
            0.12687907 = weight(_text_:objects in 2860) [ClassicSimilarity], result of:
              0.12687907 = score(doc=2860,freq=2.0), product of:
                0.3086582 = queryWeight, product of:
                  5.315071 = idf(docFreq=590, maxDocs=44218)
                  0.05807226 = queryNorm
                0.41106653 = fieldWeight in 2860, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.315071 = idf(docFreq=590, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2860)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Abstract
    Although retrieval systems based on probabilistic models will rank the objects (e.g., documents) being retrieved according to the probability of some matching criterion (e.g., relevance), they rarely yield an actual probability, and the scoring function is interpreted to be purely ordinal within a given retrieval task. In this brief communication, it is shown that some scoring functions possess the likelihood property, which means that the scoring function indicates the likelihood of matching when compared to other retrieval tasks, which is potentially more useful than pure ranking although it cannot be interpreted as an actual probability. This property can be detected by using two modified effectiveness measures: entire precision and entire recall.
  2. Crestani, F.; Dominich, S.; Lalmas, M.; Rijsbergen, C.J.K. van: Mathematical, logical, and formal methods in information retrieval : an introduction to the special issue (2003) 0.01
    0.00786799 = product of:
      0.023603968 = sum of:
        0.023603968 = product of:
          0.047207937 = sum of:
            0.047207937 = weight(_text_:22 in 1451) [ClassicSimilarity], result of:
              0.047207937 = score(doc=1451,freq=2.0), product of:
                0.20335917 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.05807226 = queryNorm
                0.23214069 = fieldWeight in 1451, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1451)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    22. 3.2003 19:27:36
  3. Crestani, F.; Du, H.: Written versus spoken queries : a qualitative and quantitative comparative analysis (2006) 0.01
    0.00786799 = product of:
      0.023603968 = sum of:
        0.023603968 = product of:
          0.047207937 = sum of:
            0.047207937 = weight(_text_:22 in 5047) [ClassicSimilarity], result of:
              0.047207937 = score(doc=5047,freq=2.0), product of:
                0.20335917 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.05807226 = queryNorm
                0.23214069 = fieldWeight in 5047, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5047)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    5. 6.2006 11:22:23
  4. Simeoni, F.; Yakici, M.; Neely, S.; Crestani, F.: Metadata harvesting for content-based distributed information retrieval (2008) 0.01
    0.007834411 = product of:
      0.023503233 = sum of:
        0.023503233 = product of:
          0.047006465 = sum of:
            0.047006465 = weight(_text_:indexing in 1336) [ClassicSimilarity], result of:
              0.047006465 = score(doc=1336,freq=2.0), product of:
                0.22229293 = queryWeight, product of:
                  3.8278677 = idf(docFreq=2614, maxDocs=44218)
                  0.05807226 = queryNorm
                0.21146181 = fieldWeight in 1336, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.8278677 = idf(docFreq=2614, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1336)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    We propose an approach to content-based Distributed Information Retrieval based on the periodic and incremental centralization of full-content indices of widely dispersed and autonomously managed document sources. Inspired by the success of the Open Archive Initiative's (OAI) Protocol for metadata harvesting, the approach occupies middle ground between content crawling and distributed retrieval. As in crawling, some data move toward the retrieval process, but it is statistics about the content rather than content itself; this grants more efficient use of network resources and wider scope of application. As in distributed retrieval, some processing is distributed along with the data, but it is indexing rather than retrieval; this reduces the costs of content provision while promoting the simplicity, effectiveness, and responsiveness of retrieval. Overall, we argue that the approach retains the good properties of centralized retrieval without renouncing to cost-effective, large-scale resource pooling. We discuss the requirements associated with the approach and identify two strategies to deploy it on top of the OAI infrastructure. In particular, we define a minimal extension of the OAI protocol which supports the coordinated harvesting of full-content indices and descriptive metadata for content resources. Finally, we report on the implementation of a proof-of-concept prototype service for multimodel content-based retrieval of distributed file collections.