Search (10 results, page 1 of 1)

Crestani, F.; Wu, S.: Testing the cluster hypothesis in distributed information retrieval (2006) 0.04
```
0.035790663 = product of:
  0.053685993 = sum of:
    0.037639882 = weight(_text_:resources in 984) [ClassicSimilarity], result of:
      0.037639882 = score(doc=984,freq=2.0), product of:
        0.18665522 = queryWeight, product of:
          3.650338 = idf(docFreq=3122, maxDocs=44218)
          0.051133685 = queryNorm
        0.20165458 = fieldWeight in 984, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.650338 = idf(docFreq=3122, maxDocs=44218)
          0.0390625 = fieldNorm(doc=984)
    0.016046109 = product of:
      0.032092217 = sum of:
        0.032092217 = weight(_text_:management in 984) [ClassicSimilarity], result of:
          0.032092217 = score(doc=984,freq=2.0), product of:
            0.17235184 = queryWeight, product of:
              3.3706124 = idf(docFreq=4130, maxDocs=44218)
              0.051133685 = queryNorm
            0.18620178 = fieldWeight in 984, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.3706124 = idf(docFreq=4130, maxDocs=44218)
              0.0390625 = fieldNorm(doc=984)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)
```
Abstract

How to merge and organise query results retrieved from different resources is one of the key issues in distributed information retrieval. Some previous research and experiments suggest that cluster-based document browsing is more effective than a single merged list. Cluster-based retrieval results presentation is based on the cluster hypothesis, which states that documents that cluster together have a similar relevance to a given query. However, while this hypothesis has been demonstrated to hold in classical information retrieval environments, it has never been fully tested in heterogeneous distributed information retrieval environments. Heterogeneous document representations, the presence of document duplicates, and disparate qualities of retrieval results, are major features of an heterogeneous distributed information retrieval environment that might disrupt the effectiveness of the cluster hypothesis. In this paper we report on an experimental investigation into the validity and effectiveness of the cluster hypothesis in highly heterogeneous distributed information retrieval environments. The results show that although clustering is affected by different retrieval results representations and quality, the cluster hypothesis still holds and that generating hierarchical clusters in highly heterogeneous distributed information retrieval environments is still a very effective way of presenting retrieval results to users.

Source

Information processing and management. 42(2006) no.5, S.1137-1150
Simeoni, F.; Yakici, M.; Neely, S.; Crestani, F.: Metadata harvesting for content-based distributed information retrieval (2008) 0.02
```
0.017743612 = product of:
  0.053230833 = sum of:
    0.053230833 = weight(_text_:resources in 1336) [ClassicSimilarity], result of:
      0.053230833 = score(doc=1336,freq=4.0), product of:
        0.18665522 = queryWeight, product of:
          3.650338 = idf(docFreq=3122, maxDocs=44218)
          0.051133685 = queryNorm
        0.28518265 = fieldWeight in 1336, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.650338 = idf(docFreq=3122, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1336)
  0.33333334 = coord(1/3)
```
Abstract

We propose an approach to content-based Distributed Information Retrieval based on the periodic and incremental centralization of full-content indices of widely dispersed and autonomously managed document sources. Inspired by the success of the Open Archive Initiative's (OAI) Protocol for metadata harvesting, the approach occupies middle ground between content crawling and distributed retrieval. As in crawling, some data move toward the retrieval process, but it is statistics about the content rather than content itself; this grants more efficient use of network resources and wider scope of application. As in distributed retrieval, some processing is distributed along with the data, but it is indexing rather than retrieval; this reduces the costs of content provision while promoting the simplicity, effectiveness, and responsiveness of retrieval. Overall, we argue that the approach retains the good properties of centralized retrieval without renouncing to cost-effective, large-scale resource pooling. We discuss the requirements associated with the approach and identify two strategies to deploy it on top of the OAI infrastructure. In particular, we define a minimal extension of the OAI protocol which supports the coordinated harvesting of full-content indices and descriptive metadata for content resources. Finally, we report on the implementation of a proof-of-concept prototype service for multimodel content-based retrieval of distributed file collections.

Crestani, F.; Lee, P.L.: Searching the web by constraining spreading activities (2000) 0.01

0.014976369 = product of:
  0.044929106 = sum of:
    0.044929106 = product of:
      0.08985821 = sum of:
        0.08985821 = weight(_text_:management in 1326) [ClassicSimilarity], result of:
          0.08985821 = score(doc=1326,freq=2.0), product of:
            0.17235184 = queryWeight, product of:
              3.3706124 = idf(docFreq=4130, maxDocs=44218)
              0.051133685 = queryNorm
            0.521365 = fieldWeight in 1326, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.3706124 = idf(docFreq=4130, maxDocs=44218)
              0.109375 = fieldNorm(doc=1326)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Source: Information processing and management. 36(2000) no.4, S.585-605

Agosti, M.; Crestani, F.; Melucci, M.: Design and implementation of a tool for the automatic construction of hypertexts for information retrieval (1996) 0.01

0.0074881846 = product of:
  0.022464553 = sum of:
    0.022464553 = product of:
      0.044929106 = sum of:
        0.044929106 = weight(_text_:management in 5571) [ClassicSimilarity], result of:
          0.044929106 = score(doc=5571,freq=2.0), product of:
            0.17235184 = queryWeight, product of:
              3.3706124 = idf(docFreq=4130, maxDocs=44218)
              0.051133685 = queryNorm
            0.2606825 = fieldWeight in 5571, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.3706124 = idf(docFreq=4130, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5571)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Source: Information processing and management. 32(1996) no.4, S.459-476

Agosti, M.; Crestani, F.; Melucci, M.: On the use of information retrieval techniques for the automatic construction of hypertext (1997) 0.01

0.0074881846 = product of:
  0.022464553 = sum of:
    0.022464553 = product of:
      0.044929106 = sum of:
        0.044929106 = weight(_text_:management in 150) [ClassicSimilarity], result of:
          0.044929106 = score(doc=150,freq=2.0), product of:
            0.17235184 = queryWeight, product of:
              3.3706124 = idf(docFreq=4130, maxDocs=44218)
              0.051133685 = queryNorm
            0.2606825 = fieldWeight in 150, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.3706124 = idf(docFreq=4130, maxDocs=44218)
              0.0546875 = fieldNorm(doc=150)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Source: Information processing and management. 33(1997) no.2, S.133-144

Crestani, F.; Rijsbergen, C.J. van: Information retrieval by imaging (1996) 0.01

0.0069279084 = product of:
  0.020783724 = sum of:
    0.020783724 = product of:
      0.04156745 = sum of:
        0.04156745 = weight(_text_:22 in 6967) [ClassicSimilarity], result of:
          0.04156745 = score(doc=6967,freq=2.0), product of:
            0.17906146 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.051133685 = queryNorm
            0.23214069 = fieldWeight in 6967, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=6967)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Source: Information retrieval: new systems and current research. Proceedings of the 16th Research Colloquium of the British Computer Society Information Retrieval Specialist Group, Drymen, Scotland, 22-23 Mar 94. Ed.: R. Leon

Crestani, F.; Dominich, S.; Lalmas, M.; Rijsbergen, C.J.K. van: Mathematical, logical, and formal methods in information retrieval : an introduction to the special issue (2003) 0.01

0.0069279084 = product of:
  0.020783724 = sum of:
    0.020783724 = product of:
      0.04156745 = sum of:
        0.04156745 = weight(_text_:22 in 1451) [ClassicSimilarity], result of:
          0.04156745 = score(doc=1451,freq=2.0), product of:
            0.17906146 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.051133685 = queryNorm
            0.23214069 = fieldWeight in 1451, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=1451)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 22. 3.2003 19:27:36

Crestani, F.; Du, H.: Written versus spoken queries : a qualitative and quantitative comparative analysis (2006) 0.01

0.0069279084 = product of:
  0.020783724 = sum of:
    0.020783724 = product of:
      0.04156745 = sum of:
        0.04156745 = weight(_text_:22 in 5047) [ClassicSimilarity], result of:
          0.04156745 = score(doc=5047,freq=2.0), product of:
            0.17906146 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.051133685 = queryNorm
            0.23214069 = fieldWeight in 5047, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=5047)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 5. 6.2006 11:22:23

Crestani, F.; Vegas, J.; Fuente, P. de la: ¬A graphical user interface for the retrieval of hierarchically structured documents (2004) 0.01

0.0064184438 = product of:
  0.01925533 = sum of:
    0.01925533 = product of:
      0.03851066 = sum of:
        0.03851066 = weight(_text_:management in 2555) [ClassicSimilarity], result of:
          0.03851066 = score(doc=2555,freq=2.0), product of:
            0.17235184 = queryWeight, product of:
              3.3706124 = idf(docFreq=4130, maxDocs=44218)
              0.051133685 = queryNorm
            0.22344214 = fieldWeight in 2555, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.3706124 = idf(docFreq=4130, maxDocs=44218)
              0.046875 = fieldNorm(doc=2555)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Source: Information processing and management. 40(2004) no.2, S.269-289

Sweeney, S.; Crestani, F.; Losada, D.E.: 'Show me more' : incremental length summarisation using novelty detection (2008) 0.01

0.005348703 = product of:
  0.016046109 = sum of:
    0.016046109 = product of:
      0.032092217 = sum of:
        0.032092217 = weight(_text_:management in 2054) [ClassicSimilarity], result of:
          0.032092217 = score(doc=2054,freq=2.0), product of:
            0.17235184 = queryWeight, product of:
              3.3706124 = idf(docFreq=4130, maxDocs=44218)
              0.051133685 = queryNorm
            0.18620178 = fieldWeight in 2054, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.3706124 = idf(docFreq=4130, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2054)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Source: Information processing and management. 44(2008) no.2, S.663-686

Search (10 results, page 1 of 1)

Authors

Years

Themes