Search (2 results, page 1 of 1)

Fujita, S.: NTCIR-2 as a Rosetta stone in laboratory experiments of IR systems (2005) 0.00
```
0.003462655 = product of:
  0.010387965 = sum of:
    0.010387965 = weight(_text_:a in 1017) [ClassicSimilarity], result of:
      0.010387965 = score(doc=1017,freq=10.0), product of:
        0.05209492 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.045180224 = queryNorm
        0.19940455 = fieldWeight in 1017, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1017)
  0.33333334 = coord(1/3)
```
Abstract

This paper presents a laboratory based evaluation study of cross-language information retrieval technologies, utilizing partially parallel test collections, NTCIR-2 (used together with NTCIR-1), where Japanese-English parallel document collections, parallel topic sets and their relevance judgments are available. These enable us to observe and compare monolingual retrieval processes in two languages as well as retrieval across languages. Our experiments focused on (1) the Rosetta stone question (whether a partially parallel collection helps in cross-language information access or not?) and (2) two aspects of retrieval difficulties namely "collection discrepancy" and "query discrepancy". Japanese and English monolingual retrieval systems are combined by dictionary based query translation modules so that a symmetrical bilingual evaluation environment is implemented.

Type

a
Fujita, S.: Technology survey and invalidity search : a comparative study of different tasks for Japanese patent document retrieval (2007) 0.00
```
0.0030970925 = product of:
  0.009291277 = sum of:
    0.009291277 = weight(_text_:a in 918) [ClassicSimilarity], result of:
      0.009291277 = score(doc=918,freq=8.0), product of:
        0.05209492 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.045180224 = queryNorm
        0.17835285 = fieldWeight in 918, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.0546875 = fieldNorm(doc=918)
  0.33333334 = coord(1/3)
```
Abstract

A comparative study of two types of patent retrieval tasks, technology survey and invalidity search, using the NTCIR-3 and -4 test collections is described, with a focus on pseudo-feedback effectiveness and different retrieval models. Invalidity searches are peculiar to patent retrieval tasks and feature small numbers of relevant documents and long queries. Different behaviors of effectiveness are observed when applying different retrieval models and pseudo-feedback. These different behaviors are analyzed in terms of the "weak cluster hypothesis", i.e., terminological cohesiveness through relevant documents.

Type

a