Search (1 results, page 1 of 1)

Did you mean:
rswk_00%3a%22Suchmaschine %2f information retrieval %2f mathematisches modell %28HEBIS%29%22 1
rswk_00%3a%22Suchmaschine %2f information retrieval %2f mathematischen modell %28HEBIS%29%22 1
rswk_00%3a%22Suchmaschine %2f information retrieval %2f mathematisches models %28HEBIS%29%22 1
rswk_00%3a%22Suchmaschine %2f information retrieval %2f mathematischer modell %28HEBIS%29%22 1
rswk_00%3a%22Suchmaschine %2f information retrieval %2f mathematisches modelle %28HEBIS%29%22 1

Airio, E.; Kettunen, K.: Does dictionary based bilingual retrieval work in a non-normalized index? (2009) 0.01
```
0.012762025 = product of:
  0.0510481 = sum of:
    0.0073573855 = weight(_text_:information in 4224) [ClassicSimilarity], result of:
      0.0073573855 = score(doc=4224,freq=2.0), product of:
        0.06322253 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.036014426 = queryNorm
        0.116372846 = fieldWeight in 4224, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=4224)
    0.043690715 = weight(_text_:retrieval in 4224) [ClassicSimilarity], result of:
      0.043690715 = score(doc=4224,freq=8.0), product of:
        0.10894058 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.036014426 = queryNorm
        0.40105087 = fieldWeight in 4224, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=4224)
  0.25 = coord(2/8)
```
Abstract

Many operational IR indexes are non-normalized, i.e. no lemmatization or stemming techniques, etc. have been employed in indexing. This poses a challenge for dictionary-based cross-language retrieval (CLIR), because translations are mostly lemmas. In this study, we face the challenge of dictionary-based CLIR in a non-normalized index. We test two optional approaches: FCG (Frequent Case Generation) and s-gramming. The idea of FCG is to automatically generate the most frequent inflected forms for a given lemma. FCG has been tested in monolingual retrieval and has been shown to be a good method for inflected retrieval, especially for highly inflected languages. S-gramming is an approximate string matching technique (an extension of n-gramming). The language pairs in our tests were English-Finnish, English-Swedish, Swedish-Finnish and Finnish-Swedish. Both our approaches performed quite well, but the results varied depending on the language pair. S-gramming and FCG performed quite equally in all the other language pairs except Finnish-Swedish, where s-gramming outperformed FCG.

Source

Information processing and management. 45(2009) no.6, S.703-713