Search (5 results, page 1 of 1)

  • × theme_ss:"Multilinguale Probleme"
  • × author_ss:"Järvelin, K."
  1. Lehtokangas, R.; Keskustalo, H.; Järvelin, K.: Experiments with transitive dictionary translation and pseudo-relevance feedback using graded relevance assessments (2008) 0.01
    0.011332112 = sum of:
      0.009680318 = product of:
        0.058081906 = sum of:
          0.058081906 = weight(_text_:authors in 1349) [ClassicSimilarity], result of:
            0.058081906 = score(doc=1349,freq=2.0), product of:
              0.19219086 = queryWeight, product of:
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.042158082 = queryNorm
              0.30220953 = fieldWeight in 1349, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.046875 = fieldNorm(doc=1349)
        0.16666667 = coord(1/6)
      0.0016517945 = product of:
        0.003303589 = sum of:
          0.003303589 = weight(_text_:s in 1349) [ClassicSimilarity], result of:
            0.003303589 = score(doc=1349,freq=2.0), product of:
              0.045835853 = queryWeight, product of:
                1.0872376 = idf(docFreq=40523, maxDocs=44218)
                0.042158082 = queryNorm
              0.072074346 = fieldWeight in 1349, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.0872376 = idf(docFreq=40523, maxDocs=44218)
                0.046875 = fieldNorm(doc=1349)
        0.5 = coord(1/2)
    
    Abstract
    In this article, the authors present evaluation results for transitive dictionary-based cross-language information retrieval (CLIR) using graded relevance assessments in a best match retrieval environment. A text database containing newspaper articles and a related set of 35 search topics were used in the tests. Source language topics (in English, German, and Swedish) were automatically translated into the target language (Finnish) via an intermediate (or pivot) language. Effectiveness of the transitively translated queries was compared to that of the directly translated and monolingual Finnish queries. Pseudo-relevance feedback (PRF) was also used to expand the original transitive target queries. Cross-language information retrieval performance was evaluated on three relevance thresholds: stringent, regular, and liberal. The transitive translations performed well achieving, on the average, 85-93% of the direct translation performance, and 66-72% of monolingual performance. Moreover, PRF was successful in raising the performance of transitive translation routes in absolute terms as well as in relation to monolingual and direct translation performance applying PRF.
    Source
    Journal of the American Society for Information Science and Technology. 59(2008) no.3, S.476-488
  2. Talvensaari, T.; Juhola, M.; Laurikkala, J.; Järvelin, K.: Corpus-based cross-language information retrieval in retrieval of highly relevant documents (2007) 0.01
    0.009443427 = sum of:
      0.008066932 = product of:
        0.04840159 = sum of:
          0.04840159 = weight(_text_:authors in 139) [ClassicSimilarity], result of:
            0.04840159 = score(doc=139,freq=2.0), product of:
              0.19219086 = queryWeight, product of:
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.042158082 = queryNorm
              0.25184128 = fieldWeight in 139, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.0390625 = fieldNorm(doc=139)
        0.16666667 = coord(1/6)
      0.0013764955 = product of:
        0.002752991 = sum of:
          0.002752991 = weight(_text_:s in 139) [ClassicSimilarity], result of:
            0.002752991 = score(doc=139,freq=2.0), product of:
              0.045835853 = queryWeight, product of:
                1.0872376 = idf(docFreq=40523, maxDocs=44218)
                0.042158082 = queryNorm
              0.060061958 = fieldWeight in 139, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.0872376 = idf(docFreq=40523, maxDocs=44218)
                0.0390625 = fieldNorm(doc=139)
        0.5 = coord(1/2)
    
    Abstract
    Information retrieval systems' ability to retrieve highly relevant documents has become more and more important in the age of extremely large collections, such as the World Wide Web (WWW). The authors' aim was to find out how corpus-based cross-language information retrieval (CLIR) manages in retrieving highly relevant documents. They created a Finnish-Swedish comparable corpus from two loosely related document collections and used it as a source of knowledge for query translation. Finnish test queries were translated into Swedish and run against a Swedish test collection. Graded relevance assessments were used in evaluating the results and three relevance criterion levels-liberal, regular, and stringent-were applied. The runs were also evaluated with generalized recall and precision, which weight the retrieved documents according to their relevance level. The performance of the Comparable Corpus Translation system (COCOT) was compared to that of a dictionarybased query translation program; the two translation methods were also combined. The results indicate that corpus-based CUR performs particularly well with highly relevant documents. In average precision, COCOT even matched the monolingual baseline on the highest relevance level. The performance of the different query translation methods was further analyzed by finding out reasons for poor rankings of highly relevant documents.
    Source
    Journal of the American Society for Information Science and Technology. 58(2007) no.3, S.322-334
  3. Toivonen, J.; Pirkola, A.; Keskustalo, H.; Visala, K.; Järvelin, K.: Translating cross-lingual spelling variants using transformation rules (2005) 0.00
    8.2589727E-4 = product of:
      0.0016517945 = sum of:
        0.0016517945 = product of:
          0.003303589 = sum of:
            0.003303589 = weight(_text_:s in 1052) [ClassicSimilarity], result of:
              0.003303589 = score(doc=1052,freq=2.0), product of:
                0.045835853 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.042158082 = queryNorm
                0.072074346 = fieldWeight in 1052, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1052)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information processing and management. 41(2005) no.4, S.859-872
  4. Pirkola, A.; Puolamäki, D.; Järvelin, K.: Applying query structuring in cross-language retrieval (2003) 0.00
    8.2589727E-4 = product of:
      0.0016517945 = sum of:
        0.0016517945 = product of:
          0.003303589 = sum of:
            0.003303589 = weight(_text_:s in 1074) [ClassicSimilarity], result of:
              0.003303589 = score(doc=1074,freq=2.0), product of:
                0.045835853 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.042158082 = queryNorm
                0.072074346 = fieldWeight in 1074, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1074)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information processing and management. 39(2003) no.3, S.391-402
  5. Talvensaari, T.; Laurikkala, J.; Järvelin, K.; Juhola, M.: ¬A study on automatic creation of a comparable document collection in cross-language information retrieval (2006) 0.00
    6.8824773E-4 = product of:
      0.0013764955 = sum of:
        0.0013764955 = product of:
          0.002752991 = sum of:
            0.002752991 = weight(_text_:s in 5601) [ClassicSimilarity], result of:
              0.002752991 = score(doc=5601,freq=2.0), product of:
                0.045835853 = queryWeight, product of:
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.042158082 = queryNorm
                0.060061958 = fieldWeight in 5601, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.0872376 = idf(docFreq=40523, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5601)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of documentation. 62(2006) no.3, S.372-387