Search (4 results, page 1 of 1)

  • × author_ss:"Oard, D.W."
  • × theme_ss:"Multilinguale Probleme"
  1. Oard, D.W.: Multilingual information access (2009) 0.00
    0.0036535384 = product of:
      0.032881845 = sum of:
        0.032881845 = weight(_text_:access in 3850) [ClassicSimilarity], result of:
          0.032881845 = score(doc=3850,freq=2.0), product of:
            0.10975764 = queryWeight, product of:
              3.389428 = idf(docFreq=4053, maxDocs=44218)
              0.03238235 = queryNorm
            0.29958594 = fieldWeight in 3850, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.389428 = idf(docFreq=4053, maxDocs=44218)
              0.0625 = fieldNorm(doc=3850)
      0.11111111 = coord(1/9)
    
  2. Kim, S.; Ko, Y.; Oard, D.W.: Combining lexical and statistical translation evidence for cross-language information retrieval (2015) 0.00
    0.0027401538 = product of:
      0.024661385 = sum of:
        0.024661385 = weight(_text_:access in 1606) [ClassicSimilarity], result of:
          0.024661385 = score(doc=1606,freq=2.0), product of:
            0.10975764 = queryWeight, product of:
              3.389428 = idf(docFreq=4053, maxDocs=44218)
              0.03238235 = queryNorm
            0.22468945 = fieldWeight in 1606, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.389428 = idf(docFreq=4053, maxDocs=44218)
              0.046875 = fieldNorm(doc=1606)
      0.11111111 = coord(1/9)
    
    Abstract
    This article explores how best to use lexical and statistical translation evidence together for cross-language information retrieval (CLIR). Lexical translation evidence is assembled from Wikipedia and from a large machine-readable dictionary, statistical translation evidence is drawn from parallel corpora, and evidence from co-occurrence in the document language provides a basis for limiting the adverse effect of translation ambiguity. Coverage statistics for NII Testbeds and Community for Information Access Research (NTCIR) queries confirm that these resources have complementary strengths. Experiments with translation evidence from a small parallel corpus indicate that even rather rough estimates of translation probabilities can yield further improvements over a strong technique for translation weighting based on using Jensen-Shannon divergence as a term-association measure. Finally, a novel approach to posttranslation query expansion using a random walk over the Wikipedia concept link graph is shown to yield further improvements over alternative techniques for posttranslation query expansion. Evaluation results on the NTCIR-5 English-Korean test collection show statistically significant improvements over strong baselines.
  3. Oard, D.W.: Serving users in many languages : cross-language information retrieval for digital libraries (1997) 0.00
    0.0022834614 = product of:
      0.020551153 = sum of:
        0.020551153 = weight(_text_:access in 1261) [ClassicSimilarity], result of:
          0.020551153 = score(doc=1261,freq=2.0), product of:
            0.10975764 = queryWeight, product of:
              3.389428 = idf(docFreq=4053, maxDocs=44218)
              0.03238235 = queryNorm
            0.18724121 = fieldWeight in 1261, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.389428 = idf(docFreq=4053, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1261)
      0.11111111 = coord(1/9)
    
    Abstract
    We are rapidly constructing an extensive network infrastructure for moving information across national boundaries, but much remains to be done before linguistic barriers can be surmounted as effectively as geographic ones. Users seeking information from a digital library could benefit from the ability to query large collections once using a single language, even when more than one language is present in the collection. If the information they locate is not available in a language that they can read, some form of translation will be needed. At present, multilingual thesauri such as EUROVOC help to address this challenge by facilitating controlled vocabulary search using terms from several languages, and services such as INSPEC produce English abstracts for documents in other languages. On the other hand, support for free text searching across languages is not yet widely deployed, and fully automatic machine translation is presently neither sufficiently fast nor sufficiently accurate to adequately support interactive cross-language information seeking. An active and rapidly growing research community has coalesced around these and other related issues, applying techniques drawn from several fields - notably information retrieval and natural language processing - to provide access to large multilingual collections.
  4. Oard, D.W.: Alternative approaches for cross-language text retrieval (1997) 0.00
    0.0015984231 = product of:
      0.014385808 = sum of:
        0.014385808 = weight(_text_:access in 1164) [ClassicSimilarity], result of:
          0.014385808 = score(doc=1164,freq=2.0), product of:
            0.10975764 = queryWeight, product of:
              3.389428 = idf(docFreq=4053, maxDocs=44218)
              0.03238235 = queryNorm
            0.13106886 = fieldWeight in 1164, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.389428 = idf(docFreq=4053, maxDocs=44218)
              0.02734375 = fieldNorm(doc=1164)
      0.11111111 = coord(1/9)
    
    Abstract
    The explosive growth of the Internet and other sources of networked information have made automatic mediation of access to networked information sources an increasingly important problem. Much of this information is expressed as electronic text, and it is becoming practical to automatically convert some printed documents and recorded speech to electronic text as well. Thus, automated systems capable of detecting useful documents are finding widespread application. With even a small number of languages it can be inconvenient to issue the same query repeatedly in every language, so users who are able to read more than one language will likely prefer a multilingual text retrieval system over a collection of monolingual systems. And since reading ability in a language does not always imply fluent writing ability in that language, such users will likely find cross-language text retrieval particularly useful for languages in which they are less confident of their ability to express their information needs effectively. The use of such systems can be also be beneficial if the user is able to read only a single language. For example, when only a small portion of the document collection will ever be examined by the user, performing retrieval before translation can be significantly more economical than performing translation before retrieval. So when the application is sufficiently important to justify the time and effort required for translation, those costs can be minimized if an effective cross-language text retrieval system is available. Even when translation is not available, there are circumstances in which cross-language text retrieval could be useful to a monolingual user. For example, a researcher might find a paper published in an unfamiliar language useful if that paper contains references to works by the same author that are in the researcher's native language.