Search (305 results, page 1 of 16)

  • × theme_ss:"Multilinguale Probleme"
  1. Dabbadie, M.; Blancherie, J.M.: Alexandria, a multilingual dictionary for knowledge management purposes (2006) 0.08
    0.07513174 = product of:
      0.09391467 = sum of:
        0.037908506 = weight(_text_:g in 2465) [ClassicSimilarity], result of:
          0.037908506 = score(doc=2465,freq=2.0), product of:
            0.15225126 = queryWeight, product of:
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.040536046 = queryNorm
            0.24898648 = fieldWeight in 2465, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.046875 = fieldNorm(doc=2465)
        0.028811965 = weight(_text_:u in 2465) [ClassicSimilarity], result of:
          0.028811965 = score(doc=2465,freq=2.0), product of:
            0.13273303 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.040536046 = queryNorm
            0.21706703 = fieldWeight in 2465, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.046875 = fieldNorm(doc=2465)
        0.01071798 = weight(_text_:a in 2465) [ClassicSimilarity], result of:
          0.01071798 = score(doc=2465,freq=18.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.22931081 = fieldWeight in 2465, product of:
              4.2426405 = tf(freq=18.0), with freq of:
                18.0 = termFreq=18.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=2465)
        0.016476223 = product of:
          0.032952446 = sum of:
            0.032952446 = weight(_text_:22 in 2465) [ClassicSimilarity], result of:
              0.032952446 = score(doc=2465,freq=2.0), product of:
                0.14195032 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.040536046 = queryNorm
                0.23214069 = fieldWeight in 2465, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2465)
          0.5 = coord(1/2)
      0.8 = coord(4/5)
    
    Abstract
    Alexandria is an innovation of international impact. It is the only multilingual dictionary for websites and PCs. A double click on a word opens a small window that gives interactive translations between 22 languages and includes meaning, synonyms and associated expressions. It is an ASP application grounded on a semantic network that is portable on any operating system or platform. Behind the application is the Integral Dictionary is the semantic network created by Memodata. Alexandria can be customized with specific vocabulary, descriptive articles, images, sounds, videos, etc. Its domains of application are considerable: e-tourism, online medias, language learning, international websites. Alexandria has also proved to be a basic tool for knowledge management purposes. The application can be customized according to a user or an organization needs. An application dedicated to mobile devices is currently being developed. Future developments are planned in the field of e-tourism in relation with French "pôles de compétitivité".
    Source
    Knowledge organization for a global learning society: Proceedings of the 9th International ISKO Conference, 4-7 July 2006, Vienna, Austria. Hrsg.: G. Budin, C. Swertz u. K. Mitgutsch
    Type
    a
  2. Hull, D.A.; Grefenstette, G.: Querying across languages : a dictionary-based approach to multilingual information retrieval (1996) 0.07
    0.071772985 = product of:
      0.119621634 = sum of:
        0.06318085 = weight(_text_:g in 1947) [ClassicSimilarity], result of:
          0.06318085 = score(doc=1947,freq=2.0), product of:
            0.15225126 = queryWeight, product of:
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.040536046 = queryNorm
            0.4149775 = fieldWeight in 1947, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.078125 = fieldNorm(doc=1947)
        0.048019946 = weight(_text_:u in 1947) [ClassicSimilarity], result of:
          0.048019946 = score(doc=1947,freq=2.0), product of:
            0.13273303 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.040536046 = queryNorm
            0.3617784 = fieldWeight in 1947, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.078125 = fieldNorm(doc=1947)
        0.00842084 = weight(_text_:a in 1947) [ClassicSimilarity], result of:
          0.00842084 = score(doc=1947,freq=4.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.18016359 = fieldWeight in 1947, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.078125 = fieldNorm(doc=1947)
      0.6 = coord(3/5)
    
    Footnote
    Wiederabgedruckt in: Readings in information retrieval. Ed.: K. Sparck Jones u. P. Willett. San Francisco: Morgan Kaufmann 1997. S.484-492
    Type
    a
  3. Lehtinen, R.; Clavel-Merrin, G.: Mehrsprachige und verschiedenartige Daten in Bibliothekssystemen und -netzen : Erfahrungen und Perspektiven aus der Schweiz und Finnland (1998) 0.07
    0.070293136 = product of:
      0.117155224 = sum of:
        0.06318085 = weight(_text_:g in 4186) [ClassicSimilarity], result of:
          0.06318085 = score(doc=4186,freq=2.0), product of:
            0.15225126 = queryWeight, product of:
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.040536046 = queryNorm
            0.4149775 = fieldWeight in 4186, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.078125 = fieldNorm(doc=4186)
        0.048019946 = weight(_text_:u in 4186) [ClassicSimilarity], result of:
          0.048019946 = score(doc=4186,freq=2.0), product of:
            0.13273303 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.040536046 = queryNorm
            0.3617784 = fieldWeight in 4186, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.078125 = fieldNorm(doc=4186)
        0.0059544328 = weight(_text_:a in 4186) [ClassicSimilarity], result of:
          0.0059544328 = score(doc=4186,freq=2.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.12739488 = fieldWeight in 4186, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.078125 = fieldNorm(doc=4186)
      0.6 = coord(3/5)
    
    Source
    Multi-sprachige, multi-charakter Fragestellungen für die Online-Umgebung: Vorträge, von einer Arbeitsgruppe von der IFLA Sektion für Katalogisierung finanziert, Istanbul, Türkei, 24.8.1999. Ed.: J.D. Byrum u. O. Madison
    Type
    a
  4. Kutschekmanesch, S.; Lutes, B.; Moelle, K.; Thiel, U.; Tzeras, K.: Automated multilingual indexing : a synthesis of rule-based and thesaurus-based methods (1998) 0.06
    0.062275004 = product of:
      0.10379167 = sum of:
        0.067910455 = weight(_text_:u in 4157) [ClassicSimilarity], result of:
          0.067910455 = score(doc=4157,freq=4.0), product of:
            0.13273303 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.040536046 = queryNorm
            0.5116319 = fieldWeight in 4157, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.078125 = fieldNorm(doc=4157)
        0.00842084 = weight(_text_:a in 4157) [ClassicSimilarity], result of:
          0.00842084 = score(doc=4157,freq=4.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.18016359 = fieldWeight in 4157, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.078125 = fieldNorm(doc=4157)
        0.027460374 = product of:
          0.054920748 = sum of:
            0.054920748 = weight(_text_:22 in 4157) [ClassicSimilarity], result of:
              0.054920748 = score(doc=4157,freq=2.0), product of:
                0.14195032 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.040536046 = queryNorm
                0.38690117 = fieldWeight in 4157, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=4157)
          0.5 = coord(1/2)
      0.6 = coord(3/5)
    
    Source
    Information und Märkte: 50. Deutscher Dokumentartag 1998, Kongreß der Deutschen Gesellschaft für Dokumentation e.V. (DGD), Rheinische Friedrich-Wilhelms-Universität Bonn, 22.-24. September 1998. Hrsg. von Marlies Ockenfeld u. Gerhard J. Mantwill
    Type
    a
  5. Rosemblat, G.; Graham, L.: Cross-language search in a monolingual health information system : flexible designs and lexical processes (2006) 0.06
    0.05551662 = product of:
      0.092527695 = sum of:
        0.053610723 = weight(_text_:g in 241) [ClassicSimilarity], result of:
          0.053610723 = score(doc=241,freq=4.0), product of:
            0.15225126 = queryWeight, product of:
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.040536046 = queryNorm
            0.35212007 = fieldWeight in 241, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.046875 = fieldNorm(doc=241)
        0.028811965 = weight(_text_:u in 241) [ClassicSimilarity], result of:
          0.028811965 = score(doc=241,freq=2.0), product of:
            0.13273303 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.040536046 = queryNorm
            0.21706703 = fieldWeight in 241, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.046875 = fieldNorm(doc=241)
        0.010105007 = weight(_text_:a in 241) [ClassicSimilarity], result of:
          0.010105007 = score(doc=241,freq=16.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.2161963 = fieldWeight in 241, product of:
              4.0 = tf(freq=16.0), with freq of:
                16.0 = termFreq=16.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=241)
      0.6 = coord(3/5)
    
    Abstract
    The predominance of English-only online health information poses a serious challenge to nonEnglish speakers. To overcome this barrier, we incorporated cross-language information retrieval (CLIR) techniques into a fully functional prototype. It supports Spanish language searches over an English data set using a Spanish-English bilingual term list (BTL). The modular design allows for system and BTL growth and takes advantage of English-system enhancements. Language-based design decisions and implications for integrating non-English components with the existing monolingual architecture are presented. Algorithmic and BTL improvements are used to bring CUR retrieval scores in line with the monolingual values. After validating these changes, we conducted a failure analysis and error categorization for the worst performing queries. We conclude with a comprehensive discussion and directions for future work.
    Source
    Knowledge organization for a global learning society: Proceedings of the 9th International ISKO Conference, 4-7 July 2006, Vienna, Austria. Hrsg.: G. Budin, C. Swertz u. K. Mitgutsch
    Type
    a
  6. Keränen, S.: Equivalence and focus of translation in multicultural thesaurus construction (2006) 0.05
    0.05103595 = product of:
      0.08505992 = sum of:
        0.044226594 = weight(_text_:g in 237) [ClassicSimilarity], result of:
          0.044226594 = score(doc=237,freq=2.0), product of:
            0.15225126 = queryWeight, product of:
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.040536046 = queryNorm
            0.29048425 = fieldWeight in 237, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.0546875 = fieldNorm(doc=237)
        0.03361396 = weight(_text_:u in 237) [ClassicSimilarity], result of:
          0.03361396 = score(doc=237,freq=2.0), product of:
            0.13273303 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.040536046 = queryNorm
            0.25324488 = fieldWeight in 237, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.0546875 = fieldNorm(doc=237)
        0.0072193667 = weight(_text_:a in 237) [ClassicSimilarity], result of:
          0.0072193667 = score(doc=237,freq=6.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.1544581 = fieldWeight in 237, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=237)
      0.6 = coord(3/5)
    
    Abstract
    This paper reports a part of an on-going PhD study on problems related to multicultural social science thesaurus construction in the general frame of information science. The main analysis methods used are discourse analysis and co-word analysis. In theoretical framework the emphasis is on communicative equivalence theories and different aims of thesaurus translation are discussed. Some examples are given how co-word analysis can be used to study contextual equivalence.
    Source
    Knowledge organization for a global learning society: Proceedings of the 9th International ISKO Conference, 4-7 July 2006, Vienna, Austria. Hrsg.: G. Budin, C. Swertz u. K. Mitgutsch
    Type
    a
  7. Drexel, G.: Knowledge engineering for intelligent information retrieval (2001) 0.05
    0.04609529 = product of:
      0.07682548 = sum of:
        0.037908506 = weight(_text_:g in 4043) [ClassicSimilarity], result of:
          0.037908506 = score(doc=4043,freq=2.0), product of:
            0.15225126 = queryWeight, product of:
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.040536046 = queryNorm
            0.24898648 = fieldWeight in 4043, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.046875 = fieldNorm(doc=4043)
        0.028811965 = weight(_text_:u in 4043) [ClassicSimilarity], result of:
          0.028811965 = score(doc=4043,freq=2.0), product of:
            0.13273303 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.040536046 = queryNorm
            0.21706703 = fieldWeight in 4043, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.046875 = fieldNorm(doc=4043)
        0.010105007 = weight(_text_:a in 4043) [ClassicSimilarity], result of:
          0.010105007 = score(doc=4043,freq=16.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.2161963 = fieldWeight in 4043, product of:
              4.0 = tf(freq=16.0), with freq of:
                16.0 = termFreq=16.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=4043)
      0.6 = coord(3/5)
    
    Abstract
    This paper presents a clustered approach to designing an overall ontological model together with a general rule-based component that serves as a mapping device. By observational criteria, a multi-lingual team of experts excerpts concepts from general communication in the media. The team, then, finds equivalent expressions in English, German, French, and Spanish. On the basis of a set of ontological and lexical relations, a conceptual network is built up. Concepts are thought to be universal. Objects unique in time and space are identified by names and will be explained by the universals as their instances. Our approach relies on multi-relational descriptions of concepts. It provides a powerful tool for documentation and conceptual language learning. First and foremost, our multi-lingual, polyhierarchical ontology fills the gap of semantically-based information retrieval by generating enhanced and improved queries for internet search
    Theme
    Semantisches Umfeld in Indexierung u. Retrieval
    Type
    a
  8. Grefenstette, G.: ¬The problem of cross-language information retrieval (1998) 0.05
    0.04574671 = product of:
      0.11436677 = sum of:
        0.10722145 = weight(_text_:g in 6301) [ClassicSimilarity], result of:
          0.10722145 = score(doc=6301,freq=4.0), product of:
            0.15225126 = queryWeight, product of:
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.040536046 = queryNorm
            0.70424014 = fieldWeight in 6301, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.09375 = fieldNorm(doc=6301)
        0.0071453196 = weight(_text_:a in 6301) [ClassicSimilarity], result of:
          0.0071453196 = score(doc=6301,freq=2.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.15287387 = fieldWeight in 6301, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.09375 = fieldNorm(doc=6301)
      0.4 = coord(2/5)
    
    Source
    Cross-language information retrieval. Ed.: G. Grefenstette
    Type
    a
  9. Clavel-Merrin, G.: Multilingual access to libraries' databases (1996) 0.04
    0.044319477 = product of:
      0.07386579 = sum of:
        0.037908506 = weight(_text_:g in 4187) [ClassicSimilarity], result of:
          0.037908506 = score(doc=4187,freq=2.0), product of:
            0.15225126 = queryWeight, product of:
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.040536046 = queryNorm
            0.24898648 = fieldWeight in 4187, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.046875 = fieldNorm(doc=4187)
        0.028811965 = weight(_text_:u in 4187) [ClassicSimilarity], result of:
          0.028811965 = score(doc=4187,freq=2.0), product of:
            0.13273303 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.040536046 = queryNorm
            0.21706703 = fieldWeight in 4187, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.046875 = fieldNorm(doc=4187)
        0.0071453196 = weight(_text_:a in 4187) [ClassicSimilarity], result of:
          0.0071453196 = score(doc=4187,freq=8.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.15287387 = fieldWeight in 4187, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=4187)
      0.6 = coord(3/5)
    
    Abstract
    Multilingual access to library databases is a topic of concern not only to users in countries such as Switzerland in which several languages are spoken, but also to those who search for information in databases containing material in more than one language. The growth of networks means that libraries can access databases outside their own immediate circle but problems of differences in interfaces will continue until there is widespread compliance with Z39.50. Considers 2 approaches to multilingual access: the use of multilingual thesauri or authority records (which implies translation work before users search the database); and the translation of the search statement at the time of searching (which implies the existence of parsers and multilingual dictionaries)
    Source
    Towards a worldwide library: a ten year forecast. Proceedings of the 19th International Essen Symposium, 23-26 Sept 1996. Ed.: A.H. Helal u. J.W. Weiss
    Type
    a
  10. Bian, G.-W.; Chen, H.-H.: Cross-language information access to multilingual collections on the Internet (2000) 0.04
    0.036343656 = product of:
      0.06057276 = sum of:
        0.037908506 = weight(_text_:g in 4436) [ClassicSimilarity], result of:
          0.037908506 = score(doc=4436,freq=2.0), product of:
            0.15225126 = queryWeight, product of:
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.040536046 = queryNorm
            0.24898648 = fieldWeight in 4436, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.046875 = fieldNorm(doc=4436)
        0.0061880285 = weight(_text_:a in 4436) [ClassicSimilarity], result of:
          0.0061880285 = score(doc=4436,freq=6.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.13239266 = fieldWeight in 4436, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=4436)
        0.016476223 = product of:
          0.032952446 = sum of:
            0.032952446 = weight(_text_:22 in 4436) [ClassicSimilarity], result of:
              0.032952446 = score(doc=4436,freq=2.0), product of:
                0.14195032 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.040536046 = queryNorm
                0.23214069 = fieldWeight in 4436, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4436)
          0.5 = coord(1/2)
      0.6 = coord(3/5)
    
    Abstract
    Language barrier is the major problem that people face in searching for, retrieving, and understanding multilingual collections on the Internet. This paper deals with query translation and document translation in a Chinese-English information retrieval system called MTIR. Bilingual dictionary and monolingual corpus-based approaches are adopted to select suitable tranlated query terms. A machine transliteration algorithm is introduced to resolve proper name searching. We consider several design issues for document translation, including which material is translated, what roles the HTML tags play in translation, what the tradeoff is between the speed performance and the translation performance, and what from the translated result is presented in. About 100.000 Web pages translated in the last 4 months of 1997 are used for quantitative study of online and real-time Web page translation
    Date
    16. 2.2000 14:22:39
    Type
    a
  11. Sheridan, P.; Ballerini, J.P.; Schäuble, P.: Building a large multilingual test collection from comparable news documents (1998) 0.03
    0.03436881 = product of:
      0.08592202 = sum of:
        0.07581701 = weight(_text_:g in 6298) [ClassicSimilarity], result of:
          0.07581701 = score(doc=6298,freq=2.0), product of:
            0.15225126 = queryWeight, product of:
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.040536046 = queryNorm
            0.49797297 = fieldWeight in 6298, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.09375 = fieldNorm(doc=6298)
        0.010105007 = weight(_text_:a in 6298) [ClassicSimilarity], result of:
          0.010105007 = score(doc=6298,freq=4.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.2161963 = fieldWeight in 6298, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.09375 = fieldNorm(doc=6298)
      0.4 = coord(2/5)
    
    Source
    Cross-language information retrieval. Ed.: G. Grefenstette
    Type
    a
  12. Picchi, E.; Peters, C.: Cross-language information retrieval : a system for comparable corpus querying (1998) 0.03
    0.03436881 = product of:
      0.08592202 = sum of:
        0.07581701 = weight(_text_:g in 6305) [ClassicSimilarity], result of:
          0.07581701 = score(doc=6305,freq=2.0), product of:
            0.15225126 = queryWeight, product of:
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.040536046 = queryNorm
            0.49797297 = fieldWeight in 6305, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.09375 = fieldNorm(doc=6305)
        0.010105007 = weight(_text_:a in 6305) [ClassicSimilarity], result of:
          0.010105007 = score(doc=6305,freq=4.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.2161963 = fieldWeight in 6305, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.09375 = fieldNorm(doc=6305)
      0.4 = coord(2/5)
    
    Source
    Cross-language information retrieval. Ed.: G. Grefenstette
    Type
    a
  13. Yamabana, K.; Muraki, K.; Doi, S.; Kamei, S.: ¬A language conversion front-end for cross-language information retrieval (1998) 0.03
    0.03436881 = product of:
      0.08592202 = sum of:
        0.07581701 = weight(_text_:g in 6306) [ClassicSimilarity], result of:
          0.07581701 = score(doc=6306,freq=2.0), product of:
            0.15225126 = queryWeight, product of:
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.040536046 = queryNorm
            0.49797297 = fieldWeight in 6306, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.09375 = fieldNorm(doc=6306)
        0.010105007 = weight(_text_:a in 6306) [ClassicSimilarity], result of:
          0.010105007 = score(doc=6306,freq=4.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.2161963 = fieldWeight in 6306, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.09375 = fieldNorm(doc=6306)
      0.4 = coord(2/5)
    
    Source
    Cross-language information retrieval. Ed.: G. Grefenstette
    Type
    a
  14. Hull, D.: ¬A weighted Boolean model for cross-language text retrieval (1998) 0.03
    0.03436881 = product of:
      0.08592202 = sum of:
        0.07581701 = weight(_text_:g in 6307) [ClassicSimilarity], result of:
          0.07581701 = score(doc=6307,freq=2.0), product of:
            0.15225126 = queryWeight, product of:
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.040536046 = queryNorm
            0.49797297 = fieldWeight in 6307, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.09375 = fieldNorm(doc=6307)
        0.010105007 = weight(_text_:a in 6307) [ClassicSimilarity], result of:
          0.010105007 = score(doc=6307,freq=4.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.2161963 = fieldWeight in 6307, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.09375 = fieldNorm(doc=6307)
      0.4 = coord(2/5)
    
    Source
    Cross-language information retrieval. Ed.: G. Grefenstette
    Type
    a
  15. Clavel-Merrin, G.: ¬The need for co-operation in creating and maintaining multilingual subject authority files (2000) 0.03
    0.033184934 = product of:
      0.082962334 = sum of:
        0.07581701 = weight(_text_:g in 6064) [ClassicSimilarity], result of:
          0.07581701 = score(doc=6064,freq=2.0), product of:
            0.15225126 = queryWeight, product of:
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.040536046 = queryNorm
            0.49797297 = fieldWeight in 6064, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.09375 = fieldNorm(doc=6064)
        0.0071453196 = weight(_text_:a in 6064) [ClassicSimilarity], result of:
          0.0071453196 = score(doc=6064,freq=2.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.15287387 = fieldWeight in 6064, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.09375 = fieldNorm(doc=6064)
      0.4 = coord(2/5)
    
    Type
    a
  16. Davis, M.W.: On the effective use of large parallel corpora in cross-language text retrieval (1998) 0.03
    0.033184934 = product of:
      0.082962334 = sum of:
        0.07581701 = weight(_text_:g in 6302) [ClassicSimilarity], result of:
          0.07581701 = score(doc=6302,freq=2.0), product of:
            0.15225126 = queryWeight, product of:
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.040536046 = queryNorm
            0.49797297 = fieldWeight in 6302, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.09375 = fieldNorm(doc=6302)
        0.0071453196 = weight(_text_:a in 6302) [ClassicSimilarity], result of:
          0.0071453196 = score(doc=6302,freq=2.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.15287387 = fieldWeight in 6302, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.09375 = fieldNorm(doc=6302)
      0.4 = coord(2/5)
    
    Source
    Cross-language information retrieval. Ed.: G. Grefenstette
    Type
    a
  17. Ballesteros, L.; Croft, W.B.: Statistical methods for cross-language information retrieval (1998) 0.03
    0.033184934 = product of:
      0.082962334 = sum of:
        0.07581701 = weight(_text_:g in 6303) [ClassicSimilarity], result of:
          0.07581701 = score(doc=6303,freq=2.0), product of:
            0.15225126 = queryWeight, product of:
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.040536046 = queryNorm
            0.49797297 = fieldWeight in 6303, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.09375 = fieldNorm(doc=6303)
        0.0071453196 = weight(_text_:a in 6303) [ClassicSimilarity], result of:
          0.0071453196 = score(doc=6303,freq=2.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.15287387 = fieldWeight in 6303, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.09375 = fieldNorm(doc=6303)
      0.4 = coord(2/5)
    
    Source
    Cross-language information retrieval. Ed.: G. Grefenstette
    Type
    a
  18. Evans, D.A.; Handerson, S.K.; Monarch, I.A.; Pereiro, J.; Delon, L.; Hersch, W.R.: Mapping vocabularies using latent semantics (1998) 0.03
    0.033184934 = product of:
      0.082962334 = sum of:
        0.07581701 = weight(_text_:g in 6304) [ClassicSimilarity], result of:
          0.07581701 = score(doc=6304,freq=2.0), product of:
            0.15225126 = queryWeight, product of:
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.040536046 = queryNorm
            0.49797297 = fieldWeight in 6304, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.09375 = fieldNorm(doc=6304)
        0.0071453196 = weight(_text_:a in 6304) [ClassicSimilarity], result of:
          0.0071453196 = score(doc=6304,freq=2.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.15287387 = fieldWeight in 6304, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.09375 = fieldNorm(doc=6304)
      0.4 = coord(2/5)
    
    Source
    Cross-language information retrieval. Ed.: G. Grefenstette
    Type
    a
  19. Peters, W.; Vossen, P.; Diez-Orzas, P.; Adriaens, G.: Cross-linguistic alignment of WordNets with an inter-lingual-index (1998) 0.03
    0.033184934 = product of:
      0.082962334 = sum of:
        0.07581701 = weight(_text_:g in 6446) [ClassicSimilarity], result of:
          0.07581701 = score(doc=6446,freq=2.0), product of:
            0.15225126 = queryWeight, product of:
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.040536046 = queryNorm
            0.49797297 = fieldWeight in 6446, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.09375 = fieldNorm(doc=6446)
        0.0071453196 = weight(_text_:a in 6446) [ClassicSimilarity], result of:
          0.0071453196 = score(doc=6446,freq=2.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.15287387 = fieldWeight in 6446, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.09375 = fieldNorm(doc=6446)
      0.4 = coord(2/5)
    
    Type
    a
  20. Cross-language information retrieval (1998) 0.03
    0.03131154 = product of:
      0.0521859 = sum of:
        0.022337802 = weight(_text_:g in 6299) [ClassicSimilarity], result of:
          0.022337802 = score(doc=6299,freq=4.0), product of:
            0.15225126 = queryWeight, product of:
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.040536046 = queryNorm
            0.1467167 = fieldWeight in 6299, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.7559474 = idf(docFreq=2809, maxDocs=44218)
              0.01953125 = fieldNorm(doc=6299)
        0.020793248 = weight(_text_:u in 6299) [ClassicSimilarity], result of:
          0.020793248 = score(doc=6299,freq=6.0), product of:
            0.13273303 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.040536046 = queryNorm
            0.15665466 = fieldWeight in 6299, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.01953125 = fieldNorm(doc=6299)
        0.009054851 = weight(_text_:a in 6299) [ClassicSimilarity], result of:
          0.009054851 = score(doc=6299,freq=74.0), product of:
            0.046739966 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.040536046 = queryNorm
            0.19372822 = fieldWeight in 6299, product of:
              8.602325 = tf(freq=74.0), with freq of:
                74.0 = termFreq=74.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.01953125 = fieldNorm(doc=6299)
      0.6 = coord(3/5)
    
    Content
    Enthält die Beiträge: GREFENSTETTE, G.: The Problem of Cross-Language Information Retrieval; DAVIS, M.W.: On the Effective Use of Large Parallel Corpora in Cross-Language Text Retrieval; BALLESTEROS, L. u. W.B. CROFT: Statistical Methods for Cross-Language Information Retrieval; Distributed Cross-Lingual Information Retrieval; Automatic Cross-Language Information Retrieval Using Latent Semantic Indexing; EVANS, D.A. u.a.: Mapping Vocabularies Using Latent Semantics; PICCHI, E. u. C. PETERS: Cross-Language Information Retrieval: A System for Comparable Corpus Querying; YAMABANA, K. u.a.: A Language Conversion Front-End for Cross-Language Information Retrieval; GACHOT, D.A. u.a.: The Systran NLP Browser: An Application of Machine Translation Technology in Cross-Language Information Retrieval; HULL, D.: A Weighted Boolean Model for Cross-Language Text Retrieval; SHERIDAN, P. u.a. Building a Large Multilingual Test Collection from Comparable News Documents; OARD; D.W. u. B.J. DORR: Evaluating Cross-Language Text Filtering Effectiveness
    Editor
    Grefenstette, G.
    Footnote
    Rez. in: Machine translation review: 1999, no.10, S.26-27 (D. Lewis): "Cross Language Information Retrieval (CLIR) addresses the growing need to access large volumes of data across language boundaries. The typical requirement is for the user to input a free form query, usually a brief description of a topic, into a search or retrieval engine which returns a list, in ranked order, of documents or web pages that are relevant to the topic. The search engine matches the terms in the query to indexed terms, usually keywords previously derived from the target documents. Unlike monolingual information retrieval, CLIR requires query terms in one language to be matched to indexed terms in another. Matching can be done by bilingual dictionary lookup, full machine translation, or by applying statistical methods. A query's success is measured in terms of recall (how many potentially relevant target documents are found) and precision (what proportion of documents found are relevant). Issues in CLIR are how to translate query terms into index terms, how to eliminate alternative translations (e.g. to decide that French 'traitement' in a query means 'treatment' and not 'salary'), and how to rank or weight translation alternatives that are retained (e.g. how to order the French terms 'aventure', 'business', 'affaire', and 'liaison' as relevant translations of English 'affair'). Grefenstette provides a lucid and useful overview of the field and the problems. The volume brings together a number of experiments and projects in CLIR. Mark Davies (New Mexico State University) describes Recuerdo, a Spanish retrieval engine which reduces translation ambiguities by scanning indexes for parallel texts; it also uses either a bilingual dictionary or direct equivalents from a parallel corpus in order to compare results for queries on parallel texts. Lisa Ballesteros and Bruce Croft (University of Massachusetts) use a 'local feedback' technique which automatically enhances a query by adding extra terms to it both before and after translation; such terms can be derived from documents known to be relevant to the query.
    Christian Fluhr at al (DIST/SMTI, France) outline the EMIR (European Multilingual Information Retrieval) and ESPRIT projects. They found that using SYSTRAN to machine translate queries and to access material from various multilingual databases produced less relevant results than a method referred to as 'multilingual reformulation' (the mechanics of which are only hinted at). An interesting technique is Latent Semantic Indexing (LSI), described by Michael Littman et al (Brown University) and, most clearly, by David Evans et al (Carnegie Mellon University). LSI involves creating matrices of documents and the terms they contain and 'fitting' related documents into a reduced matrix space. This effectively allows queries to be mapped onto a common semantic representation of the documents. Eugenio Picchi and Carol Peters (Pisa) report on a procedure to create links between translation equivalents in an Italian-English parallel corpus. The links are used to construct parallel linguistic contexts in real-time for any term or combination of terms that is being searched for in either language. Their interest is primarily lexicographic but they plan to apply the same procedure to comparable corpora, i.e. to texts which are not translations of each other but which share the same domain. Kiyoshi Yamabana et al (NEC, Japan) address the issue of how to disambiguate between alternative translations of query terms. Their DMAX (double maximise) method looks at co-occurrence frequencies between both source language words and target language words in order to arrive at the most probable translation. The statistical data for the decision are derived, not from the translation texts but independently from monolingual corpora in each language. An interactive user interface allows the user to influence the selection of terms during the matching process. Denis Gachot et al (SYSTRAN) describe the SYSTRAN NLP browser, a prototype tool which collects parsing information derived from a text or corpus previously translated with SYSTRAN. The user enters queries into the browser in either a structured or free form and receives grammatical and lexical information about the source text and/or its translation.
    The retrieved output from a query including the phrase 'big rockets' may be, for instance, a sentence containing 'giant rocket' which is semantically ranked above 'military ocket'. David Hull (Xerox Research Centre, Grenoble) describes an implementation of a weighted Boolean model for Spanish-English CLIR. Users construct Boolean-type queries, weighting each term in the query, which is then translated by an on-line dictionary before being applied to the database. Comparisons with the performance of unweighted free-form queries ('vector space' models) proved encouraging. Two contributions consider the evaluation of CLIR systems. In order to by-pass the time-consuming and expensive process of assembling a standard collection of documents and of user queries against which the performance of an CLIR system is manually assessed, Páriac Sheridan et al (ETH Zurich) propose a method based on retrieving 'seed documents'. This involves identifying a unique document in a database (the 'seed document') and, for a number of queries, measuring how fast it is retrieved. The authors have also assembled a large database of multilingual news documents for testing purposes. By storing the (fairly short) documents in a structured form tagged with descriptor codes (e.g. for topic, country and area), the test suite is easily expanded while remaining consistent for the purposes of testing. Douglas Ouard and Bonne Dorr (University of Maryland) describe an evaluation methodology which appears to apply LSI techniques in order to filter and rank incoming documents designed for testing CLIR systems. The volume provides the reader an excellent overview of several projects in CLIR. It is well supported with references and is intended as a secondary text for researchers and practitioners. It highlights the need for a good, general tutorial introduction to the field."

Years

Languages

Types

  • a 279
  • el 25
  • m 5
  • s 4
  • p 3
  • r 3
  • x 3
  • More… Less…