Search (43 results, page 1 of 3)

Park, J.-r.: Cross-lingual name and subject access : mechanisms and challenge (2007) 0.08

0.078236274 = product of:
  0.15647255 = sum of:
    0.15647255 = sum of:
      0.10786875 = weight(_text_:e.g in 255) [ClassicSimilarity], result of:
        0.10786875 = score(doc=255,freq=2.0), product of:
          0.23393378 = queryWeight, product of:
            5.2168427 = idf(docFreq=651, maxDocs=44218)
            0.044842023 = queryNorm
          0.4611081 = fieldWeight in 255, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            5.2168427 = idf(docFreq=651, maxDocs=44218)
            0.0625 = fieldNorm(doc=255)
      0.0486038 = weight(_text_:22 in 255) [ClassicSimilarity], result of:
        0.0486038 = score(doc=255,freq=2.0), product of:
          0.15702912 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.044842023 = queryNorm
          0.30952093 = fieldWeight in 255, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0625 = fieldNorm(doc=255)
  0.5 = coord(1/2)

Abstract: This paper considers issues surrounding name and subject access across languages and cultures, particularly mechanisms and knowledge organization tools (e.g., cataloging, metadata) for cross-lingual information access. The author examines current mechanisms for cross-lingual name and subject access and identifies major factors that hinder cross-lingual information access. The author provides examples from the Korean language that demonstrate the problems with cross-language name and subject access.
Date: 10. 9.2000 17:38:22

De Luca, E.W.; Dahlberg, I.: Including knowledge domains from the ICC into the multilingual lexical linked data cloud (2014) 0.07
```
0.06915175 = product of:
  0.1383035 = sum of:
    0.1383035 = sum of:
      0.09534341 = weight(_text_:e.g in 1493) [ClassicSimilarity], result of:
        0.09534341 = score(doc=1493,freq=4.0), product of:
          0.23393378 = queryWeight, product of:
            5.2168427 = idf(docFreq=651, maxDocs=44218)
            0.044842023 = queryNorm
          0.40756583 = fieldWeight in 1493, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            5.2168427 = idf(docFreq=651, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1493)
      0.042960096 = weight(_text_:22 in 1493) [ClassicSimilarity], result of:
        0.042960096 = score(doc=1493,freq=4.0), product of:
          0.15702912 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.044842023 = queryNorm
          0.27358043 = fieldWeight in 1493, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1493)
  0.5 = coord(1/2)
```
Abstract

A lot of information that is already available on the Web, or retrieved from local information systems and social networks is structured in data silos that are not semantically related. Semantic technologies make it emerge that the use of typed links that directly express their relations are an advantage for every application that can reuse the incorporated knowledge about the data. For this reason, data integration, through reengineering (e.g. triplify), or querying (e.g. D2R) is an important task in order to make information available for everyone. Thus, in order to build a semantic map of the data, we need knowledge about data items itself and the relation between heterogeneous data items. In this paper, we present our work of providing Lexical Linked Data (LLD) through a meta-model that contains all the resources and gives the possibility to retrieve and navigate them from different perspectives. We combine the existing work done on knowledge domains (based on the Information Coding Classification) within the Multilingual Lexical Linked Data Cloud (based on the RDF/OWL EurowordNet and the related integrated lexical resources (MultiWordNet, EuroWordNet, MEMODATA Lexicon, Hamburg Methaphor DB).

Date

22. 9.2014 19:01:18

Source

Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik

Weihs, J.: Three tales of multilingual cataloguing (1998) 0.02

0.0243019 = product of:
  0.0486038 = sum of:
    0.0486038 = product of:
      0.0972076 = sum of:
        0.0972076 = weight(_text_:22 in 6063) [ClassicSimilarity], result of:
          0.0972076 = score(doc=6063,freq=2.0), product of:
            0.15702912 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044842023 = queryNorm
            0.61904186 = fieldWeight in 6063, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.125 = fieldNorm(doc=6063)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 2. 8.2001 8:55:22

Luca, E.W. de: Extending the linked data cloud with multilingual lexical linked data (2013) 0.02
```
0.023835853 = product of:
  0.047671705 = sum of:
    0.047671705 = product of:
      0.09534341 = sum of:
        0.09534341 = weight(_text_:e.g in 1073) [ClassicSimilarity], result of:
          0.09534341 = score(doc=1073,freq=4.0), product of:
            0.23393378 = queryWeight, product of:
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.044842023 = queryNorm
            0.40756583 = fieldWeight in 1073, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1073)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

A lot of information that is already available on the Web, or retrieved from local information systems and social networks, is structured in data silos that are not semantically related. Semantic technologies make it apparent that the use of typed links that directly express their relations are an advantage for every application that can reuse the incorporated knowledge about the data. For this reason, data integration, through reengineering (e.g., triplify) or querying (e.g., D2R), is an important task in order to make information available for everyone. Thus, in order to build a semantic map of the data, we need knowledge about data items itself and the relation between heterogeneous data items. Here we present our work of providing Lexical Linked Data (LLD) through a meta-model that contains all the resources and gives the possibility to retrieve and navigate them from different perspectives. After giving the definition of Lexical Linked Data, we describe the existing datasets we collected and the new datasets we included. Here we describe their format and show some use cases where we link lexical data, and show how to reuse and inference semantic data derived from lexical data. Different lexical resources (MultiWordNet, EuroWordNet, MEMODATA Lexicon, the Hamburg Methaphor Database) are connected to each other towards an Integrated Vocabulary for LLD that we evaluate and present.
Austin, D.: PRECIS in a multilingual context : Pt.1: PRECIS: an overview (1976) 0.02
```
0.02359629 = product of:
  0.04719258 = sum of:
    0.04719258 = product of:
      0.09438516 = sum of:
        0.09438516 = weight(_text_:e.g in 983) [ClassicSimilarity], result of:
          0.09438516 = score(doc=983,freq=2.0), product of:
            0.23393378 = queryWeight, product of:
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.044842023 = queryNorm
            0.40346956 = fieldWeight in 983, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.0546875 = fieldNorm(doc=983)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

The present paper is offered as the first of a series of articles in which PRECIS will be reviewed as a potential multi-lingual system, having in mind the obvious need, notably in a European context, for standard and language independent methods of subject analysis and document description. This first paper outlines the origins of PRECIS, and considers its use in English language indexing. A second paper will deal in general terms with the syntactical model which is used for producing PRECIS input strings and index entries. Later papers will then review the application of this model to indexing in, firstly, the Germanic languages (e.g. German and Danish), and, secondly, the Romance languages, illustrated by French.
Green, R.; Bean, C.A.; Hudon, M.: Universality and basic level concepts (2003) 0.02
```
0.023354271 = product of:
  0.046708543 = sum of:
    0.046708543 = product of:
      0.093417086 = sum of:
        0.093417086 = weight(_text_:e.g in 2730) [ClassicSimilarity], result of:
          0.093417086 = score(doc=2730,freq=6.0), product of:
            0.23393378 = queryWeight, product of:
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.044842023 = queryNorm
            0.39933133 = fieldWeight in 2730, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.03125 = fieldNorm(doc=2730)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

This paper examines whether a concept's hierarchical level affects the likelihood of its universality across schemes for knowledge representation and knowledge organization. Empirical data an equivalents are drawn from a bilingual thesaurus, a pair of biomedical vocabularies, and two ontologies. Conceptual equivalence across resources occurs significantly more often at the basic level than at subordinate or superordinate levels. Attempts to integrate knowledge representation or knowledge organization tools should concentrate an establishing equivalences at the basic level. 1. Rationale The degree of success attainable in the integration of multiple knowledge representation systems or knowledge organization schemes is constrained by limitations an the universality of human conceptual systems. For example, human languages do not all lexicalize the same set of concepts; nor do they structure (quasi-)equivalent concepts in the same relational patterns (Riesthuis, 2001). As a consequence, even multilingual thesauri designed from the outset from the perspective of multiple languages may routinely include situations where corresponding terms are not truly equivalent (Hudon, 1997, 2001). Intuitively, where inexactness and partialness in equivalence mappings across knowledge representation schemes and knowledge organizations schemes exist, a more difficult retrieval scenario arises than where equivalence mappings reflect full and exact conceptual matches. The question we address in this paper is whether a concept's hierarchical level af ects the likelihood of its universality/full equivalence across schemes for knowledge representation and knowledge organization. Cognitive science research has shown that one particular hierarchical level-called the basic level--enjoys a privileged status (Brown, 1958; Rosch et al., 1976). Our underlying hypothesis is that concepts at the basic level (e.g., apple, shoe, chair) are more likely to match across knowledge representation schemes and knowledge organization schemes than concepts at the superordinate (e.g., fruit, footwear, furniture) or subordinate (e.g., Granny Smith, sneaker, recliner) levels. This hypothesis is consistent with ethnobiological data showing that folk classifications of flora are more likely to agree at the basic level than at superordinate or subordinate levels (Berlin, 1992).
Pearce, C.; Nicholas, C.: TELLTALE: Experiments in a dynamic hypertext environment for degraded and multilingual data (1996) 0.02
```
0.020225393 = product of:
  0.040450785 = sum of:
    0.040450785 = product of:
      0.08090157 = sum of:
        0.08090157 = weight(_text_:e.g in 4071) [ClassicSimilarity], result of:
          0.08090157 = score(doc=4071,freq=2.0), product of:
            0.23393378 = queryWeight, product of:
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.044842023 = queryNorm
            0.34583107 = fieldWeight in 4071, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.046875 = fieldNorm(doc=4071)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Methods and tools for finding documents relevant to a user's needs in a document corpora can be found in the information retrieval, library science, and hypertext communities. Typically, these systems provide retrieval capabilities for fairly static copora, their algorithms are dependent on the language for which they are written, e.g. English, and they do not perform well when presented with misspelled words or text that has been degraded by OCR techniques. In this article, we present experimentation results for the TELLTALE system. TELLTALE is a dynamic hypertext environment that provides full-text search from a hypertext-style user interface for text corpora that may be garbled by OCR or transmission errors, and that may contain languages other than English. TELLTALE uses several techniques based on n-grams (n character sequences of text). With these results we show that the dynamic linkage mechanisms in TELLTALE are tolerant of garbles in up to 30% of the characters in the body of the texts
Landry, P.; Zumer, M.; Clavel-Merrin, G.: Report on cross-language subject access options (2006) 0.02
```
0.020225393 = product of:
  0.040450785 = sum of:
    0.040450785 = product of:
      0.08090157 = sum of:
        0.08090157 = weight(_text_:e.g in 2433) [ClassicSimilarity], result of:
          0.08090157 = score(doc=2433,freq=2.0), product of:
            0.23393378 = queryWeight, product of:
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.044842023 = queryNorm
            0.34583107 = fieldWeight in 2433, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.046875 = fieldNorm(doc=2433)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

This report presents the results of desk-top based study of projects and initiatives in the area of linking and mapping subject tools. While its goal is to provide areas of further study for cross-language subject access in the European Library, and specifically the national libraries of the Ten New Member States, it is not restricted to cross-language mappings since some of the tools used to create links across thesauri or subject headings in the same language may also be appropriate for cross-language mapping. Tools reviewed have been selected to represent a variety of approaches (e.g. subject heading to subject heading, thesaurus to thesaurus, classification to subject heading) reflecting the variety of subject access tools in use in the European Library. The results show that there is no single solution that would be appropriate for all libraries but that parts of several initiatives may be applicable on a technical, organisational or content level.

Dini, L.: CACAO : multilingual access to bibliographic records (2007) 0.02

0.018226424 = product of:
  0.03645285 = sum of:
    0.03645285 = product of:
      0.0729057 = sum of:
        0.0729057 = weight(_text_:22 in 126) [ClassicSimilarity], result of:
          0.0729057 = score(doc=126,freq=2.0), product of:
            0.15702912 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044842023 = queryNorm
            0.46428138 = fieldWeight in 126, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=126)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Content: Vortrag anlässlich des Workshops: "Extending the multilingual capacity of The European Library in the EDL project Stockholm, Swedish National Library, 22-23 November 2007".

Niininen, S.; Nykyri, S.; Suominen, O.: ¬The future of metadata : open, linked, and multilingual - the YSO case (2017) 0.02
```
0.016854495 = product of:
  0.03370899 = sum of:
    0.03370899 = product of:
      0.06741798 = sum of:
        0.06741798 = weight(_text_:e.g in 3707) [ClassicSimilarity], result of:
          0.06741798 = score(doc=3707,freq=2.0), product of:
            0.23393378 = queryWeight, product of:
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.044842023 = queryNorm
            0.28819257 = fieldWeight in 3707, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3707)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Purpose The purpose of this paper is threefold: to focus on the process of multilingual concept scheme construction and the challenges involved; to addresses concrete challenges faced in the construction process and especially those related to equivalence between terms and concepts; and to briefly outlines the translation strategies developed during the process of concept scheme construction. Design/methodology/approach The analysis is based on experience acquired during the establishment of the Finnish thesaurus and ontology service Finto as well as the trilingual General Finnish Ontology YSO, both of which are being maintained and further developed at the National Library of Finland. Findings Although uniform resource identifiers can be considered language-independent, they do not render concept schemes and their construction free of language-related challenges. The fundamental issue with all the challenges faced is how to maintain consistency and predictability when the nature of language requires each concept to be treated individually. The key to such challenges is to recognise the function of the vocabulary and the needs of its intended users. Social implications Open science increases the transparency of not only research products, but also metadata tools. Gaining a deeper understanding of the challenges involved in their construction is important for a great variety of users - e.g. indexers, vocabulary builders and information seekers. Today, multilingualism is an essential aspect at both the national and international information society level. Originality/value This paper draws on the practical challenges faced in concept scheme construction in a trilingual environment, with a focus on "concept scheme" as a translation and mapping unit.
Steichen, B.; Lowe, R.: How do multilingual users search? : An investigation of query and result list language choices (2021) 0.02
```
0.016854495 = product of:
  0.03370899 = sum of:
    0.03370899 = product of:
      0.06741798 = sum of:
        0.06741798 = weight(_text_:e.g in 246) [ClassicSimilarity], result of:
          0.06741798 = score(doc=246,freq=2.0), product of:
            0.23393378 = queryWeight, product of:
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.044842023 = queryNorm
            0.28819257 = fieldWeight in 246, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.0390625 = fieldNorm(doc=246)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Many users of search systems are multilingual, that is, they are proficient in two or more languages. In order to better understand and support the language preferences and behaviors of such multilingual users, this paper presents a series of five large-scale studies that specifically elicit language choices regarding search queries and result lists. Overall, the results from the studies indicate that users frequently make use of different languages (i.e., not just their primary language), especially when they are provided with choices (e.g., when provided with a secondary language query or result list choice). In particular, when presented with a mixed-language list choice, participants choose this option to an almost equal extent compared to primary-language-only lists. Important factors leading to language choices are user-, task- and system-related, including proficiency, task topic, and result layout. Moreover, participants' subjective reasons for making particular choices indicate that their primary language is considered more comfortable, that the secondary language often has more relevant and trustworthy results, and that mixed-language lists provide a better overview. These results provide crucial insights into multilingual user preferences and behaviors, and may help in the design of systems that can better support the querying and result exploration of multilingual users.

Kutschekmanesch, S.; Lutes, B.; Moelle, K.; Thiel, U.; Tzeras, K.: Automated multilingual indexing : a synthesis of rule-based and thesaurus-based methods (1998) 0.02

0.0151886875 = product of:
  0.030377375 = sum of:
    0.030377375 = product of:
      0.06075475 = sum of:
        0.06075475 = weight(_text_:22 in 4157) [ClassicSimilarity], result of:
          0.06075475 = score(doc=4157,freq=2.0), product of:
            0.15702912 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044842023 = queryNorm
            0.38690117 = fieldWeight in 4157, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=4157)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Information und Märkte: 50. Deutscher Dokumentartag 1998, Kongreß der Deutschen Gesellschaft für Dokumentation e.V. (DGD), Rheinische Friedrich-Wilhelms-Universität Bonn, 22.-24. September 1998. Hrsg. von Marlies Ockenfeld u. Gerhard J. Mantwill

Landry, P.: MACS: multilingual access to subject and link management : Extending the Multilingual Capacity of TEL in the EDL Project (2007) 0.02

0.0151886875 = product of:
  0.030377375 = sum of:
    0.030377375 = product of:
      0.06075475 = sum of:
        0.06075475 = weight(_text_:22 in 1287) [ClassicSimilarity], result of:
          0.06075475 = score(doc=1287,freq=2.0), product of:
            0.15702912 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044842023 = queryNorm
            0.38690117 = fieldWeight in 1287, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=1287)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Content: Vortrag anlässlich des Workshops: "Extending the multilingual capacity of The European Library in the EDL project Stockholm, Swedish National Library, 22-23 November 2007".

Zhou, Y. et al.: Analysing entity context in multilingual Wikipedia to support entity-centric retrieval applications (2016) 0.02

0.0151886875 = product of:
  0.030377375 = sum of:
    0.030377375 = product of:
      0.06075475 = sum of:
        0.06075475 = weight(_text_:22 in 2758) [ClassicSimilarity], result of:
          0.06075475 = score(doc=2758,freq=2.0), product of:
            0.15702912 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044842023 = queryNorm
            0.38690117 = fieldWeight in 2758, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=2758)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 1. 2.2016 18:25:22

Celli, F. et al.: Enabling multilingual search through controlled vocabularies : the AGRIS approach (2016) 0.02

0.0151886875 = product of:
  0.030377375 = sum of:
    0.030377375 = product of:
      0.06075475 = sum of:
        0.06075475 = weight(_text_:22 in 3278) [ClassicSimilarity], result of:
          0.06075475 = score(doc=3278,freq=2.0), product of:
            0.15702912 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044842023 = queryNorm
            0.38690117 = fieldWeight in 3278, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=3278)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Metadata and semantics research: 10th International Conference, MTSR 2016, Göttingen, Germany, November 22-25, 2016, Proceedings. Eds.: E. Garoufallou

Cross-language information retrieval (1998) 0.01
```
0.014596419 = product of:
  0.029192839 = sum of:
    0.029192839 = product of:
      0.058385678 = sum of:
        0.058385678 = weight(_text_:e.g in 6299) [ClassicSimilarity], result of:
          0.058385678 = score(doc=6299,freq=6.0), product of:
            0.23393378 = queryWeight, product of:
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.044842023 = queryNorm
            0.24958208 = fieldWeight in 6299, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.01953125 = fieldNorm(doc=6299)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Footnote

Rez. in: Machine translation review: 1999, no.10, S.26-27 (D. Lewis): "Cross Language Information Retrieval (CLIR) addresses the growing need to access large volumes of data across language boundaries. The typical requirement is for the user to input a free form query, usually a brief description of a topic, into a search or retrieval engine which returns a list, in ranked order, of documents or web pages that are relevant to the topic. The search engine matches the terms in the query to indexed terms, usually keywords previously derived from the target documents. Unlike monolingual information retrieval, CLIR requires query terms in one language to be matched to indexed terms in another. Matching can be done by bilingual dictionary lookup, full machine translation, or by applying statistical methods. A query's success is measured in terms of recall (how many potentially relevant target documents are found) and precision (what proportion of documents found are relevant). Issues in CLIR are how to translate query terms into index terms, how to eliminate alternative translations (e.g. to decide that French 'traitement' in a query means 'treatment' and not 'salary'), and how to rank or weight translation alternatives that are retained (e.g. how to order the French terms 'aventure', 'business', 'affaire', and 'liaison' as relevant translations of English 'affair'). Grefenstette provides a lucid and useful overview of the field and the problems. The volume brings together a number of experiments and projects in CLIR. Mark Davies (New Mexico State University) describes Recuerdo, a Spanish retrieval engine which reduces translation ambiguities by scanning indexes for parallel texts; it also uses either a bilingual dictionary or direct equivalents from a parallel corpus in order to compare results for queries on parallel texts. Lisa Ballesteros and Bruce Croft (University of Massachusetts) use a 'local feedback' technique which automatically enhances a query by adding extra terms to it both before and after translation; such terms can be derived from documents known to be relevant to the query.
The retrieved output from a query including the phrase 'big rockets' may be, for instance, a sentence containing 'giant rocket' which is semantically ranked above 'military ocket'. David Hull (Xerox Research Centre, Grenoble) describes an implementation of a weighted Boolean model for Spanish-English CLIR. Users construct Boolean-type queries, weighting each term in the query, which is then translated by an on-line dictionary before being applied to the database. Comparisons with the performance of unweighted free-form queries ('vector space' models) proved encouraging. Two contributions consider the evaluation of CLIR systems. In order to by-pass the time-consuming and expensive process of assembling a standard collection of documents and of user queries against which the performance of an CLIR system is manually assessed, Páriac Sheridan et al (ETH Zurich) propose a method based on retrieving 'seed documents'. This involves identifying a unique document in a database (the 'seed document') and, for a number of queries, measuring how fast it is retrieved. The authors have also assembled a large database of multilingual news documents for testing purposes. By storing the (fairly short) documents in a structured form tagged with descriptor codes (e.g. for topic, country and area), the test suite is easily expanded while remaining consistent for the purposes of testing. Douglas Ouard and Bonne Dorr (University of Maryland) describe an evaluation methodology which appears to apply LSI techniques in order to filter and rank incoming documents designed for testing CLIR systems. The volume provides the reader an excellent overview of several projects in CLIR. It is well supported with references and is intended as a secondary text for researchers and practitioners. It highlights the need for a good, general tutorial introduction to the field."
Riesthuis, G.J.A.: Information languages and multilingual subject access (2003) 0.01
```
0.013483594 = product of:
  0.026967188 = sum of:
    0.026967188 = product of:
      0.053934377 = sum of:
        0.053934377 = weight(_text_:e.g in 3963) [ClassicSimilarity], result of:
          0.053934377 = score(doc=3963,freq=2.0), product of:
            0.23393378 = queryWeight, product of:
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.044842023 = queryNorm
            0.23055404 = fieldWeight in 3963, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.2168427 = idf(docFreq=651, maxDocs=44218)
              0.03125 = fieldNorm(doc=3963)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Content

"1. Introduction Multilingual and crosslingual access to information is receiving more and more attention. Maybe the most important reason for this development is the Internet. There are estimations that about half of its users are people with a mother tongue other than English and that this proportion is growing. Crosslingual access in this context means the possibility to get free text access to information using another (natural) language than the language of the information itself. This type of access is important for users with a good passive knowledge of a language but with only a small active vocabulary of the same language, e.g. a Englishman who can read Russian, but has difficulties in formulating adequate search request in that language. Crosslingual access can also be valuable for monolingual users who can automatically or manually have translations of foreign language documents. The search requests will be translated or converted into the language of the information. Multilingual access assumes that the instruments used for access, the controlled information languages, are available in more than one language. An classic example is the Englishman who uses his English edition of the Universal Decimal Classification to search the catalogue of a library in China, although the classification of the library is done using a Chinese edition. In this case the searching and the classifying results in a notation that is the same irrespective which language edition was used for indexing. Another possibility is the use of a multilingual thesaurus or subject headings list, such the trilingual edition of the Library of Congress Subject Headings built at the Royal Library in Brussels (Belgium) or the Macrothesaurus of the OECD. Here, words are the access points - in one language into which each search request will be converted, or, alternatively, into all the languages involved. Multilingual information languages and guidelines an how to build them are the subject of this paper. Particular attention will be paid to multilingual thesauri."
Mitchell, J.S.; Zeng, M.L.; Zumer, M.: Modeling classification systems in multicultural and multilingual contexts (2012) 0.01
```
0.012888029 = product of:
  0.025776058 = sum of:
    0.025776058 = product of:
      0.051552117 = sum of:
        0.051552117 = weight(_text_:22 in 1967) [ClassicSimilarity], result of:
          0.051552117 = score(doc=1967,freq=4.0), product of:
            0.15702912 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044842023 = queryNorm
            0.32829654 = fieldWeight in 1967, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=1967)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

This paper reports on the second part of an initiative of the authors on researching classification systems with the conceptual model defined by the Functional Requirements for Subject Authority Data (FRSAD) final report. In an earlier study, the authors explored whether the FRSAD conceptual model could be extended beyond subject authority data to model classification data. The focus of the current study is to determine if classification data modeled using FRSAD can be used to solve real-world discovery problems in multicultural and multilingual contexts. The paper discusses the relationships between entities (same type or different types) in the context of classification systems that involve multiple translations and /or multicultural implementations. Results of two case studies are presented in detail: (a) two instances of the DDC (DDC 22 in English, and the Swedish-English mixed translation of DDC 22), and (b) Chinese Library Classification. The use cases of conceptual models in practice are also discussed.

Timotin, A.: Multilingvism si tezaure de concepte (1994) 0.01

0.01215095 = product of:
  0.0243019 = sum of:
    0.0243019 = product of:
      0.0486038 = sum of:
        0.0486038 = weight(_text_:22 in 7887) [ClassicSimilarity], result of:
          0.0486038 = score(doc=7887,freq=2.0), product of:
            0.15702912 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044842023 = queryNorm
            0.30952093 = fieldWeight in 7887, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=7887)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Probleme de Informare si Documentare. 28(1994) no.1, S.13-22

Cao, L.; Leong, M.-K.; Low, H.-B.: Searching heterogeneous multilingual bibliographic sources (1998) 0.01

0.01215095 = product of:
  0.0243019 = sum of:
    0.0243019 = product of:
      0.0486038 = sum of:
        0.0486038 = weight(_text_:22 in 3564) [ClassicSimilarity], result of:
          0.0486038 = score(doc=3564,freq=2.0), product of:
            0.15702912 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044842023 = queryNorm
            0.30952093 = fieldWeight in 3564, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=3564)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 1. 8.1996 22:08:06

Search (43 results, page 1 of 3)

Authors

Years

Languages

Types

Themes

Classifications