Search (40 results, page 1 of 2)

  • × theme_ss:"Multilinguale Probleme"
  • × year_i:[2000 TO 2010}
  1. Widdra, M.: Besonderheiten eines multilingualen Thesaurus : Am Beispiel des "Europäischen Thesaurus Internationale Beziehungen und Länderkunde" (2004) 0.06
    0.0629623 = product of:
      0.1259246 = sum of:
        0.1259246 = product of:
          0.2518492 = sum of:
            0.2518492 = weight(_text_:thesaurus in 4628) [ClassicSimilarity], result of:
              0.2518492 = score(doc=4628,freq=6.0), product of:
                0.23732872 = queryWeight, product of:
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.051357865 = queryNorm
                1.061183 = fieldWeight in 4628, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.09375 = fieldNorm(doc=4628)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Theme
    Konzeption und Anwendung des Prinzips Thesaurus
  2. Subirats, I.; Prasad, A.R.D.; Keizer, J.; Bagdanov, A.: Implementation of rich metadata formats and demantic tools using DSpace (2008) 0.06
    0.06238496 = product of:
      0.12476992 = sum of:
        0.12476992 = sum of:
          0.09693679 = weight(_text_:thesaurus in 2656) [ClassicSimilarity], result of:
            0.09693679 = score(doc=2656,freq=8.0), product of:
              0.23732872 = queryWeight, product of:
                4.6210785 = idf(docFreq=1182, maxDocs=44218)
                0.051357865 = queryNorm
              0.40844947 = fieldWeight in 2656, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.6210785 = idf(docFreq=1182, maxDocs=44218)
                0.03125 = fieldNorm(doc=2656)
          0.027833126 = weight(_text_:22 in 2656) [ClassicSimilarity], result of:
            0.027833126 = score(doc=2656,freq=2.0), product of:
              0.1798465 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.051357865 = queryNorm
              0.15476047 = fieldWeight in 2656, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.03125 = fieldNorm(doc=2656)
      0.5 = coord(1/2)
    
    Abstract
    This poster explores the customization of DSpace to allow the use of the AGRIS Application Profile metadata standard and the AGROVOC thesaurus. The objective is the adaptation of DSpace, through the least invasive code changes either in the form of plug-ins or add-ons, to the specific needs of the Agricultural Sciences and Technology community. Metadata standards such as AGRIS AP, and Knowledge Organization Systems such as the AGROVOC thesaurus, provide mechanisms for sharing information in a standardized manner by recommending the use of common semantics and interoperable syntax (Subirats et al., 2007). AGRIS AP was created to enhance the description, exchange and subsequent retrieval of agricultural Document-like Information Objects (DLIOs). It is a metadata schema which draws from Metadata standards such as Dublin Core (DC), the Australian Government Locator Service Metadata (AGLS) and the Agricultural Metadata Element Set (AgMES) namespaces. It allows sharing of information across dispersed bibliographic systems (FAO, 2005). AGROVOC68 is a multilingual structured thesaurus covering agricultural and related domains. Its main role is to standardize the indexing process in order to make searching simpler and more efficient. AGROVOC is developed by FAO (Lauser et al., 2006). The customization of the DSpace is taking place in several phases. First, the AGRIS AP metadata schema was mapped onto the metadata DSpace model, with several enhancements implemented to support AGRIS AP elements. Next, AGROVOC will be integrated as a controlled vocabulary accessed through a local SKOS or OWL file. Eventually the system will be configurable to access AGROVOC through local files or remotely via webservices. Finally, spell checking and tooltips will be incorporated in the user interface to support metadata editing. Adapting DSpace to support AGRIS AP and annotation using the semantically-rich AGROVOC thesaurus transform DSpace into a powerful, domain-specific system for annotation and exchange of bibliographic metadata in the agricultural domain.
    Source
    Metadata for semantic and social applications : proceedings of the International Conference on Dublin Core and Metadata Applications, Berlin, 22 - 26 September 2008, DC 2008: Berlin, Germany / ed. by Jane Greenberg and Wolfgang Klas
  3. Retti, G.; Stehno, B.: ¬The Laurin thesaurus : a large, multilingual, electronic thesaurus for newspaper clipping archives (2004) 0.06
    0.056102958 = product of:
      0.112205915 = sum of:
        0.112205915 = product of:
          0.22441183 = sum of:
            0.22441183 = weight(_text_:thesaurus in 4431) [ClassicSimilarity], result of:
              0.22441183 = score(doc=4431,freq=14.0), product of:
                0.23732872 = queryWeight, product of:
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.051357865 = queryNorm
                0.9455738 = fieldWeight in 4431, product of:
                  3.7416575 = tf(freq=14.0), with freq of:
                    14.0 = termFreq=14.0
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=4431)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    This paper describes the Laurin thesaurus, which is used for indexing and searching in the Laurin system, a software package for digital clipping archives. As a multilingual thesaurus it complies with the corresponding standards, though presenting some approaches going beyond some of the standards' recommendations. The Laurin thesaurus integrates all kind of indexing terms, not only keywords, but proper names as well. The system of categories and relationships is described in detail.
    Object
    Laurin-Thesaurus
    Theme
    Konzeption und Anwendung des Prinzips Thesaurus
  4. Erstellung multilingualer Thesauri mit Hilfe moderner Softwaretools (2000) 0.05
    0.0514085 = product of:
      0.102817 = sum of:
        0.102817 = product of:
          0.205634 = sum of:
            0.205634 = weight(_text_:thesaurus in 4910) [ClassicSimilarity], result of:
              0.205634 = score(doc=4910,freq=4.0), product of:
                0.23732872 = queryWeight, product of:
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.051357865 = queryNorm
                0.8664522 = fieldWeight in 4910, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.09375 = fieldNorm(doc=4910)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Object
    UDK-Thesaurus
    Theme
    Konzeption und Anwendung des Prinzips Thesaurus
  5. Schruff, R.; Dickhaus, C.: ¬Der multilinguale Thesaurus Ethik in den Biowissenschaften (2004) 0.05
    0.048468396 = product of:
      0.09693679 = sum of:
        0.09693679 = product of:
          0.19387358 = sum of:
            0.19387358 = weight(_text_:thesaurus in 3045) [ClassicSimilarity], result of:
              0.19387358 = score(doc=3045,freq=8.0), product of:
                0.23732872 = queryWeight, product of:
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.051357865 = queryNorm
                0.81689894 = fieldWeight in 3045, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.0625 = fieldNorm(doc=3045)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Das Deutsche Referenzzentrum für Ethik in den Biowissenschaften (DRZE) in Bonn veröffentlicht zusammen mit nationalen und internationalen Partnern den neuen, mehrsprachigen Thesaurus Ethik in den Biowissenschaften. Das Werkzeug zur Recherche und Verschlagwortung liegt derzeit in deutscher und englischer Sprache vor. Eine französische Version folgt 2005. Insbesondere die Online-Version des Thesaurus bietet komfortable Navigations- und Suchmöglichkeiten.
    Object
    Bioethik-Thesaurus
  6. Keränen, S.: Equivalence and focus of translation in multicultural thesaurus construction (2006) 0.04
    0.04240985 = product of:
      0.0848197 = sum of:
        0.0848197 = product of:
          0.1696394 = sum of:
            0.1696394 = weight(_text_:thesaurus in 237) [ClassicSimilarity], result of:
              0.1696394 = score(doc=237,freq=8.0), product of:
                0.23732872 = queryWeight, product of:
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.051357865 = queryNorm
                0.7147866 = fieldWeight in 237, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=237)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    This paper reports a part of an on-going PhD study on problems related to multicultural social science thesaurus construction in the general frame of information science. The main analysis methods used are discourse analysis and co-word analysis. In theoretical framework the emphasis is on communicative equivalence theories and different aims of thesaurus translation are discussed. Some examples are given how co-word analysis can be used to study contextual equivalence.
    Theme
    Konzeption und Anwendung des Prinzips Thesaurus
  7. Frâncu, V.: Harmonizing a universal classification system with an interdisciplinary multilingual thesaurus : advantages and limitations (2000) 0.04
    0.03710089 = product of:
      0.07420178 = sum of:
        0.07420178 = product of:
          0.14840356 = sum of:
            0.14840356 = weight(_text_:thesaurus in 108) [ClassicSimilarity], result of:
              0.14840356 = score(doc=108,freq=12.0), product of:
                0.23732872 = queryWeight, product of:
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.051357865 = queryNorm
                0.62530804 = fieldWeight in 108, product of:
                  3.4641016 = tf(freq=12.0), with freq of:
                    12.0 = termFreq=12.0
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=108)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    The case under consideration is a project of building an interdisciplinary multilingual thesaurus (Romanian-English-French) starting from a list of indexing terms based on an abridged version of the Universal Decimal Classification (UDC). The resulting thesaurus is intended for public libraries for both indexing and searching purposes in bibliographic databases covering a wide range of topics but with a fairly low level of specificity. The problems encountered in such an approach fall into two groups: 1) concordance or compatibility problems in terms of the indexing languages considered (between a classification system and a thesaurus); 2) equivalence and, hence, translatability problems in terms of the natural languages involved. Additionally, the question of ambiguity given the co-occurrence of terms in more than one class, will be discussed with reference to homographs and polysemantic words. In a thesaurus with such a wide coverage yet with a low specificity level, the method adopted in the thesaurus construction was to provide as many lead-in terms as possible and post them up to the closest in meaning broader term in order to improve the recall ratio
  8. Riesthuis, G.J.A.: Multilingual subject access and the Guidelines for the establishment and development of multilingual thesauri : an experimental study (2000) 0.04
    0.036351297 = product of:
      0.072702594 = sum of:
        0.072702594 = product of:
          0.14540519 = sum of:
            0.14540519 = weight(_text_:thesaurus in 131) [ClassicSimilarity], result of:
              0.14540519 = score(doc=131,freq=8.0), product of:
                0.23732872 = queryWeight, product of:
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.051357865 = queryNorm
                0.61267424 = fieldWeight in 131, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.046875 = fieldNorm(doc=131)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    In this paper, after an introduction about problems of multilingual information languages, the rules and recommendations of the Guidelines for the establishment and development of multilingual thesauri for non-equivalence and partial equivalence of terms in different languages are discussed. Artificial terms are not very useful in searching, because most users are not willing to use a thesaurus to find the right descriptor. On the other hand indexers need guidance on how to index and therefore need a thesaurus with all desirable and necessary relations. It is suggested that bibliographic online systems can take over some of the functions for the searcher from the thesaurus and that a few new relations could be helpful to an indexer
    Theme
    Konzeption und Anwendung des Prinzips Thesaurus
  9. Francu, V.: Multilingual access to information using an intermediate language (2003) 0.03
    0.03205883 = product of:
      0.06411766 = sum of:
        0.06411766 = product of:
          0.12823533 = sum of:
            0.12823533 = weight(_text_:thesaurus in 1742) [ClassicSimilarity], result of:
              0.12823533 = score(doc=1742,freq=14.0), product of:
                0.23732872 = queryWeight, product of:
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.051357865 = queryNorm
                0.5403279 = fieldWeight in 1742, product of:
                  3.7416575 = tf(freq=14.0), with freq of:
                    14.0 = termFreq=14.0
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.03125 = fieldNorm(doc=1742)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    While being theoretically so widely available, information can be restricted from a more general use by linguistic barriers. The linguistic aspects of the information languages and particularly the chances of an enhanced access to information by means of multilingual access facilities will make the substance of this thesis. The main problem of this research is thus to demonstrate that information retrieval can be improved by using multilingual thesaurus terms based on an intermediate or switching language to search with. Universal classification systems in general can play the role of switching languages for reasons dealt with in the forthcoming pages. The Universal Decimal Classification (UDC) in particular is the classification system used as example of a switching language for our objectives. The question may arise: why a universal classification system and not another thesaurus? Because the UDC like most of the classification systems uses symbols. Therefore, it is language independent and the problems of compatibility between such a thesaurus and different other thesauri in different languages are avoided. Another question may still arise? Why not then, assign running numbers to the descriptors in a thesaurus and make a switching language out of the resulting enumerative system? Because of some other characteristics of the UDC: hierarchical structure and terminological richness, consistency and control. One big problem to find an answer to is: can a thesaurus be made having as a basis a classification system in any and all its parts? To what extent this question can be given an affirmative answer? This depends much on the attributes of the universal classification system which can be favourably used to this purpose. Examples of different situations will be given and discussed upon beginning with those classes of UDC which are best fitted for building a thesaurus structure out of them (classes which are both hierarchical and faceted)...
    Content
    Inhalt: INFORMATION LANGUAGES: A LINGUISTIC APPROACH MULTILINGUAL ASPECTS IN INFORMATION STORAGE AND RETRIEVAL COMPATIBILITY AND CONVERTIBILITY OF INFORMATION LANGUAGES CURRENT TRENDS IN MULTILINGUAL ACCESS BUILDING UDC-BASED MULTILINGUAL THESAURI ONLINE APPLICATIONS OF THE UDC-BASED MULTILINGUAL THESAURI THE IMPACT OF SPECIFICITY ON THE RETRIEVAL POWER OF A UDC-BASED MULTILINGUAL THESAURUS FINAL REMARKS AND GENERAL CONCLUSIONS Proefschrift voorgelegd tot het behalen van de graad van doctor in de Taal- en Letterkunde aan de Universiteit Antwerpen. - Vgl.: http://dlist.sir.arizona.edu/1862/.
  10. Hudon, M.: Relationships in multilingual thesauri (2001) 0.03
    0.03148115 = product of:
      0.0629623 = sum of:
        0.0629623 = product of:
          0.1259246 = sum of:
            0.1259246 = weight(_text_:thesaurus in 1147) [ClassicSimilarity], result of:
              0.1259246 = score(doc=1147,freq=6.0), product of:
                0.23732872 = queryWeight, product of:
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.051357865 = queryNorm
                0.5305915 = fieldWeight in 1147, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1147)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Because the multilingual thesaurus has a critical role to play in the global networked information world, its relational structure must come under close scrutiny. Traditionally, identity of relational structures has been sought for the different language versions of a multilingual thesaurus, often leading to the artificialization of all target languages. The various types of cross-lingual and intralingual relations found in thesauri are examined in the context of two questions: Are all types of thesaural relations transferable from one language to another? and Are the two members of a valid relation in a source language always the same in the target language(s)? Two options for resolving semantic conflicts in multilingual thesauri are presented.
    Theme
    Konzeption und Anwendung des Prinzips Thesaurus
  11. Jorna, K.; Davies, S.: Multilingual thesauri for the modern world : no ideal solution? (2001) 0.03
    0.03148115 = product of:
      0.0629623 = sum of:
        0.0629623 = product of:
          0.1259246 = sum of:
            0.1259246 = weight(_text_:thesaurus in 4486) [ClassicSimilarity], result of:
              0.1259246 = score(doc=4486,freq=6.0), product of:
                0.23732872 = queryWeight, product of:
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.051357865 = queryNorm
                0.5305915 = fieldWeight in 4486, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4486)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    In the 21st century, multilingual tools are gaining importance as increasingly diverse user groups from different cultural and linguistic backgrounds seek access to equally diverse pieces of information. The authors of this paper believe that most current forms of multilingual information access are inadequate for this role, and that a new form of multilingual thesaurus is required. The core of this paper introduces their pilot thesaurus InfoDEFT as a possible model for new online thesauri, which are semantically structured, encyclopedic and multilingual. The authors conclude that while the manual construction of such thesauri is labour intensive and hence costly, pilot thesauri can be used as training sets for artificial learning programmes, thus increasing their volume considerably at relatively little extra cost.
    Theme
    Konzeption und Anwendung des Prinzips Thesaurus
  12. Landry, P.: Multilingual subject access : the linking approach of MACS (2004) 0.03
    0.03148115 = product of:
      0.0629623 = sum of:
        0.0629623 = product of:
          0.1259246 = sum of:
            0.1259246 = weight(_text_:thesaurus in 5009) [ClassicSimilarity], result of:
              0.1259246 = score(doc=5009,freq=6.0), product of:
                0.23732872 = queryWeight, product of:
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.051357865 = queryNorm
                0.5305915 = fieldWeight in 5009, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5009)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    The MACS (Multilingual access to subjects) project is one of the many projects that are currently exploring solutions to multilingual subject access to online catalogs. Its strategy is to develop a Web based link and search interface through which equivalents between three Subject Heading Languages: SWD/RSWK (Schlagwortnormdatei/Regeln für den Schlagwortkatalog) for German, RAMEAU (Repertoire d'Autorite-Matière Encyclopedique et Alphabetique Unifie) for French and LCSH (Library of Congress Subject Headings) for English can be created and maintained, and by which users can access online databases in the language of their choice. Factors that have lead to this approach will be examined and the MACS linking strategy will be explained. The trend to using mapping or linking strategies between different controlled vocabularies to create multilingual access challenges the traditional view of the multilingual thesaurus.
    Footnote
    Auch in: The thesaurus: review, renaissance and revision. Ed. by S.K. Roe u. A.R. Thomas. Binghamton: Haworth 2004.
    Theme
    Konzeption und Anwendung des Prinzips Thesaurus
  13. Raghavan, K.S.; Neelameghan, A.: Design and development of a bilingual thesaurus for classical Tamil studies : experiences and issues (2008) 0.03
    0.03148115 = product of:
      0.0629623 = sum of:
        0.0629623 = product of:
          0.1259246 = sum of:
            0.1259246 = weight(_text_:thesaurus in 2245) [ClassicSimilarity], result of:
              0.1259246 = score(doc=2245,freq=6.0), product of:
                0.23732872 = queryWeight, product of:
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.051357865 = queryNorm
                0.5305915 = fieldWeight in 2245, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2245)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Content
    Based on experiences with the design and development of a Tamil-English bilingual thesaurus this paper discusses with examples: (1) some issues related to vocabulary management in multilingual thesauri in culture-specific domains; (2) special aspects of the Tamil language in this regard; (3) alternative ways of linking certain descriptors to lengthy lists of NTs and RTs; (4) advantages of integrated use of two or more knowledge organization tools; and (5) use of the bilingual thesaurus for certain types of research in Tamil. Issues related to equivalence, non-hierarchic associative relationships, homographs, NTs are discussed. The paper suggests that integrated use of two or more knowledge organization tools adds value.
  14. Li, K.W.; Yang, C.C.: Automatic crosslingual thesaurus generated from the Hong Kong SAR Police Department Web Corpus for Crime Analysis (2005) 0.03
    0.027094658 = product of:
      0.054189317 = sum of:
        0.054189317 = product of:
          0.108378634 = sum of:
            0.108378634 = weight(_text_:thesaurus in 3391) [ClassicSimilarity], result of:
              0.108378634 = score(doc=3391,freq=10.0), product of:
                0.23732872 = queryWeight, product of:
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.051357865 = queryNorm
                0.45666042 = fieldWeight in 3391, product of:
                  3.1622777 = tf(freq=10.0), with freq of:
                    10.0 = termFreq=10.0
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.03125 = fieldNorm(doc=3391)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    For the sake of national security, very large volumes of data and information are generated and gathered daily. Much of this data and information is written in different languages, stored in different locations, and may be seemingly unconnected. Crosslingual semantic interoperability is a major challenge to generate an overview of this disparate data and information so that it can be analyzed, shared, searched, and summarized. The recent terrorist attacks and the tragic events of September 11, 2001 have prompted increased attention an national security and criminal analysis. Many Asian countries and cities, such as Japan, Taiwan, and Singapore, have been advised that they may become the next targets of terrorist attacks. Semantic interoperability has been a focus in digital library research. Traditional information retrieval (IR) approaches normally require a document to share some common keywords with the query. Generating the associations for the related terms between the two term spaces of users and documents is an important issue. The problem can be viewed as the creation of a thesaurus. Apart from this, terrorists and criminals may communicate through letters, e-mails, and faxes in languages other than English. The translation ambiguity significantly exacerbates the retrieval problem. The problem is expanded to crosslingual semantic interoperability. In this paper, we focus an the English/Chinese crosslingual semantic interoperability problem. However, the developed techniques are not limited to English and Chinese languages but can be applied to many other languages. English and Chinese are popular languages in the Asian region. Much information about national security or crime is communicated in these languages. An efficient automatically generated thesaurus between these languages is important to crosslingual information retrieval between English and Chinese languages. To facilitate crosslingual information retrieval, a corpus-based approach uses the term co-occurrence statistics in parallel or comparable corpora to construct a statistical translation model to cross the language boundary. In this paper, the text based approach to align English/Chinese Hong Kong Police press release documents from the Web is first presented. We also introduce an algorithmic approach to generate a robust knowledge base based an statistical correlation analysis of the semantics (knowledge) embedded in the bilingual press release corpus. The research output consisted of a thesaurus-like, semantic network knowledge base, which can aid in semanticsbased crosslingual information management and retrieval.
    Theme
    Konzeption und Anwendung des Prinzips Thesaurus
  15. Francu, V.: ¬The impact of specificity on the retrieval power of a UDC-based multilingual thesaurus (2003) 0.03
    0.02570425 = product of:
      0.0514085 = sum of:
        0.0514085 = product of:
          0.102817 = sum of:
            0.102817 = weight(_text_:thesaurus in 5518) [ClassicSimilarity], result of:
              0.102817 = score(doc=5518,freq=4.0), product of:
                0.23732872 = queryWeight, product of:
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.051357865 = queryNorm
                0.4332261 = fieldWeight in 5518, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5518)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    The article describes the research done over a bibliographic database in order to show the impact the specificity of the knowledge organising tools may have on information retrieval (IR). For this purpose two multilingual Universal Decimal Classification (UDC) based thesauri having different degrees of specificity are considered. Issues of harmonising a classificatory structure with a thesaurus structure are introduced, and significant aspects of information retrieval in a multilingual environment are examined in an extensive manner. Aspects of complementarity are discussed with particular emphasis on the real impact produced on IR by alternative search facilities. Finally, a number of conclusions are formulated as they arise from the study.
  16. Landry, P.; Zumer, M.; Clavel-Merrin, G.: Report on cross-language subject access options (2006) 0.03
    0.02570425 = product of:
      0.0514085 = sum of:
        0.0514085 = product of:
          0.102817 = sum of:
            0.102817 = weight(_text_:thesaurus in 2433) [ClassicSimilarity], result of:
              0.102817 = score(doc=2433,freq=4.0), product of:
                0.23732872 = queryWeight, product of:
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.051357865 = queryNorm
                0.4332261 = fieldWeight in 2433, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2433)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    This report presents the results of desk-top based study of projects and initiatives in the area of linking and mapping subject tools. While its goal is to provide areas of further study for cross-language subject access in the European Library, and specifically the national libraries of the Ten New Member States, it is not restricted to cross-language mappings since some of the tools used to create links across thesauri or subject headings in the same language may also be appropriate for cross-language mapping. Tools reviewed have been selected to represent a variety of approaches (e.g. subject heading to subject heading, thesaurus to thesaurus, classification to subject heading) reflecting the variety of subject access tools in use in the European Library. The results show that there is no single solution that would be appropriate for all libraries but that parts of several initiatives may be applicable on a technical, organisational or content level.
  17. Landry, P.: Multilingual subject access : the linking approach of MACS (2004) 0.02
    0.021204924 = product of:
      0.04240985 = sum of:
        0.04240985 = product of:
          0.0848197 = sum of:
            0.0848197 = weight(_text_:thesaurus in 4825) [ClassicSimilarity], result of:
              0.0848197 = score(doc=4825,freq=2.0), product of:
                0.23732872 = queryWeight, product of:
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.051357865 = queryNorm
                0.3573933 = fieldWeight in 4825, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=4825)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    The thesaurus: review, renaissance and revision. Ed. by S.K. Roe u. A.R. Thomas
  18. Riesthuis, G.J.A.: Information languages and multilingual subject access (2003) 0.02
    0.020987432 = product of:
      0.041974865 = sum of:
        0.041974865 = product of:
          0.08394973 = sum of:
            0.08394973 = weight(_text_:thesaurus in 3963) [ClassicSimilarity], result of:
              0.08394973 = score(doc=3963,freq=6.0), product of:
                0.23732872 = queryWeight, product of:
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.051357865 = queryNorm
                0.35372764 = fieldWeight in 3963, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.03125 = fieldNorm(doc=3963)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    In this paper the possibilities for a multilingual thesaurus in which not all descriptors in a given language have equivalent descriptors in all other languages and in which the hierarchical structure can have variations in the different languages and a small model of such a thesaurus is given. lt is argued that the searching possibilities that more recent programs for bibliographic databases offer make such non-identical thesauri possible.
    Content
    "1. Introduction Multilingual and crosslingual access to information is receiving more and more attention. Maybe the most important reason for this development is the Internet. There are estimations that about half of its users are people with a mother tongue other than English and that this proportion is growing. Crosslingual access in this context means the possibility to get free text access to information using another (natural) language than the language of the information itself. This type of access is important for users with a good passive knowledge of a language but with only a small active vocabulary of the same language, e.g. a Englishman who can read Russian, but has difficulties in formulating adequate search request in that language. Crosslingual access can also be valuable for monolingual users who can automatically or manually have translations of foreign language documents. The search requests will be translated or converted into the language of the information. Multilingual access assumes that the instruments used for access, the controlled information languages, are available in more than one language. An classic example is the Englishman who uses his English edition of the Universal Decimal Classification to search the catalogue of a library in China, although the classification of the library is done using a Chinese edition. In this case the searching and the classifying results in a notation that is the same irrespective which language edition was used for indexing. Another possibility is the use of a multilingual thesaurus or subject headings list, such the trilingual edition of the Library of Congress Subject Headings built at the Royal Library in Brussels (Belgium) or the Macrothesaurus of the OECD. Here, words are the access points - in one language into which each search request will be converted, or, alternatively, into all the languages involved. Multilingual information languages and guidelines an how to build them are the subject of this paper. Particular attention will be paid to multilingual thesauri."
  19. Dini, L.: CACAO : multilingual access to bibliographic records (2007) 0.02
    0.020874845 = product of:
      0.04174969 = sum of:
        0.04174969 = product of:
          0.08349938 = sum of:
            0.08349938 = weight(_text_:22 in 126) [ClassicSimilarity], result of:
              0.08349938 = score(doc=126,freq=2.0), product of:
                0.1798465 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.051357865 = queryNorm
                0.46428138 = fieldWeight in 126, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=126)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Content
    Vortrag anlässlich des Workshops: "Extending the multilingual capacity of The European Library in the EDL project Stockholm, Swedish National Library, 22-23 November 2007".
  20. Evens, M.: Thesaural relations in information retrieval (2002) 0.02
    0.018175649 = product of:
      0.036351297 = sum of:
        0.036351297 = product of:
          0.072702594 = sum of:
            0.072702594 = weight(_text_:thesaurus in 1201) [ClassicSimilarity], result of:
              0.072702594 = score(doc=1201,freq=2.0), product of:
                0.23732872 = queryWeight, product of:
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.051357865 = queryNorm
                0.30633712 = fieldWeight in 1201, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.6210785 = idf(docFreq=1182, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1201)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Thesaural relations have long been used in information retrieval to enrich queries; they have sometimes been used to cluster documents as well. Sometimes the first query to an information retrieval system yields no results at all, or, what can be even more disconcerting, many thousands of hits. One solution is to rephrase the query, improving the choice of query terms by using related terms of different types. A collection of related terms is often called a thesaurus. This chapter describes the lexical-semantic relations that have been used in building thesauri and summarizes some of the effects of using these relational thesauri in information retrieval experiments