Search (113 results, page 1 of 6)

Wang, F.L.; Yang, C.C.: ¬The impact analysis of language differences on an automatic multilingual text summarization system (2006) 0.03

0.031518973 = product of:
  0.11556956 = sum of:
    0.07568369 = weight(_text_:effect in 5049) [ClassicSimilarity], result of:
      0.07568369 = score(doc=5049,freq=4.0), product of:
        0.18289955 = queryWeight, product of:
          5.29663 = idf(docFreq=601, maxDocs=44218)
          0.034531306 = queryNorm
        0.41379923 = fieldWeight in 5049, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          5.29663 = idf(docFreq=601, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5049)
    0.015471167 = weight(_text_:of in 5049) [ClassicSimilarity], result of:
      0.015471167 = score(doc=5049,freq=22.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.28651062 = fieldWeight in 5049, product of:
          4.690416 = tf(freq=22.0), with freq of:
            22.0 = termFreq=22.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5049)
    0.024414703 = weight(_text_:on in 5049) [ClassicSimilarity], result of:
      0.024414703 = score(doc=5049,freq=14.0), product of:
        0.07594867 = queryWeight, product of:
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.034531306 = queryNorm
        0.3214632 = fieldWeight in 5049, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5049)
  0.27272728 = coord(3/11)

Abstract: Based on the salient features of the documents, automatic text summarization systems extract the key sentences from source documents. This process supports the users in evaluating the relevance of the extracted documents returned by information retrieval systems. Because of this tool, efficient filtering can be achieved. Indirectly, these systems help to resolve the problem of information overloading. Many automatic text summarization systems have been implemented for use with different languages. It has been established that the grammatical and lexical differences between languages have a significant effect on text processing. However, the impact of the language differences on the automatic text summarization systems has not yet been investigated. The authors provide an impact analysis of language difference on automatic text summarization. It includes the effect on the extraction processes, the scoring mechanisms, the performance, and the matching of the extracted sentences, using the parallel corpus in English and Chinese as the tested object. The analysis results provide a greater understanding of language differences and promote the future development of more advanced text summarization techniques.
Footnote: Beitrag einer special topic section on multilingual information systems
Source: Journal of the American Society for Information Science and Technology. 57(2006) no.5, S.684-696

Kralisch, A.; Berendt, B.: Language-sensitive search behaviour and the role of domain knowledge (2005) 0.03

0.026784439 = product of:
  0.098209605 = sum of:
    0.064219736 = weight(_text_:effect in 5919) [ClassicSimilarity], result of:
      0.064219736 = score(doc=5919,freq=2.0), product of:
        0.18289955 = queryWeight, product of:
          5.29663 = idf(docFreq=601, maxDocs=44218)
          0.034531306 = queryNorm
        0.35112026 = fieldWeight in 5919, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.29663 = idf(docFreq=601, maxDocs=44218)
          0.046875 = fieldNorm(doc=5919)
    0.014810067 = weight(_text_:of in 5919) [ClassicSimilarity], result of:
      0.014810067 = score(doc=5919,freq=14.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.2742677 = fieldWeight in 5919, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=5919)
    0.01917981 = weight(_text_:on in 5919) [ClassicSimilarity], result of:
      0.01917981 = score(doc=5919,freq=6.0), product of:
        0.07594867 = queryWeight, product of:
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.034531306 = queryNorm
        0.25253648 = fieldWeight in 5919, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.046875 = fieldNorm(doc=5919)
  0.27272728 = coord(3/11)

Abstract: While many websites aim at a large and linguistically diversified audience, they present their information mostly in the languages of larger speakers groups. Little is known about the effect on accessibility. We investigated the influence of a site's language offer on website access and search behaviour with two studies, and studied the interaction of language offers and domain knowledge. To achieve high ecological validity, we analysed data from a multilingual site's web-server logfile and from a questionnaire posted on it, and compared the behaviour of users who accessed the site in a non-native language to that of users who accessed it in their native language. Results from 277,809 user sessions and 165 international survey participants indicate that a website's languages may strongly reduce website access by users not supplied with information in their native language. Once inside a site, non-native speakers with high domain knowledge behave similarly to native speakers. However, non-native speakers' behaviour becomes language-sensitive when they have low domain knowledge.
Source: New review of hypermedia and multimedia. 11(2005) no.2, S.221-246

Airio, E.: Who benefits from CLIR in web retrieval? (2008) 0.03

0.026784439 = product of:
  0.098209605 = sum of:
    0.064219736 = weight(_text_:effect in 2342) [ClassicSimilarity], result of:
      0.064219736 = score(doc=2342,freq=2.0), product of:
        0.18289955 = queryWeight, product of:
          5.29663 = idf(docFreq=601, maxDocs=44218)
          0.034531306 = queryNorm
        0.35112026 = fieldWeight in 2342, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.29663 = idf(docFreq=601, maxDocs=44218)
          0.046875 = fieldNorm(doc=2342)
    0.014810067 = weight(_text_:of in 2342) [ClassicSimilarity], result of:
      0.014810067 = score(doc=2342,freq=14.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.2742677 = fieldWeight in 2342, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=2342)
    0.01917981 = weight(_text_:on in 2342) [ClassicSimilarity], result of:
      0.01917981 = score(doc=2342,freq=6.0), product of:
        0.07594867 = queryWeight, product of:
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.034531306 = queryNorm
        0.25253648 = fieldWeight in 2342, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.046875 = fieldNorm(doc=2342)
  0.27272728 = coord(3/11)

Abstract: Purpose - The aim of the current paper is to test whether query translation is beneficial in web retrieval. Design/methodology/approach - The language pairs were Finnish-Swedish, English-German and Finnish-French. A total of 12-18 participants were recruited for each language pair. Each participant performed four retrieval tasks. The author's aim was to compare the performance of the translated queries with that of the target language queries. Thus, the author asked participants to formulate a source language query and a target language query for each task. The source language queries were translated into the target language utilizing a dictionary-based system. In English-German, also machine translation was utilized. The author used Google as the search engine. Findings - The results differed depending on the language pair. The author concluded that the dictionary coverage had an effect on the results. On average, the results of query-translation were better than in the traditional laboratory tests. Originality/value - This research shows that query translation in web is beneficial especially for users with moderate and non-active language skills. This is valuable information for developers of cross-language information retrieval systems.
Source: Journal of documentation. 64(2008) no.5, S.760-778

Yang, C.C.; Lam, W.: Introduction to the special topic section on multilingual information systems (2006) 0.03

0.025535608 = product of:
  0.09363056 = sum of:
    0.06316024 = weight(_text_:higher in 5043) [ClassicSimilarity], result of:
      0.06316024 = score(doc=5043,freq=2.0), product of:
        0.18138453 = queryWeight, product of:
          5.252756 = idf(docFreq=628, maxDocs=44218)
          0.034531306 = queryNorm
        0.34821182 = fieldWeight in 5043, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.252756 = idf(docFreq=628, maxDocs=44218)
          0.046875 = fieldNorm(doc=5043)
    0.014810067 = weight(_text_:of in 5043) [ClassicSimilarity], result of:
      0.014810067 = score(doc=5043,freq=14.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.2742677 = fieldWeight in 5043, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=5043)
    0.015660247 = weight(_text_:on in 5043) [ClassicSimilarity], result of:
      0.015660247 = score(doc=5043,freq=4.0), product of:
        0.07594867 = queryWeight, product of:
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.034531306 = queryNorm
        0.20619515 = fieldWeight in 5043, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.046875 = fieldNorm(doc=5043)
  0.27272728 = coord(3/11)

Abstract: The information available in languages other than English on the World Wide Web and global information systems is increasing significantly. According to some recent reports. the growth of non-English speaking Internet users is significantly higher than the growth of English-speaking Internet users. Asia and Europe have become the two most-populated regions of Internet users. However, there are many different languages in the many different countries of Asia and Europe. And there are many countries in the world using more than one language as their official languages. For example, Chinese and English are official languages in Hong Kong SAR; English and French are official languages in Canada. In the global economy, information systems are no longer utilized by users in a single geographical region but all over the world. Information can be generated, stored, processed, and accessed in several different languages. All of this reveals the importance of research in multilingual information systems.
Source: Journal of the American Society for Information Science and Technology. 57(2006) no.5, S.629-631

Levow, G.-A.; Oard, D.W.; Resnik, P.: Dictionary-based techniques for cross-language information retrieval (2005) 0.02

0.0248525 = product of:
  0.09112583 = sum of:
    0.064219736 = weight(_text_:effect in 1025) [ClassicSimilarity], result of:
      0.064219736 = score(doc=1025,freq=2.0), product of:
        0.18289955 = queryWeight, product of:
          5.29663 = idf(docFreq=601, maxDocs=44218)
          0.034531306 = queryNorm
        0.35112026 = fieldWeight in 1025, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.29663 = idf(docFreq=601, maxDocs=44218)
          0.046875 = fieldNorm(doc=1025)
    0.015832627 = weight(_text_:of in 1025) [ClassicSimilarity], result of:
      0.015832627 = score(doc=1025,freq=16.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.2932045 = fieldWeight in 1025, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=1025)
    0.011073467 = weight(_text_:on in 1025) [ClassicSimilarity], result of:
      0.011073467 = score(doc=1025,freq=2.0), product of:
        0.07594867 = queryWeight, product of:
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.034531306 = queryNorm
        0.14580199 = fieldWeight in 1025, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.046875 = fieldNorm(doc=1025)
  0.27272728 = coord(3/11)

Abstract: Cross-language information retrieval (CLIR) systems allow users to find documents written in different languages from that of their query. Simple knowledge structures such as bilingual term lists have proven to be a remarkably useful basis for bridging that language gap. A broad array of dictionary-based techniques have demonstrated utility, but comparison across techniques has been difficult because evaluation results often span only a limited range of conditions. This article identifies the key issues in dictionary-based CLIR, develops unified frameworks for term selection and term translation that help to explain the relationships among existing techniques, and illustrates the effect of those techniques using four contrasting languages for systematic experiments with a uniform query translation architecture. Key results include identification of a previously unseen dependence of pre- and post-translation expansion on orthographic cognates and development of a query-specific measure for translation fanout that helps to explain the utility of structured query methods.

Mustafa el Hadi, W.: Human language technology and its role in information access and management (2003) 0.02
```
0.02291468 = product of:
  0.08402049 = sum of:
    0.053516448 = weight(_text_:effect in 5524) [ClassicSimilarity], result of:
      0.053516448 = score(doc=5524,freq=2.0), product of:
        0.18289955 = queryWeight, product of:
          5.29663 = idf(docFreq=601, maxDocs=44218)
          0.034531306 = queryNorm
        0.2926002 = fieldWeight in 5524, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.29663 = idf(docFreq=601, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5524)
    0.01745383 = weight(_text_:of in 5524) [ClassicSimilarity], result of:
      0.01745383 = score(doc=5524,freq=28.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.32322758 = fieldWeight in 5524, product of:
          5.2915025 = tf(freq=28.0), with freq of:
            28.0 = termFreq=28.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5524)
    0.013050207 = weight(_text_:on in 5524) [ClassicSimilarity], result of:
      0.013050207 = score(doc=5524,freq=4.0), product of:
        0.07594867 = queryWeight, product of:
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.034531306 = queryNorm
        0.1718293 = fieldWeight in 5524, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5524)
  0.27272728 = coord(3/11)
```
Abstract

The role of linguistics in information access, extraction and dissemination is essential. Radical changes in the techniques of information and communication at the end of the twentieth century have had a significant effect on the function of the linguistic paradigm and its applications in all forms of communication. The introduction of new technical means have deeply changed the possibilities for the distribution of information. In this situation, what is the role of the linguistic paradigm and its practical applications, i.e., natural language processing (NLP) techniques when applied to information access? What solutions can linguistics offer in human computer interaction, extraction and management? Many fields show the relevance of the linguistic paradigm through the various technologies that require NLP, such as document and message understanding, information detection, extraction, and retrieval, question and answer, cross-language information retrieval (CLIR), text summarization, filtering, and spoken document retrieval. This paper focuses on the central role of human language technologies in the information society, surveys the current situation, describes the benefits of the above mentioned applications, outlines successes and challenges, and discusses solutions. It reviews the resources and means needed to advance information access and dissemination across language boundaries in the twenty-first century. Multilingualism, which is a natural result of globalization, requires more effort in the direction of language technology. The scope of human language technology (HLT) is large, so we limit our review to applications that involve multilinguality.

Chen, H.-H.; Lin, W.-C.; Yang, C.; Lin, W.-H.: Translating-transliterating named entities for multilingual information access (2006) 0.01

0.012351968 = product of:
  0.04529055 = sum of:
    0.0159967 = weight(_text_:of in 1080) [ClassicSimilarity], result of:
      0.0159967 = score(doc=1080,freq=12.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.29624295 = fieldWeight in 1080, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1080)
    0.012919044 = weight(_text_:on in 1080) [ClassicSimilarity], result of:
      0.012919044 = score(doc=1080,freq=2.0), product of:
        0.07594867 = queryWeight, product of:
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.034531306 = queryNorm
        0.17010231 = fieldWeight in 1080, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1080)
    0.016374804 = product of:
      0.03274961 = sum of:
        0.03274961 = weight(_text_:22 in 1080) [ClassicSimilarity], result of:
          0.03274961 = score(doc=1080,freq=2.0), product of:
            0.12092275 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.034531306 = queryNorm
            0.2708308 = fieldWeight in 1080, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1080)
      0.5 = coord(1/2)
  0.27272728 = coord(3/11)

Abstract: Named entities are major constituents of a document but are usually unknown words. This work proposes a systematic way of dealing with formulation, transformation, translation, and transliteration of multilingual-named entities. The rules and similarity matrices for translation and transliteration are learned automatically from parallel-named-entity corpora. The results are applied in cross-language access to collections of images with captions. Experimental results demonstrate that the similarity-based transliteration of named entities is effective, and runs in which transliteration is considered outperform the runs in which it is neglected.
Date: 4. 6.2006 19:52:22
Footnote: Beitrag einer special topic section on multilingual information systems
Source: Journal of the American Society for Information Science and Technology. 57(2006) no.5, S.645-659

MacEwan, A.: Crossing language barriers in Europe : Linking LCSH to other subject heading languages (2000) 0.01

0.012137962 = product of:
  0.044505857 = sum of:
    0.014810067 = weight(_text_:of in 5618) [ClassicSimilarity], result of:
      0.014810067 = score(doc=5618,freq=14.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.2742677 = fieldWeight in 5618, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=5618)
    0.015660247 = weight(_text_:on in 5618) [ClassicSimilarity], result of:
      0.015660247 = score(doc=5618,freq=4.0), product of:
        0.07594867 = queryWeight, product of:
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.034531306 = queryNorm
        0.20619515 = fieldWeight in 5618, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.046875 = fieldNorm(doc=5618)
    0.014035545 = product of:
      0.02807109 = sum of:
        0.02807109 = weight(_text_:22 in 5618) [ClassicSimilarity], result of:
          0.02807109 = score(doc=5618,freq=2.0), product of:
            0.12092275 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.034531306 = queryNorm
            0.23214069 = fieldWeight in 5618, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=5618)
      0.5 = coord(1/2)
  0.27272728 = coord(3/11)

Abstract: A study group representing four European national libraries (the Swiss National Library, Die Deutsche Bibliothek, the Bibliothèque nationale de France and The British Library) recently conducted a study on the possibility of establishing multilingual thesaural links between the headings in the LCSH authority file and the authority files of the German indexing system SWD/RSWK and the French indexing system RAMEAU. The study demonstrated a high level of correspondence in main headings, but also revealed a number of issues requiring further investigation. The study group's findings led to recommendations on the scope for the development of a prototype system for linking the three Subject Heading Languages (SHLs) in the databases of the four institutions
Date: 27. 5.2001 16:22:10
Source: The LCSH century: one hundred years with the Library of Congress Subject Headings system. Ed.: A.T.Stone

Dabbadie, M.; Blancherie, J.M.: Alexandria, a multilingual dictionary for knowledge management purposes (2006) 0.01

0.012112013 = product of:
  0.044410713 = sum of:
    0.011195358 = weight(_text_:of in 2465) [ClassicSimilarity], result of:
      0.011195358 = score(doc=2465,freq=8.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.20732689 = fieldWeight in 2465, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=2465)
    0.01917981 = weight(_text_:on in 2465) [ClassicSimilarity], result of:
      0.01917981 = score(doc=2465,freq=6.0), product of:
        0.07594867 = queryWeight, product of:
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.034531306 = queryNorm
        0.25253648 = fieldWeight in 2465, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.046875 = fieldNorm(doc=2465)
    0.014035545 = product of:
      0.02807109 = sum of:
        0.02807109 = weight(_text_:22 in 2465) [ClassicSimilarity], result of:
          0.02807109 = score(doc=2465,freq=2.0), product of:
            0.12092275 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.034531306 = queryNorm
            0.23214069 = fieldWeight in 2465, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=2465)
      0.5 = coord(1/2)
  0.27272728 = coord(3/11)

Abstract: Alexandria is an innovation of international impact. It is the only multilingual dictionary for websites and PCs. A double click on a word opens a small window that gives interactive translations between 22 languages and includes meaning, synonyms and associated expressions. It is an ASP application grounded on a semantic network that is portable on any operating system or platform. Behind the application is the Integral Dictionary is the semantic network created by Memodata. Alexandria can be customized with specific vocabulary, descriptive articles, images, sounds, videos, etc. Its domains of application are considerable: e-tourism, online medias, language learning, international websites. Alexandria has also proved to be a basic tool for knowledge management purposes. The application can be customized according to a user or an organization needs. An application dedicated to mobile devices is currently being developed. Future developments are planned in the field of e-tourism in relation with French "pôles de compétitivité".
Source: Knowledge organization for a global learning society: Proceedings of the 9th International ISKO Conference, 4-7 July 2006, Vienna, Austria. Hrsg.: G. Budin, C. Swertz u. K. Mitgutsch

Francu, V.: Language-independent structures and multilingual information access (2003) 0.01
```
0.011906668 = product of:
  0.06548667 = sum of:
    0.017101191 = weight(_text_:of in 2753) [ClassicSimilarity], result of:
      0.017101191 = score(doc=2753,freq=42.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.31669703 = fieldWeight in 2753, product of:
          6.4807405 = tf(freq=42.0), with freq of:
            42.0 = termFreq=42.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03125 = fieldNorm(doc=2753)
    0.048385475 = weight(_text_:great in 2753) [ClassicSimilarity], result of:
      0.048385475 = score(doc=2753,freq=2.0), product of:
        0.19443816 = queryWeight, product of:
          5.6307793 = idf(docFreq=430, maxDocs=44218)
          0.034531306 = queryNorm
        0.24884763 = fieldWeight in 2753, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.6307793 = idf(docFreq=430, maxDocs=44218)
          0.03125 = fieldNorm(doc=2753)
  0.18181819 = coord(2/11)
```
Abstract

The existence of huge amounts of information available in information systems and networks worldwide imposes the creation of adequate tools able to efficiently organize it and allow its retrieval across geographical, linguistic and cultural boundaries. An indexing language covering all areas of knowledge and converting the language-independent structure of a classification system like the Universal Decimal Classification into a thesaurus structure in more than one language seems to be a solution. Among the key attributes of the indexing language thus obtained we can mention: consistency in indexing, control an terms, user-friendliness. The paper presents the great potential in information retrieval of the combined retrieval method by means of a case study. 1. Introduction Among the consequences of the rapid development of the global information society a major one is the existence of huge amounts of information stored in information systems and networks across geographical, linguistic and cultural boundaries. The need was imposed to create tools and technologies able to efficiently organize and allow retrieval of information in this universal context. Information professionals had to cope not only with the multitude of knowledge organisation and representation systems but also with the multitude of languages the available information is stored in order to provide the users with effective information retrieval tools. For this purpose a real language industry has been developed, theoreticians and researchers making considerable efforts to find feasible solutions to problems of multilingual access by way of natural language processing and machine translation methodologies. Such corporate efforts belong to the CoBRA+ working group for multilingual access to subjects (MACS) or to the cross-language information retrieval (CLIR) tracks of the Text Retrieval Conferences that annually report the progress made in multilingual information access and retrieval. The encouraging results they have obtained so far are still confined to discipline/domain restrictions and most of their achievements are based an language pairs rather than multiple language combinations.

Source

Challenges in knowledge representation and organization for the 21st century: Integration of knowledge across boundaries. Proceedings of the 7th ISKO International Conference Granada, Spain, July 10-13, 2002. Ed.: M. López-Huertas
Larkey, L.S.; Connell, M.E.: Structured queries, language modelling, and relevance modelling in cross-language information retrieval (2005) 0.01
```
0.011589216 = product of:
  0.04249379 = sum of:
    0.012341722 = weight(_text_:of in 1022) [ClassicSimilarity], result of:
      0.012341722 = score(doc=1022,freq=14.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.22855641 = fieldWeight in 1022, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1022)
    0.018455777 = weight(_text_:on in 1022) [ClassicSimilarity], result of:
      0.018455777 = score(doc=1022,freq=8.0), product of:
        0.07594867 = queryWeight, product of:
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.034531306 = queryNorm
        0.24300331 = fieldWeight in 1022, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1022)
    0.011696288 = product of:
      0.023392577 = sum of:
        0.023392577 = weight(_text_:22 in 1022) [ClassicSimilarity], result of:
          0.023392577 = score(doc=1022,freq=2.0), product of:
            0.12092275 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.034531306 = queryNorm
            0.19345059 = fieldWeight in 1022, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1022)
      0.5 = coord(1/2)
  0.27272728 = coord(3/11)
```
Abstract

Two probabilistic approaches to cross-lingual retrieval are in wide use today, those based on probabilistic models of relevance, as exemplified by INQUERY, and those based on language modeling. INQUERY, as a query net model, allows the easy incorporation of query operators, including a synonym operator, which has proven to be extremely useful in cross-language information retrieval (CLIR), in an approach often called structured query translation. In contrast, language models incorporate translation probabilities into a unified framework. We compare the two approaches on Arabic and Spanish data sets, using two kinds of bilingual dictionaries--one derived from a conventional dictionary, and one derived from a parallel corpus. We find that structured query processing gives slightly better results when queries are not expanded. On the other hand, when queries are expanded, language modeling gives better results, but only when using a probabilistic dictionary derived from a parallel corpus. We pursue two additional issues inherent in the comparison of structured query processing with language modeling. The first concerns query expansion, and the second is the role of translation probabilities. We compare conventional expansion techniques (pseudo-relevance feedback) with relevance modeling, a new IR approach which fits into the formal framework of language modeling. We find that relevance modeling and pseudo-relevance feedback achieve comparable levels of retrieval and that good translation probabilities confer a small but significant advantage.

Date

26.12.2007 20:22:11

Seo, H.-C.; Kim, S.-B.; Rim, H.-C.; Myaeng, S.-H.: lmproving query translation in English-Korean Cross-language information retrieval (2005) 0.01

0.011165902 = product of:
  0.04094164 = sum of:
    0.015832627 = weight(_text_:of in 1023) [ClassicSimilarity], result of:
      0.015832627 = score(doc=1023,freq=16.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.2932045 = fieldWeight in 1023, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=1023)
    0.011073467 = weight(_text_:on in 1023) [ClassicSimilarity], result of:
      0.011073467 = score(doc=1023,freq=2.0), product of:
        0.07594867 = queryWeight, product of:
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.034531306 = queryNorm
        0.14580199 = fieldWeight in 1023, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.046875 = fieldNorm(doc=1023)
    0.014035545 = product of:
      0.02807109 = sum of:
        0.02807109 = weight(_text_:22 in 1023) [ClassicSimilarity], result of:
          0.02807109 = score(doc=1023,freq=2.0), product of:
            0.12092275 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.034531306 = queryNorm
            0.23214069 = fieldWeight in 1023, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=1023)
      0.5 = coord(1/2)
  0.27272728 = coord(3/11)

Abstract: Query translation is a viable method for cross-language information retrieval (CLIR), but it suffers from translation ambiguities caused by multiple translations of individual query terms. Previous research has employed various methods for disambiguation, including the method of selecting an individual target query term from multiple candidates by comparing their statistical associations with the candidate translations of other query terms. This paper proposes a new method where we examine all combinations of target query term translations corresponding to the source query terms, instead of looking at the candidates for each query term and selecting the best one at a time. The goodness value for a combination of target query terms is computed based on the association value between each pair of the terms in the combination. We tested our method using the NTCIR-3 English-Korean CLIR test collection. The results show some improvements regardless of the association measures we used.
Date: 26.12.2007 20:22:38

Bian, G.-W.; Chen, H.-H.: Cross-language information access to multilingual collections on the Internet (2000) 0.01

0.01074307 = product of:
  0.039391257 = sum of:
    0.009695465 = weight(_text_:of in 4436) [ClassicSimilarity], result of:
      0.009695465 = score(doc=4436,freq=6.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.17955035 = fieldWeight in 4436, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=4436)
    0.015660247 = weight(_text_:on in 4436) [ClassicSimilarity], result of:
      0.015660247 = score(doc=4436,freq=4.0), product of:
        0.07594867 = queryWeight, product of:
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.034531306 = queryNorm
        0.20619515 = fieldWeight in 4436, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.046875 = fieldNorm(doc=4436)
    0.014035545 = product of:
      0.02807109 = sum of:
        0.02807109 = weight(_text_:22 in 4436) [ClassicSimilarity], result of:
          0.02807109 = score(doc=4436,freq=2.0), product of:
            0.12092275 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.034531306 = queryNorm
            0.23214069 = fieldWeight in 4436, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=4436)
      0.5 = coord(1/2)
  0.27272728 = coord(3/11)

Abstract: Language barrier is the major problem that people face in searching for, retrieving, and understanding multilingual collections on the Internet. This paper deals with query translation and document translation in a Chinese-English information retrieval system called MTIR. Bilingual dictionary and monolingual corpus-based approaches are adopted to select suitable tranlated query terms. A machine transliteration algorithm is introduced to resolve proper name searching. We consider several design issues for document translation, including which material is translated, what roles the HTML tags play in translation, what the tradeoff is between the speed performance and the translation performance, and what from the translated result is presented in. About 100.000 Web pages translated in the last 4 months of 1997 are used for quantitative study of online and real-time Web page translation
Date: 16. 2.2000 14:22:39
Source: Journal of the American Society for Information Science. 51(2000) no.3, S.281-296

Levergood, B.; Farrenkopf, S.; Frasnelli, E.: ¬The specification of the language of the field and interoperability : cross-language access to catalogues and online libraries (CACAO) (2008) 0.01

0.009901192 = product of:
  0.03630437 = sum of:
    0.011195358 = weight(_text_:of in 2646) [ClassicSimilarity], result of:
      0.011195358 = score(doc=2646,freq=8.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.20732689 = fieldWeight in 2646, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=2646)
    0.011073467 = weight(_text_:on in 2646) [ClassicSimilarity], result of:
      0.011073467 = score(doc=2646,freq=2.0), product of:
        0.07594867 = queryWeight, product of:
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.034531306 = queryNorm
        0.14580199 = fieldWeight in 2646, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.046875 = fieldNorm(doc=2646)
    0.014035545 = product of:
      0.02807109 = sum of:
        0.02807109 = weight(_text_:22 in 2646) [ClassicSimilarity], result of:
          0.02807109 = score(doc=2646,freq=2.0), product of:
            0.12092275 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.034531306 = queryNorm
            0.23214069 = fieldWeight in 2646, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=2646)
      0.5 = coord(1/2)
  0.27272728 = coord(3/11)

Abstract: The CACAO Project (Cross-language Access to Catalogues and Online Libraries) has been designed to implement natural language processing and cross-language information retrieval techniques to provide cross-language access to information in libraries, a critical issue in the linguistically diverse European Union. This project report addresses two metadata-related challenges for the library community in this context: "false friends" (identical words having different meanings in different languages) and term ambiguity. The possible solutions involve enriching the metadata with attributes specifying language or the source authority file, or associating potential search terms to classes in a classification system. The European Library will evaluate an early implementation of this work in late 2008.
Source: Metadata for semantic and social applications : proceedings of the International Conference on Dublin Core and Metadata Applications, Berlin, 22 - 26 September 2008, DC 2008: Berlin, Germany / ed. by Jane Greenberg and Wolfgang Klas

Freitas-Junior, H.R.; Ribeiro-Neto, B.A.; Freitas-Vale, R. de; Laender, A.H.F.; Lima, L.R.S. de: Categorization-driven cross-language retrieval of medical information (2006) 0.01

0.008551319 = product of:
  0.031354837 = sum of:
    0.010430659 = weight(_text_:of in 5282) [ClassicSimilarity], result of:
      0.010430659 = score(doc=5282,freq=10.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.19316542 = fieldWeight in 5282, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5282)
    0.009227889 = weight(_text_:on in 5282) [ClassicSimilarity], result of:
      0.009227889 = score(doc=5282,freq=2.0), product of:
        0.07594867 = queryWeight, product of:
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.034531306 = queryNorm
        0.121501654 = fieldWeight in 5282, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5282)
    0.011696288 = product of:
      0.023392577 = sum of:
        0.023392577 = weight(_text_:22 in 5282) [ClassicSimilarity], result of:
          0.023392577 = score(doc=5282,freq=2.0), product of:
            0.12092275 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.034531306 = queryNorm
            0.19345059 = fieldWeight in 5282, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5282)
      0.5 = coord(1/2)
  0.27272728 = coord(3/11)

Abstract: The Web has become a large repository of documents (or pages) written in many different languages. In this context, traditional information retrieval (IR) techniques cannot be used whenever the user query and the documents being retrieved are in different languages. To address this problem, new cross-language information retrieval (CLIR) techniques have been proposed. In this work, we describe a method for cross-language retrieval of medical information. This method combines query terms and related medical concepts obtained automatically through a categorization procedure. The medical concepts are used to create a linguistic abstraction that allows retrieval of information in a language-independent way, minimizing linguistic problems such as polysemy. To evaluate our method, we carried out experiments using the OHSUMED test collection, whose documents are written in English, with queries expressed in Portuguese, Spanish, and French. The results indicate that our cross-language retrieval method is as effective as a standard vector space model algorithm operating on queries and documents in the same language. Further, our results are better than previous results in the literature.
Date: 22. 7.2006 16:46:36
Source: Journal of the American Society for Information Science and Technology. 57(2006) no.4, S.501-510

Subirats, I.; Prasad, A.R.D.; Keizer, J.; Bagdanov, A.: Implementation of rich metadata formats and demantic tools using DSpace (2008) 0.01
```
0.008090899 = product of:
  0.029666629 = sum of:
    0.012927286 = weight(_text_:of in 2656) [ClassicSimilarity], result of:
      0.012927286 = score(doc=2656,freq=24.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.23940048 = fieldWeight in 2656, product of:
          4.8989797 = tf(freq=24.0), with freq of:
            24.0 = termFreq=24.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03125 = fieldNorm(doc=2656)
    0.0073823114 = weight(_text_:on in 2656) [ClassicSimilarity], result of:
      0.0073823114 = score(doc=2656,freq=2.0), product of:
        0.07594867 = queryWeight, product of:
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.034531306 = queryNorm
        0.097201325 = fieldWeight in 2656, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.03125 = fieldNorm(doc=2656)
    0.0093570305 = product of:
      0.018714061 = sum of:
        0.018714061 = weight(_text_:22 in 2656) [ClassicSimilarity], result of:
          0.018714061 = score(doc=2656,freq=2.0), product of:
            0.12092275 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.034531306 = queryNorm
            0.15476047 = fieldWeight in 2656, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=2656)
      0.5 = coord(1/2)
  0.27272728 = coord(3/11)
```
Abstract

This poster explores the customization of DSpace to allow the use of the AGRIS Application Profile metadata standard and the AGROVOC thesaurus. The objective is the adaptation of DSpace, through the least invasive code changes either in the form of plug-ins or add-ons, to the specific needs of the Agricultural Sciences and Technology community. Metadata standards such as AGRIS AP, and Knowledge Organization Systems such as the AGROVOC thesaurus, provide mechanisms for sharing information in a standardized manner by recommending the use of common semantics and interoperable syntax (Subirats et al., 2007). AGRIS AP was created to enhance the description, exchange and subsequent retrieval of agricultural Document-like Information Objects (DLIOs). It is a metadata schema which draws from Metadata standards such as Dublin Core (DC), the Australian Government Locator Service Metadata (AGLS) and the Agricultural Metadata Element Set (AgMES) namespaces. It allows sharing of information across dispersed bibliographic systems (FAO, 2005). AGROVOC68 is a multilingual structured thesaurus covering agricultural and related domains. Its main role is to standardize the indexing process in order to make searching simpler and more efficient. AGROVOC is developed by FAO (Lauser et al., 2006). The customization of the DSpace is taking place in several phases. First, the AGRIS AP metadata schema was mapped onto the metadata DSpace model, with several enhancements implemented to support AGRIS AP elements. Next, AGROVOC will be integrated as a controlled vocabulary accessed through a local SKOS or OWL file. Eventually the system will be configurable to access AGROVOC through local files or remotely via webservices. Finally, spell checking and tooltips will be incorporated in the user interface to support metadata editing. Adapting DSpace to support AGRIS AP and annotation using the semantically-rich AGROVOC thesaurus transform DSpace into a powerful, domain-specific system for annotation and exchange of bibliographic metadata in the agricultural domain.

Source

Metadata for semantic and social applications : proceedings of the International Conference on Dublin Core and Metadata Applications, Berlin, 22 - 26 September 2008, DC 2008: Berlin, Germany / ed. by Jane Greenberg and Wolfgang Klas
Sieglerschmidt, J.: Convergence of internet services in the cultural heritage sector : the long way to common vocabularies, metadata formats, ontologies (2008) 0.01
```
0.0075523397 = product of:
  0.041537866 = sum of:
    0.01939093 = weight(_text_:of in 1686) [ClassicSimilarity], result of:
      0.01939093 = score(doc=1686,freq=24.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.3591007 = fieldWeight in 1686, product of:
          4.8989797 = tf(freq=24.0), with freq of:
            24.0 = termFreq=24.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=1686)
    0.022146935 = weight(_text_:on in 1686) [ClassicSimilarity], result of:
      0.022146935 = score(doc=1686,freq=8.0), product of:
        0.07594867 = queryWeight, product of:
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.034531306 = queryNorm
        0.29160398 = fieldWeight in 1686, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.046875 = fieldNorm(doc=1686)
  0.18181819 = coord(2/11)
```
Abstract

Since several years it has been observed that information offered by different knowledge producing institutions on the internet is more and more interlinked. This tendency will increase, because the fragmented information offers on the internet make the retrieval of information difficult as even impossible. At the same time the quantity of information offered on the internet grows exponentially in Europe - and elsewhere - due to many digitization projects. Insofar as funding institutions base the acceptance of projects on the observation of certain documentation standards the knowledge created will be retrievable and will remain so for a longer time. Otherwise the retrieval of information will become a matter of chance due to the limits of fragmented, knowledge producing social groups.

Source

Kompatibilität, Medien und Ethik in der Wissensorganisation - Compatibility, Media and Ethics in Knowledge Organization: Proceedings der 10. Tagung der Deutschen Sektion der Internationalen Gesellschaft für Wissensorganisation Wien, 3.-5. Juli 2006 - Proceedings of the 10th Conference of the German Section of the International Society of Knowledge Organization Vienna, 3-5 July 2006. Ed.: H.P. Ohly, S. Netscher u. K. Mitgutsch

Peters, C.; Braschler, M.: Cross-language system evaluation : the CLEF campaigns (2001) 0.01

0.007386743 = product of:
  0.040627085 = sum of:
    0.019746756 = weight(_text_:of in 6979) [ClassicSimilarity], result of:
      0.019746756 = score(doc=6979,freq=14.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.36569026 = fieldWeight in 6979, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=6979)
    0.02088033 = weight(_text_:on in 6979) [ClassicSimilarity], result of:
      0.02088033 = score(doc=6979,freq=4.0), product of:
        0.07594867 = queryWeight, product of:
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.034531306 = queryNorm
        0.27492687 = fieldWeight in 6979, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.0625 = fieldNorm(doc=6979)
  0.18181819 = coord(2/11)

Abstract: The goals of the CLEF (Cross-Language Evaluation Forum) series of evaluation campaigns for information retrieval systems operating on European languages are described. The difficulties of organizing an activity which aims at an objective evaluation of systems running on and over a number' of different languages are examined. The discussion includes an analysis of the first results and proposals for possible developments in the future.
Source: Journal of the American Society for Information Science and technology. 52(2001) no.12, S.1067-1072

Francu, V.: ¬The impact of specificity on the retrieval power of a UDC-based multilingual thesaurus (2003) 0.01

0.006905375 = product of:
  0.03797956 = sum of:
    0.015832627 = weight(_text_:of in 5518) [ClassicSimilarity], result of:
      0.015832627 = score(doc=5518,freq=16.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.2932045 = fieldWeight in 5518, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=5518)
    0.022146935 = weight(_text_:on in 5518) [ClassicSimilarity], result of:
      0.022146935 = score(doc=5518,freq=8.0), product of:
        0.07594867 = queryWeight, product of:
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.034531306 = queryNorm
        0.29160398 = fieldWeight in 5518, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.046875 = fieldNorm(doc=5518)
  0.18181819 = coord(2/11)

Abstract: The article describes the research done over a bibliographic database in order to show the impact the specificity of the knowledge organising tools may have on information retrieval (IR). For this purpose two multilingual Universal Decimal Classification (UDC) based thesauri having different degrees of specificity are considered. Issues of harmonising a classificatory structure with a thesaurus structure are introduced, and significant aspects of information retrieval in a multilingual environment are examined in an extensive manner. Aspects of complementarity are discussed with particular emphasis on the real impact produced on IR by alternative search facilities. Finally, a number of conclusions are formulated as they arise from the study.

Keränen, S.: Equivalence and focus of translation in multicultural thesaurus construction (2006) 0.01

0.006723522 = product of:
  0.03697937 = sum of:
    0.014602924 = weight(_text_:of in 237) [ClassicSimilarity], result of:
      0.014602924 = score(doc=237,freq=10.0), product of:
        0.053998582 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.034531306 = queryNorm
        0.2704316 = fieldWeight in 237, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=237)
    0.022376444 = weight(_text_:on in 237) [ClassicSimilarity], result of:
      0.022376444 = score(doc=237,freq=6.0), product of:
        0.07594867 = queryWeight, product of:
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.034531306 = queryNorm
        0.29462588 = fieldWeight in 237, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          2.199415 = idf(docFreq=13325, maxDocs=44218)
          0.0546875 = fieldNorm(doc=237)
  0.18181819 = coord(2/11)

Abstract: This paper reports a part of an on-going PhD study on problems related to multicultural social science thesaurus construction in the general frame of information science. The main analysis methods used are discourse analysis and co-word analysis. In theoretical framework the emphasis is on communicative equivalence theories and different aims of thesaurus translation are discussed. Some examples are given how co-word analysis can be used to study contextual equivalence.
Source: Knowledge organization for a global learning society: Proceedings of the 9th International ISKO Conference, 4-7 July 2006, Vienna, Austria. Hrsg.: G. Budin, C. Swertz u. K. Mitgutsch

Search (113 results, page 1 of 6)

Authors

Languages

Themes