Search (52 results, page 3 of 3)

Oard, D.W.: Alternative approaches for cross-language text retrieval (1997) 0.00
```
2.5739305E-4 = product of:
  0.0038608958 = sum of:
    0.0038608958 = product of:
      0.0077217915 = sum of:
        0.0077217915 = weight(_text_:information in 1164) [ClassicSimilarity], result of:
          0.0077217915 = score(doc=1164,freq=10.0), product of:
            0.050870337 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.028978055 = queryNorm
            0.1517936 = fieldWeight in 1164, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02734375 = fieldNorm(doc=1164)
      0.5 = coord(1/2)
  0.06666667 = coord(1/15)
```
Abstract

The explosive growth of the Internet and other sources of networked information have made automatic mediation of access to networked information sources an increasingly important problem. Much of this information is expressed as electronic text, and it is becoming practical to automatically convert some printed documents and recorded speech to electronic text as well. Thus, automated systems capable of detecting useful documents are finding widespread application. With even a small number of languages it can be inconvenient to issue the same query repeatedly in every language, so users who are able to read more than one language will likely prefer a multilingual text retrieval system over a collection of monolingual systems. And since reading ability in a language does not always imply fluent writing ability in that language, such users will likely find cross-language text retrieval particularly useful for languages in which they are less confident of their ability to express their information needs effectively. The use of such systems can be also be beneficial if the user is able to read only a single language. For example, when only a small portion of the document collection will ever be examined by the user, performing retrieval before translation can be significantly more economical than performing translation before retrieval. So when the application is sufficiently important to justify the time and effort required for translation, those costs can be minimized if an effective cross-language text retrieval system is available. Even when translation is not available, there are circumstances in which cross-language text retrieval could be useful to a monolingual user. For example, a researcher might find a paper published in an unfamiliar language useful if that paper contains references to works by the same author that are in the researcher's native language.
Multilingual text retrieval can be defined as selection of useful documents from collections that may contain several languages (English, French, Chinese, etc.). This formulation allows for the possibility that individual documents might contain more than one language, a common occurrence in some applications. Both cross-language and within-language retrieval are included in this formulation, but it is the cross-language aspect of the problem which distinguishes multilingual text retrieval from its well studied monolingual counterpart. At the SIGIR 96 workshop on "Cross-Linguistic Information Retrieval" the participants discussed the proliferation of terminology being used to describe the field and settled on "Cross-Language" as the best single description of the salient aspect of the problem. "Multilingual" was felt to be too broad, since that term has also been used to describe systems able to perform within-language retrieval in more than one language but that lack any cross-language capability. "Cross-lingual" and "cross-linguistic" were felt to be equally good descriptions of the field, but "crosslanguage" was selected as the preferred term in the interest of standardization. Unfortunately, at about the same time the U.S. Defense Advanced Research Projects Agency (DARPA) introduced "translingual" as their preferred term, so we are still some distance from reaching consensus on this matter.
Soergel, D.: SemWeb: proposal for an open, multifunctional, multilingual system for integrated access to knowledge about concepts and terminology (1996) 0.00
```
2.3255666E-4 = product of:
  0.0034883497 = sum of:
    0.0034883497 = product of:
      0.0069766995 = sum of:
        0.0069766995 = weight(_text_:information in 3575) [ClassicSimilarity], result of:
          0.0069766995 = score(doc=3575,freq=4.0), product of:
            0.050870337 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.028978055 = queryNorm
            0.13714671 = fieldWeight in 3575, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3575)
      0.5 = coord(1/2)
  0.06666667 = coord(1/15)
```
Abstract

Presents a proposal for the long-range development of an open, multifunctional, multilingual system for integrated access to many kinds of knowledge about concepts and terminology. The system would draw on existing knowledge bases that are accessible through the Internet or on CD-ROM and on a common integrated distributed knowledge base that would grow incrementally over time. Existing knowledge bases would be accessed througha common interface that would search several knowledge bases, collate the data into a common format, and present them to the user. The common integrated distributed knowldge base would provide an environment in which many contributors could carry out classification and terminological projects more efficiently, with the results available in a common format. Over time, data from other knowledge bases could be incorporated into the common knowledge base, either by actual transfer (provided the knowledge base producers are willing) or by reference through a link. Either way, such incorporation requires intellectual work but allows for tighter integration than common interface access to multiple knowledge bases. Each piece of information in the common knowledge base will have all its sources attached, providing an acknowledgment mechanism that gives due credit to all contributors. The whole system would be designed to be usable by many levels of users for improved information exchange.
Soergel, D.: SemWeb: Proposal for an Open, multifunctional, multilingual system for integrated access to knowledge about concepts and terminology : exploration and development of the concept (1996) 0.00
```
2.3255666E-4 = product of:
  0.0034883497 = sum of:
    0.0034883497 = product of:
      0.0069766995 = sum of:
        0.0069766995 = weight(_text_:information in 3576) [ClassicSimilarity], result of:
          0.0069766995 = score(doc=3576,freq=4.0), product of:
            0.050870337 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.028978055 = queryNorm
            0.13714671 = fieldWeight in 3576, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3576)
      0.5 = coord(1/2)
  0.06666667 = coord(1/15)
```
Abstract

This paper presents a proposal for the long-range development of an open, multifunctional, multilingual system for integrated access to many kinds of knowledge about concepts and terminology. The system would draw on existing knowledge bases that are accessible through the Internet or on CD-ROM an on a common integrated distributed knowledge base that would grow incrementally over time. Existing knowledge bases would be accessed through a common interface that would search several knowledge bases, collate the data into a common format, and present them to the user. The common integrated distributed knowledge base would provide an environment in which many contributors could carry out classification and terminological projects more efficiently, with the results available in a common format. Over time, data from other knowledge bases could be incorporated into the common knowledge base, either by actual transfer (provided the knowledge base producers are willing) or by reference through a link. Either way, such incorporation requires intellectual work but allows for tighter integration than common interface access to multiple knowledge bases. Each piece of information in the common knowledge base will have all its sources attached, providing an acknowledgment mechanism that gives due credit to all contributors. The whole system woul be designed to be usable by many levels of users for improved information exchange.
Cousins, S.A.; Hartley, R.J.: Towards multilingual online public access catalogues (1994) 0.00
```
2.3021935E-4 = product of:
  0.00345329 = sum of:
    0.00345329 = product of:
      0.00690658 = sum of:
        0.00690658 = weight(_text_:information in 7207) [ClassicSimilarity], result of:
          0.00690658 = score(doc=7207,freq=2.0), product of:
            0.050870337 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.028978055 = queryNorm
            0.13576832 = fieldWeight in 7207, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0546875 = fieldNorm(doc=7207)
      0.5 = coord(1/2)
  0.06666667 = coord(1/15)
```
Abstract

With increasing moves towards an integrated Europe the need for multilingual access to information becomes more pressing. One aspect of this need which has largely been neglected is the provision of multilingual access to OPACs and this paper is concerned with exploring this problem area. The need for multilingual OPAC search capabilities and the difficulties associated with this are discussed. The problems of subject access in particular are highlighted. Research into subject searching in monolingual OPACs is reviewed and its relevance to multilingual OPACs is outlined. Given the limitations of current machine translation of natural language it is likely that the utilisation of controlled subject search facilities. Finally some possible directions for further research are considered
Lassalle, E.: Text retrieval : from a monolingual system to a multilingual system (1993) 0.00
```
2.3021935E-4 = product of:
  0.00345329 = sum of:
    0.00345329 = product of:
      0.00690658 = sum of:
        0.00690658 = weight(_text_:information in 7403) [ClassicSimilarity], result of:
          0.00690658 = score(doc=7403,freq=2.0), product of:
            0.050870337 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.028978055 = queryNorm
            0.13576832 = fieldWeight in 7403, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0546875 = fieldNorm(doc=7403)
      0.5 = coord(1/2)
  0.06666667 = coord(1/15)
```
Abstract

Describes the TELMI monolingual text retrieval system and its future extension, a multilingual system. TELMI is designed for medium sized databases containing short texts. The characteristics of the system are fine-grained natural language processing (NLP); an open domain and a large scale knowledge base; automated indexing based on conceptual representation of texts and reusability of the NLP tools. Discusses the French MINITEL service, the MGS information service and the TELMI research system covering the full text system; NLP architecture; the lexical level; the syntactic level; the semantic level and an example of the use of a generic system

Hlava, M.M.K.: Machine aided indexing (MAI) in a multilingual environment (1993) 0.00

2.3021935E-4 = product of:
  0.00345329 = sum of:
    0.00345329 = product of:
      0.00690658 = sum of:
        0.00690658 = weight(_text_:information in 7405) [ClassicSimilarity], result of:
          0.00690658 = score(doc=7405,freq=2.0), product of:
            0.050870337 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.028978055 = queryNorm
            0.13576832 = fieldWeight in 7405, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0546875 = fieldNorm(doc=7405)
      0.5 = coord(1/2)
  0.06666667 = coord(1/15)

Imprint: Medford, NJ : Learned Information

Diaz, P.: Multilingual tools for accessing a Spanish library catalogue (1997) 0.00
```
2.3021935E-4 = product of:
  0.00345329 = sum of:
    0.00345329 = product of:
      0.00690658 = sum of:
        0.00690658 = weight(_text_:information in 1163) [ClassicSimilarity], result of:
          0.00690658 = score(doc=1163,freq=2.0), product of:
            0.050870337 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.028978055 = queryNorm
            0.13576832 = fieldWeight in 1163, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1163)
      0.5 = coord(1/2)
  0.06666667 = coord(1/15)
```
Abstract

The use of library resources will no longer be restricted to the physical location of libraries thanks to networking technologies and standard protocols for information retrieval. These technical achievements allow users to access geographically scattered libraries but they do not ease their intellectual access. Indeed, users need a certain command of different languages to find publications whose records are written in a unique language. Multilingual facilities, including multilingual presentation and retrieval, can intellectually open the library catalogue to a wider range of international users. Describes an attempt at using multilingual resources with a view to improving user OPAC interaction through the TRANSLIB project, which provides library users with advanced tools that support multilingual access
Borgman, C.L.: Multi-media, multi-cultural, and multi-lingual digital libraries : or how do we exchange data In 400 languages? (1997) 0.00
```
1.993758E-4 = product of:
  0.002990637 = sum of:
    0.002990637 = product of:
      0.005981274 = sum of:
        0.005981274 = weight(_text_:information in 1263) [ClassicSimilarity], result of:
          0.005981274 = score(doc=1263,freq=6.0), product of:
            0.050870337 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.028978055 = queryNorm
            0.11757882 = fieldWeight in 1263, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02734375 = fieldNorm(doc=1263)
      0.5 = coord(1/2)
  0.06666667 = coord(1/15)
```
Abstract

The Internet would not be very useful if communication were limited to textual exchanges between speakers of English located in the United States. Rather, its value lies in its ability to enable people from multiple nations, speaking multiple languages, to employ multiple media in interacting with each other. While computer networks broke through national boundaries long ago, they remain much more effective for textual communication than for exchanges of sound, images, or mixed media -- and more effective for communication in English than for exchanges in most other languages, much less interactions involving multiple languages. Supporting searching and display in multiple languages is an increasingly important issue for all digital libraries accessible on the Internet. Even if a digital library contains materials in only one language, the content needs to be searchable and displayable on computers in countries speaking other languages. We need to exchange data between digital libraries, whether in a single language or in multiple languages. Data exchanges may be large batch updates or interactive hyperlinks. In any of these cases, character sets must be represented in a consistent manner if exchanges are to succeed. Issues of interoperability, portability, and data exchange related to multi-lingual character sets have received surprisingly little attention in the digital library community or in discussions of standards for information infrastructure, except in Europe. The landmark collection of papers on Standards Policy for Information Infrastructure, for example, contains no discussion of multi-lingual issues except for a passing reference to the Unicode standard. The goal of this short essay is to draw attention to the multi-lingual issues involved in designing digital libraries accessible on the Internet. Many of the multi-lingual design issues parallel those of multi-media digital libraries, a topic more familiar to most readers of D-Lib Magazine. This essay draws examples from multi-media DLs to illustrate some of the urgent design challenges in creating a globally distributed network serving people who speak many languages other than English. First we introduce some general issues of medium, culture, and language, then discuss the design challenges in the transition from local to global systems, lastly addressing technical matters. The technical issues involve the choice of character sets to represent languages, similar to the choices made in representing images or sound. However, the scale of the language problem is far greater. Standards for multi-media representation are being adopted fairly rapidly, in parallel with the availability of multi-media content in electronic form. By contrast, we have hundreds (and sometimes thousands) of years worth of textual materials in hundreds of languages, created long before data encoding standards existed. Textual content from past and present is being encoded in language and application-specific representations that are difficult to exchange without losing data -- if they exchange at all. We illustrate the multi-language DL challenge with examples drawn from the research library community, which typically handles collections of materials in 400 or so languages. These are problems faced not only by developers of digital libraries, but by those who develop and manage any communication technology that crosses national or linguistic boundaries.

Theme

Information Gateway
Clavel-Merrin, G.: Multilingual access to libraries' databases (1996) 0.00
```
1.9733087E-4 = product of:
  0.002959963 = sum of:
    0.002959963 = product of:
      0.005919926 = sum of:
        0.005919926 = weight(_text_:information in 4187) [ClassicSimilarity], result of:
          0.005919926 = score(doc=4187,freq=2.0), product of:
            0.050870337 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.028978055 = queryNorm
            0.116372846 = fieldWeight in 4187, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=4187)
      0.5 = coord(1/2)
  0.06666667 = coord(1/15)
```
Abstract

Multilingual access to library databases is a topic of concern not only to users in countries such as Switzerland in which several languages are spoken, but also to those who search for information in databases containing material in more than one language. The growth of networks means that libraries can access databases outside their own immediate circle but problems of differences in interfaces will continue until there is widespread compliance with Z39.50. Considers 2 approaches to multilingual access: the use of multilingual thesauri or authority records (which implies translation work before users search the database); and the translation of the search statement at the time of searching (which implies the existence of parsers and multilingual dictionaries)
Martinez Arellano, F.F.: Subject searching in online catalogs including Spanish and English material (1999) 0.00
```
1.9733087E-4 = product of:
  0.002959963 = sum of:
    0.002959963 = product of:
      0.005919926 = sum of:
        0.005919926 = weight(_text_:information in 5350) [ClassicSimilarity], result of:
          0.005919926 = score(doc=5350,freq=2.0), product of:
            0.050870337 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.028978055 = queryNorm
            0.116372846 = fieldWeight in 5350, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=5350)
      0.5 = coord(1/2)
  0.06666667 = coord(1/15)
```
Abstract

The use of title words, the combination of these through the use of logic operators, and the possibility of truncating them when carrying out subject searches, are some of the search options that have been incorporated into the online catalog. Several arguments in favor of these options have been expressed which state that they represent an approach for the use of natural language and that they facilitate information retrieval. However, expressed arguments against them that support the necessity of using controlled language to obtain more precision in search results also exist. This paper reports the main results from a study whose objective was to compare advantages and disadvantages of retrieval by keywords from the title and by subject headings included in the records of LIBRUNAM, an online catalog containing records for English and Spanish items at the National Autonomous University of Mexico.
Ferber, R.: Automated indexing with thesaurus descriptors : a co-occurence based approach to multilingual retrieval (1997) 0.00
```
1.6444239E-4 = product of:
  0.0024666358 = sum of:
    0.0024666358 = product of:
      0.0049332716 = sum of:
        0.0049332716 = weight(_text_:information in 4144) [ClassicSimilarity], result of:
          0.0049332716 = score(doc=4144,freq=2.0), product of:
            0.050870337 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.028978055 = queryNorm
            0.09697737 = fieldWeight in 4144, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4144)
      0.5 = coord(1/2)
  0.06666667 = coord(1/15)
```
Abstract

Indexing documents with descriptors from a multilingual thesaurus is an approach to multilingual information retrieval. However, manual indexing is expensive. Automazed indexing methods in general use terms found in the document. Thesaurus descriptors are complex terms that are often not used in documents or have specific meanings within the thesaurus; therefore most weighting schemes of automated indexing methods are not suited to select thesaurus descriptors. In this paper a linear associative system is described that uses similarity values extracted from a large corpus of manually indexed documents to construct a rank ordering of the descriptors for a given document title. The system is adaptive and has to be tuned with a training sample of records for the specific task. The system was tested on a corpus of some 80.000 bibliographic records. The results show a high variability with changing parameter values. This indicated that it is very important to empirically adapt the model to the specific situation it is used in. The overall median of the manually assigned descriptors in the automatically generated ranked list of all 3.631 descriptors is 14 for the set used to adapt the system and 11 for a test set not used in the optimization process. This result shows that the optimization is not a fitting to a specific training set but a real adaptation of the model to the setting

Clavel, G.; Dale, P.; Heiner-Freiling, M.; Kunz, M.; Landry, P.; MacEwan, A.; Naudi, M.; Oddy, P.; Saget, A.: CoBRA+ working group on multilingual subject access : final report (1999) 0.00

1.1510967E-4 = product of:
  0.001726645 = sum of:
    0.001726645 = product of:
      0.00345329 = sum of:
        0.00345329 = weight(_text_:information in 6067) [ClassicSimilarity], result of:
          0.00345329 = score(doc=6067,freq=2.0), product of:
            0.050870337 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.028978055 = queryNorm
            0.06788416 = fieldWeight in 6067, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02734375 = fieldNorm(doc=6067)
      0.5 = coord(1/2)
  0.06666667 = coord(1/15)

Footnote: Vgl. auch: http://www.bl.uk/information/finrap3.html

Search (52 results, page 3 of 3)

Authors

Types

Themes