Search (51 results, page 2 of 3)

Turner, J.M.: Cross-language transfer of indexing concepts for storage and retrieval of moving images : preliminary results (1996) 0.00

0.0022155237 = product of:
  0.015508666 = sum of:
    0.015508666 = weight(_text_:information in 7400) [ClassicSimilarity], result of:
      0.015508666 = score(doc=7400,freq=6.0), product of:
        0.06595008 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.037568163 = queryNorm
        0.23515764 = fieldWeight in 7400, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0546875 = fieldNorm(doc=7400)
  0.14285715 = coord(1/7)

Imprint: Medford, NJ : Learned Information
Source: Global complexity: information, chaos and control. Proceedings of the 59th Annual Meeting of the American Society for Information Science, ASIS'96, Baltimore, Maryland, 21-24 Oct 1996. Ed.: S. Hardin

Aguilar-Amat, A.; Parra, J.; Piqué, R.: Logical organization of information at BACO : a knowledge multilingual database for translation purposes (1996) 0.00

0.0021927997 = product of:
  0.015349597 = sum of:
    0.015349597 = weight(_text_:information in 5170) [ClassicSimilarity], result of:
      0.015349597 = score(doc=5170,freq=2.0), product of:
        0.06595008 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.037568163 = queryNorm
        0.23274569 = fieldWeight in 5170, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.09375 = fieldNorm(doc=5170)
  0.14285715 = coord(1/7)

Pollitt, A.S.; Ellis, G.: Multilingual access to document databases (1993) 0.00

0.0021927997 = product of:
  0.015349597 = sum of:
    0.015349597 = weight(_text_:information in 1302) [ClassicSimilarity], result of:
      0.015349597 = score(doc=1302,freq=8.0), product of:
        0.06595008 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.037568163 = queryNorm
        0.23274569 = fieldWeight in 1302, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=1302)
  0.14285715 = coord(1/7)

Imprint: Antigonish, NS : Canadian Association for Information Science
Series: Annual Conference / Canadian Association for Information Science ; 21
Source: Information as a Global Commodity - Communication, Processing and Use (CAIS/ACSI '93) : 21st Annual Conference Canadian Association for Information Science, Antigonish, Nova Scotia, Canada. July 1993

Multilingual information management : current levels and future abilities. A report Commissioned by the US National Science Foundation and also delivered to the European Commission's Language Engineering Office and the US Defense Advanced Research Projects Agency, April 1999 (1999) 0.00
```
0.0021927995 = product of:
  0.015349596 = sum of:
    0.015349596 = weight(_text_:information in 6068) [ClassicSimilarity], result of:
      0.015349596 = score(doc=6068,freq=18.0), product of:
        0.06595008 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.037568163 = queryNorm
        0.23274568 = fieldWeight in 6068, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03125 = fieldNorm(doc=6068)
  0.14285715 = coord(1/7)
```
Abstract

Over the past 50 years, a variety of language-related capabilities has been developed in machine translation, information retrieval, speech recognition, text summarization, and so on. These applications rest upon a set of core techniques such as language modeling, information extraction, parsing, generation, and multimedia planning and integration; and they involve methods using statistics, rules, grammars, lexicons, ontologies, training techniques, and so on. It is a puzzling fact that although all of this work deals with language in some form or other, the major applications have each developed a separate research field. For example, there is no reason why speech recognition techniques involving n-grams and hidden Markov models could not have been used in machine translation 15 years earlier than they were, or why some of the lexical and semantic insights from the subarea called Computational Linguistics are still not used in information retrieval.
This picture will rapidly change. The twin challenges of massive information overload via the web and ubiquitous computers present us with an unavoidable task: developing techniques to handle multilingual and multi-modal information robustly and efficiently, with as high quality performance as possible. The most effective way for us to address such a mammoth task, and to ensure that our various techniques and applications fit together, is to start talking across the artificial research boundaries. Extending the current technologies will require integrating the various capabilities into multi-functional and multi-lingual natural language systems. However, at this time there is no clear vision of how these technologies could or should be assembled into a coherent framework. What would be involved in connecting a speech recognition system to an information retrieval engine, and then using machine translation and summarization software to process the retrieved text? How can traditional parsing and generation be enhanced with statistical techniques? What would be the effect of carefully crafted lexicons on traditional information retrieval? At which points should machine translation be interleaved within information retrieval systems to enable multilingual processing?

Peters, C.; Picchi, E.: Across languages, across cultures : issues in multilinguality and digital libraries (1997) 0.00

0.0020673913 = product of:
  0.014471739 = sum of:
    0.014471739 = weight(_text_:information in 1233) [ClassicSimilarity], result of:
      0.014471739 = score(doc=1233,freq=4.0), product of:
        0.06595008 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.037568163 = queryNorm
        0.21943474 = fieldWeight in 1233, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0625 = fieldNorm(doc=1233)
  0.14285715 = coord(1/7)

Abstract: With the recent rapid diffusion over the international computer networks of world-wide distributed document bases, the question of multilingual access and multilingual information retrieval is becoming increasingly relevant. We briefly discuss just some of the issues that must be addressed in order to implement a multilingual interface for a Digital Library system and describe our own approach to this problem.
Theme: Information Gateway

Ata, B.M.A.: SISDOM: a multilingual document retrieval system (1995) 0.00

0.0019566366 = product of:
  0.013696454 = sum of:
    0.013696454 = product of:
      0.041089363 = sum of:
        0.041089363 = weight(_text_:29 in 895) [ClassicSimilarity], result of:
          0.041089363 = score(doc=895,freq=2.0), product of:
            0.13215305 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.037568163 = queryNorm
            0.31092256 = fieldWeight in 895, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0625 = fieldNorm(doc=895)
      0.33333334 = coord(1/3)
  0.14285715 = coord(1/7)

Date: 31. 7.1996 9:29:12

Timotin, A.: Multilingvism si tezaure de concepte (1994) 0.00

0.0019390353 = product of:
  0.013573246 = sum of:
    0.013573246 = product of:
      0.040719736 = sum of:
        0.040719736 = weight(_text_:22 in 7887) [ClassicSimilarity], result of:
          0.040719736 = score(doc=7887,freq=2.0), product of:
            0.1315573 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.037568163 = queryNorm
            0.30952093 = fieldWeight in 7887, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=7887)
      0.33333334 = coord(1/3)
  0.14285715 = coord(1/7)

Source: Probleme de Informare si Documentare. 28(1994) no.1, S.13-22

Cao, L.; Leong, M.-K.; Low, H.-B.: Searching heterogeneous multilingual bibliographic sources (1998) 0.00

0.0019390353 = product of:
  0.013573246 = sum of:
    0.013573246 = product of:
      0.040719736 = sum of:
        0.040719736 = weight(_text_:22 in 3564) [ClassicSimilarity], result of:
          0.040719736 = score(doc=3564,freq=2.0), product of:
            0.1315573 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.037568163 = queryNorm
            0.30952093 = fieldWeight in 3564, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=3564)
      0.33333334 = coord(1/3)
  0.14285715 = coord(1/7)

Date: 1. 8.1996 22:08:06

Heinzelin, D. de; ¬d'¬Hautcourt, F.; Pols, R.: ¬Un nouveaux thesaurus multilingue informatise relatif aux instruments de musique (1998) 0.00

0.0019390353 = product of:
  0.013573246 = sum of:
    0.013573246 = product of:
      0.040719736 = sum of:
        0.040719736 = weight(_text_:22 in 932) [ClassicSimilarity], result of:
          0.040719736 = score(doc=932,freq=2.0), product of:
            0.1315573 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.037568163 = queryNorm
            0.30952093 = fieldWeight in 932, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=932)
      0.33333334 = coord(1/3)
  0.14285715 = coord(1/7)

Date: 1. 8.1996 22:01:00

Automated systems for access to multilingual and multiscript library materials : Proceedings of the ... IFLA satellite meeting ... Madrid, August 18-19, 1993 (1994) 0.00

0.0018273331 = product of:
  0.012791331 = sum of:
    0.012791331 = weight(_text_:information in 7705) [ClassicSimilarity], result of:
      0.012791331 = score(doc=7705,freq=2.0), product of:
        0.06595008 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.037568163 = queryNorm
        0.19395474 = fieldWeight in 7705, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.078125 = fieldNorm(doc=7705)
  0.14285715 = coord(1/7)

Editor: IFLA Section on Information Technology

Krieger, C.; Schmid, H.: ¬The thesaurus implementation for AGRIS on CD-ROM (1993) 0.00
```
0.0018089674 = product of:
  0.012662771 = sum of:
    0.012662771 = weight(_text_:information in 6950) [ClassicSimilarity], result of:
      0.012662771 = score(doc=6950,freq=4.0), product of:
        0.06595008 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.037568163 = queryNorm
        0.1920054 = fieldWeight in 6950, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0546875 = fieldNorm(doc=6950)
  0.14285715 = coord(1/7)
```
Abstract

AGRIS, the International System for Agricultural Sciences and Technology became available on CD-ROM in 1989. In 1992, the AGROVOC thesaurus was added to the CD-ROM database a special searching feature. AGROVOC is a multilingual thesaurus of agricultural terminology, in English, French, Spanish, Italian and German, and is AGRIS's attempt to respond to demands for multilingual access the database. In 1986, AGROVOC became the mandatory indexing tool for AGRIS and replaced the commodity and geographical codes used previously. The thesaurus is divided into 2 main sections: a list of permuted term providing access to the online thesaurus via descriptors and cross references; and a term detail section providing information about relationships between descriptors

Source

Quarterly bulletin of the International Association of Agricultural Information Specialists. 38(1993) no.4, S.185-189

Pollitt, A.S.; Ellis, G.P.; Smith, M.P.; Gregory, M.R.; Li, C.S.; Zangenberg, H.: ¬A common query interface for multilingual document retrieval from databases of the European Community Institutions (1993) 0.00

0.0018089674 = product of:
  0.012662771 = sum of:
    0.012662771 = weight(_text_:information in 7736) [ClassicSimilarity], result of:
      0.012662771 = score(doc=7736,freq=4.0), product of:
        0.06595008 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.037568163 = queryNorm
        0.1920054 = fieldWeight in 7736, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0546875 = fieldNorm(doc=7736)
  0.14285715 = coord(1/7)

Imprint: Oxford : Learned Information
Source: Online information 93: 17th International Online Meeting Proceedings, London, 7.-9.12.1993. Ed. by D.I. Raitt et al

Cross-language information retrieval (1998) 0.00
```
0.0017093135 = product of:
  0.011965194 = sum of:
    0.011965194 = weight(_text_:information in 6299) [ClassicSimilarity], result of:
      0.011965194 = score(doc=6299,freq=28.0), product of:
        0.06595008 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.037568163 = queryNorm
        0.18142805 = fieldWeight in 6299, product of:
          5.2915025 = tf(freq=28.0), with freq of:
            28.0 = termFreq=28.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.01953125 = fieldNorm(doc=6299)
  0.14285715 = coord(1/7)
```
Content

Enthält die Beiträge: GREFENSTETTE, G.: The Problem of Cross-Language Information Retrieval; DAVIS, M.W.: On the Effective Use of Large Parallel Corpora in Cross-Language Text Retrieval; BALLESTEROS, L. u. W.B. CROFT: Statistical Methods for Cross-Language Information Retrieval; Distributed Cross-Lingual Information Retrieval; Automatic Cross-Language Information Retrieval Using Latent Semantic Indexing; EVANS, D.A. u.a.: Mapping Vocabularies Using Latent Semantics; PICCHI, E. u. C. PETERS: Cross-Language Information Retrieval: A System for Comparable Corpus Querying; YAMABANA, K. u.a.: A Language Conversion Front-End for Cross-Language Information Retrieval; GACHOT, D.A. u.a.: The Systran NLP Browser: An Application of Machine Translation Technology in Cross-Language Information Retrieval; HULL, D.: A Weighted Boolean Model for Cross-Language Text Retrieval; SHERIDAN, P. u.a. Building a Large Multilingual Test Collection from Comparable News Documents; OARD; D.W. u. B.J. DORR: Evaluating Cross-Language Text Filtering Effectiveness

Footnote

Rez. in: Machine translation review: 1999, no.10, S.26-27 (D. Lewis): "Cross Language Information Retrieval (CLIR) addresses the growing need to access large volumes of data across language boundaries. The typical requirement is for the user to input a free form query, usually a brief description of a topic, into a search or retrieval engine which returns a list, in ranked order, of documents or web pages that are relevant to the topic. The search engine matches the terms in the query to indexed terms, usually keywords previously derived from the target documents. Unlike monolingual information retrieval, CLIR requires query terms in one language to be matched to indexed terms in another. Matching can be done by bilingual dictionary lookup, full machine translation, or by applying statistical methods. A query's success is measured in terms of recall (how many potentially relevant target documents are found) and precision (what proportion of documents found are relevant). Issues in CLIR are how to translate query terms into index terms, how to eliminate alternative translations (e.g. to decide that French 'traitement' in a query means 'treatment' and not 'salary'), and how to rank or weight translation alternatives that are retained (e.g. how to order the French terms 'aventure', 'business', 'affaire', and 'liaison' as relevant translations of English 'affair'). Grefenstette provides a lucid and useful overview of the field and the problems. The volume brings together a number of experiments and projects in CLIR. Mark Davies (New Mexico State University) describes Recuerdo, a Spanish retrieval engine which reduces translation ambiguities by scanning indexes for parallel texts; it also uses either a bilingual dictionary or direct equivalents from a parallel corpus in order to compare results for queries on parallel texts. Lisa Ballesteros and Bruce Croft (University of Massachusetts) use a 'local feedback' technique which automatically enhances a query by adding extra terms to it both before and after translation; such terms can be derived from documents known to be relevant to the query.
Christian Fluhr at al (DIST/SMTI, France) outline the EMIR (European Multilingual Information Retrieval) and ESPRIT projects. They found that using SYSTRAN to machine translate queries and to access material from various multilingual databases produced less relevant results than a method referred to as 'multilingual reformulation' (the mechanics of which are only hinted at). An interesting technique is Latent Semantic Indexing (LSI), described by Michael Littman et al (Brown University) and, most clearly, by David Evans et al (Carnegie Mellon University). LSI involves creating matrices of documents and the terms they contain and 'fitting' related documents into a reduced matrix space. This effectively allows queries to be mapped onto a common semantic representation of the documents. Eugenio Picchi and Carol Peters (Pisa) report on a procedure to create links between translation equivalents in an Italian-English parallel corpus. The links are used to construct parallel linguistic contexts in real-time for any term or combination of terms that is being searched for in either language. Their interest is primarily lexicographic but they plan to apply the same procedure to comparable corpora, i.e. to texts which are not translations of each other but which share the same domain. Kiyoshi Yamabana et al (NEC, Japan) address the issue of how to disambiguate between alternative translations of query terms. Their DMAX (double maximise) method looks at co-occurrence frequencies between both source language words and target language words in order to arrive at the most probable translation. The statistical data for the decision are derived, not from the translation texts but independently from monolingual corpora in each language. An interactive user interface allows the user to influence the selection of terms during the matching process. Denis Gachot et al (SYSTRAN) describe the SYSTRAN NLP browser, a prototype tool which collects parsing information derived from a text or corpus previously translated with SYSTRAN. The user enters queries into the browser in either a structured or free form and receives grammatical and lexical information about the source text and/or its translation.

Series

The Kluwer International series on information retrieval

Schubert, K.: Parameters for the design of an intermediate language for multilingual thesauri (1995) 0.00

0.0016966559 = product of:
  0.011876591 = sum of:
    0.011876591 = product of:
      0.03562977 = sum of:
        0.03562977 = weight(_text_:22 in 2092) [ClassicSimilarity], result of:
          0.03562977 = score(doc=2092,freq=2.0), product of:
            0.1315573 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.037568163 = queryNorm
            0.2708308 = fieldWeight in 2092, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2092)
      0.33333334 = coord(1/3)
  0.14285715 = coord(1/7)

Source: Knowledge organization. 22(1995) nos.3/4, S.136-140

Pearce, C.; Nicholas, C.: TELLTALE: Experiments in a dynamic hypertext environment for degraded and multilingual data (1996) 0.00
```
0.0015505435 = product of:
  0.010853804 = sum of:
    0.010853804 = weight(_text_:information in 4071) [ClassicSimilarity], result of:
      0.010853804 = score(doc=4071,freq=4.0), product of:
        0.06595008 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.037568163 = queryNorm
        0.16457605 = fieldWeight in 4071, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=4071)
  0.14285715 = coord(1/7)
```
Abstract

Methods and tools for finding documents relevant to a user's needs in a document corpora can be found in the information retrieval, library science, and hypertext communities. Typically, these systems provide retrieval capabilities for fairly static copora, their algorithms are dependent on the language for which they are written, e.g. English, and they do not perform well when presented with misspelled words or text that has been degraded by OCR techniques. In this article, we present experimentation results for the TELLTALE system. TELLTALE is a dynamic hypertext environment that provides full-text search from a hypertext-style user interface for text corpora that may be garbled by OCR or transmission errors, and that may contain languages other than English. TELLTALE uses several techniques based on n-grams (n character sequences of text). With these results we show that the dynamic linkage mechanisms in TELLTALE are tolerant of garbles in up to 30% of the characters in the body of the texts

Source

Journal of the American Society for Information Science. 47(1996) no.4, S.263-275
Senez, D.: Developments in Systran (1995) 0.00
```
0.0014618665 = product of:
  0.010233065 = sum of:
    0.010233065 = weight(_text_:information in 8546) [ClassicSimilarity], result of:
      0.010233065 = score(doc=8546,freq=2.0), product of:
        0.06595008 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.037568163 = queryNorm
        0.1551638 = fieldWeight in 8546, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0625 = fieldNorm(doc=8546)
  0.14285715 = coord(1/7)
```
Abstract

Systran, the European Commission's multilingual machine translation system, is a fast service which is available to all Commission officials. The computer cannot match the skills of the professional translator, who must continue to be responsible for all texts which are legally binding or which are for publication. But machine translation can deal, in a matter of minutes, with short-lived documents, designed, say, for information or preparatory work, and which are required urgently. It can also give a broad view of a paper in an unfamiliar language, so that an official can decide how much, if any, of it needs to go to translators

Gopestake, A.: Acquisition of lexical translation relations from MRDS (1994/95) 0.00

0.0014618665 = product of:
  0.010233065 = sum of:
    0.010233065 = weight(_text_:information in 4073) [ClassicSimilarity], result of:
      0.010233065 = score(doc=4073,freq=2.0), product of:
        0.06595008 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.037568163 = queryNorm
        0.1551638 = fieldWeight in 4073, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0625 = fieldNorm(doc=4073)
  0.14285715 = coord(1/7)

Abstract: Presents a methodology for extracting information about lexical translation equivalences from the machine readable versions of conventional dictionaries (MRDs), and describes a series of experiments on semi automatic construction of a linked multilingual lexical knowledge base for English, Dutch and Spanish. Discusses the advantage and limitations of using MRDs that this has revealed, and some strategies developed to cover gaps where direct translation can be found

Zimmermann, H.H.: Überlegungen zu einem multilingualen Thesaurus-Konzept (1995) 0.00

0.0014618665 = product of:
  0.010233065 = sum of:
    0.010233065 = weight(_text_:information in 2076) [ClassicSimilarity], result of:
      0.010233065 = score(doc=2076,freq=2.0), product of:
        0.06595008 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.037568163 = queryNorm
        0.1551638 = fieldWeight in 2076, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0625 = fieldNorm(doc=2076)
  0.14285715 = coord(1/7)

Abstract: Die Thesaurus-Thematik wird zunächst in den Zusammenhang der gesamten Erschließungs- und Retrievalmöglichkeiten eines Information-Retrieval-Systems gestellt. Auf dieser Grundlage wird ein multilinguales Thesaurus-Konzept entwickelt. Wichtige Elemente sind: die Ermöglichung des Zugangs anhand des Benutzervokabulars, eine systematische, transparente Bedeutungsdifferenzierung und eine Basis-Relationierung anhand einer einzigen ("ausgezeichneten") natürlichen Sprache.

Oard, D.W.: Alternative approaches for cross-language text retrieval (1997) 0.00
```
0.0014301144 = product of:
  0.0100108 = sum of:
    0.0100108 = weight(_text_:information in 1164) [ClassicSimilarity], result of:
      0.0100108 = score(doc=1164,freq=10.0), product of:
        0.06595008 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.037568163 = queryNorm
        0.1517936 = fieldWeight in 1164, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.02734375 = fieldNorm(doc=1164)
  0.14285715 = coord(1/7)
```
Abstract

The explosive growth of the Internet and other sources of networked information have made automatic mediation of access to networked information sources an increasingly important problem. Much of this information is expressed as electronic text, and it is becoming practical to automatically convert some printed documents and recorded speech to electronic text as well. Thus, automated systems capable of detecting useful documents are finding widespread application. With even a small number of languages it can be inconvenient to issue the same query repeatedly in every language, so users who are able to read more than one language will likely prefer a multilingual text retrieval system over a collection of monolingual systems. And since reading ability in a language does not always imply fluent writing ability in that language, such users will likely find cross-language text retrieval particularly useful for languages in which they are less confident of their ability to express their information needs effectively. The use of such systems can be also be beneficial if the user is able to read only a single language. For example, when only a small portion of the document collection will ever be examined by the user, performing retrieval before translation can be significantly more economical than performing translation before retrieval. So when the application is sufficiently important to justify the time and effort required for translation, those costs can be minimized if an effective cross-language text retrieval system is available. Even when translation is not available, there are circumstances in which cross-language text retrieval could be useful to a monolingual user. For example, a researcher might find a paper published in an unfamiliar language useful if that paper contains references to works by the same author that are in the researcher's native language.
Multilingual text retrieval can be defined as selection of useful documents from collections that may contain several languages (English, French, Chinese, etc.). This formulation allows for the possibility that individual documents might contain more than one language, a common occurrence in some applications. Both cross-language and within-language retrieval are included in this formulation, but it is the cross-language aspect of the problem which distinguishes multilingual text retrieval from its well studied monolingual counterpart. At the SIGIR 96 workshop on "Cross-Linguistic Information Retrieval" the participants discussed the proliferation of terminology being used to describe the field and settled on "Cross-Language" as the best single description of the salient aspect of the problem. "Multilingual" was felt to be too broad, since that term has also been used to describe systems able to perform within-language retrieval in more than one language but that lack any cross-language capability. "Cross-lingual" and "cross-linguistic" were felt to be equally good descriptions of the field, but "crosslanguage" was selected as the preferred term in the interest of standardization. Unfortunately, at about the same time the U.S. Defense Advanced Research Projects Agency (DARPA) introduced "translingual" as their preferred term, so we are still some distance from reaching consensus on this matter.
Soergel, D.: SemWeb: proposal for an open, multifunctional, multilingual system for integrated access to knowledge about concepts and terminology (1996) 0.00
```
0.0012921195 = product of:
  0.009044836 = sum of:
    0.009044836 = weight(_text_:information in 3575) [ClassicSimilarity], result of:
      0.009044836 = score(doc=3575,freq=4.0), product of:
        0.06595008 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.037568163 = queryNorm
        0.13714671 = fieldWeight in 3575, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3575)
  0.14285715 = coord(1/7)
```
Abstract

Presents a proposal for the long-range development of an open, multifunctional, multilingual system for integrated access to many kinds of knowledge about concepts and terminology. The system would draw on existing knowledge bases that are accessible through the Internet or on CD-ROM and on a common integrated distributed knowledge base that would grow incrementally over time. Existing knowledge bases would be accessed througha common interface that would search several knowledge bases, collate the data into a common format, and present them to the user. The common integrated distributed knowldge base would provide an environment in which many contributors could carry out classification and terminological projects more efficiently, with the results available in a common format. Over time, data from other knowledge bases could be incorporated into the common knowledge base, either by actual transfer (provided the knowledge base producers are willing) or by reference through a link. Either way, such incorporation requires intellectual work but allows for tighter integration than common interface access to multiple knowledge bases. Each piece of information in the common knowledge base will have all its sources attached, providing an acknowledgment mechanism that gives due credit to all contributors. The whole system would be designed to be usable by many levels of users for improved information exchange.

Search (51 results, page 2 of 3)

Authors

Languages

Types

Themes