Search (269 results, page 1 of 14)

Davis, M.W.: On the effective use of large parallel corpora in cross-language text retrieval (1998) 0.04

0.0434326 = product of:
  0.1302978 = sum of:
    0.07462976 = weight(_text_:retrieval in 6302) [ClassicSimilarity], result of:
      0.07462976 = score(doc=6302,freq=6.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.6946405 = fieldWeight in 6302, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.09375 = fieldNorm(doc=6302)
    0.044153035 = weight(_text_:use in 6302) [ClassicSimilarity], result of:
      0.044153035 = score(doc=6302,freq=2.0), product of:
        0.10875683 = queryWeight, product of:
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.035517205 = queryNorm
        0.40597942 = fieldWeight in 6302, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.09375 = fieldNorm(doc=6302)
    0.011514995 = weight(_text_:of in 6302) [ClassicSimilarity], result of:
      0.011514995 = score(doc=6302,freq=2.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.20732689 = fieldWeight in 6302, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.09375 = fieldNorm(doc=6302)
  0.33333334 = coord(3/9)

Series: The Kluwer International series on information retrieval
Source: Cross-language information retrieval. Ed.: G. Grefenstette

Fluhr, C.: Crosslingual access to photo databases (2012) 0.04

0.038084555 = product of:
  0.08569025 = sum of:
    0.021543756 = weight(_text_:retrieval in 93) [ClassicSimilarity], result of:
      0.021543756 = score(doc=93,freq=2.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.20052543 = fieldWeight in 93, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=93)
    0.038237654 = weight(_text_:use in 93) [ClassicSimilarity], result of:
      0.038237654 = score(doc=93,freq=6.0), product of:
        0.10875683 = queryWeight, product of:
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.035517205 = queryNorm
        0.35158852 = fieldWeight in 93, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.046875 = fieldNorm(doc=93)
    0.016284661 = weight(_text_:of in 93) [ClassicSimilarity], result of:
      0.016284661 = score(doc=93,freq=16.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.2932045 = fieldWeight in 93, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=93)
    0.009624182 = product of:
      0.028872546 = sum of:
        0.028872546 = weight(_text_:22 in 93) [ClassicSimilarity], result of:
          0.028872546 = score(doc=93,freq=2.0), product of:
            0.1243752 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.035517205 = queryNorm
            0.23214069 = fieldWeight in 93, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=93)
      0.33333334 = coord(1/3)
  0.44444445 = coord(4/9)

Abstract: This paper is about search of photos in photo databases of agencies which sell photos over the Internet. The problem is far from the behavior of photo databases managed by librarians and also far from the corpora generally used for research purposes. The descriptions use mainly single words and it is well known that it is not the best way to have a good search. This increases the problem of semantic ambiguity. This problem of semantic ambiguity is crucial for cross-language querying. On the other hand, users are not aware of documentation techniques and use generally very simple queries but want to get precise answers. This paper gives the experience gained in a 3 year use (2006-2008) of a cross-language access to several of the main international commercial photo databases. The languages used were French, English, and German.
Date: 17. 4.2012 14:25:22
Source: Next generation search engines: advanced models for information retrieval. Eds.: C. Jouis, u.a

Frâncu, V.; Sabo, C.-N.: Implementation of a UDC-based multilingual thesaurus in a library catalogue : the case of BiblioPhil (2010) 0.04

0.03791122 = product of:
  0.085300244 = sum of:
    0.03731488 = weight(_text_:retrieval in 3697) [ClassicSimilarity], result of:
      0.03731488 = score(doc=3697,freq=6.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.34732026 = fieldWeight in 3697, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=3697)
    0.022076517 = weight(_text_:use in 3697) [ClassicSimilarity], result of:
      0.022076517 = score(doc=3697,freq=2.0), product of:
        0.10875683 = queryWeight, product of:
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.035517205 = queryNorm
        0.20298971 = fieldWeight in 3697, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.046875 = fieldNorm(doc=3697)
    0.016284661 = weight(_text_:of in 3697) [ClassicSimilarity], result of:
      0.016284661 = score(doc=3697,freq=16.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.2932045 = fieldWeight in 3697, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=3697)
    0.009624182 = product of:
      0.028872546 = sum of:
        0.028872546 = weight(_text_:22 in 3697) [ClassicSimilarity], result of:
          0.028872546 = score(doc=3697,freq=2.0), product of:
            0.1243752 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.035517205 = queryNorm
            0.23214069 = fieldWeight in 3697, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=3697)
      0.33333334 = coord(1/3)
  0.44444445 = coord(4/9)

Abstract: In order to enhance the use of Universal Decimal Classification (UDC) numbers in information retrieval, the authors have represented classification with multilingual thesaurus descriptors and implemented this solution in an automated way. The authors illustrate a solution implemented in a BiblioPhil library system. The standard formats used are UNIMARC for subject authority records (i.e. the UDC-based multilingual thesaurus) and MARC XML support for data transfer. The multilingual thesaurus was built according to existing standards, the constituent parts of the classification notations being used as the basis for search terms in the multilingual information retrieval. The verbal equivalents, descriptors and non-descriptors, are used to expand the number of concepts and are given in Romanian, English and French. This approach saves the time of the indexer and provides more user-friendly and easier access to the bibliographic information. The multilingual aspect of the thesaurus enhances information access for a greater number of online users
Date: 22. 7.2010 20:40:56
Theme: Klassifikationssysteme im Online-Retrieval

Tsai, M.-.F.; Chen, H.-H.; Wang, Y.-T.: Learning a merge model for multilingual information retrieval (2011) 0.03

0.033748705 = product of:
  0.07593458 = sum of:
    0.031095734 = weight(_text_:retrieval in 2750) [ClassicSimilarity], result of:
      0.031095734 = score(doc=2750,freq=6.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.28943354 = fieldWeight in 2750, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2750)
    0.026017427 = weight(_text_:use in 2750) [ClassicSimilarity], result of:
      0.026017427 = score(doc=2750,freq=4.0), product of:
        0.10875683 = queryWeight, product of:
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.035517205 = queryNorm
        0.23922569 = fieldWeight in 2750, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2750)
    0.0107284635 = weight(_text_:of in 2750) [ClassicSimilarity], result of:
      0.0107284635 = score(doc=2750,freq=10.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.19316542 = fieldWeight in 2750, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2750)
    0.008092954 = product of:
      0.02427886 = sum of:
        0.02427886 = weight(_text_:29 in 2750) [ClassicSimilarity], result of:
          0.02427886 = score(doc=2750,freq=2.0), product of:
            0.12493842 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.035517205 = queryNorm
            0.19432661 = fieldWeight in 2750, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2750)
      0.33333334 = coord(1/3)
  0.44444445 = coord(4/9)

Abstract: This paper proposes a learning approach for the merging process in multilingual information retrieval (MLIR). To conduct the learning approach, we present a number of features that may influence the MLIR merging process. These features are mainly extracted from three levels: query, document, and translation. After the feature extraction, we then use the FRank ranking algorithm to construct a merge model. To the best of our knowledge, this practice is the first attempt to use a learning-based ranking algorithm to construct a merge model for MLIR merging. In our experiments, three test collections for the task of crosslingual information retrieval (CLIR) in NTCIR3, 4, and 5 are employed to assess the performance of our proposed method. Moreover, several merging methods are also carried out for a comparison, including traditional merging methods, the 2-step merging strategy, and the merging method based on logistic regression. The experimental results show that our proposed method can significantly improve merging quality on two different types of datasets. In addition to the effectiveness, through the merge model generated by FRank, our method can further identify key factors that influence the merging process. This information might provide us more insight and understanding into MLIR merging.
Date: 29. 1.2016 20:34:33

Larkey, L.S.; Connell, M.E.: Structured queries, language modelling, and relevance modelling in cross-language information retrieval (2005) 0.03

0.03334116 = product of:
  0.0750176 = sum of:
    0.03590626 = weight(_text_:retrieval in 1022) [ClassicSimilarity], result of:
      0.03590626 = score(doc=1022,freq=8.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.33420905 = fieldWeight in 1022, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1022)
    0.018397098 = weight(_text_:use in 1022) [ClassicSimilarity], result of:
      0.018397098 = score(doc=1022,freq=2.0), product of:
        0.10875683 = queryWeight, product of:
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.035517205 = queryNorm
        0.1691581 = fieldWeight in 1022, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1022)
    0.012694089 = weight(_text_:of in 1022) [ClassicSimilarity], result of:
      0.012694089 = score(doc=1022,freq=14.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.22855641 = fieldWeight in 1022, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1022)
    0.008020152 = product of:
      0.024060456 = sum of:
        0.024060456 = weight(_text_:22 in 1022) [ClassicSimilarity], result of:
          0.024060456 = score(doc=1022,freq=2.0), product of:
            0.1243752 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.035517205 = queryNorm
            0.19345059 = fieldWeight in 1022, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1022)
      0.33333334 = coord(1/3)
  0.44444445 = coord(4/9)

Abstract: Two probabilistic approaches to cross-lingual retrieval are in wide use today, those based on probabilistic models of relevance, as exemplified by INQUERY, and those based on language modeling. INQUERY, as a query net model, allows the easy incorporation of query operators, including a synonym operator, which has proven to be extremely useful in cross-language information retrieval (CLIR), in an approach often called structured query translation. In contrast, language models incorporate translation probabilities into a unified framework. We compare the two approaches on Arabic and Spanish data sets, using two kinds of bilingual dictionaries--one derived from a conventional dictionary, and one derived from a parallel corpus. We find that structured query processing gives slightly better results when queries are not expanded. On the other hand, when queries are expanded, language modeling gives better results, but only when using a probabilistic dictionary derived from a parallel corpus. We pursue two additional issues inherent in the comparison of structured query processing with language modeling. The first concerns query expansion, and the second is the role of translation probabilities. We compare conventional expansion techniques (pseudo-relevance feedback) with relevance modeling, a new IR approach which fits into the formal framework of language modeling. We find that relevance modeling and pseudo-relevance feedback achieve comparable levels of retrieval and that good translation probabilities confer a small but significant advantage.
Date: 26.12.2007 20:22:11

Ménard, E.: Ordinary image retrieval in a multilingual context : a comparison of two indexing vocabularies (2010) 0.03
```
0.033235926 = product of:
  0.07478084 = sum of:
    0.035180803 = weight(_text_:retrieval in 3946) [ClassicSimilarity], result of:
      0.035180803 = score(doc=3946,freq=12.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.32745665 = fieldWeight in 3946, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.03125 = fieldNorm(doc=3946)
    0.014717679 = weight(_text_:use in 3946) [ClassicSimilarity], result of:
      0.014717679 = score(doc=3946,freq=2.0), product of:
        0.10875683 = queryWeight, product of:
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.035517205 = queryNorm
        0.13532647 = fieldWeight in 3946, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.03125 = fieldNorm(doc=3946)
    0.018407993 = weight(_text_:of in 3946) [ClassicSimilarity], result of:
      0.018407993 = score(doc=3946,freq=46.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.33143494 = fieldWeight in 3946, product of:
          6.78233 = tf(freq=46.0), with freq of:
            46.0 = termFreq=46.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03125 = fieldNorm(doc=3946)
    0.006474362 = product of:
      0.019423086 = sum of:
        0.019423086 = weight(_text_:29 in 3946) [ClassicSimilarity], result of:
          0.019423086 = score(doc=3946,freq=2.0), product of:
            0.12493842 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.035517205 = queryNorm
            0.15546128 = fieldWeight in 3946, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.03125 = fieldNorm(doc=3946)
      0.33333334 = coord(1/3)
  0.44444445 = coord(4/9)
```
Abstract

Purpose - This paper seeks to examine image retrieval within two different contexts: a monolingual context where the language of the query is the same as the indexing language and a multilingual context where the language of the query is different from the indexing language. The study also aims to compare two different approaches for the indexing of ordinary images representing common objects: traditional image indexing with the use of a controlled vocabulary and free image indexing using uncontrolled vocabulary. Design/methodology/approach - This research uses three data collection methods. An analysis of the indexing terms was employed in order to examine the multiplicity of term types assigned to images. A simulation of the retrieval process involving a set of 30 images was performed with 60 participants. The quantification of the retrieval performance of each indexing approach was based on the usability measures, that is, effectiveness, efficiency and satisfaction of the user. Finally, a questionnaire was used to gather information on searcher satisfaction during and after the retrieval process. Findings - The results of this research are twofold. The analysis of indexing terms associated with all the 3,950 images provides a comprehensive description of the characteristics of the four non-combined indexing forms used for the study. Also, the retrieval simulation results offers information about the relative performance of the six indexing forms (combined and non-combined) in terms of their effectiveness, efficiency (temporal and human) and the image searcher's satisfaction. Originality/value - The findings of the study suggest that, in the near future, the information systems could benefit from allowing an increased coexistence of controlled vocabularies and uncontrolled vocabularies, resulting from collaborative image tagging, for example, and giving the users the possibility to dynamically participate in the image-indexing process, in a more user-centred way.

Date

29. 8.2010 10:51:07

Footnote

Beitrag in einem Special Issue: Content architecture: exploiting and managing diverse resources: proceedings of the first national conference of the United Kingdom chapter of the International Society for Knowedge Organization (ISKO)

Martinez Arellano, F.F.: Subject searching in online catalogs including Spanish and English material (1999) 0.03

0.029266862 = product of:
  0.087800585 = sum of:
    0.030467471 = weight(_text_:retrieval in 5350) [ClassicSimilarity], result of:
      0.030467471 = score(doc=5350,freq=4.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.2835858 = fieldWeight in 5350, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=5350)
    0.038237654 = weight(_text_:use in 5350) [ClassicSimilarity], result of:
      0.038237654 = score(doc=5350,freq=6.0), product of:
        0.10875683 = queryWeight, product of:
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.035517205 = queryNorm
        0.35158852 = fieldWeight in 5350, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.046875 = fieldNorm(doc=5350)
    0.01909546 = weight(_text_:of in 5350) [ClassicSimilarity], result of:
      0.01909546 = score(doc=5350,freq=22.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.34381276 = fieldWeight in 5350, product of:
          4.690416 = tf(freq=22.0), with freq of:
            22.0 = termFreq=22.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=5350)
  0.33333334 = coord(3/9)

Abstract: The use of title words, the combination of these through the use of logic operators, and the possibility of truncating them when carrying out subject searches, are some of the search options that have been incorporated into the online catalog. Several arguments in favor of these options have been expressed which state that they represent an approach for the use of natural language and that they facilitate information retrieval. However, expressed arguments against them that support the necessity of using controlled language to obtain more precision in search results also exist. This paper reports the main results from a study whose objective was to compare advantages and disadvantages of retrieval by keywords from the title and by subject headings included in the records of LIBRUNAM, an online catalog containing records for English and Spanish items at the National Autonomous University of Mexico.

Lin, W.-C.; Chang, Y.-C.; Chen, H.-H.: Integrating textual and visual information for cross-language image retrieval : a trans-media dictionary approach (2007) 0.03

0.028787574 = product of:
  0.08636272 = sum of:
    0.052771207 = weight(_text_:retrieval in 904) [ClassicSimilarity], result of:
      0.052771207 = score(doc=904,freq=12.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.49118498 = fieldWeight in 904, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=904)
    0.022076517 = weight(_text_:use in 904) [ClassicSimilarity], result of:
      0.022076517 = score(doc=904,freq=2.0), product of:
        0.10875683 = queryWeight, product of:
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.035517205 = queryNorm
        0.20298971 = fieldWeight in 904, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.046875 = fieldNorm(doc=904)
    0.011514995 = weight(_text_:of in 904) [ClassicSimilarity], result of:
      0.011514995 = score(doc=904,freq=8.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.20732689 = fieldWeight in 904, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=904)
  0.33333334 = coord(3/9)

Abstract: This paper explores the integration of textual and visual information for cross-language image retrieval. An approach which automatically transforms textual queries into visual representations is proposed. First, we mine the relationships between text and images and employ the mined relationships to construct visual queries from textual ones. Then, the retrieval results of textual and visual queries are combined. To evaluate the proposed approach, we conduct English monolingual and Chinese-English cross-language retrieval experiments. The selection of suitable textual query terms to construct visual queries is the major issue. Experimental results show that the proposed approach improves retrieval performance, and use of nouns is appropriate to generate visual queries.
Footnote: Beitrag in: Special issue on AIRS2005: Information Retrieval Research in Asia

Ata, B.M.A.: SISDOM: a multilingual document retrieval system (1995) 0.03

0.02662248 = product of:
  0.07986744 = sum of:
    0.049753174 = weight(_text_:retrieval in 895) [ClassicSimilarity], result of:
      0.049753174 = score(doc=895,freq=6.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.46309367 = fieldWeight in 895, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0625 = fieldNorm(doc=895)
    0.017165542 = weight(_text_:of in 895) [ClassicSimilarity], result of:
      0.017165542 = score(doc=895,freq=10.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.3090647 = fieldWeight in 895, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=895)
    0.012948724 = product of:
      0.038846172 = sum of:
        0.038846172 = weight(_text_:29 in 895) [ClassicSimilarity], result of:
          0.038846172 = score(doc=895,freq=2.0), product of:
            0.12493842 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.035517205 = queryNorm
            0.31092256 = fieldWeight in 895, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0625 = fieldNorm(doc=895)
      0.33333334 = coord(1/3)
  0.33333334 = coord(3/9)

Abstract: The Malay language is widely used in Malaysia, Indonesia and brunei. The growth in the number of documents written in Malay justifies the need for a document retrieval system for that language. Describes the implementation of a bilingual Malay and English full text document retrieval systems: SIStem capaian DOkumen Multilingua (SISDOM), by the Kebangsaan University Malaysia. The system incorporates many facilities for users, including the choice of search techniques, browsing of retrieved documents, and ranking of documents
Date: 31. 7.1996 9:29:12

Chen, K.-H.: Evaluating Chinese text retrieval with multilingual queries (2002) 0.03

0.02601744 = product of:
  0.07805232 = sum of:
    0.05026876 = weight(_text_:retrieval in 1851) [ClassicSimilarity], result of:
      0.05026876 = score(doc=1851,freq=8.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.46789268 = fieldWeight in 1851, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1851)
    0.016453419 = weight(_text_:of in 1851) [ClassicSimilarity], result of:
      0.016453419 = score(doc=1851,freq=12.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.29624295 = fieldWeight in 1851, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1851)
    0.011330134 = product of:
      0.0339904 = sum of:
        0.0339904 = weight(_text_:29 in 1851) [ClassicSimilarity], result of:
          0.0339904 = score(doc=1851,freq=2.0), product of:
            0.12493842 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.035517205 = queryNorm
            0.27205724 = fieldWeight in 1851, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1851)
      0.33333334 = coord(1/3)
  0.33333334 = coord(3/9)

Abstract: This paper reports the design of a Chinese test collection with multilingual queries and the application of this test collection to evaluate information retrieval Systems. The effective indexing units, IR models, translation techniques, and query expansion for Chinese text retrieval are identified. The collaboration of East Asian countries for construction of test collections for cross-language multilingual text retrieval is also discussed in this paper. As well, a tool is designed to help assessors judge relevante and gather the events of relevante judgment. The log file created by this tool will be used to analyze the behaviors of assessors in the future.
Source: Knowledge organization. 29(2002) nos.3/4, S.156-170

Lassalle, E.: Text retrieval : from a monolingual system to a multilingual system (1993) 0.03

0.025918245 = product of:
  0.077754736 = sum of:
    0.03554538 = weight(_text_:retrieval in 7403) [ClassicSimilarity], result of:
      0.03554538 = score(doc=7403,freq=4.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.33085006 = fieldWeight in 7403, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0546875 = fieldNorm(doc=7403)
    0.025755936 = weight(_text_:use in 7403) [ClassicSimilarity], result of:
      0.025755936 = score(doc=7403,freq=2.0), product of:
        0.10875683 = queryWeight, product of:
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.035517205 = queryNorm
        0.23682132 = fieldWeight in 7403, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.0546875 = fieldNorm(doc=7403)
    0.016453419 = weight(_text_:of in 7403) [ClassicSimilarity], result of:
      0.016453419 = score(doc=7403,freq=12.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.29624295 = fieldWeight in 7403, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=7403)
  0.33333334 = coord(3/9)

Abstract: Describes the TELMI monolingual text retrieval system and its future extension, a multilingual system. TELMI is designed for medium sized databases containing short texts. The characteristics of the system are fine-grained natural language processing (NLP); an open domain and a large scale knowledge base; automated indexing based on conceptual representation of texts and reusability of the NLP tools. Discusses the French MINITEL service, the MGS information service and the TELMI research system covering the full text system; NLP architecture; the lexical level; the syntactic level; the semantic level and an example of the use of a generic system
Source: Journal of document and text management. 1(1993) no.1, S.65-74

Oard, D.W.: Alternative approaches for cross-language text retrieval (1997) 0.03
```
0.025396675 = product of:
  0.076190025 = sum of:
    0.048672516 = weight(_text_:retrieval in 1164) [ClassicSimilarity], result of:
      0.048672516 = score(doc=1164,freq=30.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.45303512 = fieldWeight in 1164, product of:
          5.477226 = tf(freq=30.0), with freq of:
            30.0 = termFreq=30.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.02734375 = fieldNorm(doc=1164)
    0.012877968 = weight(_text_:use in 1164) [ClassicSimilarity], result of:
      0.012877968 = score(doc=1164,freq=2.0), product of:
        0.10875683 = queryWeight, product of:
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.035517205 = queryNorm
        0.11841066 = fieldWeight in 1164, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.02734375 = fieldNorm(doc=1164)
    0.014639538 = weight(_text_:of in 1164) [ClassicSimilarity], result of:
      0.014639538 = score(doc=1164,freq=38.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.2635841 = fieldWeight in 1164, product of:
          6.164414 = tf(freq=38.0), with freq of:
            38.0 = termFreq=38.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.02734375 = fieldNorm(doc=1164)
  0.33333334 = coord(3/9)
```
Abstract

The explosive growth of the Internet and other sources of networked information have made automatic mediation of access to networked information sources an increasingly important problem. Much of this information is expressed as electronic text, and it is becoming practical to automatically convert some printed documents and recorded speech to electronic text as well. Thus, automated systems capable of detecting useful documents are finding widespread application. With even a small number of languages it can be inconvenient to issue the same query repeatedly in every language, so users who are able to read more than one language will likely prefer a multilingual text retrieval system over a collection of monolingual systems. And since reading ability in a language does not always imply fluent writing ability in that language, such users will likely find cross-language text retrieval particularly useful for languages in which they are less confident of their ability to express their information needs effectively. The use of such systems can be also be beneficial if the user is able to read only a single language. For example, when only a small portion of the document collection will ever be examined by the user, performing retrieval before translation can be significantly more economical than performing translation before retrieval. So when the application is sufficiently important to justify the time and effort required for translation, those costs can be minimized if an effective cross-language text retrieval system is available. Even when translation is not available, there are circumstances in which cross-language text retrieval could be useful to a monolingual user. For example, a researcher might find a paper published in an unfamiliar language useful if that paper contains references to works by the same author that are in the researcher's native language.
Multilingual text retrieval can be defined as selection of useful documents from collections that may contain several languages (English, French, Chinese, etc.). This formulation allows for the possibility that individual documents might contain more than one language, a common occurrence in some applications. Both cross-language and within-language retrieval are included in this formulation, but it is the cross-language aspect of the problem which distinguishes multilingual text retrieval from its well studied monolingual counterpart. At the SIGIR 96 workshop on "Cross-Linguistic Information Retrieval" the participants discussed the proliferation of terminology being used to describe the field and settled on "Cross-Language" as the best single description of the salient aspect of the problem. "Multilingual" was felt to be too broad, since that term has also been used to describe systems able to perform within-language retrieval in more than one language but that lack any cross-language capability. "Cross-lingual" and "cross-linguistic" were felt to be equally good descriptions of the field, but "crosslanguage" was selected as the preferred term in the interest of standardization. Unfortunately, at about the same time the U.S. Defense Advanced Research Projects Agency (DARPA) introduced "translingual" as their preferred term, so we are still some distance from reaching consensus on this matter.
I will not attempt to draw a sharp distinction between retrieval and filtering in this survey. Although my own work on adaptive cross-language text filtering has led me to make this distinction fairly carefully in other presentations (c.f., (Oard 1997b)), such an proach does little to help understand the fundamental techniques which have been applied or the results that have been obtained in this case. Since it is still common to view filtering (detection of useful documents in dynamic document streams) as a kind of retrieval, will simply adopt that perspective here.

Theme

Semantisches Umfeld in Indexierung u. Retrieval

Diaz, P.: Multilingual tools for accessing a Spanish library catalogue (1997) 0.02

0.024911826 = product of:
  0.07473548 = sum of:
    0.03554538 = weight(_text_:retrieval in 1163) [ClassicSimilarity], result of:
      0.03554538 = score(doc=1163,freq=4.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.33085006 = fieldWeight in 1163, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1163)
    0.025755936 = weight(_text_:use in 1163) [ClassicSimilarity], result of:
      0.025755936 = score(doc=1163,freq=2.0), product of:
        0.10875683 = queryWeight, product of:
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.035517205 = queryNorm
        0.23682132 = fieldWeight in 1163, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1163)
    0.013434161 = weight(_text_:of in 1163) [ClassicSimilarity], result of:
      0.013434161 = score(doc=1163,freq=8.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.24188137 = fieldWeight in 1163, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1163)
  0.33333334 = coord(3/9)

Abstract: The use of library resources will no longer be restricted to the physical location of libraries thanks to networking technologies and standard protocols for information retrieval. These technical achievements allow users to access geographically scattered libraries but they do not ease their intellectual access. Indeed, users need a certain command of different languages to find publications whose records are written in a unique language. Multilingual facilities, including multilingual presentation and retrieval, can intellectually open the library catalogue to a wider range of international users. Describes an attempt at using multilingual resources with a view to improving user OPAC interaction through the TRANSLIB project, which provides library users with advanced tools that support multilingual access

McCulloch, E.: Multiple terminologies : an obstacle to information retrieval (2004) 0.02

0.024911826 = product of:
  0.07473548 = sum of:
    0.03554538 = weight(_text_:retrieval in 2798) [ClassicSimilarity], result of:
      0.03554538 = score(doc=2798,freq=4.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.33085006 = fieldWeight in 2798, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2798)
    0.025755936 = weight(_text_:use in 2798) [ClassicSimilarity], result of:
      0.025755936 = score(doc=2798,freq=2.0), product of:
        0.10875683 = queryWeight, product of:
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.035517205 = queryNorm
        0.23682132 = fieldWeight in 2798, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2798)
    0.013434161 = weight(_text_:of in 2798) [ClassicSimilarity], result of:
      0.013434161 = score(doc=2798,freq=8.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.24188137 = fieldWeight in 2798, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2798)
  0.33333334 = coord(3/9)

Abstract: An issue currently at the forefront of digital library research is the prevalence of disparate terminologies and the associated limitations imposed on user searching. It is thought that semantic interoperability is achievable by improving the compatibility between terminologies and classification schemes, enabling users to search multiple resources simultaneously and improve retrieval effectiveness through the use of associated terms drawn from several schemes. This column considers the terminology issue before outlining various proposed methods of tackling it, with a particular focus on terminology mapping.

Vassilakaki, E.; Garoufallou, E.; Johnson, F.; Hartley, R.J.: ¬An exploration of users' needs for multilingual information retrieval and access (2015) 0.02

0.024854181 = product of:
  0.07456254 = sum of:
    0.030467471 = weight(_text_:retrieval in 2394) [ClassicSimilarity], result of:
      0.030467471 = score(doc=2394,freq=4.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.2835858 = fieldWeight in 2394, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=2394)
    0.031220913 = weight(_text_:use in 2394) [ClassicSimilarity], result of:
      0.031220913 = score(doc=2394,freq=4.0), product of:
        0.10875683 = queryWeight, product of:
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.035517205 = queryNorm
        0.2870708 = fieldWeight in 2394, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.046875 = fieldNorm(doc=2394)
    0.012874156 = weight(_text_:of in 2394) [ClassicSimilarity], result of:
      0.012874156 = score(doc=2394,freq=10.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.23179851 = fieldWeight in 2394, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=2394)
  0.33333334 = coord(3/9)

Abstract: The need for promoting Multilingual Information Retrieval (MLIR) and Access (MLIA) has become evident, now more than ever, given the increase of the online information produced daily in languages other than English. This study aims to explore users' information needs when searching for information across languages. Specifically, the method of questionnaire was employed to shed light on the Library and Information Science (LIS) undergraduate students' use of search engines, databases, digital libraries when searching as well as their needs for multilingual access. This study contributes in informing the design of MLIR systems by focusing on the reasons and situations under which users would search and use information in multiple languages.

Subirats, I.; Prasad, A.R.D.; Keizer, J.; Bagdanov, A.: Implementation of rich metadata formats and demantic tools using DSpace (2008) 0.02
```
0.024395086 = product of:
  0.05488894 = sum of:
    0.014362504 = weight(_text_:retrieval in 2656) [ClassicSimilarity], result of:
      0.014362504 = score(doc=2656,freq=2.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.13368362 = fieldWeight in 2656, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.03125 = fieldNorm(doc=2656)
    0.020813942 = weight(_text_:use in 2656) [ClassicSimilarity], result of:
      0.020813942 = score(doc=2656,freq=4.0), product of:
        0.10875683 = queryWeight, product of:
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.035517205 = queryNorm
        0.19138055 = fieldWeight in 2656, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.03125 = fieldNorm(doc=2656)
    0.013296372 = weight(_text_:of in 2656) [ClassicSimilarity], result of:
      0.013296372 = score(doc=2656,freq=24.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.23940048 = fieldWeight in 2656, product of:
          4.8989797 = tf(freq=24.0), with freq of:
            24.0 = termFreq=24.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03125 = fieldNorm(doc=2656)
    0.0064161215 = product of:
      0.019248364 = sum of:
        0.019248364 = weight(_text_:22 in 2656) [ClassicSimilarity], result of:
          0.019248364 = score(doc=2656,freq=2.0), product of:
            0.1243752 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.035517205 = queryNorm
            0.15476047 = fieldWeight in 2656, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=2656)
      0.33333334 = coord(1/3)
  0.44444445 = coord(4/9)
```
Abstract

This poster explores the customization of DSpace to allow the use of the AGRIS Application Profile metadata standard and the AGROVOC thesaurus. The objective is the adaptation of DSpace, through the least invasive code changes either in the form of plug-ins or add-ons, to the specific needs of the Agricultural Sciences and Technology community. Metadata standards such as AGRIS AP, and Knowledge Organization Systems such as the AGROVOC thesaurus, provide mechanisms for sharing information in a standardized manner by recommending the use of common semantics and interoperable syntax (Subirats et al., 2007). AGRIS AP was created to enhance the description, exchange and subsequent retrieval of agricultural Document-like Information Objects (DLIOs). It is a metadata schema which draws from Metadata standards such as Dublin Core (DC), the Australian Government Locator Service Metadata (AGLS) and the Agricultural Metadata Element Set (AgMES) namespaces. It allows sharing of information across dispersed bibliographic systems (FAO, 2005). AGROVOC68 is a multilingual structured thesaurus covering agricultural and related domains. Its main role is to standardize the indexing process in order to make searching simpler and more efficient. AGROVOC is developed by FAO (Lauser et al., 2006). The customization of the DSpace is taking place in several phases. First, the AGRIS AP metadata schema was mapped onto the metadata DSpace model, with several enhancements implemented to support AGRIS AP elements. Next, AGROVOC will be integrated as a controlled vocabulary accessed through a local SKOS or OWL file. Eventually the system will be configurable to access AGROVOC through local files or remotely via webservices. Finally, spell checking and tooltips will be incorporated in the user interface to support metadata editing. Adapting DSpace to support AGRIS AP and annotation using the semantically-rich AGROVOC thesaurus transform DSpace into a powerful, domain-specific system for annotation and exchange of bibliographic metadata in the agricultural domain.

Source

Metadata for semantic and social applications : proceedings of the International Conference on Dublin Core and Metadata Applications, Berlin, 22 - 26 September 2008, DC 2008: Berlin, Germany / ed. by Jane Greenberg and Wolfgang Klas

Oard, D.W.; He, D.; Wang, J.: User-assisted query translation for interactive cross-language information retrieval (2008) 0.02

0.024088517 = product of:
  0.07226555 = sum of:
    0.03731488 = weight(_text_:retrieval in 2030) [ClassicSimilarity], result of:
      0.03731488 = score(doc=2030,freq=6.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.34732026 = fieldWeight in 2030, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=2030)
    0.022076517 = weight(_text_:use in 2030) [ClassicSimilarity], result of:
      0.022076517 = score(doc=2030,freq=2.0), product of:
        0.10875683 = queryWeight, product of:
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.035517205 = queryNorm
        0.20298971 = fieldWeight in 2030, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.046875 = fieldNorm(doc=2030)
    0.012874156 = weight(_text_:of in 2030) [ClassicSimilarity], result of:
      0.012874156 = score(doc=2030,freq=10.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.23179851 = fieldWeight in 2030, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=2030)
  0.33333334 = coord(3/9)

Abstract: Interactive Cross-Language Information Retrieval (CLIR), a process in which searcher and system collaborate to find documents that satisfy an information need regardless of the language in which those documents are written, calls for designs in which synergies between searcher and system can be leveraged so that the strengths of one can cover weaknesses of the other. This paper describes an approach that employs user-assisted query translation to help searchers better understand the system's operation. Supporting interaction and interface designs are introduced, and results from three user studies are presented. The results indicate that experienced searchers presented with this new system evolve new search strategies that make effective use of the new capabilities, that they achieve retrieval effectiveness comparable to results obtained using fully automatic techniques, and that reported satisfaction with support for cross-language searching increased. The paper concludes with a description of a freely available interactive CLIR system that incorporates lessons learned from this research.

Ye, Z.; Huang, J.X.; He, B.; Lin, H.: Mining a multilingual association dictionary from Wikipedia for cross-language information retrieval (2012) 0.02

0.023835635 = product of:
  0.0715069 = sum of:
    0.031095734 = weight(_text_:retrieval in 513) [ClassicSimilarity], result of:
      0.031095734 = score(doc=513,freq=6.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.28943354 = fieldWeight in 513, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0390625 = fieldNorm(doc=513)
    0.026017427 = weight(_text_:use in 513) [ClassicSimilarity], result of:
      0.026017427 = score(doc=513,freq=4.0), product of:
        0.10875683 = queryWeight, product of:
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.035517205 = queryNorm
        0.23922569 = fieldWeight in 513, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.0390625 = fieldNorm(doc=513)
    0.014393743 = weight(_text_:of in 513) [ClassicSimilarity], result of:
      0.014393743 = score(doc=513,freq=18.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.25915858 = fieldWeight in 513, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0390625 = fieldNorm(doc=513)
  0.33333334 = coord(3/9)

Abstract: Wikipedia is characterized by its dense link structure and a large number of articles in different languages, which make it a notable Web corpus for knowledge extraction and mining, in particular for mining the multilingual associations. In this paper, motivated by a psychological theory of word meaning, we propose a graph-based approach to constructing a cross-language association dictionary (CLAD) from Wikipedia, which can be used in a variety of cross-language accessing and processing applications. In order to evaluate the quality of the mined CLAD, and to demonstrate how the mined CLAD can be used in practice, we explore two different applications of the mined CLAD to cross-language information retrieval (CLIR). First, we use the mined CLAD to conduct cross-language query expansion; and, second, we use it to filter out translation candidates with low translation probabilities. Experimental results on a variety of standard CLIR test collections show that the CLIR retrieval performance can be substantially improved with the above two applications of CLAD, which indicates that the mined CLAD is of sound quality.
Source: Journal of the American Society for Information Science and Technology. 63(2012) no.12, S.2474-2487

Menard, E.: Study on the influence of vocabularies used for image indexing in a multilingual retrieval environment : reflections on scribbles (2007) 0.02

0.023158565 = product of:
  0.069475695 = sum of:
    0.03590626 = weight(_text_:retrieval in 1089) [ClassicSimilarity], result of:
      0.03590626 = score(doc=1089,freq=8.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.33420905 = fieldWeight in 1089, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1089)
    0.018397098 = weight(_text_:use in 1089) [ClassicSimilarity], result of:
      0.018397098 = score(doc=1089,freq=2.0), product of:
        0.10875683 = queryWeight, product of:
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.035517205 = queryNorm
        0.1691581 = fieldWeight in 1089, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1089)
    0.015172338 = weight(_text_:of in 1089) [ClassicSimilarity], result of:
      0.015172338 = score(doc=1089,freq=20.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.27317715 = fieldWeight in 1089, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1089)
  0.33333334 = coord(3/9)

Abstract: For many years, the Web became an important media for the diffusion of multilingual resources. Linguistic differenees still form a major obstacle to scientific, cultural, and educational exchange. Besides this linguistic diversity, a multitude of databases and collections now contain documents in various formats, which may also adversely affect the retrieval process. This paper describes a research project aiming to verify the existing relations between two indexing approaches: traditional image indexing recommending the use of controlled vocabularies or free image indexing using uncontrolled vocabulary, and their respective performance for image retrieval, in a multilingual context. This research also compares image retrieval within two contexts: a monolingual context where the language of the query is the same as the indexing language; and a multilingual context where the language of the query is different from the indexing language. This research will indicate whether one of these indexing approaches surpasses the other, in terms of effectiveness, efficiency, and satisfaction of the image searchers. This paper presents the context and the problem statement of the research project. The experiment carried out is also described, as well as the data collection methods

Pollitt, A.S.; Ellis, G.: Multilingual access to document databases (1993) 0.02

0.023121227 = product of:
  0.069363676 = sum of:
    0.03731488 = weight(_text_:retrieval in 1302) [ClassicSimilarity], result of:
      0.03731488 = score(doc=1302,freq=6.0), product of:
        0.10743652 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.035517205 = queryNorm
        0.34732026 = fieldWeight in 1302, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=1302)
    0.022076517 = weight(_text_:use in 1302) [ClassicSimilarity], result of:
      0.022076517 = score(doc=1302,freq=2.0), product of:
        0.10875683 = queryWeight, product of:
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.035517205 = queryNorm
        0.20298971 = fieldWeight in 1302, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.0620887 = idf(docFreq=5623, maxDocs=44218)
          0.046875 = fieldNorm(doc=1302)
    0.009972278 = weight(_text_:of in 1302) [ClassicSimilarity], result of:
      0.009972278 = score(doc=1302,freq=6.0), product of:
        0.05554029 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035517205 = queryNorm
        0.17955035 = fieldWeight in 1302, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=1302)
  0.33333334 = coord(3/9)

Abstract: This paper examines the reasons why approaches to facilitate document retrieval which apply AI (Artificial Intelligence) or Expert Systems techniques, relying on so-called "natural language" query statements from the end-user will result in sub-optimal solutions. It does so by reflecting on the nature of language and the fundamental problems in document retrieval. Support is given to the work of thesaurus builders and indexers with illustrations of how their work may be utilised in a generally applicable computer-based document retrieval system using Multilingual MenUSE software. The EuroMenUSE interface providing multilingual document access to EPOQUE, the European Parliament's Online Query System is described.
Source: Information as a Global Commodity - Communication, Processing and Use (CAIS/ACSI '93) : 21st Annual Conference Canadian Association for Information Science, Antigonish, Nova Scotia, Canada. July 1993

Search (269 results, page 1 of 14)

Authors

Years

Languages

Types

Themes

Classifications