Search (188 results, page 1 of 10)

  • × language_ss:"e"
  • × theme_ss:"Multilinguale Probleme"
  1. Kutschekmanesch, S.; Lutes, B.; Moelle, K.; Thiel, U.; Tzeras, K.: Automated multilingual indexing : a synthesis of rule-based and thesaurus-based methods (1998) 0.10
    0.099763565 = product of:
      0.12470446 = sum of:
        0.015349891 = weight(_text_:information in 4157) [ClassicSimilarity], result of:
          0.015349891 = score(doc=4157,freq=2.0), product of:
            0.07914162 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.045082662 = queryNorm
            0.19395474 = fieldWeight in 4157, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.078125 = fieldNorm(doc=4157)
        0.02446797 = weight(_text_:und in 4157) [ClassicSimilarity], result of:
          0.02446797 = score(doc=4157,freq=2.0), product of:
            0.09991972 = queryWeight, product of:
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.045082662 = queryNorm
            0.24487628 = fieldWeight in 4157, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.078125 = fieldNorm(doc=4157)
        0.054346204 = product of:
          0.10869241 = sum of:
            0.10869241 = weight(_text_:dokumentation in 4157) [ClassicSimilarity], result of:
              0.10869241 = score(doc=4157,freq=2.0), product of:
                0.21059684 = queryWeight, product of:
                  4.671349 = idf(docFreq=1124, maxDocs=44218)
                  0.045082662 = queryNorm
                0.516116 = fieldWeight in 4157, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.671349 = idf(docFreq=1124, maxDocs=44218)
                  0.078125 = fieldNorm(doc=4157)
          0.5 = coord(1/2)
        0.030540392 = product of:
          0.061080784 = sum of:
            0.061080784 = weight(_text_:22 in 4157) [ClassicSimilarity], result of:
              0.061080784 = score(doc=4157,freq=2.0), product of:
                0.1578718 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045082662 = queryNorm
                0.38690117 = fieldWeight in 4157, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=4157)
          0.5 = coord(1/2)
      0.8 = coord(4/5)
    
    Source
    Information und Märkte: 50. Deutscher Dokumentartag 1998, Kongreß der Deutschen Gesellschaft für Dokumentation e.V. (DGD), Rheinische Friedrich-Wilhelms-Universität Bonn, 22.-24. September 1998. Hrsg. von Marlies Ockenfeld u. Gerhard J. Mantwill
  2. Stiller, J.; Gäde, M.; Petras, V.: Multilingual access to digital libraries : the Europeana use case (2013) 0.02
    0.019617343 = product of:
      0.049043357 = sum of:
        0.010744925 = weight(_text_:information in 902) [ClassicSimilarity], result of:
          0.010744925 = score(doc=902,freq=2.0), product of:
            0.07914162 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.045082662 = queryNorm
            0.13576832 = fieldWeight in 902, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0546875 = fieldNorm(doc=902)
        0.03829843 = weight(_text_:und in 902) [ClassicSimilarity], result of:
          0.03829843 = score(doc=902,freq=10.0), product of:
            0.09991972 = queryWeight, product of:
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.045082662 = queryNorm
            0.38329202 = fieldWeight in 902, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.0546875 = fieldNorm(doc=902)
      0.4 = coord(2/5)
    
    Abstract
    Der Artikel fasst Komponenten für einen mehrsprachigen Zugang in digitalen Bibliotheken zusammen. Dabei wird der Fokus auf Bibliotheken für das digitale Kulturerbe gelegt. Eine Analyse aktueller (existierender) Informationssysteme im sogenannten GLAM-Bereich (Galerien, Bibliotheken, Archive, Museen) beschreibt angewandte Lösungen für die Recherche (Suchen und Blättern) von und die Interaktion mit mehrsprachigen Inhalten. Europeana, die europäische digitale Bibliothek für Kulturerbe, wird als Fallbeispiel hervorgehoben und es werden beispielhaft Interaktionsszenarios für die mehrsprachige Recherche vorgestellt. Die Herausforderungen in der Implementierung von Komponenten für den mehrsprachigen Informationszugang sowie Empfehlungen für den verbesserten Einsatz werden vorgestellt und diskutiert.
    Source
    Information - Wissenschaft und Praxis. 64(2013) H.2/3, S.86-95
  3. Celli, F. et al.: Enabling multilingual search through controlled vocabularies : the AGRIS approach (2016) 0.02
    0.018356113 = product of:
      0.045890283 = sum of:
        0.015349891 = weight(_text_:information in 3278) [ClassicSimilarity], result of:
          0.015349891 = score(doc=3278,freq=2.0), product of:
            0.07914162 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.045082662 = queryNorm
            0.19395474 = fieldWeight in 3278, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.078125 = fieldNorm(doc=3278)
        0.030540392 = product of:
          0.061080784 = sum of:
            0.061080784 = weight(_text_:22 in 3278) [ClassicSimilarity], result of:
              0.061080784 = score(doc=3278,freq=2.0), product of:
                0.1578718 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045082662 = queryNorm
                0.38690117 = fieldWeight in 3278, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=3278)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Series
    Communications in computer and information science; 672
    Source
    Metadata and semantics research: 10th International Conference, MTSR 2016, Göttingen, Germany, November 22-25, 2016, Proceedings. Eds.: E. Garoufallou
  4. Park, J.-r.: Cross-lingual name and subject access : mechanisms and challenge (2007) 0.02
    0.016719494 = product of:
      0.041798733 = sum of:
        0.01736642 = weight(_text_:information in 255) [ClassicSimilarity], result of:
          0.01736642 = score(doc=255,freq=4.0), product of:
            0.07914162 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.045082662 = queryNorm
            0.21943474 = fieldWeight in 255, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0625 = fieldNorm(doc=255)
        0.024432313 = product of:
          0.048864625 = sum of:
            0.048864625 = weight(_text_:22 in 255) [ClassicSimilarity], result of:
              0.048864625 = score(doc=255,freq=2.0), product of:
                0.1578718 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045082662 = queryNorm
                0.30952093 = fieldWeight in 255, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=255)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    This paper considers issues surrounding name and subject access across languages and cultures, particularly mechanisms and knowledge organization tools (e.g., cataloging, metadata) for cross-lingual information access. The author examines current mechanisms for cross-lingual name and subject access and identifies major factors that hinder cross-lingual information access. The author provides examples from the Korean language that demonstrate the problems with cross-language name and subject access.
    Date
    10. 9.2000 17:38:22
  5. Chen, H.-H.; Lin, W.-C.; Yang, C.; Lin, W.-H.: Translating-transliterating named entities for multilingual information access (2006) 0.02
    0.015995612 = product of:
      0.039989028 = sum of:
        0.018610755 = weight(_text_:information in 1080) [ClassicSimilarity], result of:
          0.018610755 = score(doc=1080,freq=6.0), product of:
            0.07914162 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.045082662 = queryNorm
            0.23515764 = fieldWeight in 1080, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1080)
        0.021378273 = product of:
          0.042756546 = sum of:
            0.042756546 = weight(_text_:22 in 1080) [ClassicSimilarity], result of:
              0.042756546 = score(doc=1080,freq=2.0), product of:
                0.1578718 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045082662 = queryNorm
                0.2708308 = fieldWeight in 1080, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1080)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Date
    4. 6.2006 19:52:22
    Footnote
    Beitrag einer special topic section on multilingual information systems
    Source
    Journal of the American Society for Information Science and Technology. 57(2006) no.5, S.645-659
  6. De Luca, E.W.; Dahlberg, I.: Including knowledge domains from the ICC into the multilingual lexical linked data cloud (2014) 0.01
    0.014778084 = product of:
      0.03694521 = sum of:
        0.015349891 = weight(_text_:information in 1493) [ClassicSimilarity], result of:
          0.015349891 = score(doc=1493,freq=8.0), product of:
            0.07914162 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.045082662 = queryNorm
            0.19395474 = fieldWeight in 1493, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1493)
        0.021595318 = product of:
          0.043190636 = sum of:
            0.043190636 = weight(_text_:22 in 1493) [ClassicSimilarity], result of:
              0.043190636 = score(doc=1493,freq=4.0), product of:
                0.1578718 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045082662 = queryNorm
                0.27358043 = fieldWeight in 1493, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1493)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    A lot of information that is already available on the Web, or retrieved from local information systems and social networks is structured in data silos that are not semantically related. Semantic technologies make it emerge that the use of typed links that directly express their relations are an advantage for every application that can reuse the incorporated knowledge about the data. For this reason, data integration, through reengineering (e.g. triplify), or querying (e.g. D2R) is an important task in order to make information available for everyone. Thus, in order to build a semantic map of the data, we need knowledge about data items itself and the relation between heterogeneous data items. In this paper, we present our work of providing Lexical Linked Data (LLD) through a meta-model that contains all the resources and gives the possibility to retrieve and navigate them from different perspectives. We combine the existing work done on knowledge domains (based on the Information Coding Classification) within the Multilingual Lexical Linked Data Cloud (based on the RDF/OWL EurowordNet and the related integrated lexical resources (MultiWordNet, EuroWordNet, MEMODATA Lexicon, Hamburg Methaphor DB).
    Date
    22. 9.2014 19:01:18
    Source
    Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik
  7. Frâncu, V.; Sabo, C.-N.: Implementation of a UDC-based multilingual thesaurus in a library catalogue : the case of BiblioPhil (2010) 0.01
    0.014697641 = product of:
      0.036744103 = sum of:
        0.018419871 = weight(_text_:information in 3697) [ClassicSimilarity], result of:
          0.018419871 = score(doc=3697,freq=8.0), product of:
            0.07914162 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.045082662 = queryNorm
            0.23274569 = fieldWeight in 3697, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=3697)
        0.018324234 = product of:
          0.036648467 = sum of:
            0.036648467 = weight(_text_:22 in 3697) [ClassicSimilarity], result of:
              0.036648467 = score(doc=3697,freq=2.0), product of:
                0.1578718 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045082662 = queryNorm
                0.23214069 = fieldWeight in 3697, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3697)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    In order to enhance the use of Universal Decimal Classification (UDC) numbers in information retrieval, the authors have represented classification with multilingual thesaurus descriptors and implemented this solution in an automated way. The authors illustrate a solution implemented in a BiblioPhil library system. The standard formats used are UNIMARC for subject authority records (i.e. the UDC-based multilingual thesaurus) and MARC XML support for data transfer. The multilingual thesaurus was built according to existing standards, the constituent parts of the classification notations being used as the basis for search terms in the multilingual information retrieval. The verbal equivalents, descriptors and non-descriptors, are used to expand the number of concepts and are given in Romanian, English and French. This approach saves the time of the indexer and provides more user-friendly and easier access to the bibliographic information. The multilingual aspect of the thesaurus enhances information access for a greater number of online users
    Date
    22. 7.2010 20:40:56
  8. Sieglerschmidt, J.: Convergence of internet services in the cultural heritage sector : the long way to common vocabularies, metadata formats, ontologies (2008) 0.01
    0.014109929 = product of:
      0.035274822 = sum of:
        0.020594042 = weight(_text_:information in 1686) [ClassicSimilarity], result of:
          0.020594042 = score(doc=1686,freq=10.0), product of:
            0.07914162 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.045082662 = queryNorm
            0.2602176 = fieldWeight in 1686, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=1686)
        0.014680781 = weight(_text_:und in 1686) [ClassicSimilarity], result of:
          0.014680781 = score(doc=1686,freq=2.0), product of:
            0.09991972 = queryWeight, product of:
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.045082662 = queryNorm
            0.14692576 = fieldWeight in 1686, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.046875 = fieldNorm(doc=1686)
      0.4 = coord(2/5)
    
    Abstract
    Since several years it has been observed that information offered by different knowledge producing institutions on the internet is more and more interlinked. This tendency will increase, because the fragmented information offers on the internet make the retrieval of information difficult as even impossible. At the same time the quantity of information offered on the internet grows exponentially in Europe - and elsewhere - due to many digitization projects. Insofar as funding institutions base the acceptance of projects on the observation of certain documentation standards the knowledge created will be retrievable and will remain so for a longer time. Otherwise the retrieval of information will become a matter of chance due to the limits of fragmented, knowledge producing social groups.
    Source
    Kompatibilität, Medien und Ethik in der Wissensorganisation - Compatibility, Media and Ethics in Knowledge Organization: Proceedings der 10. Tagung der Deutschen Sektion der Internationalen Gesellschaft für Wissensorganisation Wien, 3.-5. Juli 2006 - Proceedings of the 10th Conference of the German Section of the International Society of Knowledge Organization Vienna, 3-5 July 2006. Ed.: H.P. Ohly, S. Netscher u. K. Mitgutsch
  9. Bian, G.-W.; Chen, H.-H.: Cross-language information access to multilingual collections on the Internet (2000) 0.01
    0.013710524 = product of:
      0.03427631 = sum of:
        0.015952077 = weight(_text_:information in 4436) [ClassicSimilarity], result of:
          0.015952077 = score(doc=4436,freq=6.0), product of:
            0.07914162 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.045082662 = queryNorm
            0.20156369 = fieldWeight in 4436, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=4436)
        0.018324234 = product of:
          0.036648467 = sum of:
            0.036648467 = weight(_text_:22 in 4436) [ClassicSimilarity], result of:
              0.036648467 = score(doc=4436,freq=2.0), product of:
                0.1578718 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045082662 = queryNorm
                0.23214069 = fieldWeight in 4436, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4436)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Language barrier is the major problem that people face in searching for, retrieving, and understanding multilingual collections on the Internet. This paper deals with query translation and document translation in a Chinese-English information retrieval system called MTIR. Bilingual dictionary and monolingual corpus-based approaches are adopted to select suitable tranlated query terms. A machine transliteration algorithm is introduced to resolve proper name searching. We consider several design issues for document translation, including which material is translated, what roles the HTML tags play in translation, what the tradeoff is between the speed performance and the translation performance, and what from the translated result is presented in. About 100.000 Web pages translated in the last 4 months of 1997 are used for quantitative study of online and real-time Web page translation
    Date
    16. 2.2000 14:22:39
    Source
    Journal of the American Society for Information Science. 51(2000) no.3, S.281-296
  10. Seo, H.-C.; Kim, S.-B.; Rim, H.-C.; Myaeng, S.-H.: lmproving query translation in English-Korean Cross-language information retrieval (2005) 0.01
    0.013710524 = product of:
      0.03427631 = sum of:
        0.015952077 = weight(_text_:information in 1023) [ClassicSimilarity], result of:
          0.015952077 = score(doc=1023,freq=6.0), product of:
            0.07914162 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.045082662 = queryNorm
            0.20156369 = fieldWeight in 1023, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=1023)
        0.018324234 = product of:
          0.036648467 = sum of:
            0.036648467 = weight(_text_:22 in 1023) [ClassicSimilarity], result of:
              0.036648467 = score(doc=1023,freq=2.0), product of:
                0.1578718 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045082662 = queryNorm
                0.23214069 = fieldWeight in 1023, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1023)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Query translation is a viable method for cross-language information retrieval (CLIR), but it suffers from translation ambiguities caused by multiple translations of individual query terms. Previous research has employed various methods for disambiguation, including the method of selecting an individual target query term from multiple candidates by comparing their statistical associations with the candidate translations of other query terms. This paper proposes a new method where we examine all combinations of target query term translations corresponding to the source query terms, instead of looking at the candidates for each query term and selecting the best one at a time. The goodness value for a combination of target query terms is computed based on the association value between each pair of the terms in the combination. We tested our method using the NTCIR-3 English-Korean CLIR test collection. The results show some improvements regardless of the association measures we used.
    Date
    26.12.2007 20:22:38
    Source
    Information processing and management. 41(2005) no.3, S.507-522
  11. Freitas-Junior, H.R.; Ribeiro-Neto, B.A.; Freitas-Vale, R. de; Laender, A.H.F.; Lima, L.R.S. de: Categorization-driven cross-language retrieval of medical information (2006) 0.01
    0.0136279585 = product of:
      0.034069896 = sum of:
        0.018799702 = weight(_text_:information in 5282) [ClassicSimilarity], result of:
          0.018799702 = score(doc=5282,freq=12.0), product of:
            0.07914162 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.045082662 = queryNorm
            0.23754507 = fieldWeight in 5282, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5282)
        0.015270196 = product of:
          0.030540392 = sum of:
            0.030540392 = weight(_text_:22 in 5282) [ClassicSimilarity], result of:
              0.030540392 = score(doc=5282,freq=2.0), product of:
                0.1578718 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045082662 = queryNorm
                0.19345059 = fieldWeight in 5282, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5282)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    The Web has become a large repository of documents (or pages) written in many different languages. In this context, traditional information retrieval (IR) techniques cannot be used whenever the user query and the documents being retrieved are in different languages. To address this problem, new cross-language information retrieval (CLIR) techniques have been proposed. In this work, we describe a method for cross-language retrieval of medical information. This method combines query terms and related medical concepts obtained automatically through a categorization procedure. The medical concepts are used to create a linguistic abstraction that allows retrieval of information in a language-independent way, minimizing linguistic problems such as polysemy. To evaluate our method, we carried out experiments using the OHSUMED test collection, whose documents are written in English, with queries expressed in Portuguese, Spanish, and French. The results indicate that our cross-language retrieval method is as effective as a standard vector space model algorithm operating on queries and documents in the same language. Further, our results are better than previous results in the literature.
    Date
    22. 7.2006 16:46:36
    Source
    Journal of the American Society for Information Science and Technology. 57(2006) no.4, S.501-510
  12. Levergood, B.; Farrenkopf, S.; Frasnelli, E.: ¬The specification of the language of the field and interoperability : cross-language access to catalogues and online libraries (CACAO) (2008) 0.01
    0.01253962 = product of:
      0.031349048 = sum of:
        0.013024815 = weight(_text_:information in 2646) [ClassicSimilarity], result of:
          0.013024815 = score(doc=2646,freq=4.0), product of:
            0.07914162 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.045082662 = queryNorm
            0.16457605 = fieldWeight in 2646, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=2646)
        0.018324234 = product of:
          0.036648467 = sum of:
            0.036648467 = weight(_text_:22 in 2646) [ClassicSimilarity], result of:
              0.036648467 = score(doc=2646,freq=2.0), product of:
                0.1578718 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045082662 = queryNorm
                0.23214069 = fieldWeight in 2646, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2646)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    The CACAO Project (Cross-language Access to Catalogues and Online Libraries) has been designed to implement natural language processing and cross-language information retrieval techniques to provide cross-language access to information in libraries, a critical issue in the linguistically diverse European Union. This project report addresses two metadata-related challenges for the library community in this context: "false friends" (identical words having different meanings in different languages) and term ambiguity. The possible solutions involve enriching the metadata with attributes specifying language or the source authority file, or associating potential search terms to classes in a classification system. The European Library will evaluate an early implementation of this work in late 2008.
    Source
    Metadata for semantic and social applications : proceedings of the International Conference on Dublin Core and Metadata Applications, Berlin, 22 - 26 September 2008, DC 2008: Berlin, Germany / ed. by Jane Greenberg and Wolfgang Klas
  13. Larkey, L.S.; Connell, M.E.: Structured queries, language modelling, and relevance modelling in cross-language information retrieval (2005) 0.01
    0.011425437 = product of:
      0.028563593 = sum of:
        0.013293398 = weight(_text_:information in 1022) [ClassicSimilarity], result of:
          0.013293398 = score(doc=1022,freq=6.0), product of:
            0.07914162 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.045082662 = queryNorm
            0.16796975 = fieldWeight in 1022, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1022)
        0.015270196 = product of:
          0.030540392 = sum of:
            0.030540392 = weight(_text_:22 in 1022) [ClassicSimilarity], result of:
              0.030540392 = score(doc=1022,freq=2.0), product of:
                0.1578718 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045082662 = queryNorm
                0.19345059 = fieldWeight in 1022, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1022)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Two probabilistic approaches to cross-lingual retrieval are in wide use today, those based on probabilistic models of relevance, as exemplified by INQUERY, and those based on language modeling. INQUERY, as a query net model, allows the easy incorporation of query operators, including a synonym operator, which has proven to be extremely useful in cross-language information retrieval (CLIR), in an approach often called structured query translation. In contrast, language models incorporate translation probabilities into a unified framework. We compare the two approaches on Arabic and Spanish data sets, using two kinds of bilingual dictionaries--one derived from a conventional dictionary, and one derived from a parallel corpus. We find that structured query processing gives slightly better results when queries are not expanded. On the other hand, when queries are expanded, language modeling gives better results, but only when using a probabilistic dictionary derived from a parallel corpus. We pursue two additional issues inherent in the comparison of structured query processing with language modeling. The first concerns query expansion, and the second is the role of translation probabilities. We compare conventional expansion techniques (pseudo-relevance feedback) with relevance modeling, a new IR approach which fits into the formal framework of language modeling. We find that relevance modeling and pseudo-relevance feedback achieve comparable levels of retrieval and that good translation probabilities confer a small but significant advantage.
    Date
    26.12.2007 20:22:11
    Source
    Information processing and management. 41(2005) no.3, S.457-474
  14. Li, K.W.; Yang, C.C.: Automatic crosslingual thesaurus generated from the Hong Kong SAR Police Department Web Corpus for Crime Analysis (2005) 0.01
    0.011282823 = product of:
      0.028207056 = sum of:
        0.01841987 = weight(_text_:information in 3391) [ClassicSimilarity], result of:
          0.01841987 = score(doc=3391,freq=18.0), product of:
            0.07914162 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.045082662 = queryNorm
            0.23274568 = fieldWeight in 3391, product of:
              4.2426405 = tf(freq=18.0), with freq of:
                18.0 = termFreq=18.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.03125 = fieldNorm(doc=3391)
        0.009787188 = weight(_text_:und in 3391) [ClassicSimilarity], result of:
          0.009787188 = score(doc=3391,freq=2.0), product of:
            0.09991972 = queryWeight, product of:
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.045082662 = queryNorm
            0.09795051 = fieldWeight in 3391, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.03125 = fieldNorm(doc=3391)
      0.4 = coord(2/5)
    
    Abstract
    For the sake of national security, very large volumes of data and information are generated and gathered daily. Much of this data and information is written in different languages, stored in different locations, and may be seemingly unconnected. Crosslingual semantic interoperability is a major challenge to generate an overview of this disparate data and information so that it can be analyzed, shared, searched, and summarized. The recent terrorist attacks and the tragic events of September 11, 2001 have prompted increased attention an national security and criminal analysis. Many Asian countries and cities, such as Japan, Taiwan, and Singapore, have been advised that they may become the next targets of terrorist attacks. Semantic interoperability has been a focus in digital library research. Traditional information retrieval (IR) approaches normally require a document to share some common keywords with the query. Generating the associations for the related terms between the two term spaces of users and documents is an important issue. The problem can be viewed as the creation of a thesaurus. Apart from this, terrorists and criminals may communicate through letters, e-mails, and faxes in languages other than English. The translation ambiguity significantly exacerbates the retrieval problem. The problem is expanded to crosslingual semantic interoperability. In this paper, we focus an the English/Chinese crosslingual semantic interoperability problem. However, the developed techniques are not limited to English and Chinese languages but can be applied to many other languages. English and Chinese are popular languages in the Asian region. Much information about national security or crime is communicated in these languages. An efficient automatically generated thesaurus between these languages is important to crosslingual information retrieval between English and Chinese languages. To facilitate crosslingual information retrieval, a corpus-based approach uses the term co-occurrence statistics in parallel or comparable corpora to construct a statistical translation model to cross the language boundary. In this paper, the text based approach to align English/Chinese Hong Kong Police press release documents from the Web is first presented. We also introduce an algorithmic approach to generate a robust knowledge base based an statistical correlation analysis of the semantics (knowledge) embedded in the bilingual press release corpus. The research output consisted of a thesaurus-like, semantic network knowledge base, which can aid in semanticsbased crosslingual information management and retrieval.
    Source
    Journal of the American Society for Information Science and Technology. 56(2005) no.3, S.272-281
    Theme
    Konzeption und Anwendung des Prinzips Thesaurus
  15. Hudon, M.: Multilingual thesaurus construction : integrating the views of different cultures in one gateway to knowledge and concepts (1997) 0.01
    0.011149001 = product of:
      0.027872503 = sum of:
        0.010744925 = weight(_text_:information in 1804) [ClassicSimilarity], result of:
          0.010744925 = score(doc=1804,freq=2.0), product of:
            0.07914162 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.045082662 = queryNorm
            0.13576832 = fieldWeight in 1804, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1804)
        0.01712758 = weight(_text_:und in 1804) [ClassicSimilarity], result of:
          0.01712758 = score(doc=1804,freq=2.0), product of:
            0.09991972 = queryWeight, product of:
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.045082662 = queryNorm
            0.17141339 = fieldWeight in 1804, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1804)
      0.4 = coord(2/5)
    
    Source
    Information services and use. 17(1997) nos.2/3, S.111-123
    Theme
    Konzeption und Anwendung des Prinzips Thesaurus
  16. Keränen, S.: Equivalence and focus of translation in multicultural thesaurus construction (2006) 0.01
    0.011149001 = product of:
      0.027872503 = sum of:
        0.010744925 = weight(_text_:information in 237) [ClassicSimilarity], result of:
          0.010744925 = score(doc=237,freq=2.0), product of:
            0.07914162 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.045082662 = queryNorm
            0.13576832 = fieldWeight in 237, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0546875 = fieldNorm(doc=237)
        0.01712758 = weight(_text_:und in 237) [ClassicSimilarity], result of:
          0.01712758 = score(doc=237,freq=2.0), product of:
            0.09991972 = queryWeight, product of:
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.045082662 = queryNorm
            0.17141339 = fieldWeight in 237, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.0546875 = fieldNorm(doc=237)
      0.4 = coord(2/5)
    
    Abstract
    This paper reports a part of an on-going PhD study on problems related to multicultural social science thesaurus construction in the general frame of information science. The main analysis methods used are discourse analysis and co-word analysis. In theoretical framework the emphasis is on communicative equivalence theories and different aims of thesaurus translation are discussed. Some examples are given how co-word analysis can be used to study contextual equivalence.
    Theme
    Konzeption und Anwendung des Prinzips Thesaurus
  17. Hudon, M.: Relationships in multilingual thesauri (2001) 0.01
    0.0110822385 = product of:
      0.027705597 = sum of:
        0.013024815 = weight(_text_:information in 1147) [ClassicSimilarity], result of:
          0.013024815 = score(doc=1147,freq=4.0), product of:
            0.07914162 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.045082662 = queryNorm
            0.16457605 = fieldWeight in 1147, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=1147)
        0.014680781 = weight(_text_:und in 1147) [ClassicSimilarity], result of:
          0.014680781 = score(doc=1147,freq=2.0), product of:
            0.09991972 = queryWeight, product of:
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.045082662 = queryNorm
            0.14692576 = fieldWeight in 1147, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.046875 = fieldNorm(doc=1147)
      0.4 = coord(2/5)
    
    Abstract
    Because the multilingual thesaurus has a critical role to play in the global networked information world, its relational structure must come under close scrutiny. Traditionally, identity of relational structures has been sought for the different language versions of a multilingual thesaurus, often leading to the artificialization of all target languages. The various types of cross-lingual and intralingual relations found in thesauri are examined in the context of two questions: Are all types of thesaural relations transferable from one language to another? and Are the two members of a valid relation in a source language always the same in the target language(s)? Two options for resolving semantic conflicts in multilingual thesauri are presented.
    Series
    Information science and knowledge management; vol.2
    Theme
    Konzeption und Anwendung des Prinzips Thesaurus
  18. Jorna, K.; Davies, S.: Multilingual thesauri for the modern world : no ideal solution? (2001) 0.01
    0.0110822385 = product of:
      0.027705597 = sum of:
        0.013024815 = weight(_text_:information in 4486) [ClassicSimilarity], result of:
          0.013024815 = score(doc=4486,freq=4.0), product of:
            0.07914162 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.045082662 = queryNorm
            0.16457605 = fieldWeight in 4486, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=4486)
        0.014680781 = weight(_text_:und in 4486) [ClassicSimilarity], result of:
          0.014680781 = score(doc=4486,freq=2.0), product of:
            0.09991972 = queryWeight, product of:
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.045082662 = queryNorm
            0.14692576 = fieldWeight in 4486, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.046875 = fieldNorm(doc=4486)
      0.4 = coord(2/5)
    
    Abstract
    In the 21st century, multilingual tools are gaining importance as increasingly diverse user groups from different cultural and linguistic backgrounds seek access to equally diverse pieces of information. The authors of this paper believe that most current forms of multilingual information access are inadequate for this role, and that a new form of multilingual thesaurus is required. The core of this paper introduces their pilot thesaurus InfoDEFT as a possible model for new online thesauri, which are semantically structured, encyclopedic and multilingual. The authors conclude that while the manual construction of such thesauri is labour intensive and hence costly, pilot thesauri can be used as training sets for artificial learning programmes, thus increasing their volume considerably at relatively little extra cost.
    Theme
    Konzeption und Anwendung des Prinzips Thesaurus
  19. Fluhr, C.: Crosslingual access to photo databases (2012) 0.01
    0.011013667 = product of:
      0.027534168 = sum of:
        0.009209936 = weight(_text_:information in 93) [ClassicSimilarity], result of:
          0.009209936 = score(doc=93,freq=2.0), product of:
            0.07914162 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.045082662 = queryNorm
            0.116372846 = fieldWeight in 93, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=93)
        0.018324234 = product of:
          0.036648467 = sum of:
            0.036648467 = weight(_text_:22 in 93) [ClassicSimilarity], result of:
              0.036648467 = score(doc=93,freq=2.0), product of:
                0.1578718 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045082662 = queryNorm
                0.23214069 = fieldWeight in 93, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=93)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Date
    17. 4.2012 14:25:22
    Source
    Next generation search engines: advanced models for information retrieval. Eds.: C. Jouis, u.a
  20. Lehtinen, R.; Clavel-Merrin, G.: Mehrsprachige und verschiedenartige Daten in Bibliothekssystemen und -netzen : Erfahrungen und Perspektiven aus der Schweiz und Finnland (1998) 0.01
    0.009787188 = product of:
      0.04893594 = sum of:
        0.04893594 = weight(_text_:und in 4186) [ClassicSimilarity], result of:
          0.04893594 = score(doc=4186,freq=8.0), product of:
            0.09991972 = queryWeight, product of:
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.045082662 = queryNorm
            0.48975256 = fieldWeight in 4186, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.078125 = fieldNorm(doc=4186)
      0.2 = coord(1/5)
    

Years

Types

  • a 174
  • el 14
  • m 2
  • r 2
  • s 2
  • x 2
  • n 1
  • More… Less…