Search (5 results, page 1 of 1)

  • × theme_ss:"Multilinguale Probleme"
  • × theme_ss:"Semantische Interoperabilität"
  • × year_i:[2000 TO 2010}
  1. Li, K.W.; Yang, C.C.: Automatic crosslingual thesaurus generated from the Hong Kong SAR Police Department Web Corpus for Crime Analysis (2005) 0.05
    0.05124729 = product of:
      0.15374187 = sum of:
        0.12688403 = weight(_text_:english in 3391) [ClassicSimilarity], result of:
          0.12688403 = score(doc=3391,freq=12.0), product of:
            0.21787451 = queryWeight, product of:
              5.3797226 = idf(docFreq=553, maxDocs=44218)
              0.04049921 = queryNorm
            0.58237207 = fieldWeight in 3391, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              5.3797226 = idf(docFreq=553, maxDocs=44218)
              0.03125 = fieldNorm(doc=3391)
        0.026857845 = product of:
          0.05371569 = sum of:
            0.05371569 = weight(_text_:countries in 3391) [ClassicSimilarity], result of:
              0.05371569 = score(doc=3391,freq=2.0), product of:
                0.22186631 = queryWeight, product of:
                  5.478287 = idf(docFreq=501, maxDocs=44218)
                  0.04049921 = queryNorm
                0.24210837 = fieldWeight in 3391, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.478287 = idf(docFreq=501, maxDocs=44218)
                  0.03125 = fieldNorm(doc=3391)
          0.5 = coord(1/2)
      0.33333334 = coord(2/6)
    
    Abstract
    For the sake of national security, very large volumes of data and information are generated and gathered daily. Much of this data and information is written in different languages, stored in different locations, and may be seemingly unconnected. Crosslingual semantic interoperability is a major challenge to generate an overview of this disparate data and information so that it can be analyzed, shared, searched, and summarized. The recent terrorist attacks and the tragic events of September 11, 2001 have prompted increased attention an national security and criminal analysis. Many Asian countries and cities, such as Japan, Taiwan, and Singapore, have been advised that they may become the next targets of terrorist attacks. Semantic interoperability has been a focus in digital library research. Traditional information retrieval (IR) approaches normally require a document to share some common keywords with the query. Generating the associations for the related terms between the two term spaces of users and documents is an important issue. The problem can be viewed as the creation of a thesaurus. Apart from this, terrorists and criminals may communicate through letters, e-mails, and faxes in languages other than English. The translation ambiguity significantly exacerbates the retrieval problem. The problem is expanded to crosslingual semantic interoperability. In this paper, we focus an the English/Chinese crosslingual semantic interoperability problem. However, the developed techniques are not limited to English and Chinese languages but can be applied to many other languages. English and Chinese are popular languages in the Asian region. Much information about national security or crime is communicated in these languages. An efficient automatically generated thesaurus between these languages is important to crosslingual information retrieval between English and Chinese languages. To facilitate crosslingual information retrieval, a corpus-based approach uses the term co-occurrence statistics in parallel or comparable corpora to construct a statistical translation model to cross the language boundary. In this paper, the text based approach to align English/Chinese Hong Kong Police press release documents from the Web is first presented. We also introduce an algorithmic approach to generate a robust knowledge base based an statistical correlation analysis of the semantics (knowledge) embedded in the bilingual press release corpus. The research output consisted of a thesaurus-like, semantic network knowledge base, which can aid in semanticsbased crosslingual information management and retrieval.
  2. Wang, S.; Isaac, A.; Schopman, B.; Schlobach, S.; Meij, L. van der: Matching multilingual subject vocabularies (2009) 0.01
    0.012950047 = product of:
      0.07770028 = sum of:
        0.07770028 = weight(_text_:english in 3035) [ClassicSimilarity], result of:
          0.07770028 = score(doc=3035,freq=2.0), product of:
            0.21787451 = queryWeight, product of:
              5.3797226 = idf(docFreq=553, maxDocs=44218)
              0.04049921 = queryNorm
            0.3566286 = fieldWeight in 3035, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.3797226 = idf(docFreq=553, maxDocs=44218)
              0.046875 = fieldNorm(doc=3035)
      0.16666667 = coord(1/6)
    
    Abstract
    Most libraries and other cultural heritage institutions use controlled knowledge organisation systems, such as thesauri, to describe their collections. Unfortunately, as most of these institutions use different such systems, united access to heterogeneous collections is difficult. Things are even worse in an international context when concepts have labels in different languages. In order to overcome the multilingual interoperability problem between European Libraries, extensive work has been done to manually map concepts from different knowledge organisation systems, which is a tedious and expensive process. Within the TELplus project, we developed and evaluated methods to automatically discover these mappings, using different ontology matching techniques. In experiments on major French, English and German subject heading lists Rameau, LCSH and SWD, we show that we can automatically produce mappings of surprisingly good quality, even when using relatively naive translation and matching methods.
  3. Dini, L.: CACAO : multilingual access to bibliographic records (2007) 0.01
    0.005487084 = product of:
      0.032922503 = sum of:
        0.032922503 = product of:
          0.065845005 = sum of:
            0.065845005 = weight(_text_:22 in 126) [ClassicSimilarity], result of:
              0.065845005 = score(doc=126,freq=2.0), product of:
                0.14182134 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04049921 = queryNorm
                0.46428138 = fieldWeight in 126, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=126)
          0.5 = coord(1/2)
      0.16666667 = coord(1/6)
    
    Content
    Vortrag anlässlich des Workshops: "Extending the multilingual capacity of The European Library in the EDL project Stockholm, Swedish National Library, 22-23 November 2007".
  4. Landry, P.: MACS: multilingual access to subject and link management : Extending the Multilingual Capacity of TEL in the EDL Project (2007) 0.00
    0.0045725703 = product of:
      0.027435422 = sum of:
        0.027435422 = product of:
          0.054870844 = sum of:
            0.054870844 = weight(_text_:22 in 1287) [ClassicSimilarity], result of:
              0.054870844 = score(doc=1287,freq=2.0), product of:
                0.14182134 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04049921 = queryNorm
                0.38690117 = fieldWeight in 1287, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=1287)
          0.5 = coord(1/2)
      0.16666667 = coord(1/6)
    
    Content
    Vortrag anlässlich des Workshops: "Extending the multilingual capacity of The European Library in the EDL project Stockholm, Swedish National Library, 22-23 November 2007".
  5. Levergood, B.; Farrenkopf, S.; Frasnelli, E.: ¬The specification of the language of the field and interoperability : cross-language access to catalogues and online libraries (CACAO) (2008) 0.00
    0.002743542 = product of:
      0.016461251 = sum of:
        0.016461251 = product of:
          0.032922503 = sum of:
            0.032922503 = weight(_text_:22 in 2646) [ClassicSimilarity], result of:
              0.032922503 = score(doc=2646,freq=2.0), product of:
                0.14182134 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04049921 = queryNorm
                0.23214069 = fieldWeight in 2646, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2646)
          0.5 = coord(1/2)
      0.16666667 = coord(1/6)
    
    Source
    Metadata for semantic and social applications : proceedings of the International Conference on Dublin Core and Metadata Applications, Berlin, 22 - 26 September 2008, DC 2008: Berlin, Germany / ed. by Jane Greenberg and Wolfgang Klas