Search (8 results, page 1 of 1)

  • × theme_ss:"Konzeption und Anwendung des Prinzips Thesaurus"
  • × theme_ss:"Semantische Interoperabilität"
  1. Huckstorf, A.; Petras, V.: Mind the lexical gap : EuroVoc Building Block of the Semantic Web (2011) 0.07
    0.06901542 = product of:
      0.18404113 = sum of:
        0.108516045 = weight(_text_:ontologie in 2782) [ClassicSimilarity], result of:
          0.108516045 = score(doc=2782,freq=2.0), product of:
            0.2339713 = queryWeight, product of:
              6.996407 = idf(docFreq=109, maxDocs=44218)
              0.033441637 = queryNorm
            0.46380067 = fieldWeight in 2782, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              6.996407 = idf(docFreq=109, maxDocs=44218)
              0.046875 = fieldNorm(doc=2782)
        0.06638107 = weight(_text_:semantic in 2782) [ClassicSimilarity], result of:
          0.06638107 = score(doc=2782,freq=6.0), product of:
            0.13904566 = queryWeight, product of:
              4.1578603 = idf(docFreq=1879, maxDocs=44218)
              0.033441637 = queryNorm
            0.47740483 = fieldWeight in 2782, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              4.1578603 = idf(docFreq=1879, maxDocs=44218)
              0.046875 = fieldNorm(doc=2782)
        0.0091440175 = product of:
          0.02743205 = sum of:
            0.02743205 = weight(_text_:29 in 2782) [ClassicSimilarity], result of:
              0.02743205 = score(doc=2782,freq=2.0), product of:
                0.117637224 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.033441637 = queryNorm
                0.23319192 = fieldWeight in 2782, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2782)
          0.33333334 = coord(1/3)
      0.375 = coord(3/8)
    
    Abstract
    Ein Konferenzereignis der besonderen Art fand am 18. und 19. November 2010 in Luxemburg statt. Initiiert durch das Amt für Veröffentlichungen der Europäischen Union (http://publications.europa.eu) waren Bibliothekare und Information Professionals eingeladen, um über die Zukunft mehrsprachiger kontrollierter Vokabulare in Informationssystemen und insbesondere deren Beitrag zum Semantic Web zu diskutieren. Organisiert wurde die Konferenz durch das EuroVoc-Team, das den Thesaurus der Europäischen Union bearbeitet. Die letzte EuroVoc-Konferenz fand im Jahr 2006 statt. In der Zwischenzeit ist EuroVoc zu einem ontologie-basierten Thesaurusmanagementsystem übergegangen und hat systematisch begonnen, Semantic-Web-Technologien für die Bearbeitung und Repräsentation einzusetzen und sich mit anderen Vokabularen zu vernetzen. Ein produktiver Austausch fand mit den Produzenten anderer europäischer und internationaler Vokabulare (z.B. United Nations oder FAO) sowie Vertretern aus Projekten, die an Themen über automatische Indexierung (hier insbesondere parlamentarische und rechtliche Dokumente) sowie Interoperabilitiät zwischen Vokabularen arbeiten, statt.
    Date
    29. 3.2013 17:46:08
  2. Li, K.W.; Yang, C.C.: Automatic crosslingual thesaurus generated from the Hong Kong SAR Police Department Web Corpus for Crime Analysis (2005) 0.01
    0.007141466 = product of:
      0.057131726 = sum of:
        0.057131726 = weight(_text_:semantic in 3391) [ClassicSimilarity], result of:
          0.057131726 = score(doc=3391,freq=10.0), product of:
            0.13904566 = queryWeight, product of:
              4.1578603 = idf(docFreq=1879, maxDocs=44218)
              0.033441637 = queryNorm
            0.41088465 = fieldWeight in 3391, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              4.1578603 = idf(docFreq=1879, maxDocs=44218)
              0.03125 = fieldNorm(doc=3391)
      0.125 = coord(1/8)
    
    Abstract
    For the sake of national security, very large volumes of data and information are generated and gathered daily. Much of this data and information is written in different languages, stored in different locations, and may be seemingly unconnected. Crosslingual semantic interoperability is a major challenge to generate an overview of this disparate data and information so that it can be analyzed, shared, searched, and summarized. The recent terrorist attacks and the tragic events of September 11, 2001 have prompted increased attention an national security and criminal analysis. Many Asian countries and cities, such as Japan, Taiwan, and Singapore, have been advised that they may become the next targets of terrorist attacks. Semantic interoperability has been a focus in digital library research. Traditional information retrieval (IR) approaches normally require a document to share some common keywords with the query. Generating the associations for the related terms between the two term spaces of users and documents is an important issue. The problem can be viewed as the creation of a thesaurus. Apart from this, terrorists and criminals may communicate through letters, e-mails, and faxes in languages other than English. The translation ambiguity significantly exacerbates the retrieval problem. The problem is expanded to crosslingual semantic interoperability. In this paper, we focus an the English/Chinese crosslingual semantic interoperability problem. However, the developed techniques are not limited to English and Chinese languages but can be applied to many other languages. English and Chinese are popular languages in the Asian region. Much information about national security or crime is communicated in these languages. An efficient automatically generated thesaurus between these languages is important to crosslingual information retrieval between English and Chinese languages. To facilitate crosslingual information retrieval, a corpus-based approach uses the term co-occurrence statistics in parallel or comparable corpora to construct a statistical translation model to cross the language boundary. In this paper, the text based approach to align English/Chinese Hong Kong Police press release documents from the Web is first presented. We also introduce an algorithmic approach to generate a robust knowledge base based an statistical correlation analysis of the semantics (knowledge) embedded in the bilingual press release corpus. The research output consisted of a thesaurus-like, semantic network knowledge base, which can aid in semanticsbased crosslingual information management and retrieval.
  3. Andrade, J. de; Lopes Ginez de Lara, M.: Interoperability and mapping between knowledge organization systems : metathesaurus - Unified Medical Language System of the National Library of Medicine (2016) 0.01
    0.0067749894 = product of:
      0.054199915 = sum of:
        0.054199915 = weight(_text_:semantic in 2826) [ClassicSimilarity], result of:
          0.054199915 = score(doc=2826,freq=4.0), product of:
            0.13904566 = queryWeight, product of:
              4.1578603 = idf(docFreq=1879, maxDocs=44218)
              0.033441637 = queryNorm
            0.38979942 = fieldWeight in 2826, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.1578603 = idf(docFreq=1879, maxDocs=44218)
              0.046875 = fieldNorm(doc=2826)
      0.125 = coord(1/8)
    
    Abstract
    This paper is aimed at assessing the potential of interoperable knowledge organization systems to respond to search strategies in order to retrieve information from databases in the areas of health and biomedicine. An analysis was done on the semantic consistency of synonym grouping of a term selected from the Metathesaurus, the Unified Medical Language System of the National Library of Medicine, based on the characteristics of equivalence proposed in ISO 25964: 2: 2011 and based on the following categories: semantic, morphological, syntactic and typographical variations. This paper highlights the importance of understanding the results of automatic mapping as well as the need for characterization, evaluation and selection of equivalences for preparation of consistent search strategies and presentation of search results in scientific work methodologies.
  4. Doerr, M.: Semantic problems of thesaurus mapping (2001) 0.01
    0.005645824 = product of:
      0.045166593 = sum of:
        0.045166593 = weight(_text_:semantic in 5902) [ClassicSimilarity], result of:
          0.045166593 = score(doc=5902,freq=4.0), product of:
            0.13904566 = queryWeight, product of:
              4.1578603 = idf(docFreq=1879, maxDocs=44218)
              0.033441637 = queryNorm
            0.32483283 = fieldWeight in 5902, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.1578603 = idf(docFreq=1879, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5902)
      0.125 = coord(1/8)
    
    Abstract
    With networked information access to heterogeneous data sources, the problem of terminology provision and interoperability of controlled vocabulary schemes such as thesauri becomes increasingly urgent. Solutions are needed to improve the performance of full-text retrieval systems and to guide the design of controlled terminology schemes for use in structured data, including metadata. Thesauri are created in different languages, with different scope and points of view and at different levels of abstraction and detail, to accomodate access to a specific group of collections. In any wider search accessing distributed collections, the user would like to start with familiar terminology and let the system find out the correspondences to other terminologies in order to retrieve equivalent results from all addressed collections. This paper investigates possible semantic differences that may hinder the unambiguous mapping and transition from one thesaurus to another. It focusses on the differences of meaning of terms and their relations as intended by their creators for indexing and querying a specific collection, in contrast to methods investigating the statistical relevance of terms for objects in a collection. It develops a notion of optimal mapping, paying particular attention to the intellectual quality of mappings between terms from different vocabularies and to problems of polysemy. Proposals are made to limit the vagueness introduced by the transition from one vocabulary to another. The paper shows ways in which thesaurus creators can improve their methodology to meet the challenges of networked access of distributed collections created under varying conditions. For system implementers, the discussion will lead to a better understanding of the complexity of the problem
  5. García-Marco, F.-J.: Enhancing the visibility and relevance of thesauri in the Web : searching for a hub in the linked data environment (2016) 0.01
    0.005645824 = product of:
      0.045166593 = sum of:
        0.045166593 = weight(_text_:semantic in 2916) [ClassicSimilarity], result of:
          0.045166593 = score(doc=2916,freq=4.0), product of:
            0.13904566 = queryWeight, product of:
              4.1578603 = idf(docFreq=1879, maxDocs=44218)
              0.033441637 = queryNorm
            0.32483283 = fieldWeight in 2916, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.1578603 = idf(docFreq=1879, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2916)
      0.125 = coord(1/8)
    
    Abstract
    Thesauri have triumphed in many domains that require precise and exhaustive information because of their representational power, their capability to integrate the concept-based and alphabetical approaches to organizing information, and their standardization and, more recently, formalization. Nevertheless, there is room to improve their relevance in the digital age by embracing the open linked data initiatives and by taking advantage of their structural and functional proximity to some of the big collaborative knowledge repositories in the Internet, notably the Wikipedia environment. With a focus on its implications for enhanced interoperability, this structural proximity is analysed, and the benefits of such collaboration for the different potential stakeholders are considered. It is proposed that better devices for ensuring semantic browsing are provided when necessary, and that an open hub for thesauri interconnection is developed, perhaps using existing big open Internet semantic facilities, such as Wikipedia.
  6. Bandholtz, T.; Schulte-Coerne, T.; Glaser, R.; Fock, J.; Keller, T.: iQvoc - open source SKOS(XL) maintenance and publishing tool (2010) 0.01
    0.0055890805 = product of:
      0.044712644 = sum of:
        0.044712644 = weight(_text_:semantic in 604) [ClassicSimilarity], result of:
          0.044712644 = score(doc=604,freq=2.0), product of:
            0.13904566 = queryWeight, product of:
              4.1578603 = idf(docFreq=1879, maxDocs=44218)
              0.033441637 = queryNorm
            0.32156807 = fieldWeight in 604, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.1578603 = idf(docFreq=1879, maxDocs=44218)
              0.0546875 = fieldNorm(doc=604)
      0.125 = coord(1/8)
    
    Source
    Proceedings of the Sixth Workshop on Scripting and Development for the Semantic Web, Crete, Greece, May 31, 2010, CEUR Workshop Proceedings, SFSW - http://ceur-ws.org/Vol-699/Paper2.pdf
  7. Baca, M.; Gill, M.: Encoding multilingual knowledge systems in the digital age : the Getty vocabularies (2015) 0.00
    0.0047906404 = product of:
      0.038325123 = sum of:
        0.038325123 = weight(_text_:semantic in 2203) [ClassicSimilarity], result of:
          0.038325123 = score(doc=2203,freq=2.0), product of:
            0.13904566 = queryWeight, product of:
              4.1578603 = idf(docFreq=1879, maxDocs=44218)
              0.033441637 = queryNorm
            0.2756298 = fieldWeight in 2203, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.1578603 = idf(docFreq=1879, maxDocs=44218)
              0.046875 = fieldNorm(doc=2203)
      0.125 = coord(1/8)
    
    Abstract
    This paper gives an overview of the history, development, and structure of the electronic thesauri produced and maintained by the Getty Research Institute (GRI). We describe the evolution of the Art & Architecture Thesaurus (AAT®), the Getty Thesaurus of Geographic Names (TGN®), and the Union List of Artist Names (ULAN®) as multilingual, cross-cultural knowledge organization systems (KOS); the factors that make them unique; and their potential, when expressed as Linked Open Data (LOD) to play a key role in the Semantic Web.
  8. ISO 25964-2: Der Standard für die Interoperabilität von Thesauri (2013) 0.00
    0.0027945403 = product of:
      0.022356322 = sum of:
        0.022356322 = weight(_text_:semantic in 772) [ClassicSimilarity], result of:
          0.022356322 = score(doc=772,freq=2.0), product of:
            0.13904566 = queryWeight, product of:
              4.1578603 = idf(docFreq=1879, maxDocs=44218)
              0.033441637 = queryNorm
            0.16078404 = fieldWeight in 772, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.1578603 = idf(docFreq=1879, maxDocs=44218)
              0.02734375 = fieldNorm(doc=772)
      0.125 = coord(1/8)
    
    Content
    "Interoperabilität bedeutet, dass zwei oder mehrere Systeme oder Systemkomponenten Informationen austauschen und nutzen können. Diese Fähigkeit ist von grundsätzlicher Bedeutung für Informationsnetzwerke, auf die wir alle angewiesen sind, und mehr noch für die Entwicklung des Semantischen Web. Bei Thesauri gibt es zwei Ebenen der Interoperabilität: 1. Erstellung des Thesaurus in einer standardisierten Form, um den Import und die Nutzung des Thesaurus in anderen Systemen zu gewährleisten (dies wurde in ISO 25964-1 behandelt und 2011 publiziert). 2. Ergänzende Nutzung von weiteren Vokabularien und zwar durch die Erstellung von Mappings zwischen den Begriffen eines Thesaurus und denen eines anderen Vokabulars (Thema von ISO 25964-2). Nur wenn Mappings mit der empfohlenen Sorgfalt von ISO 25964-2 durchgeführt werden, kann sichergestellt werden, dass die auf ihnen basierenden Verknüpfungen im Semantic Web den nichtsahnenden Netzsurfer nicht zu irreführenden Ergebnissen führen.