Search (11 results, page 1 of 1)

Li, K.W.; Yang, C.C.: Automatic crosslingual thesaurus generated from the Hong Kong SAR Police Department Web Corpus for Crime Analysis (2005) 0.00
```
0.0035694435 = product of:
  0.014277774 = sum of:
    0.014277774 = weight(_text_:information in 3391) [ClassicSimilarity], result of:
      0.014277774 = score(doc=3391,freq=18.0), product of:
        0.06134496 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.034944877 = queryNorm
        0.23274568 = fieldWeight in 3391, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03125 = fieldNorm(doc=3391)
  0.25 = coord(1/4)
```
Abstract

For the sake of national security, very large volumes of data and information are generated and gathered daily. Much of this data and information is written in different languages, stored in different locations, and may be seemingly unconnected. Crosslingual semantic interoperability is a major challenge to generate an overview of this disparate data and information so that it can be analyzed, shared, searched, and summarized. The recent terrorist attacks and the tragic events of September 11, 2001 have prompted increased attention an national security and criminal analysis. Many Asian countries and cities, such as Japan, Taiwan, and Singapore, have been advised that they may become the next targets of terrorist attacks. Semantic interoperability has been a focus in digital library research. Traditional information retrieval (IR) approaches normally require a document to share some common keywords with the query. Generating the associations for the related terms between the two term spaces of users and documents is an important issue. The problem can be viewed as the creation of a thesaurus. Apart from this, terrorists and criminals may communicate through letters, e-mails, and faxes in languages other than English. The translation ambiguity significantly exacerbates the retrieval problem. The problem is expanded to crosslingual semantic interoperability. In this paper, we focus an the English/Chinese crosslingual semantic interoperability problem. However, the developed techniques are not limited to English and Chinese languages but can be applied to many other languages. English and Chinese are popular languages in the Asian region. Much information about national security or crime is communicated in these languages. An efficient automatically generated thesaurus between these languages is important to crosslingual information retrieval between English and Chinese languages. To facilitate crosslingual information retrieval, a corpus-based approach uses the term co-occurrence statistics in parallel or comparable corpora to construct a statistical translation model to cross the language boundary. In this paper, the text based approach to align English/Chinese Hong Kong Police press release documents from the Web is first presented. We also introduce an algorithmic approach to generate a robust knowledge base based an statistical correlation analysis of the semantics (knowledge) embedded in the bilingual press release corpus. The research output consisted of a thesaurus-like, semantic network knowledge base, which can aid in semanticsbased crosslingual information management and retrieval.

Source

Journal of the American Society for Information Science and Technology. 56(2005) no.3, S.272-281

Kempf, A.O.: Thesauri und Interoperabilität mit anderen Vokabularen : Die neue Thesaurusnorm ISO 25964 (2013) 0.00

0.0029745363 = product of:
  0.011898145 = sum of:
    0.011898145 = weight(_text_:information in 1144) [ClassicSimilarity], result of:
      0.011898145 = score(doc=1144,freq=2.0), product of:
        0.06134496 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.034944877 = queryNorm
        0.19395474 = fieldWeight in 1144, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.078125 = fieldNorm(doc=1144)
  0.25 = coord(1/4)

Source: Information - Wissenschaft und Praxis. 64(2013) H.6, S.365-368

ISO 25964-2: Thesauri and interoperability with other vocabularies : Part 2: Interoperability with other vocabularies (2013) 0.00
```
0.0029446408 = product of:
  0.011778563 = sum of:
    0.011778563 = weight(_text_:information in 4832) [ClassicSimilarity], result of:
      0.011778563 = score(doc=4832,freq=4.0), product of:
        0.06134496 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.034944877 = queryNorm
        0.1920054 = fieldWeight in 4832, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4832)
  0.25 = coord(1/4)
```
Abstract

ISO 25964-2:2013 is applicable to thesauri and other types of vocabulary that are commonly used for information retrieval. It describes, compares and contrasts the elements and features of these vocabularies that are implicated when interoperability is needed. It gives recommendations for the establishment and maintenance of mappings between multiple thesauri, or between thesauri and other types of vocabularies.

Content

Part 1: Thesauri for information retrieval.
García-Marco, F.-J.: Enhancing the visibility and relevance of thesauri in the Web : searching for a hub in the linked data environment (2016) 0.00
```
0.0025760243 = product of:
  0.010304097 = sum of:
    0.010304097 = weight(_text_:information in 2916) [ClassicSimilarity], result of:
      0.010304097 = score(doc=2916,freq=6.0), product of:
        0.06134496 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.034944877 = queryNorm
        0.16796975 = fieldWeight in 2916, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2916)
  0.25 = coord(1/4)
```
Abstract

Thesauri have triumphed in many domains that require precise and exhaustive information because of their representational power, their capability to integrate the concept-based and alphabetical approaches to organizing information, and their standardization and, more recently, formalization. Nevertheless, there is room to improve their relevance in the digital age by embracing the open linked data initiatives and by taking advantage of their structural and functional proximity to some of the big collaborative knowledge repositories in the Internet, notably the Wikipedia environment. With a focus on its implications for enhanced interoperability, this structural proximity is analysed, and the benefits of such collaboration for the different potential stakeholders are considered. It is proposed that better devices for ensuring semantic browsing are provided when necessary, and that an open hub for thesauri interconnection is developed, perhaps using existing big open Internet semantic facilities, such as Wikipedia.

Content

Beitrag in einem Special issue: The Great Debate: "This House Believes that the Traditional Thesaurus has no Place in Modern Information Retrieval." [19 February 2015, 14:00-17:30 preceded by ISKO UK AGM and followed by networking, wine and nibbles; vgl.: http://www.iskouk.org/content/great-debate].
Huckstorf, A.; Petras, V.: Mind the lexical gap : EuroVoc Building Block of the Semantic Web (2011) 0.00
```
0.0025239778 = product of:
  0.010095911 = sum of:
    0.010095911 = weight(_text_:information in 2782) [ClassicSimilarity], result of:
      0.010095911 = score(doc=2782,freq=4.0), product of:
        0.06134496 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.034944877 = queryNorm
        0.16457605 = fieldWeight in 2782, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=2782)
  0.25 = coord(1/4)
```
Abstract

Ein Konferenzereignis der besonderen Art fand am 18. und 19. November 2010 in Luxemburg statt. Initiiert durch das Amt für Veröffentlichungen der Europäischen Union (http://publications.europa.eu) waren Bibliothekare und Information Professionals eingeladen, um über die Zukunft mehrsprachiger kontrollierter Vokabulare in Informationssystemen und insbesondere deren Beitrag zum Semantic Web zu diskutieren. Organisiert wurde die Konferenz durch das EuroVoc-Team, das den Thesaurus der Europäischen Union bearbeitet. Die letzte EuroVoc-Konferenz fand im Jahr 2006 statt. In der Zwischenzeit ist EuroVoc zu einem ontologie-basierten Thesaurusmanagementsystem übergegangen und hat systematisch begonnen, Semantic-Web-Technologien für die Bearbeitung und Repräsentation einzusetzen und sich mit anderen Vokabularen zu vernetzen. Ein produktiver Austausch fand mit den Produzenten anderer europäischer und internationaler Vokabulare (z.B. United Nations oder FAO) sowie Vertretern aus Projekten, die an Themen über automatische Indexierung (hier insbesondere parlamentarische und rechtliche Dokumente) sowie Interoperabilitiät zwischen Vokabularen arbeiten, statt.

Source

Information - Wissenschaft und Praxis. 62(2011) H.2/3, S.125-126
Doerr, M.: Semantic problems of thesaurus mapping (2001) 0.00
```
0.0021033147 = product of:
  0.008413259 = sum of:
    0.008413259 = weight(_text_:information in 5902) [ClassicSimilarity], result of:
      0.008413259 = score(doc=5902,freq=4.0), product of:
        0.06134496 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.034944877 = queryNorm
        0.13714671 = fieldWeight in 5902, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5902)
  0.25 = coord(1/4)
```
Abstract

With networked information access to heterogeneous data sources, the problem of terminology provision and interoperability of controlled vocabulary schemes such as thesauri becomes increasingly urgent. Solutions are needed to improve the performance of full-text retrieval systems and to guide the design of controlled terminology schemes for use in structured data, including metadata. Thesauri are created in different languages, with different scope and points of view and at different levels of abstraction and detail, to accomodate access to a specific group of collections. In any wider search accessing distributed collections, the user would like to start with familiar terminology and let the system find out the correspondences to other terminologies in order to retrieve equivalent results from all addressed collections. This paper investigates possible semantic differences that may hinder the unambiguous mapping and transition from one thesaurus to another. It focusses on the differences of meaning of terms and their relations as intended by their creators for indexing and querying a specific collection, in contrast to methods investigating the statistical relevance of terms for objects in a collection. It develops a notion of optimal mapping, paying particular attention to the intellectual quality of mappings between terms from different vocabularies and to problems of polysemy. Proposals are made to limit the vagueness introduced by the transition from one vocabulary to another. The paper shows ways in which thesaurus creators can improve their methodology to meet the challenges of networked access of distributed collections created under varying conditions. For system implementers, the discussion will lead to a better understanding of the complexity of the problem

Source

Journal of digital information. 1(2001) no.8,
Bandholtz, T.; Schulte-Coerne, T.; Glaser, R.; Fock, J.; Keller, T.: iQvoc - open source SKOS(XL) maintenance and publishing tool (2010) 0.00
```
0.0020821756 = product of:
  0.008328702 = sum of:
    0.008328702 = weight(_text_:information in 604) [ClassicSimilarity], result of:
      0.008328702 = score(doc=604,freq=2.0), product of:
        0.06134496 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.034944877 = queryNorm
        0.13576832 = fieldWeight in 604, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0546875 = fieldNorm(doc=604)
  0.25 = coord(1/4)
```
Abstract

iQvoc is a new open source SKOS-XL vocabulary management tool developed by the Federal Environment Agency, Germany, and innoQ Deutschland GmbH. Its immediate purpose is maintaining and publishing reference vocabularies in the upcoming Linked Data cloud of environmental information, but it may be easily adapted to host any SKOS- XL compliant vocabulary. iQvoc is implemented as a Ruby on Rails application running on top of JRuby - the Java implementation of the Ruby Programming Language. To increase the user experience when editing content, iQvoc uses heavily the JavaScript library jQuery.

Dextre Clarke, S.G.: Overview of ISO NP 25964 : structured vocabularies for information retrieval (2007) 0.00

0.0017847219 = product of:
  0.0071388874 = sum of:
    0.0071388874 = weight(_text_:information in 535) [ClassicSimilarity], result of:
      0.0071388874 = score(doc=535,freq=2.0), product of:
        0.06134496 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.034944877 = queryNorm
        0.116372846 = fieldWeight in 535, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=535)
  0.25 = coord(1/4)

Andrade, J. de; Lopes Ginez de Lara, M.: Interoperability and mapping between knowledge organization systems : metathesaurus - Unified Medical Language System of the National Library of Medicine (2016) 0.00
```
0.0017847219 = product of:
  0.0071388874 = sum of:
    0.0071388874 = weight(_text_:information in 2826) [ClassicSimilarity], result of:
      0.0071388874 = score(doc=2826,freq=2.0), product of:
        0.06134496 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.034944877 = queryNorm
        0.116372846 = fieldWeight in 2826, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=2826)
  0.25 = coord(1/4)
```
Abstract

This paper is aimed at assessing the potential of interoperable knowledge organization systems to respond to search strategies in order to retrieve information from databases in the areas of health and biomedicine. An analysis was done on the semantic consistency of synonym grouping of a term selected from the Metathesaurus, the Unified Medical Language System of the National Library of Medicine, based on the characteristics of equivalence proposed in ISO 25964: 2: 2011 and based on the following categories: semantic, morphological, syntactic and typographical variations. This paper highlights the importance of understanding the results of automatic mapping as well as the need for characterization, evaluation and selection of equivalences for preparation of consistent search strategies and presentation of search results in scientific work methodologies.

Dunckel, P.: Zusammenführung mehrerer Thesauri zu einem gemeinsamen Begriffssystem : Probleme und Lösungsansätze (2017) 0.00

0.0017847219 = product of:
  0.0071388874 = sum of:
    0.0071388874 = weight(_text_:information in 4233) [ClassicSimilarity], result of:
      0.0071388874 = score(doc=4233,freq=2.0), product of:
        0.06134496 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.034944877 = queryNorm
        0.116372846 = fieldWeight in 4233, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=4233)
  0.25 = coord(1/4)

Source: Information - Wissenschaft und Praxis. 68(2017) H.4, S.253-262

ISO 25964-2: Der Standard für die Interoperabilität von Thesauri (2013) 0.00
```
0.0010410878 = product of:
  0.004164351 = sum of:
    0.004164351 = weight(_text_:information in 772) [ClassicSimilarity], result of:
      0.004164351 = score(doc=772,freq=2.0), product of:
        0.06134496 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.034944877 = queryNorm
        0.06788416 = fieldWeight in 772, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.02734375 = fieldNorm(doc=772)
  0.25 = coord(1/4)
```
Content

Der vollständige Titel von Teil 2 lautet "Information and documentation - Thesauri and interoperability with other vocabularies - Teil 2: Interoperability with other vocabularies". Wichtige Themen, die der Standard behandelt, sind Strukturmodelle für das Mapping, Richtlinien für Mappingtypen und der Umgang mit Präkombination, die besonders bei Klassifikationen, Taxonomien und Schlagwortsystemen vorkommt. Das primäre Augenmerk von ISO 25964 gilt den Thesauri, und mit Ausnahme von Terminologien existieren keine vergleichbaren Standards für die anderen Vokabulartypen. Statt zu versuchen, diese normativ darzustellen, behandelt Teil 2 ausschließlich die Interoperabilität zwischen ihnen und den Thesauri. Die Kapitel für die einzelnen Vokabulartypen decken jeweils folgende Sachverhalte ab: - Schlüsseleigenschaften des Vokabulars (deskriptiv, nicht normativ) - Semantische Komponenten/Beziehungen (deskriptiv, nicht normativ) Sofern anwendbar, Empfehlungen für das Mapping zwischen Vokabular und Thesaurus (normativ).

Search (11 results, page 1 of 1)

Authors

Years

Languages

Types

Themes