Search (16 results, page 1 of 1)

Berti, Jr., D.W.; Lima, G.; Maculan, B.; Soergel, D.: Computer-assisted checking of conceptual relationships in a large thesaurus (2018) 0.05

0.04713631 = product of:
  0.09427262 = sum of:
    0.09427262 = sum of:
      0.037610095 = weight(_text_:research in 4721) [ClassicSimilarity], result of:
        0.037610095 = score(doc=4721,freq=2.0), product of:
          0.1491455 = queryWeight, product of:
            2.8529835 = idf(docFreq=6931, maxDocs=44218)
            0.05227703 = queryNorm
          0.2521705 = fieldWeight in 4721, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.8529835 = idf(docFreq=6931, maxDocs=44218)
            0.0625 = fieldNorm(doc=4721)
      0.056662526 = weight(_text_:22 in 4721) [ClassicSimilarity], result of:
        0.056662526 = score(doc=4721,freq=2.0), product of:
          0.18306525 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.05227703 = queryNorm
          0.30952093 = fieldWeight in 4721, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0625 = fieldNorm(doc=4721)
  0.5 = coord(1/2)

Date: 17. 1.2019 19:04:22
Source: Challenges and opportunities for knowledge organization in the digital age: proceedings of the Fifteenth International ISKO Conference, 9-11 July 2018, Porto, Portugal / organized by: International Society for Knowledge Organization (ISKO), ISKO Spain and Portugal Chapter, University of Porto - Faculty of Arts and Humanities, Research Centre in Communication, Information and Digital Culture (CIC.digital) - Porto. Eds.: F. Ribeiro u. M.E. Cerveira

Boteram, F.: Semantische Relationen in Dokumentationssprachen : vom Thesaurus zum semantischen Netz (2010) 0.01

0.012394927 = product of:
  0.024789855 = sum of:
    0.024789855 = product of:
      0.04957971 = sum of:
        0.04957971 = weight(_text_:22 in 4792) [ClassicSimilarity], result of:
          0.04957971 = score(doc=4792,freq=2.0), product of:
            0.18306525 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05227703 = queryNorm
            0.2708308 = fieldWeight in 4792, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4792)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Wissensspeicher in digitalen Räumen: Nachhaltigkeit - Verfügbarkeit - semantische Interoperabilität. Proceedings der 11. Tagung der Deutschen Sektion der Internationalen Gesellschaft für Wissensorganisation, Konstanz, 20. bis 22. Februar 2008. Hrsg.: J. Sieglerschmidt u. H.P.Ohly

Amirhosseini, M.: Quantitative evaluation of the movement from complexity toward simplicity in the structure of thesaurus descriptors (2015) 0.01
```
0.0101785315 = product of:
  0.020357063 = sum of:
    0.020357063 = product of:
      0.040714126 = sum of:
        0.040714126 = weight(_text_:research in 3695) [ClassicSimilarity], result of:
          0.040714126 = score(doc=3695,freq=6.0), product of:
            0.1491455 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.05227703 = queryNorm
            0.2729826 = fieldWeight in 3695, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3695)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

The concepts of simplicity and complexity play major roles in information storage and retrieval in knowledge organizations. This paper reports an investigation of these concepts in the structure of descriptors. The main purpose of simplicity is to decrease the number of words in the construction of descriptors as this idea affects semantic relations, recall and precision. ISO 25964 has affirmed the purpose of simplicity by requiring splitting compound terms into simpler concepts. This work aims to elaborate the standard methods of evaluation by providing a more detailed evaluation of the descriptors structure and identifying effective factors in simplicity and complexity results in the structure of thesauri descriptors. The research population is taken from the descriptors of the Commonwealth Agricultural Bureaux (CAB) Thesaurus, the Persian Cultural Thesaurus (ASFA) and the Chemical Thesaurus. This research was conducted using the statistical and content analysis method. In this research we propose a new quantitative approach as well as novel indicators and indices involving Simplicity and Factoring Ratios to evaluate the descriptors structure. The results will be useful in the verification, selection and maintenance purposes in knowledge organizations and the inquiry method can be further developed in the field of ontology evaluation.
Kempf, A.O.; Baum, K.: Thesaurus-based indexing of research data in the social sciences : opportunities and difficulties of internationalization efforts (2013) 0.01
```
0.009972882 = product of:
  0.019945765 = sum of:
    0.019945765 = product of:
      0.03989153 = sum of:
        0.03989153 = weight(_text_:research in 1656) [ClassicSimilarity], result of:
          0.03989153 = score(doc=1656,freq=4.0), product of:
            0.1491455 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.05227703 = queryNorm
            0.2674672 = fieldWeight in 1656, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046875 = fieldNorm(doc=1656)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Efforts towards internationalization have become increasingly important in scientific environments. As for content-based indexing of scientific research data, however, standards leading to internationally coherent indexing which is vital for retrieval purposes are not yet sufficiently developed. Even concerning the concrete use of indexing instruments, launched by initiatives on an international scale, there are still no binding policies and guidelines. Against this backdrop, essential criteria which internationally applicable indexing systems should meet will be outlined. These will be illustrated through the multilingual European Language Social Science Thesaurus (ELSST), originally based on the UK Data Archive's (UKDA) Humanities and Social Science Electronic Thesaurus (HASSET) and ultimately developed by the Council of European Social Science Data Archives (CESSDA). Additionally, the general pros and cons of using international versus national indexing languages will be weighed using the ELSST and the Thesaurus for the Social Sciences (TSS) developed by GESIS - Leibniz-Institute for the Social Sciences. In this light, the benefit of vocabulary crosswalks for supporting a combined use of international and national indexing systems will be discussed.

Miranda Guedes, R. de; Aparecida Moura, M.: Semantic warrant, cultural hospitality and knowledge representation in multicultural contexts : experiments with the use of the EuroVoc and UNBIS thesauri (2018) 0.01

0.009402524 = product of:
  0.018805047 = sum of:
    0.018805047 = product of:
      0.037610095 = sum of:
        0.037610095 = weight(_text_:research in 4778) [ClassicSimilarity], result of:
          0.037610095 = score(doc=4778,freq=2.0), product of:
            0.1491455 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.05227703 = queryNorm
            0.2521705 = fieldWeight in 4778, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0625 = fieldNorm(doc=4778)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Challenges and opportunities for knowledge organization in the digital age: proceedings of the Fifteenth International ISKO Conference, 9-11 July 2018, Porto, Portugal / organized by: International Society for Knowledge Organization (ISKO), ISKO Spain and Portugal Chapter, University of Porto - Faculty of Arts and Humanities, Research Centre in Communication, Information and Digital Culture (CIC.digital) - Porto. Eds.: F. Ribeiro u. M.E. Cerveira

White, M.: ¬The value of taxonomies, thesauri and metadata in enterprise search (2016) 0.01
```
0.008310735 = product of:
  0.01662147 = sum of:
    0.01662147 = product of:
      0.03324294 = sum of:
        0.03324294 = weight(_text_:research in 2964) [ClassicSimilarity], result of:
          0.03324294 = score(doc=2964,freq=4.0), product of:
            0.1491455 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.05227703 = queryNorm
            0.22288933 = fieldWeight in 2964, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2964)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Although the technical, mathematical and linguistic principles of search date back to the early 1960s and enterprise search applications have been commercially available since the 1980s; it is only since the launch of Microsoft SharePoint 2010 and the integration of the Apache Lucene and Solr projects in 2010 that there has been a wider adoption of enterprise search applications. Surveys carried out over the last five years indicate that although enterprises accept that search applications are essential in locating information, there has not been any significant investment in search teams to support these applications. Where taxonomies, thesauri and metadata have been used to improve the search user interface and enhance the search experience, the indications are that levels of search satisfaction are significantly higher. The challenges faced by search managers in developing and maintaining these tools include a lack of published research on the use of these tools and difficulty in recruiting search team members with the requisite skills and experience. There would seem to be an important and immediate opportunity to bring together the research, knowledge organization and enterprise search communities to explore how good practice in the use of taxonomies, thesauri and metadata in enterprise search can be established, enhanced and promoted.
Assem, M. van: Converting and integrating vocabularies for the Semantic Web (2010) 0.01
```
0.008142825 = product of:
  0.01628565 = sum of:
    0.01628565 = product of:
      0.0325713 = sum of:
        0.0325713 = weight(_text_:research in 4639) [ClassicSimilarity], result of:
          0.0325713 = score(doc=4639,freq=6.0), product of:
            0.1491455 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.05227703 = queryNorm
            0.21838607 = fieldWeight in 4639, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.03125 = fieldNorm(doc=4639)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

This thesis focuses on conversion of vocabularies for representation and integration of collections on the Semantic Web. A secondary focus is how to represent metadata schemas (RDF Schemas representing metadata element sets) such that they interoperate with vocabularies. The primary domain in which we operate is that of cultural heritage collections. The background worldview in which a solution is sought is that of the Semantic Web research paradigmwith its associated theories, methods, tools and use cases. In other words, we assume the SemanticWeb is in principle able to provide the context to realize interoperable collections. Interoperability is dependent on the interplay between representations and the applications that use them. We mean applications in the widest sense, such as "search" and "annotation". These applications or tasks are often present in software applications, such as the E-Culture application. It is therefore necessary that applications requirements on the vocabulary representation are met. This leads us to formulate the following problem statement: HOW CAN EXISTING VOCABULARIES BE MADE AVAILABLE TO SEMANTIC WEB APPLICATIONS?
We refine the problem statement into three research questions. The first two focus on the problem of conversion of a vocabulary to a Semantic Web representation from its original format. Conversion of a vocabulary to a representation in a Semantic Web language is necessary to make the vocabulary available to SemanticWeb applications. In the last question we focus on integration of collection metadata schemas in a way that allows for vocabulary representations as produced by our methods. Academisch proefschrift ter verkrijging van de graad Doctor aan de Vrije Universiteit Amsterdam, Dutch Research School for Information and Knowledge Systems.

Kless, D.; Milton, S.: Comparison of thesauri and ontologies from a semiotic perspective (2010) 0.01

0.007051893 = product of:
  0.014103786 = sum of:
    0.014103786 = product of:
      0.028207572 = sum of:
        0.028207572 = weight(_text_:research in 756) [ClassicSimilarity], result of:
          0.028207572 = score(doc=756,freq=2.0), product of:
            0.1491455 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.05227703 = queryNorm
            0.18912788 = fieldWeight in 756, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046875 = fieldNorm(doc=756)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Footnote: Preprint. To be published as Vol 122 in the Conferences in Research and Practice in Information Technology Series by the Australian Computer Society Inc. http://crpit.com/.

Welhouse, Z.; Lee, J.H.; Bancroft, J.: "What am I fighting for?" : creating a controlled vocabulary for video game plot metadata (2015) 0.01
```
0.007051893 = product of:
  0.014103786 = sum of:
    0.014103786 = product of:
      0.028207572 = sum of:
        0.028207572 = weight(_text_:research in 2015) [ClassicSimilarity], result of:
          0.028207572 = score(doc=2015,freq=2.0), product of:
            0.1491455 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.05227703 = queryNorm
            0.18912788 = fieldWeight in 2015, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046875 = fieldNorm(doc=2015)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

A video game's plot is one of its defining features, and prior research confirms the importance of plot metadata to users through persona analysis, interviews, and surveys. However, existing organizational systems, including library catalogs, game-related websites, and traditional plot classification systems, do not adequately describe the plot information of video games, in other words, what the game is really about. We attempt to address the issue by creating a controlled vocabulary based on a domain analysis involving a review of relevant literature and existing data structures. The controlled vocabulary is constructed in a pair structure for maximizing flexibility and extensibility. Adopting this controlled vocabulary for describing plot information of games will allow for useful search and collocation of video games.

Baca, M.; Gill, M.: Encoding multilingual knowledge systems in the digital age : the Getty vocabularies (2015) 0.01

0.007051893 = product of:
  0.014103786 = sum of:
    0.014103786 = product of:
      0.028207572 = sum of:
        0.028207572 = weight(_text_:research in 2203) [ClassicSimilarity], result of:
          0.028207572 = score(doc=2203,freq=2.0), product of:
            0.1491455 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.05227703 = queryNorm
            0.18912788 = fieldWeight in 2203, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046875 = fieldNorm(doc=2203)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Abstract: This paper gives an overview of the history, development, and structure of the electronic thesauri produced and maintained by the Getty Research Institute (GRI). We describe the evolution of the Art & Architecture Thesaurus (AAT®), the Getty Thesaurus of Geographic Names (TGN®), and the Union List of Artist Names (ULAN®) as multilingual, cross-cultural knowledge organization systems (KOS); the factors that make them unique; and their potential, when expressed as Linked Open Data (LOD) to play a key role in the Semantic Web.

Engerer, V.: Control and syntagmatization : vocabulary requirements in information retrieval thesauri and natural language lexicons (2017) 0.01
```
0.007051893 = product of:
  0.014103786 = sum of:
    0.014103786 = product of:
      0.028207572 = sum of:
        0.028207572 = weight(_text_:research in 3678) [ClassicSimilarity], result of:
          0.028207572 = score(doc=3678,freq=2.0), product of:
            0.1491455 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.05227703 = queryNorm
            0.18912788 = fieldWeight in 3678, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046875 = fieldNorm(doc=3678)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

This paper explores the relationships between natural language lexicons in lexical semantics and thesauri in information retrieval research. These different areas of knowledge have different restrictions on use of vocabulary; thesauri are used only in information search and retrieval contexts, whereas lexicons are mental systems and generally applicable in all domains of life. A set of vocabulary requirements that defines the more concrete characteristics of vocabulary items in the 2 contexts can be derived from this framework: lexicon items have to be learnable, complex, transparent, etc., whereas thesaurus terms must be effective, current and relevant, searchable, etc. The differences in vocabulary properties correlate with 2 other factors, the well-known dimension of Control (deliberate, social activities of building and maintaining vocabularies), and Syntagmatization, which is less known and describes vocabulary items' varying formal preparedness to exit the thesaurus/lexicon, enter into linear syntactic constructions, and, finally, acquire communicative functionality. It is proposed that there is an inverse relationship between Control and Syntagmatization.
Dextre Clarke, S.G.: ¬The Information Retrieval Thesaurus (2019) 0.01
```
0.007051893 = product of:
  0.014103786 = sum of:
    0.014103786 = product of:
      0.028207572 = sum of:
        0.028207572 = weight(_text_:research in 5210) [ClassicSimilarity], result of:
          0.028207572 = score(doc=5210,freq=2.0), product of:
            0.1491455 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.05227703 = queryNorm
            0.18912788 = fieldWeight in 5210, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.046875 = fieldNorm(doc=5210)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

In the post-war period before computers were readily available, urgent demand for scientific and industrial develop-ment stimulated research and development (R&D) that led to the birth of the information retrieval thesaurus. This article traces the early history, speciation and progressive improvement of the thesaurus to reach the state now conveyed by guidelines in inter-national and national standards. Despite doubts about the effec-tiveness of the thesaurus throughout this period, and notwith-standing the dominance of Google and other search engines in the information retrieval (IR) scene today, the thesaurus still plays a complementary part in the organization of knowledge and in-formation resources. Success today depends on interoperability, and is opening up opportunities in linked data applications. At the same time, the IR demand from workers in the knowledge society drives interest in hybrid forms of knowledge organization system (KOS) that may pool the genes of thesauri with those of ontologies and classification schemes.
Dextre Clarke, S.G.: Origins and trajectory of the long thesaurus debate (2016) 0.01
```
0.005876578 = product of:
  0.011753156 = sum of:
    0.011753156 = product of:
      0.023506312 = sum of:
        0.023506312 = weight(_text_:research in 2913) [ClassicSimilarity], result of:
          0.023506312 = score(doc=2913,freq=2.0), product of:
            0.1491455 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.05227703 = queryNorm
            0.15760657 = fieldWeight in 2913, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2913)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

The information retrieval thesaurus emerged in the 1950s, settled down to a more-or-less standard format in the 1970s and has continued to evolve marginally since then. Throughout its whole lifetime, doubts have been expressed about its efficacy with emphasis latterly on cost-effectiveness. Prolonged testing of different styles of index language in the 1970s failed to settle the doubts. The arena occupied by the debate has moved from small isolated databases in the post-war era to diverse situations nowadays with the whole Internet at one extreme and small in-house collections at the other. Sophisticated statistical techniques now dominate the retrieval landscape on the Internet but leave opportunities for the thesaurus and other knowledge organization techniques in niches such as image libraries and corporate intranets. The promise of an ontology-driven semantic web with linked data resources opens another opportunity. Thus much scope remains for research to establish the usefulness of the thesaurus in these places and to inspire its continuing evolution.
Losee, R.: Thesaurus structure, descriptive parameters, and scale (2016) 0.01
```
0.005876578 = product of:
  0.011753156 = sum of:
    0.011753156 = product of:
      0.023506312 = sum of:
        0.023506312 = weight(_text_:research in 3087) [ClassicSimilarity], result of:
          0.023506312 = score(doc=3087,freq=2.0), product of:
            0.1491455 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.05227703 = queryNorm
            0.15760657 = fieldWeight in 3087, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3087)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

A thesaurus contains a set of terms or features that may be used to represent recorded information, including prose documents or scientific data sets. The focus of this work is on the basic structural nature of a thesaurus itself, not on how people develop a thesaurus or how a thesaurus effects retrieval performance. Thesauri in this research are automatically developed in a simulation from sets of randomly or exhaustively generated documents. Each thesaurus is generated by the Thesaurus Generator software from a set of several hundred documents, and thousands of different document sets are used as input to the Thesaurus Generator, producing thousands of thesauri. Thus, thousands of thesauri are generated for each data point in accompanying graphs. The characteristics of this large number of thesauri are studied so that the relationships between thesaurus parameters can be determined. Some rules governing these relationships are suggested, addressing factors such as tree height and width, number of tree roots in thesauri, and number of terms available for the vocabulary. How these parameters scale as vocabularies grow is addressed. These results apply to various information systems that contain features with hierarchical relationships, including many thesauri and ontologies.
Amirhosseini, M.: Theoretical base of quantitative evaluation of unity in a thesaurus term network based on Kant's epistemology (2010) 0.01
```
0.005876578 = product of:
  0.011753156 = sum of:
    0.011753156 = product of:
      0.023506312 = sum of:
        0.023506312 = weight(_text_:research in 5854) [ClassicSimilarity], result of:
          0.023506312 = score(doc=5854,freq=2.0), product of:
            0.1491455 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.05227703 = queryNorm
            0.15760657 = fieldWeight in 5854, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5854)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

The quantitative evaluation of thesauri has been carried out much further since 1976. This type of evaluation is based on counting of special factors in thesaurus structure, some of which are counting preferred terms, non preferred terms, cross reference terms and so on. Therefore, various statistical tests have been proposed and applied for evaluation of thesauri. In this article, we try to explain some ratios in the field of unity quantitative evaluation in a thesaurus term network. Theoretical base of the ratios' indicators and indices construction, and epistemological thought in this type of quantitative evaluation, are discussed in this article. The theoretical base of quantitative evaluation is the epistemological thought of Immanuel Kant's Critique of pure reason. The cognition states of transcendental understanding are divided into three steps, the first is perception, the second combination and the third, relation making. Terms relation domains and conceptual relation domains can be analyzed with ratios. The use of quantitative evaluations in current research in the field of thesaurus construction prepares a basis for a restoration period. In modern thesaurus construction, traditional term relations are analyzed in detail in the form of new conceptual relations. Hence, the new domains of hierarchical and associative relations are constructed in the form of relations between concepts. The newly formed conceptual domains can be a suitable basis for quantitative evaluation analysis in conceptual relations.
Willis, C.; Losee, R.M.: ¬A random walk on an ontology : using thesaurus structure for automatic subject indexing (2013) 0.00
```
0.004701262 = product of:
  0.009402524 = sum of:
    0.009402524 = product of:
      0.018805047 = sum of:
        0.018805047 = weight(_text_:research in 1016) [ClassicSimilarity], result of:
          0.018805047 = score(doc=1016,freq=2.0), product of:
            0.1491455 = queryWeight, product of:
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.05227703 = queryNorm
            0.12608525 = fieldWeight in 1016, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.8529835 = idf(docFreq=6931, maxDocs=44218)
              0.03125 = fieldNorm(doc=1016)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Relationships between terms and features are an essential component of thesauri, ontologies, and a range of controlled vocabularies. In this article, we describe ways to identify important concepts in documents using the relationships in a thesaurus or other vocabulary structures. We introduce a methodology for the analysis and modeling of the indexing process based on a weighted random walk algorithm. The primary goal of this research is the analysis of the contribution of thesaurus structure to the indexing process. The resulting models are evaluated in the context of automatic subject indexing using four collections of documents pre-indexed with 4 different thesauri (AGROVOC [UN Food and Agriculture Organization], high-energy physics taxonomy [HEP], National Agricultural Library Thesaurus [NALT], and medical subject headings [MeSH]). We also introduce a thesaurus-centric matching algorithm intended to improve the quality of candidate concepts. In all cases, the weighted random walk improves automatic indexing performance over matching alone with an increase in average precision (AP) of 9% for HEP, 11% for MeSH, 35% for NALT, and 37% for AGROVOC. The results of the analysis support our hypothesis that subject indexing is in part a browsing process, and that using the vocabulary and its structure in a thesaurus contributes to the indexing process. The amount that the vocabulary structure contributes was found to differ among the 4 thesauri, possibly due to the vocabulary used in the corresponding thesauri and the structural relationships between the terms. Each of the thesauri and the manual indexing associated with it is characterized using the methods developed here.

Search (16 results, page 1 of 1)

Authors

Languages

Themes