Search (155 results, page 1 of 8)

Nielsen, M.L.: Thesaurus construction : key issues and selected readings (2004) 0.03

0.033408046 = product of:
  0.05011207 = sum of:
    0.028516391 = weight(_text_:to in 5006) [ClassicSimilarity], result of:
      0.028516391 = score(doc=5006,freq=12.0), product of:
        0.08279609 = queryWeight, product of:
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.045541126 = queryNorm
        0.34441712 = fieldWeight in 5006, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.0546875 = fieldNorm(doc=5006)
    0.021595677 = product of:
      0.043191355 = sum of:
        0.043191355 = weight(_text_:22 in 5006) [ClassicSimilarity], result of:
          0.043191355 = score(doc=5006,freq=2.0), product of:
            0.15947726 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045541126 = queryNorm
            0.2708308 = fieldWeight in 5006, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5006)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: The purpose of this selected bibliography is to introduce issues and problems in relation to thesaurus construction and to present a set of readings that may be used in practical thesaurus design. The concept of thesaurus is discussed, the purpose of the thesaurus and how the concept has evolved over the years according to new IR technologies. Different approaches to thesaurus construction are introduced, and readings dealing with specific problems and developments in the collection, formation and organisation of thesaurus concepts and terms are presented. Primarily manual construction methods are discussed, but the bibliography also refers to research about techniques for automatic thesaurus construction.
Date: 18. 5.2006 20:06:22

Crouch, C.J.: ¬An approach to the automatic construction of global thesauri (1990) 0.03

0.031751644 = product of:
  0.047627464 = sum of:
    0.026031785 = weight(_text_:to in 4042) [ClassicSimilarity], result of:
      0.026031785 = score(doc=4042,freq=10.0), product of:
        0.08279609 = queryWeight, product of:
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.045541126 = queryNorm
        0.3144084 = fieldWeight in 4042, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4042)
    0.021595677 = product of:
      0.043191355 = sum of:
        0.043191355 = weight(_text_:22 in 4042) [ClassicSimilarity], result of:
          0.043191355 = score(doc=4042,freq=2.0), product of:
            0.15947726 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045541126 = queryNorm
            0.2708308 = fieldWeight in 4042, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4042)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: The benefits of a well constructed thesaurus to an information retrieval system have long been recognised by both researchers and practitioners in the field. Examines both early and current approaches to automatic thesaurus construction and describes an approach to the automatic generation of global thesauri based on the term discrimination value model of Salton Yang, and Yu and on an appropriate clustering algorithm. This method has been implemented and applied to 2 document collections. Preliminary results indicate that this method, which produces improvements in retrieval performance in excess of 10 and 15% in the test collections, is viable and worthy of continued investigation.
Date: 22. 4.1996 3:39:53

Eastman, C.M.: Overlaps in postings to thesaurus terms : a preliminary study (1988) 0.03

0.027839875 = product of:
  0.04175981 = sum of:
    0.020164136 = weight(_text_:to in 3555) [ClassicSimilarity], result of:
      0.020164136 = score(doc=3555,freq=6.0), product of:
        0.08279609 = queryWeight, product of:
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.045541126 = queryNorm
        0.24353972 = fieldWeight in 3555, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.0546875 = fieldNorm(doc=3555)
    0.021595677 = product of:
      0.043191355 = sum of:
        0.043191355 = weight(_text_:22 in 3555) [ClassicSimilarity], result of:
          0.043191355 = score(doc=3555,freq=2.0), product of:
            0.15947726 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045541126 = queryNorm
            0.2708308 = fieldWeight in 3555, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3555)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: The patterns of overlap between terms which are closely related in a thesaurus are considered. The relationships considered are parent/child, in which one term is a broader term of the other, and sibling in which to 2 terms share the same broader term. The patterns of overlap observed in the MeSH thesaurus with respect to selected MEDLINE postings are examined. The implications of the overlap patterns are discussed, in particular, the impact of the overlap patterns on the potential effectiveness of a proposed algorithm for handling negation is considered.
Date: 25.12.1995 22:52:34

Schneider, J.W.; Borlund, P.: ¬A bibliometric-based semiautomatic approach to identification of candidate thesaurus terms : parsing and filtering of noun phrases from citation contexts (2005) 0.03

0.027839875 = product of:
  0.04175981 = sum of:
    0.020164136 = weight(_text_:to in 156) [ClassicSimilarity], result of:
      0.020164136 = score(doc=156,freq=6.0), product of:
        0.08279609 = queryWeight, product of:
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.045541126 = queryNorm
        0.24353972 = fieldWeight in 156, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.0546875 = fieldNorm(doc=156)
    0.021595677 = product of:
      0.043191355 = sum of:
        0.043191355 = weight(_text_:22 in 156) [ClassicSimilarity], result of:
          0.043191355 = score(doc=156,freq=2.0), product of:
            0.15947726 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045541126 = queryNorm
            0.2708308 = fieldWeight in 156, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=156)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: The present study investigates the ability of a bibliometric based semi-automatic method to select candidate thesaurus terms from citation contexts. The method consists of document co-citation analysis, citation context analysis, and noun phrase parsing. The investigation is carried out within the specialty area of periodontology. The results clearly demonstrate that the method is able to select important candidate thesaurus terms within the chosen specialty area.
Date: 8. 3.2007 19:55:22

Dextre Clarke, S.G.: Thesaural relationships (2001) 0.03

0.025373083 = product of:
  0.038059622 = sum of:
    0.016463947 = weight(_text_:to in 1149) [ClassicSimilarity], result of:
      0.016463947 = score(doc=1149,freq=4.0), product of:
        0.08279609 = queryWeight, product of:
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.045541126 = queryNorm
        0.19884932 = fieldWeight in 1149, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1149)
    0.021595677 = product of:
      0.043191355 = sum of:
        0.043191355 = weight(_text_:22 in 1149) [ClassicSimilarity], result of:
          0.043191355 = score(doc=1149,freq=2.0), product of:
            0.15947726 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045541126 = queryNorm
            0.2708308 = fieldWeight in 1149, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1149)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: A thesaurus in the controlled vocabulary environment is a tool designed to support effective infonnation retrieval (IR) by guiding indexers and searchers consistently to choose the same terms for expressing a given concept or combination of concepts. Terms in the thesaurus are linked by relationships of three well-known types: equivalence, hierarchical, and associative. The functions and properties of these three basic types and some subcategories are described, as well as some additional relationship types conunonly found in thesauri. Progressive automation of IR processes and the capability for simultaneous searching of vast networked resources are creating some pressures for change in the categorization and consistency of relationships.
Date: 22. 9.2007 15:45:57

Lambert, N.: Of thesauri and computers : reflections on the need for thesauri (1995) 0.03

0.02532377 = product of:
  0.037985653 = sum of:
    0.013304878 = weight(_text_:to in 3734) [ClassicSimilarity], result of:
      0.013304878 = score(doc=3734,freq=2.0), product of:
        0.08279609 = queryWeight, product of:
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.045541126 = queryNorm
        0.16069452 = fieldWeight in 3734, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.0625 = fieldNorm(doc=3734)
    0.024680775 = product of:
      0.04936155 = sum of:
        0.04936155 = weight(_text_:22 in 3734) [ClassicSimilarity], result of:
          0.04936155 = score(doc=3734,freq=2.0), product of:
            0.15947726 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045541126 = queryNorm
            0.30952093 = fieldWeight in 3734, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=3734)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: Most indexed databases now include their thesauri and/or coding in their bibliographic files, searchable at the databases' online connect rates. Assesses the searchability of these on the different hosts. Thesauri and classifications are also available as diskette or CD-ROM products. Describes a number of these, highlighting the diskette thesaurus from IFI/Plenum Data for its flexible databases, the CLAIMS Uniterm and Comprehensive indexes to US chemical patents
Source: Searcher. 3(1995) no.8, S.18-22

Aitchison, J.; Dextre Clarke, S.G.: ¬The Thesaurus : a historical viewpoint, with a look to the future (2004) 0.02

0.023862753 = product of:
  0.035794128 = sum of:
    0.017283546 = weight(_text_:to in 5005) [ClassicSimilarity], result of:
      0.017283546 = score(doc=5005,freq=6.0), product of:
        0.08279609 = queryWeight, product of:
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.045541126 = queryNorm
        0.20874833 = fieldWeight in 5005, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.046875 = fieldNorm(doc=5005)
    0.018510582 = product of:
      0.037021164 = sum of:
        0.037021164 = weight(_text_:22 in 5005) [ClassicSimilarity], result of:
          0.037021164 = score(doc=5005,freq=2.0), product of:
            0.15947726 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045541126 = queryNorm
            0.23214069 = fieldWeight in 5005, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=5005)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: After a period of experiment and evolution in the 1950s and 1960s, a fairly standard format for thesauri was established with the publication of the influential Thesaurus of Engineering and Scientific Terms (TEST) in 1967. This and other early thesauri relied primarily an the presentation of terms in alphabetical order. The value of a classified presentation was subsequently realised, and in particular the technique of facet analysis has profoundly influenced thesaurus evolution. Thesaurofacet and the Art & Architecture Thesaurus have acted as models for two distinct breeds of thesaurus using faceted displays of terms. As of the 1990s, the expansion of end-user access to vast networked resources is imposing further requirements an the style and structure of controlled vocabularies. The international standards for thesauri, first conceived in a print-based era, are badly in need of updating. Work is in hand in the UK and the USA to revise and develop standards in support of electronic thesauri.
Date: 22. 9.2007 15:46:13

Bagheri, M.: Development of thesauri in Iran (2006) 0.02

0.023862753 = product of:
  0.035794128 = sum of:
    0.017283546 = weight(_text_:to in 260) [ClassicSimilarity], result of:
      0.017283546 = score(doc=260,freq=6.0), product of:
        0.08279609 = queryWeight, product of:
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.045541126 = queryNorm
        0.20874833 = fieldWeight in 260, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.046875 = fieldNorm(doc=260)
    0.018510582 = product of:
      0.037021164 = sum of:
        0.037021164 = weight(_text_:22 in 260) [ClassicSimilarity], result of:
          0.037021164 = score(doc=260,freq=2.0), product of:
            0.15947726 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045541126 = queryNorm
            0.23214069 = fieldWeight in 260, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=260)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: The need for Persian thesauri became apparent during the late 1960s with the advent of documentation centres in Iran. The first Persian controlled vocabulary was published by IRANDOC in 1977. Other centres worked on translations of existing thesauri, but it was soon realised that these efforts did not meet the needs of the centres. After the Islamic revolution in 1979, the foundation of new centres intensified the need for Persian thesauri, especially in the fields of history and government documents. Also, during the Iran-Iraq war, Iranian research centres produced reports in scientific and technical fields, both to support military requirements and to meet society's needs. In order to provide a comprehensive thesaurus, the Council of Scientific Research of Iran approved a project for the compilation of such a work. Nowadays, 12 Persian thesauri are available and others are being prepared, based on the literary corpus and conformity with characteristics of Iranian culture.
Source: Indexer. 25(2006) no.1, S.19-22

Milstead, J.L.: Thesauri in a full-text world (1998) 0.02

0.022679746 = product of:
  0.03401962 = sum of:
    0.018594133 = weight(_text_:to in 2337) [ClassicSimilarity], result of:
      0.018594133 = score(doc=2337,freq=10.0), product of:
        0.08279609 = queryWeight, product of:
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.045541126 = queryNorm
        0.22457743 = fieldWeight in 2337, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2337)
    0.015425485 = product of:
      0.03085097 = sum of:
        0.03085097 = weight(_text_:22 in 2337) [ClassicSimilarity], result of:
          0.03085097 = score(doc=2337,freq=2.0), product of:
            0.15947726 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045541126 = queryNorm
            0.19345059 = fieldWeight in 2337, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2337)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: Despite early claims to the contemporary, thesauri continue to find use as access tools for information in the full-text environment. Their mode of use is changing, but this change actually represents an expansion rather than a contrdiction of their utility. Thesauri and similar vocabulary tools can complement full-text access by aiding users in focusing their searches, by supplementing the linguistic analysis of the text search engine, and even by serving as one of the tools used by the linguistic engine for its analysis. While human indexing contunues to be used for many databases, the trend is to increase the use of machine aids for this purpose. All machine-aided indexing (MAI) systems rely on thesauri as the basis for term selection. In the 21st century, the balance of effort between human and machine will change at both input and output, but thesauri will continue to play an important role for the foreseeable future
Date: 22. 9.1997 19:16:05

Busch, J.A.: Building and accessing vocabulary resources for networked resource discovery and navigation (1998) 0.02

0.022158299 = product of:
  0.033237446 = sum of:
    0.011641769 = weight(_text_:to in 2346) [ClassicSimilarity], result of:
      0.011641769 = score(doc=2346,freq=2.0), product of:
        0.08279609 = queryWeight, product of:
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.045541126 = queryNorm
        0.14060771 = fieldWeight in 2346, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2346)
    0.021595677 = product of:
      0.043191355 = sum of:
        0.043191355 = weight(_text_:22 in 2346) [ClassicSimilarity], result of:
          0.043191355 = score(doc=2346,freq=2.0), product of:
            0.15947726 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045541126 = queryNorm
            0.2708308 = fieldWeight in 2346, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2346)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: The Getty has a lenghty history in the research and development of thesauri and other structured vocabulary tools to make the use and exchange of electronic information easier
Date: 22. 9.1997 19:16:05

Schmitz-Esser, W.: Thesaurus and beyond : an advanced formula for linguistic engineering and information retrieval (1999) 0.02

0.02174836 = product of:
  0.03262254 = sum of:
    0.014111955 = weight(_text_:to in 4116) [ClassicSimilarity], result of:
      0.014111955 = score(doc=4116,freq=4.0), product of:
        0.08279609 = queryWeight, product of:
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.045541126 = queryNorm
        0.17044228 = fieldWeight in 4116, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.046875 = fieldNorm(doc=4116)
    0.018510582 = product of:
      0.037021164 = sum of:
        0.037021164 = weight(_text_:22 in 4116) [ClassicSimilarity], result of:
          0.037021164 = score(doc=4116,freq=2.0), product of:
            0.15947726 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045541126 = queryNorm
            0.23214069 = fieldWeight in 4116, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=4116)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: This paper describes a proposal for a new approach to thesaurus design and construction that could have significant implicantions for change in the way multilingual thesauri are handled and integrated with each other. The formula presented here has its origin in the work of the German Thesaurus Committee and has had input from a number of scientists and practitioners int he field. The emphasis is on the various types of relationships found among concepts, notiions and universals in languages. These relationships are analysed and refined beyond the approach taken in existing thesauri. This proposal is very much at the discussion stage and the author invites the assistance of interested readers through criticisms, discussion and dialogue. Applications of the proposed thesaurus are included and the major goal of this proposal is to provide the basis for improved desing and integration of multilingual thesauri
Source: Knowledge organization. 26(1999) no.1, S.10-22

Mooers, C.N.: ¬The indexing language of an information retrieval system (1985) 0.02
```
0.021190222 = product of:
  0.03178533 = sum of:
    0.020987494 = weight(_text_:to in 3644) [ClassicSimilarity], result of:
      0.020987494 = score(doc=3644,freq=26.0), product of:
        0.08279609 = queryWeight, product of:
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.045541126 = queryNorm
        0.25348413 = fieldWeight in 3644, product of:
          5.0990195 = tf(freq=26.0), with freq of:
            26.0 = termFreq=26.0
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.02734375 = fieldNorm(doc=3644)
    0.010797839 = product of:
      0.021595677 = sum of:
        0.021595677 = weight(_text_:22 in 3644) [ClassicSimilarity], result of:
          0.021595677 = score(doc=3644,freq=2.0), product of:
            0.15947726 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045541126 = queryNorm
            0.1354154 = fieldWeight in 3644, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02734375 = fieldNorm(doc=3644)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)
```
Abstract

Calvin Mooers' work toward the resolution of the problem of ambiguity in indexing went unrecognized for years. At the time he introduced the "descriptor" - a term with a very distinct meaning-indexers were, for the most part, taking index terms directly from the document, without either rationalizing them with context or normalizing them with some kind of classification. It is ironic that Mooers' term came to be attached to the popular but unsophisticated indexing methods which he was trying to root out. Simply expressed, what Mooers did was to take the dictionary definitions of terms and redefine them so clearly that they could not be used in any context except that provided by the new definition. He did, at great pains, construct such meanings for over four hundred words; disambiguation and specificity were sought after and found for these words. He proposed that all indexers adopt this method so that when the index supplied a term, it also supplied the exact meaning for that term as used in the indexed document. The same term used differently in another document would be defined differently and possibly renamed to avoid ambiguity. The disambiguation was achieved by using unabridged dictionaries and other sources of defining terminology. In practice, this tends to produce circularity in definition, that is, word A refers to word B which refers to word C which refers to word A. It was necessary, therefore, to break this chain by creating a new, definitive meaning for each word. Eventually, means such as those used by Austin (q.v.) for PRECIS achieved the same purpose, but by much more complex means than just creating a unique definition of each term. Mooers, however, was probably the first to realize how confusing undefined terminology could be. Early automatic indexers dealt with distinct disciplines and, as long as they did not stray beyond disciplinary boundaries, a quick and dirty keyword approach was satisfactory. The trouble came when attempts were made to make a combined index for two or more distinct disciplines. A number of processes have since been developed, mostly involving tagging of some kind or use of strings. Mooers' solution has rarely been considered seriously and probably would be extremely difficult to apply now because of so much interdisciplinarity. But for a specific, weIl defined field, it is still weIl worth considering. Mooers received training in mathematics and physics from the University of Minnesota and the Massachusetts Institute of Technology. He was the founder of Zator Company, which developed and marketed a coded card information retrieval system, and of Rockford Research, Inc., which engages in research in information science. He is the inventor of the TRAC computer language.

Footnote

Original in: Information retrieval today: papers presented at an Institute conducted by the Library School and the Center for Continuation Study, University of Minnesota, Sept. 19-22, 1962. Ed. by Wesley Simonton. Minneapolis, Minn.: The Center, 1963. S.21-36.

Byrne, C.C.; McCracken, S.A.: ¬An adaptive thesaurus employing semantic distance, relational inheritance and nominal compound interpretation for linguistic support of information retrieval (1999) 0.01

0.012340388 = product of:
  0.037021164 = sum of:
    0.037021164 = product of:
      0.07404233 = sum of:
        0.07404233 = weight(_text_:22 in 4483) [ClassicSimilarity], result of:
          0.07404233 = score(doc=4483,freq=2.0), product of:
            0.15947726 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045541126 = queryNorm
            0.46428138 = fieldWeight in 4483, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=4483)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 15. 3.2000 10:22:37

Qin, J.; Paling, S.: Converting a controlled vocabulary into an ontology : the case of GEM (2001) 0.01

0.012340388 = product of:
  0.037021164 = sum of:
    0.037021164 = product of:
      0.07404233 = sum of:
        0.07404233 = weight(_text_:22 in 3895) [ClassicSimilarity], result of:
          0.07404233 = score(doc=3895,freq=2.0), product of:
            0.15947726 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045541126 = queryNorm
            0.46428138 = fieldWeight in 3895, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=3895)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 24. 8.2005 19:20:22

Doerr, M.: Semantic problems of thesaurus mapping (2001) 0.01
```
0.011087399 = product of:
  0.033262197 = sum of:
    0.033262197 = weight(_text_:to in 5902) [ClassicSimilarity], result of:
      0.033262197 = score(doc=5902,freq=32.0), product of:
        0.08279609 = queryWeight, product of:
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.045541126 = queryNorm
        0.40173632 = fieldWeight in 5902, product of:
          5.656854 = tf(freq=32.0), with freq of:
            32.0 = termFreq=32.0
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5902)
  0.33333334 = coord(1/3)
```
Abstract

With networked information access to heterogeneous data sources, the problem of terminology provision and interoperability of controlled vocabulary schemes such as thesauri becomes increasingly urgent. Solutions are needed to improve the performance of full-text retrieval systems and to guide the design of controlled terminology schemes for use in structured data, including metadata. Thesauri are created in different languages, with different scope and points of view and at different levels of abstraction and detail, to accomodate access to a specific group of collections. In any wider search accessing distributed collections, the user would like to start with familiar terminology and let the system find out the correspondences to other terminologies in order to retrieve equivalent results from all addressed collections. This paper investigates possible semantic differences that may hinder the unambiguous mapping and transition from one thesaurus to another. It focusses on the differences of meaning of terms and their relations as intended by their creators for indexing and querying a specific collection, in contrast to methods investigating the statistical relevance of terms for objects in a collection. It develops a notion of optimal mapping, paying particular attention to the intellectual quality of mappings between terms from different vocabularies and to problems of polysemy. Proposals are made to limit the vagueness introduced by the transition from one vocabulary to another. The paper shows ways in which thesaurus creators can improve their methodology to meet the challenges of networked access of distributed collections created under varying conditions. For system implementers, the discussion will lead to a better understanding of the complexity of the problem
Hudon, M.: Multilingual thesaurus construction : integrating the views of different cultures in one gateway to knowledge and concepts (1997) 0.01
```
0.009505464 = product of:
  0.028516391 = sum of:
    0.028516391 = weight(_text_:to in 1804) [ClassicSimilarity], result of:
      0.028516391 = score(doc=1804,freq=12.0), product of:
        0.08279609 = queryWeight, product of:
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.045541126 = queryNorm
        0.34441712 = fieldWeight in 1804, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1804)
  0.33333334 = coord(1/3)
```
Abstract

Based on the premise that in a multilingual thesaurus all languages are equal, reviews the options and solutions offered by the guidelines to the developer of specialized thesauri. Introduces other problems of a sociocultural, and even of a truly political nature, which are a prominent features in the daily life of the thesaurus designer but with which the theory and the guidelines do not deal very well. Focuses in turn on semantic, managerial, and technological aspects of multilingual thesaurus construction, from the perspective of giving equal treatment to all languages involved

Footnote

Contribution to a special issue devoted to papers read at the 1996 Electronic Access to Fiction research seminar at Copenhagen, Denmark
Nielsen, M.L.: ¬A framework for work task based thesaurus design (2001) 0.01
```
0.00940797 = product of:
  0.02822391 = sum of:
    0.02822391 = weight(_text_:to in 4477) [ClassicSimilarity], result of:
      0.02822391 = score(doc=4477,freq=16.0), product of:
        0.08279609 = queryWeight, product of:
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.045541126 = queryNorm
        0.34088457 = fieldWeight in 4477, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.046875 = fieldNorm(doc=4477)
  0.33333334 = coord(1/3)
```
Abstract

Design and construction of indexing languages require thorough knowledge and understanding of the information environment. This empirical study investigated a mixed set of methods (group interviews, recollection of information needs and word association tests to collect data; content analysis and discourse analysis to analyse data) to evaluate whether these methods collected the data needed for work domain oriented thesaurus design. The findings showed that the study methods together provided the domain knowledge needed to define the role of the thesaurus and design its content and structure. The study was carried out from a person-insituation perspective. The findings reflected the information environment and made it possible to develop a thesaurus according to the characteristics of the work domain. It seemed more difficult to capture the needs of the individual user and adapt the thesaurus to individual characteristics.
Youlin, Z.; Baptista Nunes, J.M.; Zhonghua, D.: Construction and evolution of a Chinese Information Science and Information Service (CIS&IS) onto-thesaurus (2014) 0.01
```
0.00940797 = product of:
  0.02822391 = sum of:
    0.02822391 = weight(_text_:to in 1376) [ClassicSimilarity], result of:
      0.02822391 = score(doc=1376,freq=16.0), product of:
        0.08279609 = queryWeight, product of:
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.045541126 = queryNorm
        0.34088457 = fieldWeight in 1376, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.046875 = fieldNorm(doc=1376)
  0.33333334 = coord(1/3)
```
Abstract

Thesauri are the most important tools for information and knowledge organization, and they undergo regular improvements according to the rapid development of new requirements and affordances of emerging information techniques. This paper attempts to integrate ontology into the conceptual organization scheme of thesauri and proposes a new solution to extend the functionality of thesauri based on ontological features, which is termed here as an onto-thesaurus. In this study, a prototype system named the Chinese Information Science and Information Service onto-thesaurus system (CIS&IS), was developed to analyze ontothesaurus with the category of information science and information service in the Chinese Topic Classification Dictionary with a two-stage approach. The first stage aims to define and construct the onto-thesaurus. The second stage aims to realize the evolution function of onto-thesaurus. The main purpose of this system was to achieve the function of self-learning and auto-evolution and to enable a much more effective conceptual retrieval by the newly proposed onto-thesaurus.
Little, K.: Constructing a thesaurus of environmental protection terms (1993) 0.01
```
0.008869919 = product of:
  0.026609756 = sum of:
    0.026609756 = weight(_text_:to in 8200) [ClassicSimilarity], result of:
      0.026609756 = score(doc=8200,freq=8.0), product of:
        0.08279609 = queryWeight, product of:
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.045541126 = queryNorm
        0.32138905 = fieldWeight in 8200, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.0625 = fieldNorm(doc=8200)
  0.33333334 = coord(1/3)
```
Abstract

Discusses the background to and the process of thesaurus construction in an interdisciplinary soft science subject, environmental protection, for a specific government organization, the Environmental Protection Authority, Western Australia. Discusses and recommends methods of construction and develops a decision making checklists to aid in managing the task. Concentrates on the difficulties of creating a thesaurus in the real world compared to textual advice on thesaurus construction

Footnote

Paper presented at the 10th National Cataloguing Conference on Subject to change: subject access and the role of the cataloguer, Freemantle, Western Australia, 4-6 Nov 93
Diaz, I.: Semi-automatic construction of thesaurus applying domain analysis techniques (1998) 0.01
```
0.008869919 = product of:
  0.026609756 = sum of:
    0.026609756 = weight(_text_:to in 3744) [ClassicSimilarity], result of:
      0.026609756 = score(doc=3744,freq=8.0), product of:
        0.08279609 = queryWeight, product of:
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.045541126 = queryNorm
        0.32138905 = fieldWeight in 3744, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.818051 = idf(docFreq=19512, maxDocs=44218)
          0.0625 = fieldNorm(doc=3744)
  0.33333334 = coord(1/3)
```
Abstract

Describes a specific application of domain analysis to the construction of thesauri to exploit domain analysis' ability to construct valid domain representations and determine fuzzy limits that normally define specific domains. The system employs a structure, called a Software Thesaurus (developed from a descriptor thesaurus), as a repository to store the information regarding specific domains. The domain representation is constructued semi automatically and can be used as a means of semiautomatic thesaurus generation

Search (155 results, page 1 of 8)

Authors

Years

Types

Themes