Document (#40088)

Author
Losee, R.
Title
Thesaurus structure, descriptive parameters, and scale
Source
Journal of the Association for Information Science and Technology. 67(2016) no.9, S.2156-2165
Year
2016
Abstract
A thesaurus contains a set of terms or features that may be used to represent recorded information, including prose documents or scientific data sets. The focus of this work is on the basic structural nature of a thesaurus itself, not on how people develop a thesaurus or how a thesaurus effects retrieval performance. Thesauri in this research are automatically developed in a simulation from sets of randomly or exhaustively generated documents. Each thesaurus is generated by the Thesaurus Generator software from a set of several hundred documents, and thousands of different document sets are used as input to the Thesaurus Generator, producing thousands of thesauri. Thus, thousands of thesauri are generated for each data point in accompanying graphs. The characteristics of this large number of thesauri are studied so that the relationships between thesaurus parameters can be determined. Some rules governing these relationships are suggested, addressing factors such as tree height and width, number of tree roots in thesauri, and number of terms available for the vocabulary. How these parameters scale as vocabularies grow is addressed. These results apply to various information systems that contain features with hierarchical relationships, including many thesauri and ontologies.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23544/full.
Theme
Konzeption und Anwendung des Prinzips Thesaurus

Similar documents (author)

  1. Losee, R.M.: ¬A Gray code based ordering for documents on shelves : classification for browsing and retrieval (1992) 5.18
    5.184806 = sum of:
      5.184806 = weight(author_txt:losee in 2335) [ClassicSimilarity], result of:
        5.184806 = fieldWeight in 2335, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.29569 = idf(docFreq=29, maxDocs=44218)
          0.625 = fieldNorm(doc=2335)
    
  2. Losee, R.M.: ¬The relative shelf location of circulated books : a study of classification, users, and browsing (1993) 5.18
    5.184806 = sum of:
      5.184806 = weight(author_txt:losee in 4485) [ClassicSimilarity], result of:
        5.184806 = fieldWeight in 4485, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.29569 = idf(docFreq=29, maxDocs=44218)
          0.625 = fieldNorm(doc=4485)
    
  3. Losee, R.M.: Seven fundamental questions for the science of library classification (1993) 5.18
    5.184806 = sum of:
      5.184806 = weight(author_txt:losee in 4508) [ClassicSimilarity], result of:
        5.184806 = fieldWeight in 4508, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.29569 = idf(docFreq=29, maxDocs=44218)
          0.625 = fieldNorm(doc=4508)
    
  4. Losee, R.M.: Term dependence : truncating the Bahadur Lazarsfeld expansion (1994) 5.18
    5.184806 = sum of:
      5.184806 = weight(author_txt:losee in 7390) [ClassicSimilarity], result of:
        5.184806 = fieldWeight in 7390, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.29569 = idf(docFreq=29, maxDocs=44218)
          0.625 = fieldNorm(doc=7390)
    
  5. Losee, R.M.: Upper bounds for retrieval performance and their user measuring performance and generating optimal queries : can it get any better than this? (1994) 5.18
    5.184806 = sum of:
      5.184806 = weight(author_txt:losee in 7418) [ClassicSimilarity], result of:
        5.184806 = fieldWeight in 7418, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.29569 = idf(docFreq=29, maxDocs=44218)
          0.625 = fieldNorm(doc=7418)
    

Similar documents (content)

  1. Srinivasan, P.: Thesaurus construction (1992) 0.27
    0.2670631 = sum of:
      0.2670631 = product of:
        0.95379686 = sum of:
          0.020981299 = weight(abstract_txt:terms in 3504) [ClassicSimilarity], result of:
            0.020981299 = score(doc=3504,freq=1.0), product of:
              0.06641184 = queryWeight, product of:
                1.0837425 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.015153835 = queryNorm
              0.3159271 = fieldWeight in 3504, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=3504)
          0.022168463 = weight(abstract_txt:each in 3504) [ClassicSimilarity], result of:
            0.022168463 = score(doc=3504,freq=1.0), product of:
              0.06889393 = queryWeight, product of:
                1.1038089 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.015153835 = queryNorm
              0.32177672 = fieldWeight in 3504, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.078125 = fieldNorm(doc=3504)
          0.029673308 = weight(abstract_txt:features in 3504) [ClassicSimilarity], result of:
            0.029673308 = score(doc=3504,freq=1.0), product of:
              0.08367606 = queryWeight, product of:
                1.2164773 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.015153835 = queryNorm
              0.35462123 = fieldWeight in 3504, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.078125 = fieldNorm(doc=3504)
          0.021810012 = weight(abstract_txt:these in 3504) [ClassicSimilarity], result of:
            0.021810012 = score(doc=3504,freq=2.0), product of:
              0.061917722 = queryWeight, product of:
                1.2816118 = boost
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.015153835 = queryNorm
              0.35224184 = fieldWeight in 3504, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.078125 = fieldNorm(doc=3504)
          0.03331476 = weight(abstract_txt:documents in 3504) [ClassicSimilarity], result of:
            0.03331476 = score(doc=3504,freq=1.0), product of:
              0.10346945 = queryWeight, product of:
                1.6567427 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.015153835 = queryNorm
              0.32197678 = fieldWeight in 3504, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=3504)
          0.30505145 = weight(abstract_txt:thesauri in 3504) [ClassicSimilarity], result of:
            0.30505145 = score(doc=3504,freq=4.0), product of:
              0.35944003 = queryWeight, product of:
                4.36694 = boost
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.015153835 = queryNorm
              0.84868526 = fieldWeight in 3504, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.078125 = fieldNorm(doc=3504)
          0.52079755 = weight(abstract_txt:thesaurus in 3504) [ClassicSimilarity], result of:
            0.52079755 = score(doc=3504,freq=7.0), product of:
              0.4877247 = queryWeight, product of:
                6.2301283 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.015153835 = queryNorm
              1.0678105 = fieldWeight in 3504, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.078125 = fieldNorm(doc=3504)
        0.28 = coord(7/25)
    
  2. Rada, R.: Connecting and evaluating thesauri : issues and cases (1987) 0.26
    0.260246 = sum of:
      0.260246 = product of:
        1.0843583 = sum of:
          0.035606444 = weight(abstract_txt:terms in 823) [ClassicSimilarity], result of:
            0.035606444 = score(doc=823,freq=2.0), product of:
              0.06641184 = queryWeight, product of:
                1.0837425 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.015153835 = queryNorm
              0.53614604 = fieldWeight in 823, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=823)
          0.01850641 = weight(abstract_txt:these in 823) [ClassicSimilarity], result of:
            0.01850641 = score(doc=823,freq=1.0), product of:
              0.061917722 = queryWeight, product of:
                1.2816118 = boost
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.015153835 = queryNorm
              0.29888713 = fieldWeight in 823, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.09375 = fieldNorm(doc=823)
          0.03997771 = weight(abstract_txt:documents in 823) [ClassicSimilarity], result of:
            0.03997771 = score(doc=823,freq=1.0), product of:
              0.10346945 = queryWeight, product of:
                1.6567427 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.015153835 = queryNorm
              0.38637212 = fieldWeight in 823, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.09375 = fieldNorm(doc=823)
          0.063447855 = weight(abstract_txt:relationships in 823) [ClassicSimilarity], result of:
            0.063447855 = score(doc=823,freq=1.0), product of:
              0.14078125 = queryWeight, product of:
                1.9325085 = boost
                4.807296 = idf(docFreq=981, maxDocs=44218)
                0.015153835 = queryNorm
              0.45068398 = fieldWeight in 823, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.807296 = idf(docFreq=981, maxDocs=44218)
                0.09375 = fieldNorm(doc=823)
          0.51768947 = weight(abstract_txt:thesauri in 823) [ClassicSimilarity], result of:
            0.51768947 = score(doc=823,freq=8.0), product of:
              0.35944003 = queryWeight, product of:
                4.36694 = boost
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.015153835 = queryNorm
              1.4402666 = fieldWeight in 823, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.09375 = fieldNorm(doc=823)
          0.40913048 = weight(abstract_txt:thesaurus in 823) [ClassicSimilarity], result of:
            0.40913048 = score(doc=823,freq=3.0), product of:
              0.4877247 = queryWeight, product of:
                6.2301283 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.015153835 = queryNorm
              0.8388554 = fieldWeight in 823, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.09375 = fieldNorm(doc=823)
        0.24 = coord(6/25)
    
  3. Aitchison, J.: ¬A classification as a source for a thesaurus : the bibliographic classification of H.E. Bliss as a source of thesaurus terms and structure (1986) 0.21
    0.21069723 = sum of:
      0.21069723 = product of:
        0.8779052 = sum of:
          0.029672036 = weight(abstract_txt:terms in 1569) [ClassicSimilarity], result of:
            0.029672036 = score(doc=1569,freq=2.0), product of:
              0.06641184 = queryWeight, product of:
                1.0837425 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.015153835 = queryNorm
              0.44678837 = fieldWeight in 1569, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=1569)
          0.015422009 = weight(abstract_txt:these in 1569) [ClassicSimilarity], result of:
            0.015422009 = score(doc=1569,freq=1.0), product of:
              0.061917722 = queryWeight, product of:
                1.2816118 = boost
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.015153835 = queryNorm
              0.24907261 = fieldWeight in 1569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.078125 = fieldNorm(doc=1569)
          0.03359067 = weight(abstract_txt:number in 1569) [ClassicSimilarity], result of:
            0.03359067 = score(doc=1569,freq=1.0), product of:
              0.10403995 = queryWeight, product of:
                1.6613039 = boost
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.015153835 = queryNorm
              0.3228632 = fieldWeight in 1569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.078125 = fieldNorm(doc=1569)
          0.052873217 = weight(abstract_txt:relationships in 1569) [ClassicSimilarity], result of:
            0.052873217 = score(doc=1569,freq=1.0), product of:
              0.14078125 = queryWeight, product of:
                1.9325085 = boost
                4.807296 = idf(docFreq=981, maxDocs=44218)
                0.015153835 = queryNorm
              0.37557 = fieldWeight in 1569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.807296 = idf(docFreq=981, maxDocs=44218)
                0.078125 = fieldNorm(doc=1569)
          0.26418233 = weight(abstract_txt:thesauri in 1569) [ClassicSimilarity], result of:
            0.26418233 = score(doc=1569,freq=3.0), product of:
              0.35944003 = queryWeight, product of:
                4.36694 = boost
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.015153835 = queryNorm
              0.734983 = fieldWeight in 1569, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.078125 = fieldNorm(doc=1569)
          0.48216492 = weight(abstract_txt:thesaurus in 1569) [ClassicSimilarity], result of:
            0.48216492 = score(doc=1569,freq=6.0), product of:
              0.4877247 = queryWeight, product of:
                6.2301283 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.015153835 = queryNorm
              0.9886006 = fieldWeight in 1569, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.078125 = fieldNorm(doc=1569)
        0.24 = coord(6/25)
    
  4. Evens, M.: Thesaural relations in information retrieval (2002) 0.20
    0.19839516 = sum of:
      0.19839516 = product of:
        0.8266465 = sum of:
          0.043608807 = weight(abstract_txt:terms in 1201) [ClassicSimilarity], result of:
            0.043608807 = score(doc=1201,freq=3.0), product of:
              0.06641184 = queryWeight, product of:
                1.0837425 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.015153835 = queryNorm
              0.6566421 = fieldWeight in 1201, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=1201)
          0.01850641 = weight(abstract_txt:these in 1201) [ClassicSimilarity], result of:
            0.01850641 = score(doc=1201,freq=1.0), product of:
              0.061917722 = queryWeight, product of:
                1.2816118 = boost
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.015153835 = queryNorm
              0.29888713 = fieldWeight in 1201, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.09375 = fieldNorm(doc=1201)
          0.03997771 = weight(abstract_txt:documents in 1201) [ClassicSimilarity], result of:
            0.03997771 = score(doc=1201,freq=1.0), product of:
              0.10346945 = queryWeight, product of:
                1.6567427 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.015153835 = queryNorm
              0.38637212 = fieldWeight in 1201, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.09375 = fieldNorm(doc=1201)
          0.22949725 = weight(abstract_txt:thousands in 1201) [ClassicSimilarity], result of:
            0.22949725 = score(doc=1201,freq=1.0), product of:
              0.33173034 = queryWeight, product of:
                2.9664812 = boost
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.015153835 = queryNorm
              0.6918187 = fieldWeight in 1201, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.09375 = fieldNorm(doc=1201)
          0.25884473 = weight(abstract_txt:thesauri in 1201) [ClassicSimilarity], result of:
            0.25884473 = score(doc=1201,freq=2.0), product of:
              0.35944003 = queryWeight, product of:
                4.36694 = boost
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.015153835 = queryNorm
              0.7201333 = fieldWeight in 1201, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.09375 = fieldNorm(doc=1201)
          0.23621158 = weight(abstract_txt:thesaurus in 1201) [ClassicSimilarity], result of:
            0.23621158 = score(doc=1201,freq=1.0), product of:
              0.4877247 = queryWeight, product of:
                6.2301283 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.015153835 = queryNorm
              0.48431337 = fieldWeight in 1201, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.09375 = fieldNorm(doc=1201)
        0.24 = coord(6/25)
    
  5. Willis, C.; Losee, R.M.: ¬A random walk on an ontology : using thesaurus structure for automatic subject indexing (2013) 0.20
    0.19627818 = sum of:
      0.19627818 = product of:
        0.70099354 = sum of:
          0.020770425 = weight(abstract_txt:terms in 1016) [ClassicSimilarity], result of:
            0.020770425 = score(doc=1016,freq=2.0), product of:
              0.06641184 = queryWeight, product of:
                1.0837425 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.015153835 = queryNorm
              0.31275186 = fieldWeight in 1016, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1016)
          0.015517924 = weight(abstract_txt:each in 1016) [ClassicSimilarity], result of:
            0.015517924 = score(doc=1016,freq=1.0), product of:
              0.06889393 = queryWeight, product of:
                1.1038089 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.015153835 = queryNorm
              0.2252437 = fieldWeight in 1016, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1016)
          0.020771315 = weight(abstract_txt:features in 1016) [ClassicSimilarity], result of:
            0.020771315 = score(doc=1016,freq=1.0), product of:
              0.08367606 = queryWeight, product of:
                1.2164773 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.015153835 = queryNorm
              0.24823485 = fieldWeight in 1016, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1016)
          0.032979928 = weight(abstract_txt:documents in 1016) [ClassicSimilarity], result of:
            0.032979928 = score(doc=1016,freq=2.0), product of:
              0.10346945 = queryWeight, product of:
                1.6567427 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.015153835 = queryNorm
              0.31874073 = fieldWeight in 1016, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1016)
          0.06410537 = weight(abstract_txt:relationships in 1016) [ClassicSimilarity], result of:
            0.06410537 = score(doc=1016,freq=3.0), product of:
              0.14078125 = queryWeight, product of:
                1.9325085 = boost
                4.807296 = idf(docFreq=981, maxDocs=44218)
                0.015153835 = queryNorm
              0.45535442 = fieldWeight in 1016, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.807296 = idf(docFreq=981, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1016)
          0.23874055 = weight(abstract_txt:thesauri in 1016) [ClassicSimilarity], result of:
            0.23874055 = score(doc=1016,freq=5.0), product of:
              0.35944003 = queryWeight, product of:
                4.36694 = boost
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.015153835 = queryNorm
              0.6642013 = fieldWeight in 1016, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1016)
          0.30810803 = weight(abstract_txt:thesaurus in 1016) [ClassicSimilarity], result of:
            0.30810803 = score(doc=1016,freq=5.0), product of:
              0.4877247 = queryWeight, product of:
                6.2301283 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.015153835 = queryNorm
              0.6317253 = fieldWeight in 1016, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1016)
        0.28 = coord(7/25)