Document (#35697)

Author
Amirhosseini, M.
Title
Quantitative evaluation of the movement from complexity toward simplicity in the structure of thesaurus descriptors
Source
Malaysian journal of library and information science. 20(2015), no.3, S.47-62
Year
2015
Abstract
The concepts of simplicity and complexity play major roles in information storage and retrieval in knowledge organizations. This paper reports an investigation of these concepts in the structure of descriptors. The main purpose of simplicity is to decrease the number of words in the construction of descriptors as this idea affects semantic relations, recall and precision. ISO 25964 has affirmed the purpose of simplicity by requiring splitting compound terms into simpler concepts. This work aims to elaborate the standard methods of evaluation by providing a more detailed evaluation of the descriptors structure and identifying effective factors in simplicity and complexity results in the structure of thesauri descriptors. The research population is taken from the descriptors of the Commonwealth Agricultural Bureaux (CAB) Thesaurus, the Persian Cultural Thesaurus (ASFA) and the Chemical Thesaurus. This research was conducted using the statistical and content analysis method. In this research we propose a new quantitative approach as well as novel indicators and indices involving Simplicity and Factoring Ratios to evaluate the descriptors structure. The results will be useful in the verification, selection and maintenance purposes in knowledge organizations and the inquiry method can be further developed in the field of ontology evaluation.
Content
Vgl. auch: https://www.researchgate.net/publication/285228543_Quantitative_evaluation_of_the_movement_from_complexity_toward_simplicity_in_the_structure_of_thesaurus_descriptors.
Theme
Konzeption und Anwendung des Prinzips Thesaurus
Wissensrepräsentation

Similar documents (content)

  1. Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: FACET: thesaurus retrieval with semantic term expansion (2002) 0.21
    0.20657073 = sum of:
      0.20657073 = product of:
        0.64553356 = sum of:
          0.053870745 = weight(abstract_txt:compound in 2176) [ClassicSimilarity], result of:
            0.053870745 = score(doc=2176,freq=2.0), product of:
              0.09286715 = queryWeight, product of:
                7.500458 = idf(docFreq=64, maxDocs=43254)
                0.012381531 = queryNorm
              0.58008397 = fieldWeight in 2176, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.500458 = idf(docFreq=64, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2176)
          0.016238932 = weight(abstract_txt:purpose in 2176) [ClassicSimilarity], result of:
            0.016238932 = score(doc=2176,freq=1.0), product of:
              0.06627531 = queryWeight, product of:
                1.1947026 = boost
                4.480408 = idf(docFreq=1331, maxDocs=43254)
                0.012381531 = queryNorm
              0.24502233 = fieldWeight in 2176, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.480408 = idf(docFreq=1331, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2176)
          0.012515424 = weight(abstract_txt:research in 2176) [ClassicSimilarity], result of:
            0.012515424 = score(doc=2176,freq=2.0), product of:
              0.05061714 = queryWeight, product of:
                1.2787285 = boost
                3.1970165 = idf(docFreq=4806, maxDocs=43254)
                0.012381531 = queryNorm
              0.24725664 = fieldWeight in 2176, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1970165 = idf(docFreq=4806, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2176)
          0.0064885 = weight(abstract_txt:this in 2176) [ClassicSimilarity], result of:
            0.0064885 = score(doc=2176,freq=1.0), product of:
              0.048796616 = queryWeight, product of:
                1.6208723 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.012381531 = queryNorm
              0.13297029 = fieldWeight in 2176, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2176)
          0.036572896 = weight(abstract_txt:concepts in 2176) [ClassicSimilarity], result of:
            0.036572896 = score(doc=2176,freq=2.0), product of:
              0.10346024 = queryWeight, product of:
                1.8281687 = boost
                4.570701 = idf(docFreq=1216, maxDocs=43254)
                0.012381531 = queryNorm
              0.35349712 = fieldWeight in 2176, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.570701 = idf(docFreq=1216, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2176)
          0.1317243 = weight(abstract_txt:thesaurus in 2176) [ClassicSimilarity], result of:
            0.1317243 = score(doc=2176,freq=7.0), product of:
              0.17622523 = queryWeight, product of:
                2.755072 = boost
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.012381531 = queryNorm
              0.747477 = fieldWeight in 2176, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2176)
          0.06513984 = weight(abstract_txt:structure in 2176) [ClassicSimilarity], result of:
            0.06513984 = score(doc=2176,freq=3.0), product of:
              0.15745297 = queryWeight, product of:
                2.9115844 = boost
                4.367643 = idf(docFreq=1490, maxDocs=43254)
                0.012381531 = queryNorm
              0.4137098 = fieldWeight in 2176, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.367643 = idf(docFreq=1490, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2176)
          0.3229829 = weight(abstract_txt:descriptors in 2176) [ClassicSimilarity], result of:
            0.3229829 = score(doc=2176,freq=3.0), product of:
              0.51217157 = queryWeight, product of:
                6.213348 = boost
                6.657565 = idf(docFreq=150, maxDocs=43254)
                0.012381531 = queryNorm
              0.6306147 = fieldWeight in 2176, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.657565 = idf(docFreq=150, maxDocs=43254)
                0.0546875 = fieldNorm(doc=2176)
        0.32 = coord(8/25)
    
  2. Deokattey, S.; Dixit, D.K.; Bhanumurthy, K.: Co-word and facet analysis as tools for conceptualization in ontologies : a preliminary study of a micro-domain (2012) 0.17
    0.17490996 = sum of:
      0.17490996 = product of:
        0.72879153 = sum of:
          0.01895977 = weight(abstract_txt:method in 2306) [ClassicSimilarity], result of:
            0.01895977 = score(doc=2306,freq=1.0), product of:
              0.06722656 = queryWeight, product of:
                1.2032459 = boost
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.012381531 = queryNorm
              0.28202796 = fieldWeight in 2306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.0625 = fieldNorm(doc=2306)
          0.010113989 = weight(abstract_txt:research in 2306) [ClassicSimilarity], result of:
            0.010113989 = score(doc=2306,freq=1.0), product of:
              0.05061714 = queryWeight, product of:
                1.2787285 = boost
                3.1970165 = idf(docFreq=4806, maxDocs=43254)
                0.012381531 = queryNorm
              0.19981353 = fieldWeight in 2306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1970165 = idf(docFreq=4806, maxDocs=43254)
                0.0625 = fieldNorm(doc=2306)
          0.010487 = weight(abstract_txt:this in 2306) [ClassicSimilarity], result of:
            0.010487 = score(doc=2306,freq=2.0), product of:
              0.048796616 = queryWeight, product of:
                1.6208723 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.012381531 = queryNorm
              0.21491244 = fieldWeight in 2306, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0625 = fieldNorm(doc=2306)
          0.029555364 = weight(abstract_txt:concepts in 2306) [ClassicSimilarity], result of:
            0.029555364 = score(doc=2306,freq=1.0), product of:
              0.10346024 = queryWeight, product of:
                1.8281687 = boost
                4.570701 = idf(docFreq=1216, maxDocs=43254)
                0.012381531 = queryNorm
              0.28566882 = fieldWeight in 2306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.570701 = idf(docFreq=1216, maxDocs=43254)
                0.0625 = fieldNorm(doc=2306)
          0.056899555 = weight(abstract_txt:thesaurus in 2306) [ClassicSimilarity], result of:
            0.056899555 = score(doc=2306,freq=1.0), product of:
              0.17622523 = queryWeight, product of:
                2.755072 = boost
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.012381531 = queryNorm
              0.32287973 = fieldWeight in 2306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.0625 = fieldNorm(doc=2306)
          0.6027759 = weight(abstract_txt:descriptors in 2306) [ClassicSimilarity], result of:
            0.6027759 = score(doc=2306,freq=8.0), product of:
              0.51217157 = queryWeight, product of:
                6.213348 = boost
                6.657565 = idf(docFreq=150, maxDocs=43254)
                0.012381531 = queryNorm
              1.1769023 = fieldWeight in 2306, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.657565 = idf(docFreq=150, maxDocs=43254)
                0.0625 = fieldNorm(doc=2306)
        0.24 = coord(6/25)
    
  3. Amirhosseini, M.: Theoretical base of quantitative evaluation of unity in a thesaurus term network based on Kant's epistemology (2010) 0.17
    0.16914539 = sum of:
      0.16914539 = product of:
        0.52857935 = sum of:
          0.108048156 = weight(abstract_txt:ratios in 855) [ClassicSimilarity], result of:
            0.108048156 = score(doc=855,freq=3.0), product of:
              0.11803569 = queryWeight, product of:
                1.1273937 = boost
                8.455969 = idf(docFreq=24, maxDocs=43254)
                0.012381531 = queryNorm
              0.9153855 = fieldWeight in 855, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.455969 = idf(docFreq=24, maxDocs=43254)
                0.0625 = fieldNorm(doc=855)
          0.010113989 = weight(abstract_txt:research in 855) [ClassicSimilarity], result of:
            0.010113989 = score(doc=855,freq=1.0), product of:
              0.05061714 = queryWeight, product of:
                1.2787285 = boost
                3.1970165 = idf(docFreq=4806, maxDocs=43254)
                0.012381531 = queryNorm
              0.19981353 = fieldWeight in 855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1970165 = idf(docFreq=4806, maxDocs=43254)
                0.0625 = fieldNorm(doc=855)
          0.110400625 = weight(abstract_txt:quantitative in 855) [ClassicSimilarity], result of:
            0.110400625 = score(doc=855,freq=6.0), product of:
              0.11974281 = queryWeight, product of:
                1.6058636 = boost
                6.022356 = idf(docFreq=284, maxDocs=43254)
                0.012381531 = queryNorm
              0.9219812 = fieldWeight in 855, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.022356 = idf(docFreq=284, maxDocs=43254)
                0.0625 = fieldNorm(doc=855)
          0.014830858 = weight(abstract_txt:this in 855) [ClassicSimilarity], result of:
            0.014830858 = score(doc=855,freq=4.0), product of:
              0.048796616 = queryWeight, product of:
                1.6208723 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.012381531 = queryNorm
              0.3039321 = fieldWeight in 855, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.0625 = fieldNorm(doc=855)
          0.029555364 = weight(abstract_txt:concepts in 855) [ClassicSimilarity], result of:
            0.029555364 = score(doc=855,freq=1.0), product of:
              0.10346024 = queryWeight, product of:
                1.8281687 = boost
                4.570701 = idf(docFreq=1216, maxDocs=43254)
                0.012381531 = queryNorm
              0.28566882 = fieldWeight in 855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.570701 = idf(docFreq=1216, maxDocs=43254)
                0.0625 = fieldNorm(doc=855)
          0.09885014 = weight(abstract_txt:evaluation in 855) [ClassicSimilarity], result of:
            0.09885014 = score(doc=855,freq=7.0), product of:
              0.13313156 = queryWeight, product of:
                2.3946357 = boost
                4.490216 = idf(docFreq=1318, maxDocs=43254)
                0.012381531 = queryNorm
              0.74249965 = fieldWeight in 855, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.490216 = idf(docFreq=1318, maxDocs=43254)
                0.0625 = fieldNorm(doc=855)
          0.11379911 = weight(abstract_txt:thesaurus in 855) [ClassicSimilarity], result of:
            0.11379911 = score(doc=855,freq=4.0), product of:
              0.17622523 = queryWeight, product of:
                2.755072 = boost
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.012381531 = queryNorm
              0.64575946 = fieldWeight in 855, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.0625 = fieldNorm(doc=855)
          0.042981148 = weight(abstract_txt:structure in 855) [ClassicSimilarity], result of:
            0.042981148 = score(doc=855,freq=1.0), product of:
              0.15745297 = queryWeight, product of:
                2.9115844 = boost
                4.367643 = idf(docFreq=1490, maxDocs=43254)
                0.012381531 = queryNorm
              0.27297768 = fieldWeight in 855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.367643 = idf(docFreq=1490, maxDocs=43254)
                0.0625 = fieldNorm(doc=855)
        0.32 = coord(8/25)
    
  4. Harter, S.P.; Cheng, Y.-R.: Colinked descriptors : improving vocabulary selection for end-user searching (1996) 0.14
    0.14324914 = sum of:
      0.14324914 = product of:
        0.71624565 = sum of:
          0.023699712 = weight(abstract_txt:method in 5285) [ClassicSimilarity], result of:
            0.023699712 = score(doc=5285,freq=1.0), product of:
              0.06722656 = queryWeight, product of:
                1.2032459 = boost
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.012381531 = queryNorm
              0.35253495 = fieldWeight in 5285, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.078125 = fieldNorm(doc=5285)
          0.012642487 = weight(abstract_txt:research in 5285) [ClassicSimilarity], result of:
            0.012642487 = score(doc=5285,freq=1.0), product of:
              0.05061714 = queryWeight, product of:
                1.2787285 = boost
                3.1970165 = idf(docFreq=4806, maxDocs=43254)
                0.012381531 = queryNorm
              0.24976692 = fieldWeight in 5285, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1970165 = idf(docFreq=4806, maxDocs=43254)
                0.078125 = fieldNorm(doc=5285)
          0.01310875 = weight(abstract_txt:this in 5285) [ClassicSimilarity], result of:
            0.01310875 = score(doc=5285,freq=2.0), product of:
              0.048796616 = queryWeight, product of:
                1.6208723 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.012381531 = queryNorm
              0.26864055 = fieldWeight in 5285, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.078125 = fieldNorm(doc=5285)
          0.07112445 = weight(abstract_txt:thesaurus in 5285) [ClassicSimilarity], result of:
            0.07112445 = score(doc=5285,freq=1.0), product of:
              0.17622523 = queryWeight, product of:
                2.755072 = boost
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.012381531 = queryNorm
              0.40359968 = fieldWeight in 5285, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.078125 = fieldNorm(doc=5285)
          0.5956702 = weight(abstract_txt:descriptors in 5285) [ClassicSimilarity], result of:
            0.5956702 = score(doc=5285,freq=5.0), product of:
              0.51217157 = queryWeight, product of:
                6.213348 = boost
                6.657565 = idf(docFreq=150, maxDocs=43254)
                0.012381531 = queryNorm
              1.1630287 = fieldWeight in 5285, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.657565 = idf(docFreq=150, maxDocs=43254)
                0.078125 = fieldNorm(doc=5285)
        0.2 = coord(5/25)
    
  5. Riesthuis, G.J.A.: Information languages and multilingual subject access (2003) 0.12
    0.121030726 = sum of:
      0.121030726 = product of:
        0.75644207 = sum of:
          0.012977 = weight(abstract_txt:this in 5964) [ClassicSimilarity], result of:
            0.012977 = score(doc=5964,freq=1.0), product of:
              0.048796616 = queryWeight, product of:
                1.6208723 = boost
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.012381531 = queryNorm
              0.26594058 = fieldWeight in 5964, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4314568 = idf(docFreq=10335, maxDocs=43254)
                0.109375 = fieldNorm(doc=5964)
          0.14081922 = weight(abstract_txt:thesaurus in 5964) [ClassicSimilarity], result of:
            0.14081922 = score(doc=5964,freq=2.0), product of:
              0.17622523 = queryWeight, product of:
                2.755072 = boost
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.012381531 = queryNorm
              0.7990866 = fieldWeight in 5964, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.109375 = fieldNorm(doc=5964)
          0.07521701 = weight(abstract_txt:structure in 5964) [ClassicSimilarity], result of:
            0.07521701 = score(doc=5964,freq=1.0), product of:
              0.15745297 = queryWeight, product of:
                2.9115844 = boost
                4.367643 = idf(docFreq=1490, maxDocs=43254)
                0.012381531 = queryNorm
              0.47771093 = fieldWeight in 5964, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.367643 = idf(docFreq=1490, maxDocs=43254)
                0.109375 = fieldNorm(doc=5964)
          0.52742887 = weight(abstract_txt:descriptors in 5964) [ClassicSimilarity], result of:
            0.52742887 = score(doc=5964,freq=2.0), product of:
              0.51217157 = queryWeight, product of:
                6.213348 = boost
                6.657565 = idf(docFreq=150, maxDocs=43254)
                0.012381531 = queryNorm
              1.0297894 = fieldWeight in 5964, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.657565 = idf(docFreq=150, maxDocs=43254)
                0.109375 = fieldNorm(doc=5964)
        0.16 = coord(4/25)