Document (#35696)

Author
Amirhosseini, M.
Title
Quantitative evaluation of the movement from complexity toward simplicity in the structure of thesaurus descriptors
Source
Malaysian journal of library and information science. 20(2015), no.3, S.47-62
Year
2015
Abstract
The concepts of simplicity and complexity play major roles in information storage and retrieval in knowledge organizations. This paper reports an investigation of these concepts in the structure of descriptors. The main purpose of simplicity is to decrease the number of words in the construction of descriptors as this idea affects semantic relations, recall and precision. ISO 25964 has affirmed the purpose of simplicity by requiring splitting compound terms into simpler concepts. This work aims to elaborate the standard methods of evaluation by providing a more detailed evaluation of the descriptors structure and identifying effective factors in simplicity and complexity results in the structure of thesauri descriptors. The research population is taken from the descriptors of the Commonwealth Agricultural Bureaux (CAB) Thesaurus, the Persian Cultural Thesaurus (ASFA) and the Chemical Thesaurus. This research was conducted using the statistical and content analysis method. In this research we propose a new quantitative approach as well as novel indicators and indices involving Simplicity and Factoring Ratios to evaluate the descriptors structure. The results will be useful in the verification, selection and maintenance purposes in knowledge organizations and the inquiry method can be further developed in the field of ontology evaluation.
Content
Vgl. auch: https://www.researchgate.net/publication/285228543_Quantitative_evaluation_of_the_movement_from_complexity_toward_simplicity_in_the_structure_of_thesaurus_descriptors.
Theme
Konzeption und Anwendung des Prinzips Thesaurus
Wissensrepräsentation

Similar documents (content)

  1. Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: FACET: thesaurus retrieval with semantic term expansion (2002) 0.21
    0.20748134 = sum of:
      0.20748134 = product of:
        0.6483792 = sum of:
          0.054040615 = weight(abstract_txt:compound in 175) [ClassicSimilarity], result of:
            0.054040615 = score(doc=175,freq=2.0), product of:
              0.09326275 = queryWeight, product of:
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.012447988 = queryNorm
              0.5794448 = fieldWeight in 175, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.016159015 = weight(abstract_txt:purpose in 175) [ClassicSimilarity], result of:
            0.016159015 = score(doc=175,freq=1.0), product of:
              0.06619999 = queryWeight, product of:
                1.1914885 = boost
                4.463432 = idf(docFreq=1384, maxDocs=44218)
                0.012447988 = queryNorm
              0.24409392 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.463432 = idf(docFreq=1384, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.012283754 = weight(abstract_txt:research in 175) [ClassicSimilarity], result of:
            0.012283754 = score(doc=175,freq=2.0), product of:
              0.050098244 = queryWeight, product of:
                1.2694564 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.012447988 = queryNorm
              0.2451933 = fieldWeight in 175, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.0063830693 = weight(abstract_txt:this in 175) [ClassicSimilarity], result of:
            0.0063830693 = score(doc=175,freq=1.0), product of:
              0.048370548 = queryWeight, product of:
                1.6103542 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.012447988 = queryNorm
              0.1319619 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.0363902 = weight(abstract_txt:concepts in 175) [ClassicSimilarity], result of:
            0.0363902 = score(doc=175,freq=2.0), product of:
              0.10333752 = queryWeight, product of:
                1.8232052 = boost
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.012447988 = queryNorm
              0.35214895 = fieldWeight in 175, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.1325723 = weight(abstract_txt:thesaurus in 175) [ClassicSimilarity], result of:
            0.1325723 = score(doc=175,freq=7.0), product of:
              0.17736197 = queryWeight, product of:
                2.7580755 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.012447988 = queryNorm
              0.7474674 = fieldWeight in 175, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.065128386 = weight(abstract_txt:structure in 175) [ClassicSimilarity], result of:
            0.065128386 = score(doc=175,freq=3.0), product of:
              0.15777364 = queryWeight, product of:
                2.9083598 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.012447988 = queryNorm
              0.41279635 = fieldWeight in 175, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.32542184 = weight(abstract_txt:descriptors in 175) [ClassicSimilarity], result of:
            0.32542184 = score(doc=175,freq=3.0), product of:
              0.5158555 = queryWeight, product of:
                6.2224145 = boost
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.012447988 = queryNorm
              0.63083917 = fieldWeight in 175, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
        0.32 = coord(8/25)
    
  2. Deokattey, S.; Dixit, D.K.; Bhanumurthy, K.: Co-word and facet analysis as tools for conceptualization in ontologies : a preliminary study of a micro-domain (2012) 0.18
    0.17596358 = sum of:
      0.17596358 = product of:
        0.7331816 = sum of:
          0.018937064 = weight(abstract_txt:method in 841) [ClassicSimilarity], result of:
            0.018937064 = score(doc=841,freq=1.0), product of:
              0.067317575 = queryWeight, product of:
                1.2015038 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.012447988 = queryNorm
              0.28130937 = fieldWeight in 841, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=841)
          0.009926773 = weight(abstract_txt:research in 841) [ClassicSimilarity], result of:
            0.009926773 = score(doc=841,freq=1.0), product of:
              0.050098244 = queryWeight, product of:
                1.2694564 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.012447988 = queryNorm
              0.19814612 = fieldWeight in 841, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0625 = fieldNorm(doc=841)
          0.010316597 = weight(abstract_txt:this in 841) [ClassicSimilarity], result of:
            0.010316597 = score(doc=841,freq=2.0), product of:
              0.048370548 = queryWeight, product of:
                1.6103542 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.012447988 = queryNorm
              0.21328263 = fieldWeight in 841, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=841)
          0.029407723 = weight(abstract_txt:concepts in 841) [ClassicSimilarity], result of:
            0.029407723 = score(doc=841,freq=1.0), product of:
              0.10333752 = queryWeight, product of:
                1.8232052 = boost
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.012447988 = queryNorm
              0.28457934 = fieldWeight in 841, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.0625 = fieldNorm(doc=841)
          0.057265848 = weight(abstract_txt:thesaurus in 841) [ClassicSimilarity], result of:
            0.057265848 = score(doc=841,freq=1.0), product of:
              0.17736197 = queryWeight, product of:
                2.7580755 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.012447988 = queryNorm
              0.3228756 = fieldWeight in 841, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.0625 = fieldNorm(doc=841)
          0.6073276 = weight(abstract_txt:descriptors in 841) [ClassicSimilarity], result of:
            0.6073276 = score(doc=841,freq=8.0), product of:
              0.5158555 = queryWeight, product of:
                6.2224145 = boost
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.012447988 = queryNorm
              1.1773212 = fieldWeight in 841, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.0625 = fieldNorm(doc=841)
        0.24 = coord(6/25)
    
  3. Amirhosseini, M.: Theoretical base of quantitative evaluation of unity in a thesaurus term network based on Kant's epistemology (2010) 0.17
    0.1685757 = sum of:
      0.1685757 = product of:
        0.5267991 = sum of:
          0.10808647 = weight(abstract_txt:ratios in 5854) [ClassicSimilarity], result of:
            0.10808647 = score(doc=5854,freq=3.0), product of:
              0.11831791 = queryWeight, product of:
                1.1263442 = boost
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.012447988 = queryNorm
              0.9135258 = fieldWeight in 5854, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.0625 = fieldNorm(doc=5854)
          0.009926773 = weight(abstract_txt:research in 5854) [ClassicSimilarity], result of:
            0.009926773 = score(doc=5854,freq=1.0), product of:
              0.050098244 = queryWeight, product of:
                1.2694564 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.012447988 = queryNorm
              0.19814612 = fieldWeight in 5854, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0625 = fieldNorm(doc=5854)
          0.10806808 = weight(abstract_txt:quantitative in 5854) [ClassicSimilarity], result of:
            0.10806808 = score(doc=5854,freq=6.0), product of:
              0.11830448 = queryWeight, product of:
                1.592801 = boost
                5.9667873 = idf(docFreq=307, maxDocs=44218)
                0.012447988 = queryNorm
              0.9134741 = fieldWeight in 5854, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.9667873 = idf(docFreq=307, maxDocs=44218)
                0.0625 = fieldNorm(doc=5854)
          0.014589872 = weight(abstract_txt:this in 5854) [ClassicSimilarity], result of:
            0.014589872 = score(doc=5854,freq=4.0), product of:
              0.048370548 = queryWeight, product of:
                1.6103542 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.012447988 = queryNorm
              0.3016272 = fieldWeight in 5854, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=5854)
          0.029407723 = weight(abstract_txt:concepts in 5854) [ClassicSimilarity], result of:
            0.029407723 = score(doc=5854,freq=1.0), product of:
              0.10333752 = queryWeight, product of:
                1.8232052 = boost
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.012447988 = queryNorm
              0.28457934 = fieldWeight in 5854, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.0625 = fieldNorm(doc=5854)
          0.09921489 = weight(abstract_txt:evaluation in 5854) [ClassicSimilarity], result of:
            0.09921489 = score(doc=5854,freq=7.0), product of:
              0.13374634 = queryWeight, product of:
                2.3950624 = boost
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.012447988 = queryNorm
              0.74181384 = fieldWeight in 5854, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.0625 = fieldNorm(doc=5854)
          0.114531696 = weight(abstract_txt:thesaurus in 5854) [ClassicSimilarity], result of:
            0.114531696 = score(doc=5854,freq=4.0), product of:
              0.17736197 = queryWeight, product of:
                2.7580755 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.012447988 = queryNorm
              0.6457512 = fieldWeight in 5854, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.0625 = fieldNorm(doc=5854)
          0.04297359 = weight(abstract_txt:structure in 5854) [ClassicSimilarity], result of:
            0.04297359 = score(doc=5854,freq=1.0), product of:
              0.15777364 = queryWeight, product of:
                2.9083598 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.012447988 = queryNorm
              0.27237496 = fieldWeight in 5854, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.0625 = fieldNorm(doc=5854)
        0.32 = coord(8/25)
    
  4. Harter, S.P.; Cheng, Y.-R.: Colinked descriptors : improving vocabulary selection for end-user searching (1996) 0.14
    0.14414524 = sum of:
      0.14414524 = product of:
        0.7207262 = sum of:
          0.02367133 = weight(abstract_txt:method in 4216) [ClassicSimilarity], result of:
            0.02367133 = score(doc=4216,freq=1.0), product of:
              0.067317575 = queryWeight, product of:
                1.2015038 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.012447988 = queryNorm
              0.3516367 = fieldWeight in 4216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.078125 = fieldNorm(doc=4216)
          0.012408466 = weight(abstract_txt:research in 4216) [ClassicSimilarity], result of:
            0.012408466 = score(doc=4216,freq=1.0), product of:
              0.050098244 = queryWeight, product of:
                1.2694564 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.012447988 = queryNorm
              0.24768265 = fieldWeight in 4216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.078125 = fieldNorm(doc=4216)
          0.012895747 = weight(abstract_txt:this in 4216) [ClassicSimilarity], result of:
            0.012895747 = score(doc=4216,freq=2.0), product of:
              0.048370548 = queryWeight, product of:
                1.6103542 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.012447988 = queryNorm
              0.2666033 = fieldWeight in 4216, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=4216)
          0.07158231 = weight(abstract_txt:thesaurus in 4216) [ClassicSimilarity], result of:
            0.07158231 = score(doc=4216,freq=1.0), product of:
              0.17736197 = queryWeight, product of:
                2.7580755 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.012447988 = queryNorm
              0.4035945 = fieldWeight in 4216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.078125 = fieldNorm(doc=4216)
          0.60016835 = weight(abstract_txt:descriptors in 4216) [ClassicSimilarity], result of:
            0.60016835 = score(doc=4216,freq=5.0), product of:
              0.5158555 = queryWeight, product of:
                6.2224145 = boost
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.012447988 = queryNorm
              1.1634427 = fieldWeight in 4216, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.078125 = fieldNorm(doc=4216)
        0.2 = coord(5/25)
    
  5. Riesthuis, G.J.A.: Information languages and multilingual subject access (2003) 0.12
    0.12177717 = sum of:
      0.12177717 = product of:
        0.7611073 = sum of:
          0.012766139 = weight(abstract_txt:this in 3963) [ClassicSimilarity], result of:
            0.012766139 = score(doc=3963,freq=1.0), product of:
              0.048370548 = queryWeight, product of:
                1.6103542 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.012447988 = queryNorm
              0.2639238 = fieldWeight in 3963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.109375 = fieldNorm(doc=3963)
          0.14172575 = weight(abstract_txt:thesaurus in 3963) [ClassicSimilarity], result of:
            0.14172575 = score(doc=3963,freq=2.0), product of:
              0.17736197 = queryWeight, product of:
                2.7580755 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.012447988 = queryNorm
              0.7990763 = fieldWeight in 3963, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.109375 = fieldNorm(doc=3963)
          0.075203784 = weight(abstract_txt:structure in 3963) [ClassicSimilarity], result of:
            0.075203784 = score(doc=3963,freq=1.0), product of:
              0.15777364 = queryWeight, product of:
                2.9083598 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.012447988 = queryNorm
              0.47665617 = fieldWeight in 3963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.109375 = fieldNorm(doc=3963)
          0.53141165 = weight(abstract_txt:descriptors in 3963) [ClassicSimilarity], result of:
            0.53141165 = score(doc=3963,freq=2.0), product of:
              0.5158555 = queryWeight, product of:
                6.2224145 = boost
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.012447988 = queryNorm
              1.030156 = fieldWeight in 3963, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.109375 = fieldNorm(doc=3963)
        0.16 = coord(4/25)