Document (#40868)

Author
Szostak, R.
Title
Facet analysis using grammar
Source
http://www.iskocus.org/NASKO2017papers/NASKO2017_paper_4.pdf [NASKO 2017, June 15-16, 2017, Champaign, IL, USA]
Year
2017
Abstract
Basic grammar can achieve most/all of the goals of facet analysis without requiring the use of facet indicators. Facet analysis is thus rendered far simpler for classificationist, classifier, and user. We compare facet analysis and grammar, and show how various facets can be represented grammatically. We then address potential challenges in employing grammar as subject classification. A detailed review of basic grammar supports the hypothesis that it is feasible to usefully employ grammatical construction in subject classification. A manageable - and programmable - set of adjustments is required as classifiers move fairly directly from sentences in a document (or object or idea) description to formulating a subject classification. The user likewise can move fairly quickly from a query to the identification of relevant works. A review of theories in linguistics indicates that a grammatical approach should reduce ambiguity while encouraging ease of use. This paper applies the recommended approach to a small sample of recently published books. It finds that the approach is feasible and results in a more precise subject description than the subject headings assigned at present. It then explores PRECIS, an indexing system developed in the 1970s. Though our approach differs from PRECIS in many important ways, the experience of PRECIS supports our conclusions regarding both feasibility and precision.
Content
Beitrag bei: NASKO 2017: Visualizing Knowledge Organization: Bringing Focus to Abstract Realities. The sixth North American Symposium on Knowledge Organization (NASKO 2017), June 15-16, 2017, in Champaign, IL, USA.
Theme
Theorie verbaler Dokumentationssprachen
Universale Facettenklassifikationen
Object
PRECIS

Similar documents (author)

  1. Szostak, R.: Comment on Hjørland's concept theory (2010) 5.17
    5.1710296 = sum of:
      5.1710296 = weight(author_txt:szostak in 5107) [ClassicSimilarity], result of:
        5.1710296 = fieldWeight in 5107, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.273647 = idf(docFreq=29, maxDocs=43254)
          0.625 = fieldNorm(doc=5107)
    
  2. Szostak, R.: Classfying scholarly theories and methods (2003) 5.17
    5.1710296 = sum of:
      5.1710296 = weight(author_txt:szostak in 4105) [ClassicSimilarity], result of:
        5.1710296 = fieldWeight in 4105, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.273647 = idf(docFreq=29, maxDocs=43254)
          0.625 = fieldNorm(doc=4105)
    
  3. Szostak, R.: ¬A schema for unifying human science : interdisciplinary perspectives on culture (2003) 5.17
    5.1710296 = sum of:
      5.1710296 = weight(author_txt:szostak in 1929) [ClassicSimilarity], result of:
        5.1710296 = fieldWeight in 1929, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.273647 = idf(docFreq=29, maxDocs=43254)
          0.625 = fieldNorm(doc=1929)
    
  4. Szostak, R.: Interdisciplinarity and the classification of scholarly documents by phenomena, theories and methods (2007) 5.17
    5.1710296 = sum of:
      5.1710296 = weight(author_txt:szostak in 3136) [ClassicSimilarity], result of:
        5.1710296 = fieldWeight in 3136, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.273647 = idf(docFreq=29, maxDocs=43254)
          0.625 = fieldNorm(doc=3136)
    
  5. Szostak, R.: Classification, interdisciplinarity, and the study of science (2008) 5.17
    5.1710296 = sum of:
      5.1710296 = weight(author_txt:szostak in 3894) [ClassicSimilarity], result of:
        5.1710296 = fieldWeight in 3894, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.273647 = idf(docFreq=29, maxDocs=43254)
          0.625 = fieldNorm(doc=3894)
    

Similar documents (content)

  1. Austin, D.: PRECIS in a multilingual context : Pt.1: PRECIS: an overview (1976) 0.28
    0.28204983 = sum of:
      0.28204983 = product of:
        1.1752076 = sum of:
          0.029925946 = weight(abstract_txt:then in 2448) [ClassicSimilarity], result of:
            0.029925946 = score(doc=2448,freq=1.0), product of:
              0.08256187 = queryWeight, product of:
                1.2267568 = boost
                4.6395764 = idf(docFreq=1135, maxDocs=43254)
                0.014505835 = queryNorm
              0.3624669 = fieldWeight in 2448, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6395764 = idf(docFreq=1135, maxDocs=43254)
                0.078125 = fieldNorm(doc=2448)
          0.034172192 = weight(abstract_txt:description in 2448) [ClassicSimilarity], result of:
            0.034172192 = score(doc=2448,freq=1.0), product of:
              0.09019785 = queryWeight, product of:
                1.2822325 = boost
                4.849385 = idf(docFreq=920, maxDocs=43254)
                0.014505835 = queryNorm
              0.37885818 = fieldWeight in 2448, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.849385 = idf(docFreq=920, maxDocs=43254)
                0.078125 = fieldNorm(doc=2448)
          0.03482903 = weight(abstract_txt:review in 2448) [ClassicSimilarity], result of:
            0.03482903 = score(doc=2448,freq=1.0), product of:
              0.09135 = queryWeight, product of:
                1.2903959 = boost
                4.8802586 = idf(docFreq=892, maxDocs=43254)
                0.014505835 = queryNorm
              0.3812702 = fieldWeight in 2448, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8802586 = idf(docFreq=892, maxDocs=43254)
                0.078125 = fieldNorm(doc=2448)
          0.029708419 = weight(abstract_txt:analysis in 2448) [ClassicSimilarity], result of:
            0.029708419 = score(doc=2448,freq=1.0), product of:
              0.10351675 = queryWeight, product of:
                1.9426252 = boost
                3.67349 = idf(docFreq=2984, maxDocs=43254)
                0.014505835 = queryNorm
              0.28699142 = fieldWeight in 2448, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.67349 = idf(docFreq=2984, maxDocs=43254)
                0.078125 = fieldNorm(doc=2448)
          0.044784185 = weight(abstract_txt:subject in 2448) [ClassicSimilarity], result of:
            0.044784185 = score(doc=2448,freq=1.0), product of:
              0.1466034 = queryWeight, product of:
                2.5847037 = boost
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.014505835 = queryNorm
              0.30547848 = fieldWeight in 2448, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.078125 = fieldNorm(doc=2448)
          1.0017879 = weight(title_txt:precis in 2448) [ClassicSimilarity], result of:
            1.0017879 = score(doc=2448,freq=2.0), product of:
              0.3092041 = queryWeight, product of:
                2.9076154 = boost
                7.3310394 = idf(docFreq=76, maxDocs=43254)
                0.014505835 = queryNorm
              3.239892 = fieldWeight in 2448, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3310394 = idf(docFreq=76, maxDocs=43254)
                0.3125 = fieldNorm(doc=2448)
        0.24 = coord(6/25)
    
  2. Fang, L.; Tuan, L.A.; Hui, S.C.; Wu, L.: Syntactic based approach for grammar question retrieval (2018) 0.28
    0.27518192 = sum of:
      0.27518192 = product of:
        1.1465913 = sum of:
          0.0077304225 = weight(abstract_txt:from in 87) [ClassicSimilarity], result of:
            0.0077304225 = score(doc=87,freq=1.0), product of:
              0.04448226 = queryWeight, product of:
                1.1028279 = boost
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.014505835 = queryNorm
              0.17378664 = fieldWeight in 87, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.0625 = fieldNorm(doc=87)
          0.023940757 = weight(abstract_txt:then in 87) [ClassicSimilarity], result of:
            0.023940757 = score(doc=87,freq=1.0), product of:
              0.08256187 = queryWeight, product of:
                1.2267568 = boost
                4.6395764 = idf(docFreq=1135, maxDocs=43254)
                0.014505835 = queryNorm
              0.28997353 = fieldWeight in 87, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6395764 = idf(docFreq=1135, maxDocs=43254)
                0.0625 = fieldNorm(doc=87)
          0.033611238 = weight(abstract_txt:analysis in 87) [ClassicSimilarity], result of:
            0.033611238 = score(doc=87,freq=2.0), product of:
              0.10351675 = queryWeight, product of:
                1.9426252 = boost
                3.67349 = idf(docFreq=2984, maxDocs=43254)
                0.014505835 = queryNorm
              0.3246937 = fieldWeight in 87, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.67349 = idf(docFreq=2984, maxDocs=43254)
                0.0625 = fieldNorm(doc=87)
          0.043996595 = weight(abstract_txt:approach in 87) [ClassicSimilarity], result of:
            0.043996595 = score(doc=87,freq=3.0), product of:
              0.10821062 = queryWeight, product of:
                1.9861802 = boost
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.014505835 = queryNorm
              0.40658295 = fieldWeight in 87, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.0625 = fieldNorm(doc=87)
          0.2489183 = weight(abstract_txt:grammatical in 87) [ClassicSimilarity], result of:
            0.2489183 = score(doc=87,freq=4.0), product of:
              0.24776436 = queryWeight, product of:
                2.1251428 = boost
                8.037259 = idf(docFreq=37, maxDocs=43254)
                0.014505835 = queryNorm
              1.0046574 = fieldWeight in 87, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.037259 = idf(docFreq=37, maxDocs=43254)
                0.0625 = fieldNorm(doc=87)
          0.788394 = weight(abstract_txt:grammar in 87) [ClassicSimilarity], result of:
            0.788394 = score(doc=87,freq=9.0), product of:
              0.553455 = queryWeight, product of:
                5.0220366 = boost
                7.5973077 = idf(docFreq=58, maxDocs=43254)
                0.014505835 = queryNorm
              1.4244952 = fieldWeight in 87, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                7.5973077 = idf(docFreq=58, maxDocs=43254)
                0.0625 = fieldNorm(doc=87)
        0.24 = coord(6/25)
    
  3. Richmond, P.A.: Classification from PRECIS : some possibilities (1976) 0.24
    0.23514347 = sum of:
      0.23514347 = product of:
        1.1757174 = sum of:
          0.01352824 = weight(abstract_txt:from in 1200) [ClassicSimilarity], result of:
            0.01352824 = score(doc=1200,freq=1.0), product of:
              0.04448226 = queryWeight, product of:
                1.1028279 = boost
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.014505835 = queryNorm
              0.30412662 = fieldWeight in 1200, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.109375 = fieldNorm(doc=1200)
          0.040209673 = weight(abstract_txt:classification in 1200) [ClassicSimilarity], result of:
            0.040209673 = score(doc=1200,freq=1.0), product of:
              0.09195592 = queryWeight, product of:
                1.5856385 = boost
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.014505835 = queryNorm
              0.43727118 = fieldWeight in 1200, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.109375 = fieldNorm(doc=1200)
          0.041591786 = weight(abstract_txt:analysis in 1200) [ClassicSimilarity], result of:
            0.041591786 = score(doc=1200,freq=1.0), product of:
              0.10351675 = queryWeight, product of:
                1.9426252 = boost
                3.67349 = idf(docFreq=2984, maxDocs=43254)
                0.014505835 = queryNorm
              0.40178797 = fieldWeight in 1200, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.67349 = idf(docFreq=2984, maxDocs=43254)
                0.109375 = fieldNorm(doc=1200)
          0.08866816 = weight(abstract_txt:subject in 1200) [ClassicSimilarity], result of:
            0.08866816 = score(doc=1200,freq=2.0), product of:
              0.1466034 = queryWeight, product of:
                2.5847037 = boost
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.014505835 = queryNorm
              0.6048165 = fieldWeight in 1200, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.109375 = fieldNorm(doc=1200)
          0.9917195 = weight(title_txt:precis in 1200) [ClassicSimilarity], result of:
            0.9917195 = score(doc=1200,freq=1.0), product of:
              0.3092041 = queryWeight, product of:
                2.9076154 = boost
                7.3310394 = idf(docFreq=76, maxDocs=43254)
                0.014505835 = queryNorm
              3.2073298 = fieldWeight in 1200, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3310394 = idf(docFreq=76, maxDocs=43254)
                0.4375 = fieldNorm(doc=1200)
        0.2 = coord(5/25)
    
  4. Austin, D.; Digger, J.A.: PRECIS: The Preserved Context Index System (1985) 0.23
    0.22749731 = sum of:
      0.22749731 = product of:
        0.9479055 = sum of:
          0.009663029 = weight(abstract_txt:from in 5653) [ClassicSimilarity], result of:
            0.009663029 = score(doc=5653,freq=4.0), product of:
              0.04448226 = queryWeight, product of:
                1.1028279 = boost
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.014505835 = queryNorm
              0.2172333 = fieldWeight in 5653, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.0390625 = fieldNorm(doc=5653)
          0.018479833 = weight(abstract_txt:basic in 5653) [ClassicSimilarity], result of:
            0.018479833 = score(doc=5653,freq=1.0), product of:
              0.09503851 = queryWeight, product of:
                1.3161898 = boost
                4.977811 = idf(docFreq=809, maxDocs=43254)
                0.014505835 = queryNorm
              0.19444573 = fieldWeight in 5653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.977811 = idf(docFreq=809, maxDocs=43254)
                0.0390625 = fieldNorm(doc=5653)
          0.024873285 = weight(abstract_txt:classification in 5653) [ClassicSimilarity], result of:
            0.024873285 = score(doc=5653,freq=3.0), product of:
              0.09195592 = queryWeight, product of:
                1.5856385 = boost
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.014505835 = queryNorm
              0.2704914 = fieldWeight in 5653, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.0390625 = fieldNorm(doc=5653)
          0.022451917 = weight(abstract_txt:approach in 5653) [ClassicSimilarity], result of:
            0.022451917 = score(doc=5653,freq=2.0), product of:
              0.10821062 = queryWeight, product of:
                1.9861802 = boost
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.014505835 = queryNorm
              0.20748349 = fieldWeight in 5653, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7558525 = idf(docFreq=2748, maxDocs=43254)
                0.0390625 = fieldNorm(doc=5653)
          0.022392092 = weight(abstract_txt:subject in 5653) [ClassicSimilarity], result of:
            0.022392092 = score(doc=5653,freq=1.0), product of:
              0.1466034 = queryWeight, product of:
                2.5847037 = boost
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.014505835 = queryNorm
              0.15273924 = fieldWeight in 5653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.0390625 = fieldNorm(doc=5653)
          0.8500453 = weight(title_txt:precis in 5653) [ClassicSimilarity], result of:
            0.8500453 = score(doc=5653,freq=1.0), product of:
              0.3092041 = queryWeight, product of:
                2.9076154 = boost
                7.3310394 = idf(docFreq=76, maxDocs=43254)
                0.014505835 = queryNorm
              2.7491398 = fieldWeight in 5653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3310394 = idf(docFreq=76, maxDocs=43254)
                0.375 = fieldNorm(doc=5653)
        0.24 = coord(6/25)
    
  5. Foskett, D.J.: Facet analysis (2009) 0.22
    0.21739563 = sum of:
      0.21739563 = product of:
        0.77641296 = sum of:
          0.015460845 = weight(abstract_txt:from in 219) [ClassicSimilarity], result of:
            0.015460845 = score(doc=219,freq=1.0), product of:
              0.04448226 = queryWeight, product of:
                1.1028279 = boost
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.014505835 = queryNorm
              0.34757328 = fieldWeight in 219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.125 = fieldNorm(doc=219)
          0.19734944 = weight(abstract_txt:classificationist in 219) [ClassicSimilarity], result of:
            0.19734944 = score(doc=219,freq=1.0), product of:
              0.1684541 = queryWeight, product of:
                1.2390661 = boost
                9.37226 = idf(docFreq=9, maxDocs=43254)
                0.014505835 = queryNorm
              1.1715325 = fieldWeight in 219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.37226 = idf(docFreq=9, maxDocs=43254)
                0.125 = fieldNorm(doc=219)
          0.05467551 = weight(abstract_txt:description in 219) [ClassicSimilarity], result of:
            0.05467551 = score(doc=219,freq=1.0), product of:
              0.09019785 = queryWeight, product of:
                1.2822325 = boost
                4.849385 = idf(docFreq=920, maxDocs=43254)
                0.014505835 = queryNorm
              0.6061731 = fieldWeight in 219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.849385 = idf(docFreq=920, maxDocs=43254)
                0.125 = fieldNorm(doc=219)
          0.06498864 = weight(abstract_txt:classification in 219) [ClassicSimilarity], result of:
            0.06498864 = score(doc=219,freq=2.0), product of:
              0.09195592 = queryWeight, product of:
                1.5856385 = boost
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.014505835 = queryNorm
              0.7067369 = fieldWeight in 219, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.125 = fieldNorm(doc=219)
          0.067222476 = weight(abstract_txt:analysis in 219) [ClassicSimilarity], result of:
            0.067222476 = score(doc=219,freq=2.0), product of:
              0.10351675 = queryWeight, product of:
                1.9426252 = boost
                3.67349 = idf(docFreq=2984, maxDocs=43254)
                0.014505835 = queryNorm
              0.6493874 = fieldWeight in 219, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.67349 = idf(docFreq=2984, maxDocs=43254)
                0.125 = fieldNorm(doc=219)
          0.0716547 = weight(abstract_txt:subject in 219) [ClassicSimilarity], result of:
            0.0716547 = score(doc=219,freq=1.0), product of:
              0.1466034 = queryWeight, product of:
                2.5847037 = boost
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.014505835 = queryNorm
              0.48876557 = fieldWeight in 219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9101245 = idf(docFreq=2355, maxDocs=43254)
                0.125 = fieldNorm(doc=219)
          0.30506134 = weight(abstract_txt:facet in 219) [ClassicSimilarity], result of:
            0.30506134 = score(doc=219,freq=1.0), product of:
              0.38509902 = queryWeight, product of:
                4.1891403 = boost
                6.337307 = idf(docFreq=207, maxDocs=43254)
                0.014505835 = queryNorm
              0.7921634 = fieldWeight in 219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.337307 = idf(docFreq=207, maxDocs=43254)
                0.125 = fieldNorm(doc=219)
        0.28 = coord(7/25)