Document (#40867)

Author
Szostak, R.
Title
Facet analysis using grammar
Source
http://www.iskocus.org/NASKO2017papers/NASKO2017_paper_4.pdf [NASKO 2017, June 15-16, 2017, Champaign, IL, USA]
Year
2017
Abstract
Basic grammar can achieve most/all of the goals of facet analysis without requiring the use of facet indicators. Facet analysis is thus rendered far simpler for classificationist, classifier, and user. We compare facet analysis and grammar, and show how various facets can be represented grammatically. We then address potential challenges in employing grammar as subject classification. A detailed review of basic grammar supports the hypothesis that it is feasible to usefully employ grammatical construction in subject classification. A manageable - and programmable - set of adjustments is required as classifiers move fairly directly from sentences in a document (or object or idea) description to formulating a subject classification. The user likewise can move fairly quickly from a query to the identification of relevant works. A review of theories in linguistics indicates that a grammatical approach should reduce ambiguity while encouraging ease of use. This paper applies the recommended approach to a small sample of recently published books. It finds that the approach is feasible and results in a more precise subject description than the subject headings assigned at present. It then explores PRECIS, an indexing system developed in the 1970s. Though our approach differs from PRECIS in many important ways, the experience of PRECIS supports our conclusions regarding both feasibility and precision.
Content
Beitrag bei: NASKO 2017: Visualizing Knowledge Organization: Bringing Focus to Abstract Realities. The sixth North American Symposium on Knowledge Organization (NASKO 2017), June 15-16, 2017, in Champaign, IL, USA.
Theme
Theorie verbaler Dokumentationssprachen
Universale Facettenklassifikationen
Object
PRECIS

Similar documents (author)

  1. Szostak, R.: Comment on Hjørland's concept theory (2010) 5.16
    5.164313 = sum of:
      5.164313 = weight(author_txt:szostak in 5107) [ClassicSimilarity], result of:
        5.164313 = fieldWeight in 5107, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.2629 = idf(docFreq=30, maxDocs=44218)
          0.625 = fieldNorm(doc=5107)
    
  2. Szostak, R.: Classfying scholarly theories and methods (2003) 5.16
    5.164313 = sum of:
      5.164313 = weight(author_txt:szostak in 2104) [ClassicSimilarity], result of:
        5.164313 = fieldWeight in 2104, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.2629 = idf(docFreq=30, maxDocs=44218)
          0.625 = fieldNorm(doc=2104)
    
  3. Szostak, R.: ¬A schema for unifying human science : interdisciplinary perspectives on culture (2003) 5.16
    5.164313 = sum of:
      5.164313 = weight(author_txt:szostak in 803) [ClassicSimilarity], result of:
        5.164313 = fieldWeight in 803, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.2629 = idf(docFreq=30, maxDocs=44218)
          0.625 = fieldNorm(doc=803)
    
  4. Szostak, R.: Interdisciplinarity and the classification of scholarly documents by phenomena, theories and methods (2007) 5.16
    5.164313 = sum of:
      5.164313 = weight(author_txt:szostak in 1135) [ClassicSimilarity], result of:
        5.164313 = fieldWeight in 1135, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.2629 = idf(docFreq=30, maxDocs=44218)
          0.625 = fieldNorm(doc=1135)
    
  5. Szostak, R.: Classification, interdisciplinarity, and the study of science (2008) 5.16
    5.164313 = sum of:
      5.164313 = weight(author_txt:szostak in 1893) [ClassicSimilarity], result of:
        5.164313 = fieldWeight in 1893, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.2629 = idf(docFreq=30, maxDocs=44218)
          0.625 = fieldNorm(doc=1893)
    

Similar documents (content)

  1. Austin, D.: PRECIS in a multilingual context : Pt.1: PRECIS: an overview (1976) 0.28
    0.2844906 = sum of:
      0.2844906 = product of:
        1.1853775 = sum of:
          0.02956074 = weight(abstract_txt:then in 983) [ClassicSimilarity], result of:
            0.02956074 = score(doc=983,freq=1.0), product of:
              0.08195557 = queryWeight, product of:
                1.2274804 = boost
                4.616861 = idf(docFreq=1187, maxDocs=44218)
                0.014461625 = queryNorm
              0.36069226 = fieldWeight in 983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.616861 = idf(docFreq=1187, maxDocs=44218)
                0.078125 = fieldNorm(doc=983)
          0.03395563 = weight(abstract_txt:description in 983) [ClassicSimilarity], result of:
            0.03395563 = score(doc=983,freq=1.0), product of:
              0.0898896 = queryWeight, product of:
                1.2855237 = boost
                4.835176 = idf(docFreq=954, maxDocs=44218)
                0.014461625 = queryNorm
              0.37774813 = fieldWeight in 983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.835176 = idf(docFreq=954, maxDocs=44218)
                0.078125 = fieldNorm(doc=983)
          0.034449007 = weight(abstract_txt:review in 983) [ClassicSimilarity], result of:
            0.034449007 = score(doc=983,freq=1.0), product of:
              0.09075824 = queryWeight, product of:
                1.29172 = boost
                4.858482 = idf(docFreq=932, maxDocs=44218)
                0.014461625 = queryNorm
              0.3795689 = fieldWeight in 983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.858482 = idf(docFreq=932, maxDocs=44218)
                0.078125 = fieldNorm(doc=983)
          0.0292987 = weight(abstract_txt:analysis in 983) [ClassicSimilarity], result of:
            0.0292987 = score(doc=983,freq=1.0), product of:
              0.10264643 = queryWeight, product of:
                1.9427292 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.014461625 = queryNorm
              0.2854332 = fieldWeight in 983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.078125 = fieldNorm(doc=983)
          0.044786945 = weight(abstract_txt:subject in 983) [ClassicSimilarity], result of:
            0.044786945 = score(doc=983,freq=1.0), product of:
              0.14672899 = queryWeight, product of:
                2.5968885 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.014461625 = queryNorm
              0.30523583 = fieldWeight in 983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.078125 = fieldNorm(doc=983)
          1.0133264 = weight(title_txt:precis in 983) [ClassicSimilarity], result of:
            1.0133264 = score(doc=983,freq=2.0), product of:
              0.3118279 = queryWeight, product of:
                2.9324355 = boost
                7.3530817 = idf(docFreq=76, maxDocs=44218)
                0.014461625 = queryNorm
              3.2496336 = fieldWeight in 983, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3530817 = idf(docFreq=76, maxDocs=44218)
                0.3125 = fieldNorm(doc=983)
        0.24 = coord(6/25)
    
  2. Fang, L.; Tuan, L.A.; Hui, S.C.; Wu, L.: Syntactic based approach for grammar question retrieval (2018) 0.28
    0.2752593 = sum of:
      0.2752593 = product of:
        1.1469138 = sum of:
          0.0076105315 = weight(abstract_txt:from in 5086) [ClassicSimilarity], result of:
            0.0076105315 = score(doc=5086,freq=1.0), product of:
              0.044057045 = queryWeight, product of:
                1.1022463 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.014461625 = queryNorm
              0.17274266 = fieldWeight in 5086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=5086)
          0.02364859 = weight(abstract_txt:then in 5086) [ClassicSimilarity], result of:
            0.02364859 = score(doc=5086,freq=1.0), product of:
              0.08195557 = queryWeight, product of:
                1.2274804 = boost
                4.616861 = idf(docFreq=1187, maxDocs=44218)
                0.014461625 = queryNorm
              0.2885538 = fieldWeight in 5086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.616861 = idf(docFreq=1187, maxDocs=44218)
                0.0625 = fieldNorm(doc=5086)
          0.033147696 = weight(abstract_txt:analysis in 5086) [ClassicSimilarity], result of:
            0.033147696 = score(doc=5086,freq=2.0), product of:
              0.10264643 = queryWeight, product of:
                1.9427292 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.014461625 = queryNorm
              0.3229308 = fieldWeight in 5086, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=5086)
          0.043734595 = weight(abstract_txt:approach in 5086) [ClassicSimilarity], result of:
            0.043734595 = score(doc=5086,freq=3.0), product of:
              0.10786849 = queryWeight, product of:
                1.9915336 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.014461625 = queryNorm
              0.40544364 = fieldWeight in 5086, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=5086)
          0.24681321 = weight(abstract_txt:grammatical in 5086) [ClassicSimilarity], result of:
            0.24681321 = score(doc=5086,freq=4.0), product of:
              0.2465664 = queryWeight, product of:
                2.1290815 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.014461625 = queryNorm
              1.001001 = fieldWeight in 5086, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0625 = fieldNorm(doc=5086)
          0.7919592 = weight(abstract_txt:grammar in 5086) [ClassicSimilarity], result of:
            0.7919592 = score(doc=5086,freq=9.0), product of:
              0.55557495 = queryWeight, product of:
                5.053202 = boost
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.014461625 = queryNorm
              1.4254768 = fieldWeight in 5086, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.0625 = fieldNorm(doc=5086)
        0.24 = coord(6/25)
    
  3. Richmond, P.A.: Classification from PRECIS : some possibilities (1976) 0.24
    0.23725684 = sum of:
      0.23725684 = product of:
        1.1862842 = sum of:
          0.013318431 = weight(abstract_txt:from in 1200) [ClassicSimilarity], result of:
            0.013318431 = score(doc=1200,freq=1.0), product of:
              0.044057045 = queryWeight, product of:
                1.1022463 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.014461625 = queryNorm
              0.30229968 = fieldWeight in 1200, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.109375 = fieldNorm(doc=1200)
          0.04013196 = weight(abstract_txt:classification in 1200) [ClassicSimilarity], result of:
            0.04013196 = score(doc=1200,freq=1.0), product of:
              0.09191229 = queryWeight, product of:
                1.5920539 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.014461625 = queryNorm
              0.43663323 = fieldWeight in 1200, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.109375 = fieldNorm(doc=1200)
          0.04101818 = weight(abstract_txt:analysis in 1200) [ClassicSimilarity], result of:
            0.04101818 = score(doc=1200,freq=1.0), product of:
              0.10264643 = queryWeight, product of:
                1.9427292 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.014461625 = queryNorm
              0.3996065 = fieldWeight in 1200, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.109375 = fieldNorm(doc=1200)
          0.08867362 = weight(abstract_txt:subject in 1200) [ClassicSimilarity], result of:
            0.08867362 = score(doc=1200,freq=2.0), product of:
              0.14672899 = queryWeight, product of:
                2.5968885 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.014461625 = queryNorm
              0.6043361 = fieldWeight in 1200, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.109375 = fieldNorm(doc=1200)
          1.003142 = weight(title_txt:precis in 1200) [ClassicSimilarity], result of:
            1.003142 = score(doc=1200,freq=1.0), product of:
              0.3118279 = queryWeight, product of:
                2.9324355 = boost
                7.3530817 = idf(docFreq=76, maxDocs=44218)
                0.014461625 = queryNorm
              3.2169733 = fieldWeight in 1200, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3530817 = idf(docFreq=76, maxDocs=44218)
                0.4375 = fieldNorm(doc=1200)
        0.2 = coord(5/25)
    
  4. Austin, D.; Digger, J.A.: PRECIS: The Preserved Context Index System (1985) 0.23
    0.22975953 = sum of:
      0.22975953 = product of:
        0.9573314 = sum of:
          0.009513164 = weight(abstract_txt:from in 3652) [ClassicSimilarity], result of:
            0.009513164 = score(doc=3652,freq=4.0), product of:
              0.044057045 = queryWeight, product of:
                1.1022463 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.014461625 = queryNorm
              0.21592833 = fieldWeight in 3652, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3652)
          0.018445292 = weight(abstract_txt:basic in 3652) [ClassicSimilarity], result of:
            0.018445292 = score(doc=3652,freq=1.0), product of:
              0.09499746 = queryWeight, product of:
                1.3215431 = boost
                4.970654 = idf(docFreq=833, maxDocs=44218)
                0.014461625 = queryNorm
              0.19416617 = fieldWeight in 3652, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.970654 = idf(docFreq=833, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3652)
          0.024825212 = weight(abstract_txt:classification in 3652) [ClassicSimilarity], result of:
            0.024825212 = score(doc=3652,freq=3.0), product of:
              0.09191229 = queryWeight, product of:
                1.5920539 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.014461625 = queryNorm
              0.27009675 = fieldWeight in 3652, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3652)
          0.022318216 = weight(abstract_txt:approach in 3652) [ClassicSimilarity], result of:
            0.022318216 = score(doc=3652,freq=2.0), product of:
              0.10786849 = queryWeight, product of:
                1.9915336 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.014461625 = queryNorm
              0.20690209 = fieldWeight in 3652, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3652)
          0.022393472 = weight(abstract_txt:subject in 3652) [ClassicSimilarity], result of:
            0.022393472 = score(doc=3652,freq=1.0), product of:
              0.14672899 = queryWeight, product of:
                2.5968885 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.014461625 = queryNorm
              0.15261792 = fieldWeight in 3652, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3652)
          0.85983604 = weight(title_txt:precis in 3652) [ClassicSimilarity], result of:
            0.85983604 = score(doc=3652,freq=1.0), product of:
              0.3118279 = queryWeight, product of:
                2.9324355 = boost
                7.3530817 = idf(docFreq=76, maxDocs=44218)
                0.014461625 = queryNorm
              2.7574058 = fieldWeight in 3652, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3530817 = idf(docFreq=76, maxDocs=44218)
                0.375 = fieldNorm(doc=3652)
        0.24 = coord(6/25)
    
  5. Foskett, D.J.: Facet analysis (2009) 0.22
    0.21703938 = sum of:
      0.21703938 = product of:
        0.77514064 = sum of:
          0.015221063 = weight(abstract_txt:from in 3754) [ClassicSimilarity], result of:
            0.015221063 = score(doc=3754,freq=1.0), product of:
              0.044057045 = queryWeight, product of:
                1.1022463 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.014461625 = queryNorm
              0.34548533 = fieldWeight in 3754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.125 = fieldNorm(doc=3754)
          0.19923173 = weight(abstract_txt:classificationist in 3754) [ClassicSimilarity], result of:
            0.19923173 = score(doc=3754,freq=1.0), product of:
              0.16966176 = queryWeight, product of:
                1.2488272 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.014461625 = queryNorm
              1.1742878 = fieldWeight in 3754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.125 = fieldNorm(doc=3754)
          0.054329004 = weight(abstract_txt:description in 3754) [ClassicSimilarity], result of:
            0.054329004 = score(doc=3754,freq=1.0), product of:
              0.0898896 = queryWeight, product of:
                1.2855237 = boost
                4.835176 = idf(docFreq=954, maxDocs=44218)
                0.014461625 = queryNorm
              0.604397 = fieldWeight in 3754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.835176 = idf(docFreq=954, maxDocs=44218)
                0.125 = fieldNorm(doc=3754)
          0.06486304 = weight(abstract_txt:classification in 3754) [ClassicSimilarity], result of:
            0.06486304 = score(doc=3754,freq=2.0), product of:
              0.09191229 = queryWeight, product of:
                1.5920539 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.014461625 = queryNorm
              0.7057058 = fieldWeight in 3754, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.125 = fieldNorm(doc=3754)
          0.06629539 = weight(abstract_txt:analysis in 3754) [ClassicSimilarity], result of:
            0.06629539 = score(doc=3754,freq=2.0), product of:
              0.10264643 = queryWeight, product of:
                1.9427292 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.014461625 = queryNorm
              0.6458616 = fieldWeight in 3754, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.125 = fieldNorm(doc=3754)
          0.07165911 = weight(abstract_txt:subject in 3754) [ClassicSimilarity], result of:
            0.07165911 = score(doc=3754,freq=1.0), product of:
              0.14672899 = queryWeight, product of:
                2.5968885 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.014461625 = queryNorm
              0.48837733 = fieldWeight in 3754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.125 = fieldNorm(doc=3754)
          0.30354133 = weight(abstract_txt:facet in 3754) [ClassicSimilarity], result of:
            0.30354133 = score(doc=3754,freq=1.0), product of:
              0.38413173 = queryWeight, product of:
                4.201801 = boost
                6.321609 = idf(docFreq=215, maxDocs=44218)
                0.014461625 = queryNorm
              0.7902011 = fieldWeight in 3754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.321609 = idf(docFreq=215, maxDocs=44218)
                0.125 = fieldNorm(doc=3754)
        0.28 = coord(7/25)