Document (#39303)

Author
Piros, A.
Title
Automatic interpretation of complex UDC numbers : towards support for library systems
Source
Classification and authority control: expanding resource discovery: proceedings of the International UDC Seminar 2015, 29-30 October 2015, Lisbon, Portugal. Eds.: Slavic, A. u. M.I. Cordeiro
Imprint
Würzburg : Ergon-Verlag
Year
2015
Pages
S.177-194
Abstract
Analytico-synthetic and faceted classifications, such as Universal Decimal Classification (UDC) express content of documents with complex, pre-combined classification codes. Without classification authority control that would help manage and access structured notations, the use of UDC codes in searching and browsing is limited. Existing UDC parsing solutions are usually created for a particular database system or a specific task and are not widely applicable. The approach described in this paper provides a solution by which the analysis and interpretation of UDC notations would be stored into an intermediate format (in this case, in XML) by automatic means without any data or information loss. Due to its richness, the output file can be converted into different formats, such as standard mark-up and data exchange formats or simple lists of the recommended entry points of a UDC number. The program can also be used to create authority records containing complex UDC numbers which can be comprehensively analysed in order to be retrieved effectively. The Java program, as well as the corresponding schema definition it employs, is under continuous development. The current version of the interpreter software is now available online for testing purposes at the following web site: http://interpreter-eto.rhcloud.com. The future plan is to implement conversion methods for standard formats and to create standard online interfaces in order to make it possible to use the features of software as a service. This would result in the algorithm being able to be employed both in existing and future library systems to analyse UDC numbers without any significant programming effort.
Content
Präsentation unter: http://www.udcds.com/seminar/2015/media/slides/Piros_InternationalUDCSeminar2015.pdf.
Theme
Automatisches Klassifizieren
Object
UDC

Similar documents (content)

  1. Frâncu, V.; Sabo, C.-N.: Implementation of a UDC-based multilingual thesaurus in a library catalogue : the case of BiblioPhil (2010) 0.22
    0.22181214 = sum of:
      0.22181214 = product of:
        0.693163 = sum of:
          0.035460997 = weight(abstract_txt:order in 162) [ClassicSimilarity], result of:
            0.035460997 = score(doc=162,freq=1.0), product of:
              0.10201476 = queryWeight, product of:
                1.1978296 = boost
                4.4493637 = idf(docFreq=1373, maxDocs=43254)
                0.01914124 = queryNorm
              0.34760654 = fieldWeight in 162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4493637 = idf(docFreq=1373, maxDocs=43254)
                0.078125 = fieldNorm(doc=162)
          0.04146108 = weight(abstract_txt:existing in 162) [ClassicSimilarity], result of:
            0.04146108 = score(doc=162,freq=1.0), product of:
              0.1132199 = queryWeight, product of:
                1.2619 = boost
                4.6873546 = idf(docFreq=1082, maxDocs=43254)
                0.01914124 = queryNorm
              0.36619958 = fieldWeight in 162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6873546 = idf(docFreq=1082, maxDocs=43254)
                0.078125 = fieldNorm(doc=162)
          0.06057353 = weight(abstract_txt:authority in 162) [ClassicSimilarity], result of:
            0.06057353 = score(doc=162,freq=1.0), product of:
              0.14577542 = queryWeight, product of:
                1.4318769 = boost
                5.3187375 = idf(docFreq=575, maxDocs=43254)
                0.01914124 = queryNorm
              0.41552636 = fieldWeight in 162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3187375 = idf(docFreq=575, maxDocs=43254)
                0.078125 = fieldNorm(doc=162)
          0.066835545 = weight(abstract_txt:classification in 162) [ClassicSimilarity], result of:
            0.066835545 = score(doc=162,freq=3.0), product of:
              0.123544686 = queryWeight, product of:
                1.6144373 = boost
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.01914124 = queryNorm
              0.5409828 = fieldWeight in 162, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.078125 = fieldNorm(doc=162)
          0.06362422 = weight(abstract_txt:standard in 162) [ClassicSimilarity], result of:
            0.06362422 = score(doc=162,freq=1.0), product of:
              0.172428 = queryWeight, product of:
                1.9072739 = boost
                4.723073 = idf(docFreq=1044, maxDocs=43254)
                0.01914124 = queryNorm
              0.36899006 = fieldWeight in 162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.723073 = idf(docFreq=1044, maxDocs=43254)
                0.078125 = fieldNorm(doc=162)
          0.19610132 = weight(abstract_txt:notations in 162) [ClassicSimilarity], result of:
            0.19610132 = score(doc=162,freq=1.0), product of:
              0.31901866 = queryWeight, product of:
                2.1182225 = boost
                7.8681827 = idf(docFreq=44, maxDocs=43254)
                0.01914124 = queryNorm
              0.61470175 = fieldWeight in 162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8681827 = idf(docFreq=44, maxDocs=43254)
                0.078125 = fieldNorm(doc=162)
          0.10641561 = weight(abstract_txt:formats in 162) [ClassicSimilarity], result of:
            0.10641561 = score(doc=162,freq=1.0), product of:
              0.24295716 = queryWeight, product of:
                2.2639875 = boost
                5.6064196 = idf(docFreq=431, maxDocs=43254)
                0.01914124 = queryNorm
              0.4380015 = fieldWeight in 162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6064196 = idf(docFreq=431, maxDocs=43254)
                0.078125 = fieldNorm(doc=162)
          0.1226907 = weight(abstract_txt:numbers in 162) [ClassicSimilarity], result of:
            0.1226907 = score(doc=162,freq=1.0), product of:
              0.2671369 = queryWeight, product of:
                2.3739748 = boost
                5.878787 = idf(docFreq=328, maxDocs=43254)
                0.01914124 = queryNorm
              0.45928025 = fieldWeight in 162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.878787 = idf(docFreq=328, maxDocs=43254)
                0.078125 = fieldNorm(doc=162)
        0.32 = coord(8/25)
    
  2. Piros, A.: ¬The thought behind the symbol : about the automatic interpretation and representation of UDC numbers (2017) 0.13
    0.13260837 = sum of:
      0.13260837 = product of:
        0.4736013 = sum of:
          0.08619948 = weight(abstract_txt:analytico in 5318) [ClassicSimilarity], result of:
            0.08619948 = score(doc=5318,freq=1.0), product of:
              0.1698617 = queryWeight, product of:
                1.0929399 = boost
                8.119497 = idf(docFreq=34, maxDocs=43254)
                0.01914124 = queryNorm
              0.5074686 = fieldWeight in 5318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.119497 = idf(docFreq=34, maxDocs=43254)
                0.0625 = fieldNorm(doc=5318)
          0.024914004 = weight(abstract_txt:future in 5318) [ClassicSimilarity], result of:
            0.024914004 = score(doc=5318,freq=1.0), product of:
              0.09355451 = queryWeight, product of:
                1.1470858 = boost
                4.2608747 = idf(docFreq=1658, maxDocs=43254)
                0.01914124 = queryNorm
              0.26630467 = fieldWeight in 5318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2608747 = idf(docFreq=1658, maxDocs=43254)
                0.0625 = fieldNorm(doc=5318)
          0.037534557 = weight(abstract_txt:software in 5318) [ClassicSimilarity], result of:
            0.037534557 = score(doc=5318,freq=2.0), product of:
              0.0975843 = queryWeight, product of:
                1.1715302 = boost
                4.351674 = idf(docFreq=1514, maxDocs=43254)
                0.01914124 = queryNorm
              0.38463727 = fieldWeight in 5318, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.351674 = idf(docFreq=1514, maxDocs=43254)
                0.0625 = fieldNorm(doc=5318)
          0.06390778 = weight(abstract_txt:automatic in 5318) [ClassicSimilarity], result of:
            0.06390778 = score(doc=5318,freq=2.0), product of:
              0.13914306 = queryWeight, product of:
                1.3989246 = boost
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.01914124 = queryNorm
              0.45929548 = fieldWeight in 5318, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.0625 = fieldNorm(doc=5318)
          0.030870017 = weight(abstract_txt:classification in 5318) [ClassicSimilarity], result of:
            0.030870017 = score(doc=5318,freq=1.0), product of:
              0.123544686 = queryWeight, product of:
                1.6144373 = boost
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.01914124 = queryNorm
              0.24986924 = fieldWeight in 5318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.0625 = fieldNorm(doc=5318)
          0.09136679 = weight(abstract_txt:complex in 5318) [ClassicSimilarity], result of:
            0.09136679 = score(doc=5318,freq=2.0), product of:
              0.2021382 = queryWeight, product of:
                2.0650632 = boost
                5.1138144 = idf(docFreq=706, maxDocs=43254)
                0.01914124 = queryNorm
              0.4520016 = fieldWeight in 5318, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1138144 = idf(docFreq=706, maxDocs=43254)
                0.0625 = fieldNorm(doc=5318)
          0.13880867 = weight(abstract_txt:numbers in 5318) [ClassicSimilarity], result of:
            0.13880867 = score(doc=5318,freq=2.0), product of:
              0.2671369 = queryWeight, product of:
                2.3739748 = boost
                5.878787 = idf(docFreq=328, maxDocs=43254)
                0.01914124 = queryNorm
              0.51961625 = fieldWeight in 5318, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.878787 = idf(docFreq=328, maxDocs=43254)
                0.0625 = fieldNorm(doc=5318)
        0.28 = coord(7/25)
    
  3. Classification Research Group: ¬The need for a faceted classification as the basis of all methods of information retrieval (1985) 0.11
    0.11093546 = sum of:
      0.11093546 = product of:
        0.3466733 = sum of:
          0.015571252 = weight(abstract_txt:future in 5641) [ClassicSimilarity], result of:
            0.015571252 = score(doc=5641,freq=1.0), product of:
              0.09355451 = queryWeight, product of:
                1.1470858 = boost
                4.2608747 = idf(docFreq=1658, maxDocs=43254)
                0.01914124 = queryNorm
              0.16644043 = fieldWeight in 5641, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2608747 = idf(docFreq=1658, maxDocs=43254)
                0.0390625 = fieldNorm(doc=5641)
          0.04146108 = weight(abstract_txt:existing in 5641) [ClassicSimilarity], result of:
            0.04146108 = score(doc=5641,freq=4.0), product of:
              0.1132199 = queryWeight, product of:
                1.2619 = boost
                4.6873546 = idf(docFreq=1082, maxDocs=43254)
                0.01914124 = queryNorm
              0.36619958 = fieldWeight in 5641, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.6873546 = idf(docFreq=1082, maxDocs=43254)
                0.0390625 = fieldNorm(doc=5641)
          0.028243516 = weight(abstract_txt:automatic in 5641) [ClassicSimilarity], result of:
            0.028243516 = score(doc=5641,freq=1.0), product of:
              0.13914306 = queryWeight, product of:
                1.3989246 = boost
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.01914124 = queryNorm
              0.20298184 = fieldWeight in 5641, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.0390625 = fieldNorm(doc=5641)
          0.066835545 = weight(abstract_txt:classification in 5641) [ClassicSimilarity], result of:
            0.066835545 = score(doc=5641,freq=12.0), product of:
              0.123544686 = queryWeight, product of:
                1.6144373 = boost
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.01914124 = queryNorm
              0.5409828 = fieldWeight in 5641, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.0390625 = fieldNorm(doc=5641)
          0.04942799 = weight(abstract_txt:would in 5641) [ClassicSimilarity], result of:
            0.04942799 = score(doc=5641,freq=2.0), product of:
              0.18359102 = queryWeight, product of:
                1.9680443 = boost
                4.873562 = idf(docFreq=898, maxDocs=43254)
                0.01914124 = queryNorm
              0.2692288 = fieldWeight in 5641, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.873562 = idf(docFreq=898, maxDocs=43254)
                0.0390625 = fieldNorm(doc=5641)
          0.040378798 = weight(abstract_txt:complex in 5641) [ClassicSimilarity], result of:
            0.040378798 = score(doc=5641,freq=1.0), product of:
              0.2021382 = queryWeight, product of:
                2.0650632 = boost
                5.1138144 = idf(docFreq=706, maxDocs=43254)
                0.01914124 = queryNorm
              0.19975838 = fieldWeight in 5641, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1138144 = idf(docFreq=706, maxDocs=43254)
                0.0390625 = fieldNorm(doc=5641)
          0.043409795 = weight(abstract_txt:without in 5641) [ClassicSimilarity], result of:
            0.043409795 = score(doc=5641,freq=1.0), product of:
              0.21213123 = queryWeight, product of:
                2.1154923 = boost
                5.2386947 = idf(docFreq=623, maxDocs=43254)
                0.01914124 = queryNorm
              0.20463651 = fieldWeight in 5641, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2386947 = idf(docFreq=623, maxDocs=43254)
                0.0390625 = fieldNorm(doc=5641)
          0.06134535 = weight(abstract_txt:numbers in 5641) [ClassicSimilarity], result of:
            0.06134535 = score(doc=5641,freq=1.0), product of:
              0.2671369 = queryWeight, product of:
                2.3739748 = boost
                5.878787 = idf(docFreq=328, maxDocs=43254)
                0.01914124 = queryNorm
              0.22964013 = fieldWeight in 5641, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.878787 = idf(docFreq=328, maxDocs=43254)
                0.0390625 = fieldNorm(doc=5641)
        0.32 = coord(8/25)
    
  4. Riesthuis, G.J.A.: Decomposition of UDC-numbers and the text of the UDC Master Reference File (1998) 0.11
    0.1079722 = sum of:
      0.1079722 = product of:
        0.67482626 = sum of:
          0.046305027 = weight(abstract_txt:classification in 2400) [ClassicSimilarity], result of:
            0.046305027 = score(doc=2400,freq=1.0), product of:
              0.123544686 = queryWeight, product of:
                1.6144373 = boost
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.01914124 = queryNorm
              0.37480387 = fieldWeight in 2400, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9979079 = idf(docFreq=2157, maxDocs=43254)
                0.09375 = fieldNorm(doc=2400)
          0.08388208 = weight(abstract_txt:would in 2400) [ClassicSimilarity], result of:
            0.08388208 = score(doc=2400,freq=1.0), product of:
              0.18359102 = queryWeight, product of:
                1.9680443 = boost
                4.873562 = idf(docFreq=898, maxDocs=43254)
                0.01914124 = queryNorm
              0.45689642 = fieldWeight in 2400, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.873562 = idf(docFreq=898, maxDocs=43254)
                0.09375 = fieldNorm(doc=2400)
          0.13705018 = weight(abstract_txt:complex in 2400) [ClassicSimilarity], result of:
            0.13705018 = score(doc=2400,freq=2.0), product of:
              0.2021382 = queryWeight, product of:
                2.0650632 = boost
                5.1138144 = idf(docFreq=706, maxDocs=43254)
                0.01914124 = queryNorm
              0.6780024 = fieldWeight in 2400, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1138144 = idf(docFreq=706, maxDocs=43254)
                0.09375 = fieldNorm(doc=2400)
          0.407589 = weight(abstract_txt:notations in 2400) [ClassicSimilarity], result of:
            0.407589 = score(doc=2400,freq=3.0), product of:
              0.31901866 = queryWeight, product of:
                2.1182225 = boost
                7.8681827 = idf(docFreq=44, maxDocs=43254)
                0.01914124 = queryNorm
              1.2776337 = fieldWeight in 2400, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.8681827 = idf(docFreq=44, maxDocs=43254)
                0.09375 = fieldNorm(doc=2400)
        0.16 = coord(4/25)
    
  5. Tillett, B.B.: Numbers to identify entities (ISADN's-International Standard Authority Data Numbers) (2007) 0.10
    0.102977775 = sum of:
      0.102977775 = product of:
        0.5148889 = sum of:
          0.043599505 = weight(abstract_txt:future in 2793) [ClassicSimilarity], result of:
            0.043599505 = score(doc=2793,freq=1.0), product of:
              0.09355451 = queryWeight, product of:
                1.1470858 = boost
                4.2608747 = idf(docFreq=1658, maxDocs=43254)
                0.01914124 = queryNorm
              0.46603316 = fieldWeight in 2793, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2608747 = idf(docFreq=1658, maxDocs=43254)
                0.109375 = fieldNorm(doc=2793)
          0.05804551 = weight(abstract_txt:existing in 2793) [ClassicSimilarity], result of:
            0.05804551 = score(doc=2793,freq=1.0), product of:
              0.1132199 = queryWeight, product of:
                1.2619 = boost
                4.6873546 = idf(docFreq=1082, maxDocs=43254)
                0.01914124 = queryNorm
              0.5126794 = fieldWeight in 2793, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6873546 = idf(docFreq=1082, maxDocs=43254)
                0.109375 = fieldNorm(doc=2793)
          0.11992947 = weight(abstract_txt:authority in 2793) [ClassicSimilarity], result of:
            0.11992947 = score(doc=2793,freq=2.0), product of:
              0.14577542 = queryWeight, product of:
                1.4318769 = boost
                5.3187375 = idf(docFreq=575, maxDocs=43254)
                0.01914124 = queryNorm
              0.8227002 = fieldWeight in 2793, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3187375 = idf(docFreq=575, maxDocs=43254)
                0.109375 = fieldNorm(doc=2793)
          0.12154743 = weight(abstract_txt:without in 2793) [ClassicSimilarity], result of:
            0.12154743 = score(doc=2793,freq=1.0), product of:
              0.21213123 = queryWeight, product of:
                2.1154923 = boost
                5.2386947 = idf(docFreq=623, maxDocs=43254)
                0.01914124 = queryNorm
              0.57298225 = fieldWeight in 2793, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2386947 = idf(docFreq=623, maxDocs=43254)
                0.109375 = fieldNorm(doc=2793)
          0.17176698 = weight(abstract_txt:numbers in 2793) [ClassicSimilarity], result of:
            0.17176698 = score(doc=2793,freq=1.0), product of:
              0.2671369 = queryWeight, product of:
                2.3739748 = boost
                5.878787 = idf(docFreq=328, maxDocs=43254)
                0.01914124 = queryNorm
              0.6429923 = fieldWeight in 2793, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.878787 = idf(docFreq=328, maxDocs=43254)
                0.109375 = fieldNorm(doc=2793)
        0.2 = coord(5/25)