Document (#39302)

Author
Piros, A.
Title
Automatic interpretation of complex UDC numbers : towards support for library systems
Source
Classification and authority control: expanding resource discovery: proceedings of the International UDC Seminar 2015, 29-30 October 2015, Lisbon, Portugal. Eds.: Slavic, A. u. M.I. Cordeiro
Imprint
Würzburg : Ergon-Verlag
Year
2015
Pages
S.177-194
Abstract
Analytico-synthetic and faceted classifications, such as Universal Decimal Classification (UDC) express content of documents with complex, pre-combined classification codes. Without classification authority control that would help manage and access structured notations, the use of UDC codes in searching and browsing is limited. Existing UDC parsing solutions are usually created for a particular database system or a specific task and are not widely applicable. The approach described in this paper provides a solution by which the analysis and interpretation of UDC notations would be stored into an intermediate format (in this case, in XML) by automatic means without any data or information loss. Due to its richness, the output file can be converted into different formats, such as standard mark-up and data exchange formats or simple lists of the recommended entry points of a UDC number. The program can also be used to create authority records containing complex UDC numbers which can be comprehensively analysed in order to be retrieved effectively. The Java program, as well as the corresponding schema definition it employs, is under continuous development. The current version of the interpreter software is now available online for testing purposes at the following web site: http://interpreter-eto.rhcloud.com. The future plan is to implement conversion methods for standard formats and to create standard online interfaces in order to make it possible to use the features of software as a service. This would result in the algorithm being able to be employed both in existing and future library systems to analyse UDC numbers without any significant programming effort.
Content
Präsentation unter: http://www.udcds.com/seminar/2015/media/slides/Piros_InternationalUDCSeminar2015.pdf.
Theme
Automatisches Klassifizieren
Object
UDC

Similar documents (content)

  1. Frâncu, V.; Sabo, C.-N.: Implementation of a UDC-based multilingual thesaurus in a library catalogue : the case of BiblioPhil (2010) 0.22
    0.22037297 = sum of:
      0.22037297 = product of:
        0.68866557 = sum of:
          0.035219222 = weight(abstract_txt:order in 3697) [ClassicSimilarity], result of:
            0.035219222 = score(doc=3697,freq=1.0), product of:
              0.10137393 = queryWeight, product of:
                1.194073 = boost
                4.446962 = idf(docFreq=1407, maxDocs=44218)
                0.019091146 = queryNorm
              0.3474189 = fieldWeight in 3697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.446962 = idf(docFreq=1407, maxDocs=44218)
                0.078125 = fieldNorm(doc=3697)
          0.04029578 = weight(abstract_txt:existing in 3697) [ClassicSimilarity], result of:
            0.04029578 = score(doc=3697,freq=1.0), product of:
              0.110895224 = queryWeight, product of:
                1.2488899 = boost
                4.6511106 = idf(docFreq=1147, maxDocs=44218)
                0.019091146 = queryNorm
              0.36336803 = fieldWeight in 3697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6511106 = idf(docFreq=1147, maxDocs=44218)
                0.078125 = fieldNorm(doc=3697)
          0.060248666 = weight(abstract_txt:authority in 3697) [ClassicSimilarity], result of:
            0.060248666 = score(doc=3697,freq=1.0), product of:
              0.14500114 = queryWeight, product of:
                1.428083 = boost
                5.318461 = idf(docFreq=588, maxDocs=44218)
                0.019091146 = queryNorm
              0.41550475 = fieldWeight in 3697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.318461 = idf(docFreq=588, maxDocs=44218)
                0.078125 = fieldNorm(doc=3697)
          0.06619688 = weight(abstract_txt:classification in 3697) [ClassicSimilarity], result of:
            0.06619688 = score(doc=3697,freq=3.0), product of:
              0.12254291 = queryWeight, product of:
                1.6078942 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.019091146 = queryNorm
              0.5401935 = fieldWeight in 3697, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=3697)
          0.06337917 = weight(abstract_txt:standard in 3697) [ClassicSimilarity], result of:
            0.06337917 = score(doc=3697,freq=1.0), product of:
              0.17168587 = queryWeight, product of:
                1.9031835 = boost
                4.725219 = idf(docFreq=1065, maxDocs=44218)
                0.019091146 = queryNorm
              0.36915773 = fieldWeight in 3697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.725219 = idf(docFreq=1065, maxDocs=44218)
                0.078125 = fieldNorm(doc=3697)
          0.19508472 = weight(abstract_txt:notations in 3697) [ClassicSimilarity], result of:
            0.19508472 = score(doc=3697,freq=1.0), product of:
              0.31736228 = queryWeight, product of:
                2.1127367 = boost
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.019091146 = queryNorm
              0.6147067 = fieldWeight in 3697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.078125 = fieldNorm(doc=3697)
          0.105942 = weight(abstract_txt:formats in 3697) [ClassicSimilarity], result of:
            0.105942 = score(doc=3697,freq=1.0), product of:
              0.24181451 = queryWeight, product of:
                2.2586792 = boost
                5.6078424 = idf(docFreq=440, maxDocs=44218)
                0.019091146 = queryNorm
              0.43811268 = fieldWeight in 3697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6078424 = idf(docFreq=440, maxDocs=44218)
                0.078125 = fieldNorm(doc=3697)
          0.12229914 = weight(abstract_txt:numbers in 3697) [ClassicSimilarity], result of:
            0.12229914 = score(doc=3697,freq=1.0), product of:
              0.26610467 = queryWeight, product of:
                2.3694067 = boost
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.019091146 = queryNorm
              0.45959038 = fieldWeight in 3697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.078125 = fieldNorm(doc=3697)
        0.32 = coord(8/25)
    
  2. Piros, A.: Az ETO-jelzetek automatikus interpretálásának és elemzésének kérdései (2018) 0.17
    0.16871381 = sum of:
      0.16871381 = product of:
        0.6025493 = sum of:
          0.074442685 = weight(abstract_txt:richness in 855) [ClassicSimilarity], result of:
            0.074442685 = score(doc=855,freq=1.0), product of:
              0.15377456 = queryWeight, product of:
                1.0399082 = boost
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.019091146 = queryNorm
              0.48410273 = fieldWeight in 855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.0625 = fieldNorm(doc=855)
          0.02648247 = weight(abstract_txt:software in 855) [ClassicSimilarity], result of:
            0.02648247 = score(doc=855,freq=1.0), product of:
              0.09727147 = queryWeight, product of:
                1.1696622 = boost
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.019091146 = queryNorm
              0.27225322 = fieldWeight in 855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.0625 = fieldNorm(doc=855)
          0.045589466 = weight(abstract_txt:existing in 855) [ClassicSimilarity], result of:
            0.045589466 = score(doc=855,freq=2.0), product of:
              0.110895224 = queryWeight, product of:
                1.2488899 = boost
                4.6511106 = idf(docFreq=1147, maxDocs=44218)
                0.019091146 = queryNorm
              0.41110396 = fieldWeight in 855, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6511106 = idf(docFreq=1147, maxDocs=44218)
                0.0625 = fieldNorm(doc=855)
          0.055781 = weight(abstract_txt:would in 855) [ClassicSimilarity], result of:
            0.055781 = score(doc=855,freq=1.0), product of:
              0.18296489 = queryWeight, product of:
                1.9647045 = boost
                4.877963 = idf(docFreq=914, maxDocs=44218)
                0.019091146 = queryNorm
              0.3048727 = fieldWeight in 855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.877963 = idf(docFreq=914, maxDocs=44218)
                0.0625 = fieldNorm(doc=855)
          0.06358743 = weight(abstract_txt:complex in 855) [ClassicSimilarity], result of:
            0.06358743 = score(doc=855,freq=1.0), product of:
              0.19966 = queryWeight, product of:
                2.0523853 = boost
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.019091146 = queryNorm
              0.31847855 = fieldWeight in 855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.0625 = fieldNorm(doc=855)
          0.097009845 = weight(abstract_txt:without in 855) [ClassicSimilarity], result of:
            0.097009845 = score(doc=855,freq=2.0), product of:
              0.210012 = queryWeight, product of:
                2.1049192 = boost
                5.2260876 = idf(docFreq=645, maxDocs=44218)
                0.019091146 = queryNorm
              0.46192524 = fieldWeight in 855, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2260876 = idf(docFreq=645, maxDocs=44218)
                0.0625 = fieldNorm(doc=855)
          0.2396564 = weight(abstract_txt:numbers in 855) [ClassicSimilarity], result of:
            0.2396564 = score(doc=855,freq=6.0), product of:
              0.26610467 = queryWeight, product of:
                2.3694067 = boost
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.019091146 = queryNorm
              0.90060955 = fieldWeight in 855, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.0625 = fieldNorm(doc=855)
        0.28 = coord(7/25)
    
  3. Piros, A.: ¬The thought behind the symbol : about the automatic interpretation and representation of UDC numbers (2017) 0.13
    0.13182591 = sum of:
      0.13182591 = product of:
        0.4708068 = sum of:
          0.08645082 = weight(abstract_txt:analytico in 3853) [ClassicSimilarity], result of:
            0.08645082 = score(doc=3853,freq=1.0), product of:
              0.16989577 = queryWeight, product of:
                1.09306 = boost
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.019091146 = queryNorm
              0.5088462 = fieldWeight in 3853, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.0625 = fieldNorm(doc=3853)
          0.024489306 = weight(abstract_txt:future in 3853) [ClassicSimilarity], result of:
            0.024489306 = score(doc=3853,freq=1.0), product of:
              0.09232744 = queryWeight, product of:
                1.1395494 = boost
                4.243905 = idf(docFreq=1724, maxDocs=44218)
                0.019091146 = queryNorm
              0.26524407 = fieldWeight in 3853, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.243905 = idf(docFreq=1724, maxDocs=44218)
                0.0625 = fieldNorm(doc=3853)
          0.03745187 = weight(abstract_txt:software in 3853) [ClassicSimilarity], result of:
            0.03745187 = score(doc=3853,freq=2.0), product of:
              0.09727147 = queryWeight, product of:
                1.1696622 = boost
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.019091146 = queryNorm
              0.3850242 = fieldWeight in 3853, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.0625 = fieldNorm(doc=3853)
          0.06354787 = weight(abstract_txt:automatic in 3853) [ClassicSimilarity], result of:
            0.06354787 = score(doc=3853,freq=2.0), product of:
              0.1383791 = queryWeight, product of:
                1.3950925 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.019091146 = queryNorm
              0.45923027 = fieldWeight in 3853, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=3853)
          0.030575031 = weight(abstract_txt:classification in 3853) [ClassicSimilarity], result of:
            0.030575031 = score(doc=3853,freq=1.0), product of:
              0.12254291 = queryWeight, product of:
                1.6078942 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.019091146 = queryNorm
              0.2495047 = fieldWeight in 3853, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=3853)
          0.089926206 = weight(abstract_txt:complex in 3853) [ClassicSimilarity], result of:
            0.089926206 = score(doc=3853,freq=2.0), product of:
              0.19966 = queryWeight, product of:
                2.0523853 = boost
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.019091146 = queryNorm
              0.4503967 = fieldWeight in 3853, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.0625 = fieldNorm(doc=3853)
          0.13836569 = weight(abstract_txt:numbers in 3853) [ClassicSimilarity], result of:
            0.13836569 = score(doc=3853,freq=2.0), product of:
              0.26610467 = queryWeight, product of:
                2.3694067 = boost
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.019091146 = queryNorm
              0.51996714 = fieldWeight in 3853, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.0625 = fieldNorm(doc=3853)
        0.28 = coord(7/25)
    
  4. Classification Research Group: ¬The need for a faceted classification as the basis of all methods of information retrieval (1985) 0.11
    0.1097444 = sum of:
      0.1097444 = product of:
        0.34295127 = sum of:
          0.015305815 = weight(abstract_txt:future in 3640) [ClassicSimilarity], result of:
            0.015305815 = score(doc=3640,freq=1.0), product of:
              0.09232744 = queryWeight, product of:
                1.1395494 = boost
                4.243905 = idf(docFreq=1724, maxDocs=44218)
                0.019091146 = queryNorm
              0.16577753 = fieldWeight in 3640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.243905 = idf(docFreq=1724, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3640)
          0.04029578 = weight(abstract_txt:existing in 3640) [ClassicSimilarity], result of:
            0.04029578 = score(doc=3640,freq=4.0), product of:
              0.110895224 = queryWeight, product of:
                1.2488899 = boost
                4.6511106 = idf(docFreq=1147, maxDocs=44218)
                0.019091146 = queryNorm
              0.36336803 = fieldWeight in 3640, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.6511106 = idf(docFreq=1147, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3640)
          0.028084459 = weight(abstract_txt:automatic in 3640) [ClassicSimilarity], result of:
            0.028084459 = score(doc=3640,freq=1.0), product of:
              0.1383791 = queryWeight, product of:
                1.3950925 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.019091146 = queryNorm
              0.20295304 = fieldWeight in 3640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3640)
          0.06619688 = weight(abstract_txt:classification in 3640) [ClassicSimilarity], result of:
            0.06619688 = score(doc=3640,freq=12.0), product of:
              0.12254291 = queryWeight, product of:
                1.6078942 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.019091146 = queryNorm
              0.5401935 = fieldWeight in 3640, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3640)
          0.049303904 = weight(abstract_txt:would in 3640) [ClassicSimilarity], result of:
            0.049303904 = score(doc=3640,freq=2.0), product of:
              0.18296489 = queryWeight, product of:
                1.9647045 = boost
                4.877963 = idf(docFreq=914, maxDocs=44218)
                0.019091146 = queryNorm
              0.26947194 = fieldWeight in 3640, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.877963 = idf(docFreq=914, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3640)
          0.039742146 = weight(abstract_txt:complex in 3640) [ClassicSimilarity], result of:
            0.039742146 = score(doc=3640,freq=1.0), product of:
              0.19966 = queryWeight, product of:
                2.0523853 = boost
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.019091146 = queryNorm
              0.1990491 = fieldWeight in 3640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3640)
          0.0428727 = weight(abstract_txt:without in 3640) [ClassicSimilarity], result of:
            0.0428727 = score(doc=3640,freq=1.0), product of:
              0.210012 = queryWeight, product of:
                2.1049192 = boost
                5.2260876 = idf(docFreq=645, maxDocs=44218)
                0.019091146 = queryNorm
              0.20414405 = fieldWeight in 3640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2260876 = idf(docFreq=645, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3640)
          0.06114957 = weight(abstract_txt:numbers in 3640) [ClassicSimilarity], result of:
            0.06114957 = score(doc=3640,freq=1.0), product of:
              0.26610467 = queryWeight, product of:
                2.3694067 = boost
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.019091146 = queryNorm
              0.22979519 = fieldWeight in 3640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3640)
        0.32 = coord(8/25)
    
  5. Riesthuis, G.J.A.: Decomposition of UDC-numbers and the text of the UDC Master Reference File (1998) 0.11
    0.10718388 = sum of:
      0.10718388 = product of:
        0.6698993 = sum of:
          0.045862548 = weight(abstract_txt:classification in 399) [ClassicSimilarity], result of:
            0.045862548 = score(doc=399,freq=1.0), product of:
              0.12254291 = queryWeight, product of:
                1.6078942 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.019091146 = queryNorm
              0.37425706 = fieldWeight in 399, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.09375 = fieldNorm(doc=399)
          0.083671495 = weight(abstract_txt:would in 399) [ClassicSimilarity], result of:
            0.083671495 = score(doc=399,freq=1.0), product of:
              0.18296489 = queryWeight, product of:
                1.9647045 = boost
                4.877963 = idf(docFreq=914, maxDocs=44218)
                0.019091146 = queryNorm
              0.45730904 = fieldWeight in 399, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.877963 = idf(docFreq=914, maxDocs=44218)
                0.09375 = fieldNorm(doc=399)
          0.1348893 = weight(abstract_txt:complex in 399) [ClassicSimilarity], result of:
            0.1348893 = score(doc=399,freq=2.0), product of:
              0.19966 = queryWeight, product of:
                2.0523853 = boost
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.019091146 = queryNorm
              0.67559505 = fieldWeight in 399, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.09375 = fieldNorm(doc=399)
          0.40547594 = weight(abstract_txt:notations in 399) [ClassicSimilarity], result of:
            0.40547594 = score(doc=399,freq=3.0), product of:
              0.31736228 = queryWeight, product of:
                2.1127367 = boost
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.019091146 = queryNorm
              1.2776438 = fieldWeight in 399, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.09375 = fieldNorm(doc=399)
        0.16 = coord(4/25)