Document (#43857)

Author
Piros, A.
Title
Az ETO-jelzetek automatikus interpretálásának és elemzésének kérdései
Imprint
Debrecen : Debreceni Egyetemm, Matematika- és Számítástudományok Doktori Iskola
Year
2018
Pages
48 S.
Abstract
Converting UDC numbers manually to a complex format such as the one mentioned above is an unrealistic expectation; supporting building these representations, as far as possible automatically, is a well-founded requirement. An additional advantage of this approach is that the existing records could also be processed and converted. In my dissertation I would like to prove also that it is possible to design and implement an algorithm that is able to convert pre-coordinated UDC numbers into the introduced format by identifying all their elements and revealing their whole syntactic structure as well. In my dissertation I will discuss a feasible way of building a UDC-specific XML schema for describing the most detailed and complicated UDC numbers (containing not only the common auxiliary signs and numbers, but also the different types of special auxiliaries). The schema definition is available online at: http://piros.udc-interpreter.hu#xsd. The primary goal of my research is to prove that it is possible to support building, retrieving, and analyzing UDC numbers without compromises, by taking the whole syntactic richness of the scheme by storing the UDC numbers reserving the meaning of pre-coordination. The research has also included the implementation of a software that parses UDC classmarks attended to prove that such solution can be applied automatically without any additional effort or even retrospectively on existing collections.
Content
Vgl. auch: New automatic interpreter for complex UDC numbers. Unter: <https%3A%2F%2Fudcc.org%2Ffiles%2FAttilaPiros_EC_36-37_2014-2015.pdf&usg=AOvVaw3kc9CwDDCWP7aArpfjrs5b>
Footnote
Egyetemi doktori (PhD) értekezés tézisei.
Object
UDC

Similar documents (content)

  1. Piros, A.: ¬The thought behind the symbol : about the automatic interpretation and representation of UDC numbers (2017) 0.25
    0.24946094 = sum of:
      0.24946094 = product of:
        0.77956545 = sum of:
          0.20858207 = weight(abstract_txt:classmarks in 3853) [ClassicSimilarity], result of:
            0.20858207 = score(doc=3853,freq=3.0), product of:
              0.2051029 = queryWeight, product of:
                1.2023615 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.018158177 = queryNorm
              1.016963 = fieldWeight in 3853, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=3853)
          0.054315172 = weight(abstract_txt:format in 3853) [ClassicSimilarity], result of:
            0.054315172 = score(doc=3853,freq=2.0), product of:
              0.12062621 = queryWeight, product of:
                1.3040222 = boost
                5.0942993 = idf(docFreq=736, maxDocs=44218)
                0.018158177 = queryNorm
              0.4502767 = fieldWeight in 3853, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0942993 = idf(docFreq=736, maxDocs=44218)
                0.0625 = fieldNorm(doc=3853)
          0.0597811 = weight(abstract_txt:whole in 3853) [ClassicSimilarity], result of:
            0.0597811 = score(doc=3853,freq=1.0), product of:
              0.16201188 = queryWeight, product of:
                1.5112543 = boost
                5.9038734 = idf(docFreq=327, maxDocs=44218)
                0.018158177 = queryNorm
              0.3689921 = fieldWeight in 3853, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9038734 = idf(docFreq=327, maxDocs=44218)
                0.0625 = fieldNorm(doc=3853)
          0.11420616 = weight(abstract_txt:syntactic in 3853) [ClassicSimilarity], result of:
            0.11420616 = score(doc=3853,freq=2.0), product of:
              0.19797967 = queryWeight, product of:
                1.6706076 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.018158177 = queryNorm
              0.576858 = fieldWeight in 3853, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.0625 = fieldNorm(doc=3853)
          0.03224788 = weight(abstract_txt:also in 3853) [ClassicSimilarity], result of:
            0.03224788 = score(doc=3853,freq=2.0), product of:
              0.10735897 = queryWeight, product of:
                1.7397959 = boost
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.018158177 = queryNorm
              0.30037433 = fieldWeight in 3853, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.0625 = fieldNorm(doc=3853)
          0.043118894 = weight(abstract_txt:possible in 3853) [ClassicSimilarity], result of:
            0.043118894 = score(doc=3853,freq=1.0), product of:
              0.14915794 = queryWeight, product of:
                1.7759591 = boost
                4.6253138 = idf(docFreq=1177, maxDocs=44218)
                0.018158177 = queryNorm
              0.2890821 = fieldWeight in 3853, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6253138 = idf(docFreq=1177, maxDocs=44218)
                0.0625 = fieldNorm(doc=3853)
          0.016396284 = weight(abstract_txt:that in 3853) [ClassicSimilarity], result of:
            0.016396284 = score(doc=3853,freq=2.0), product of:
              0.078288555 = queryWeight, product of:
                1.8195915 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.018158177 = queryNorm
              0.20943399 = fieldWeight in 3853, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=3853)
          0.2509179 = weight(abstract_txt:numbers in 3853) [ClassicSimilarity], result of:
            0.2509179 = score(doc=3853,freq=2.0), product of:
              0.48256496 = queryWeight, product of:
                4.5175467 = boost
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.018158177 = queryNorm
              0.51996714 = fieldWeight in 3853, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.0625 = fieldNorm(doc=3853)
        0.32 = coord(8/25)
    
  2. Piros, A.: Automatic interpretation of complex UDC numbers : towards support for library systems (2015) 0.18
    0.18102989 = sum of:
      0.18102989 = product of:
        0.5028608 = sum of:
          0.015155116 = weight(abstract_txt:well in 2301) [ClassicSimilarity], result of:
            0.015155116 = score(doc=2301,freq=1.0), product of:
              0.07093682 = queryWeight, product of:
                3.9066048 = idf(docFreq=2416, maxDocs=44218)
                0.018158177 = queryNorm
              0.21364245 = fieldWeight in 2301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9066048 = idf(docFreq=2416, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2301)
          0.03616978 = weight(abstract_txt:existing in 2301) [ClassicSimilarity], result of:
            0.03616978 = score(doc=2301,freq=2.0), product of:
              0.100550935 = queryWeight, product of:
                1.1905762 = boost
                4.6511106 = idf(docFreq=1147, maxDocs=44218)
                0.018158177 = queryNorm
              0.35971597 = fieldWeight in 2301, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6511106 = idf(docFreq=1147, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2301)
          0.0336058 = weight(abstract_txt:format in 2301) [ClassicSimilarity], result of:
            0.0336058 = score(doc=2301,freq=1.0), product of:
              0.12062621 = queryWeight, product of:
                1.3040222 = boost
                5.0942993 = idf(docFreq=736, maxDocs=44218)
                0.018158177 = queryNorm
              0.2785945 = fieldWeight in 2301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0942993 = idf(docFreq=736, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2301)
          0.06284222 = weight(abstract_txt:without in 2301) [ClassicSimilarity], result of:
            0.06284222 = score(doc=2301,freq=3.0), product of:
              0.12694807 = queryWeight, product of:
                1.3377569 = boost
                5.2260876 = idf(docFreq=645, maxDocs=44218)
                0.018158177 = queryNorm
              0.495023 = fieldWeight in 2301, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2260876 = idf(docFreq=645, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2301)
          0.06770861 = weight(abstract_txt:schema in 2301) [ClassicSimilarity], result of:
            0.06770861 = score(doc=2301,freq=1.0), product of:
              0.192425 = queryWeight, product of:
                1.6470048 = boost
                6.434197 = idf(docFreq=192, maxDocs=44218)
                0.018158177 = queryNorm
              0.35187015 = fieldWeight in 2301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.434197 = idf(docFreq=192, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2301)
          0.019952357 = weight(abstract_txt:also in 2301) [ClassicSimilarity], result of:
            0.019952357 = score(doc=2301,freq=1.0), product of:
              0.10735897 = queryWeight, product of:
                1.7397959 = boost
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.018158177 = queryNorm
              0.18584713 = fieldWeight in 2301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2301)
          0.037729032 = weight(abstract_txt:possible in 2301) [ClassicSimilarity], result of:
            0.037729032 = score(doc=2301,freq=1.0), product of:
              0.14915794 = queryWeight, product of:
                1.7759591 = boost
                4.6253138 = idf(docFreq=1177, maxDocs=44218)
                0.018158177 = queryNorm
              0.25294685 = fieldWeight in 2301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6253138 = idf(docFreq=1177, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2301)
          0.0101446835 = weight(abstract_txt:that in 2301) [ClassicSimilarity], result of:
            0.0101446835 = score(doc=2301,freq=1.0), product of:
              0.078288555 = queryWeight, product of:
                1.8195915 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.018158177 = queryNorm
              0.12958068 = fieldWeight in 2301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2301)
          0.21955319 = weight(abstract_txt:numbers in 2301) [ClassicSimilarity], result of:
            0.21955319 = score(doc=2301,freq=2.0), product of:
              0.48256496 = queryWeight, product of:
                4.5175467 = boost
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.018158177 = queryNorm
              0.45497125 = fieldWeight in 2301, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2301)
        0.36 = coord(9/25)
    
  3. Beall, J.: Representation DDC system in MARC 21 (2008) 0.15
    0.14729838 = sum of:
      0.14729838 = product of:
        0.73649186 = sum of:
          0.021650165 = weight(abstract_txt:well in 2167) [ClassicSimilarity], result of:
            0.021650165 = score(doc=2167,freq=1.0), product of:
              0.07093682 = queryWeight, product of:
                3.9066048 = idf(docFreq=2416, maxDocs=44218)
                0.018158177 = queryNorm
              0.3052035 = fieldWeight in 2167, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9066048 = idf(docFreq=2416, maxDocs=44218)
                0.078125 = fieldNorm(doc=2167)
          0.04800828 = weight(abstract_txt:format in 2167) [ClassicSimilarity], result of:
            0.04800828 = score(doc=2167,freq=1.0), product of:
              0.12062621 = queryWeight, product of:
                1.3040222 = boost
                5.0942993 = idf(docFreq=736, maxDocs=44218)
                0.018158177 = queryNorm
              0.39799213 = fieldWeight in 2167, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0942993 = idf(docFreq=736, maxDocs=44218)
                0.078125 = fieldNorm(doc=2167)
          0.06556046 = weight(abstract_txt:additional in 2167) [ClassicSimilarity], result of:
            0.06556046 = score(doc=2167,freq=1.0), product of:
              0.1484769 = queryWeight, product of:
                1.4467503 = boost
                5.6518817 = idf(docFreq=421, maxDocs=44218)
                0.018158177 = queryNorm
              0.44155326 = fieldWeight in 2167, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6518817 = idf(docFreq=421, maxDocs=44218)
                0.078125 = fieldNorm(doc=2167)
          0.014492406 = weight(abstract_txt:that in 2167) [ClassicSimilarity], result of:
            0.014492406 = score(doc=2167,freq=1.0), product of:
              0.078288555 = queryWeight, product of:
                1.8195915 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.018158177 = queryNorm
              0.18511525 = fieldWeight in 2167, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=2167)
          0.58678055 = weight(abstract_txt:numbers in 2167) [ClassicSimilarity], result of:
            0.58678055 = score(doc=2167,freq=7.0), product of:
              0.48256496 = queryWeight, product of:
                4.5175467 = boost
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.018158177 = queryNorm
              1.2159618 = fieldWeight in 2167, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.078125 = fieldNorm(doc=2167)
        0.2 = coord(5/25)
    
  4. Broadbent, E.: Classification access in the online catalog (1995) 0.12
    0.11541015 = sum of:
      0.11541015 = product of:
        0.57705075 = sum of:
          0.057609938 = weight(abstract_txt:format in 5571) [ClassicSimilarity], result of:
            0.057609938 = score(doc=5571,freq=1.0), product of:
              0.12062621 = queryWeight, product of:
                1.3040222 = boost
                5.0942993 = idf(docFreq=736, maxDocs=44218)
                0.018158177 = queryNorm
              0.47759056 = fieldWeight in 5571, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0942993 = idf(docFreq=736, maxDocs=44218)
                0.09375 = fieldNorm(doc=5571)
          0.03420404 = weight(abstract_txt:also in 5571) [ClassicSimilarity], result of:
            0.03420404 = score(doc=5571,freq=1.0), product of:
              0.10735897 = queryWeight, product of:
                1.7397959 = boost
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.018158177 = queryNorm
              0.31859508 = fieldWeight in 5571, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.09375 = fieldNorm(doc=5571)
          0.09146898 = weight(abstract_txt:possible in 5571) [ClassicSimilarity], result of:
            0.09146898 = score(doc=5571,freq=2.0), product of:
              0.14915794 = queryWeight, product of:
                1.7759591 = boost
                4.6253138 = idf(docFreq=1177, maxDocs=44218)
                0.018158177 = queryNorm
              0.6132358 = fieldWeight in 5571, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6253138 = idf(docFreq=1177, maxDocs=44218)
                0.09375 = fieldNorm(doc=5571)
          0.017390886 = weight(abstract_txt:that in 5571) [ClassicSimilarity], result of:
            0.017390886 = score(doc=5571,freq=1.0), product of:
              0.078288555 = queryWeight, product of:
                1.8195915 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.018158177 = queryNorm
              0.22213829 = fieldWeight in 5571, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=5571)
          0.3763769 = weight(abstract_txt:numbers in 5571) [ClassicSimilarity], result of:
            0.3763769 = score(doc=5571,freq=2.0), product of:
              0.48256496 = queryWeight, product of:
                4.5175467 = boost
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.018158177 = queryNorm
              0.77995074 = fieldWeight in 5571, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.09375 = fieldNorm(doc=5571)
        0.2 = coord(5/25)
    
  5. Cortez, E.; Silva, A.S. da; Gonçalves, M.A.; Mesquita, F.; Moura, E.S. de: ¬A flexible approach for extracting metadata from bibliographic citations (2009) 0.11
    0.108863376 = sum of:
      0.108863376 = product of:
        0.38879776 = sum of:
          0.025575895 = weight(abstract_txt:existing in 2848) [ClassicSimilarity], result of:
            0.025575895 = score(doc=2848,freq=1.0), product of:
              0.100550935 = queryWeight, product of:
                1.1905762 = boost
                4.6511106 = idf(docFreq=1147, maxDocs=44218)
                0.018158177 = queryNorm
              0.2543576 = fieldWeight in 2848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6511106 = idf(docFreq=1147, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2848)
          0.0336058 = weight(abstract_txt:format in 2848) [ClassicSimilarity], result of:
            0.0336058 = score(doc=2848,freq=1.0), product of:
              0.12062621 = queryWeight, product of:
                1.3040222 = boost
                5.0942993 = idf(docFreq=736, maxDocs=44218)
                0.018158177 = queryNorm
              0.2785945 = fieldWeight in 2848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0942993 = idf(docFreq=736, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2848)
          0.03628197 = weight(abstract_txt:without in 2848) [ClassicSimilarity], result of:
            0.03628197 = score(doc=2848,freq=1.0), product of:
              0.12694807 = queryWeight, product of:
                1.3377569 = boost
                5.2260876 = idf(docFreq=645, maxDocs=44218)
                0.018158177 = queryNorm
              0.28580165 = fieldWeight in 2848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2260876 = idf(docFreq=645, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2848)
          0.060360692 = weight(abstract_txt:automatically in 2848) [ClassicSimilarity], result of:
            0.060360692 = score(doc=2848,freq=2.0), product of:
              0.14146805 = queryWeight, product of:
                1.4121906 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.018158177 = queryNorm
              0.42667368 = fieldWeight in 2848, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2848)
          0.017571107 = weight(abstract_txt:that in 2848) [ClassicSimilarity], result of:
            0.017571107 = score(doc=2848,freq=3.0), product of:
              0.078288555 = queryWeight, product of:
                1.8195915 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.018158177 = queryNorm
              0.22444029 = fieldWeight in 2848, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2848)
          0.060154736 = weight(abstract_txt:building in 2848) [ClassicSimilarity], result of:
            0.060154736 = score(doc=2848,freq=1.0), product of:
              0.20356785 = queryWeight, product of:
                2.0747433 = boost
                5.403468 = idf(docFreq=540, maxDocs=44218)
                0.018158177 = queryNorm
              0.29550216 = fieldWeight in 2848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.403468 = idf(docFreq=540, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2848)
          0.15524755 = weight(abstract_txt:numbers in 2848) [ClassicSimilarity], result of:
            0.15524755 = score(doc=2848,freq=1.0), product of:
              0.48256496 = queryWeight, product of:
                4.5175467 = boost
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.018158177 = queryNorm
              0.32171327 = fieldWeight in 2848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2848)
        0.28 = coord(7/25)