Document (#39516)

Author
Posch, L.
Schaer, P.
Bleier, A.
Strohmaier, M.
Title
¬A system for probabilistic linking of thesauri and classification systems
Source
Künstliche Intelligenz. 2015, S.1-4
Year
2015
Abstract
This paper presents a system which creates and visualizes probabilistic semantic links between concepts in a thesaurus and classes in a classification system. For creating the links, we build on the Polylingual Labeled Topic Model (PLL-TM) (Posch et al., in KI 2015: advances in artificial intelligence, 2015). PLL-TM identifies probable thesaurus descriptors for each class in the classification system by using information from the natural language text of documents, their assigned thesaurus descriptors and their designated classes. The links are then presented to users of the system in an interactive visualization, providing them with an automatically generated overview of the relations between the thesaurus and the classification system.
Content
Vgl.: http://link.springer.com/article/10.1007%2Fs13218-015-0413-9.
Theme
Semantische Interoperabilität

Similar documents (author)

  1. Schaer, P.: Integration von Open-Access-Repositorien in Fachportale (2010) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:schaer in 2320) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 2320, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=2320)
    
  2. Schaer, P.: Sprachmodelle und neuronale Netze im Information Retrieval (2023) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:schaer in 799) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 799, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=799)
    
  3. Munkelt, J.; Schaer, P.: Towards an IR test collection for the German National Library (2018) 4.46
    4.462149 = sum of:
      4.462149 = weight(author_txt:schaer in 5780) [ClassicSimilarity], result of:
        4.462149 = fieldWeight in 5780, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.5 = fieldNorm(doc=5780)
    
  4. Mayr, P.; Schaer, P.; Mutschke, P.: ¬A science model driven retrieval prototype (2011) 3.35
    3.346612 = sum of:
      3.346612 = weight(author_txt:schaer in 649) [ClassicSimilarity], result of:
        3.346612 = fieldWeight in 649, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.375 = fieldNorm(doc=649)
    
  5. Neumann, M.; Steinberg, J.; Schaer, P.: Web-ccraping for non-programmers : introducing OXPath for digital library metadata harvesting (2017) 3.35
    3.346612 = sum of:
      3.346612 = weight(author_txt:schaer in 3895) [ClassicSimilarity], result of:
        3.346612 = fieldWeight in 3895, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.375 = fieldNorm(doc=3895)
    

Similar documents (content)

  1. Loosjes, T.P.; Tichelaar, P.A.; Goossens, J.; Stuurman, P.: Ontsluiting op onderwerp (1977) 0.18
    0.18498223 = sum of:
      0.18498223 = product of:
        0.7707593 = sum of:
          0.021012357 = weight(abstract_txt:between in 910) [ClassicSimilarity], result of:
            0.021012357 = score(doc=910,freq=1.0), product of:
              0.07765762 = queryWeight, product of:
                1.2469767 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.017981464 = queryNorm
              0.2705769 = fieldWeight in 910, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.078125 = fieldNorm(doc=910)
          0.09995603 = weight(abstract_txt:classes in 910) [ClassicSimilarity], result of:
            0.09995603 = score(doc=910,freq=1.0), product of:
              0.21965456 = queryWeight, product of:
                2.0971835 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.017981464 = queryNorm
              0.45506012 = fieldWeight in 910, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.078125 = fieldNorm(doc=910)
          0.25878748 = weight(abstract_txt:descriptors in 910) [ClassicSimilarity], result of:
            0.25878748 = score(doc=910,freq=3.0), product of:
              0.28715914 = queryWeight, product of:
                2.397881 = boost
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.017981464 = queryNorm
              0.90119886 = fieldWeight in 910, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.078125 = fieldNorm(doc=910)
          0.111470394 = weight(abstract_txt:classification in 910) [ClassicSimilarity], result of:
            0.111470394 = score(doc=910,freq=3.0), product of:
              0.20635271 = queryWeight, product of:
                2.8746595 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.017981464 = queryNorm
              0.5401935 = fieldWeight in 910, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=910)
          0.08229831 = weight(abstract_txt:system in 910) [ClassicSimilarity], result of:
            0.08229831 = score(doc=910,freq=2.0), product of:
              0.2208811 = queryWeight, product of:
                3.642556 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.017981464 = queryNorm
              0.372591 = fieldWeight in 910, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.078125 = fieldNorm(doc=910)
          0.19723469 = weight(abstract_txt:thesaurus in 910) [ClassicSimilarity], result of:
            0.19723469 = score(doc=910,freq=2.0), product of:
              0.34555972 = queryWeight, product of:
                3.7199996 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.017981464 = queryNorm
              0.5707688 = fieldWeight in 910, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.078125 = fieldNorm(doc=910)
        0.24 = coord(6/25)
    
  2. Williamson, N.J.: Deriving a thesaurus from a restructured UDC (1996) 0.17
    0.16846651 = sum of:
      0.16846651 = product of:
        0.7019438 = sum of:
          0.087775335 = weight(abstract_txt:class in 5194) [ClassicSimilarity], result of:
            0.087775335 = score(doc=5194,freq=2.0), product of:
              0.11236776 = queryWeight, product of:
                1.0606502 = boost
                5.8917522 = idf(docFreq=331, maxDocs=44218)
                0.017981464 = queryNorm
              0.7811434 = fieldWeight in 5194, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8917522 = idf(docFreq=331, maxDocs=44218)
                0.09375 = fieldNorm(doc=5194)
          0.01914295 = weight(abstract_txt:their in 5194) [ClassicSimilarity], result of:
            0.01914295 = score(doc=5194,freq=1.0), product of:
              0.06462779 = queryWeight, product of:
                1.1375643 = boost
                3.1594994 = idf(docFreq=5101, maxDocs=44218)
                0.017981464 = queryNorm
              0.29620308 = fieldWeight in 5194, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1594994 = idf(docFreq=5101, maxDocs=44218)
                0.09375 = fieldNorm(doc=5194)
          0.17929323 = weight(abstract_txt:descriptors in 5194) [ClassicSimilarity], result of:
            0.17929323 = score(doc=5194,freq=1.0), product of:
              0.28715914 = queryWeight, product of:
                2.397881 = boost
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.017981464 = queryNorm
              0.62436885 = fieldWeight in 5194, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.09375 = fieldNorm(doc=5194)
          0.10921823 = weight(abstract_txt:classification in 5194) [ClassicSimilarity], result of:
            0.10921823 = score(doc=5194,freq=2.0), product of:
              0.20635271 = queryWeight, product of:
                2.8746595 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.017981464 = queryNorm
              0.52927935 = fieldWeight in 5194, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.09375 = fieldNorm(doc=5194)
          0.06983243 = weight(abstract_txt:system in 5194) [ClassicSimilarity], result of:
            0.06983243 = score(doc=5194,freq=1.0), product of:
              0.2208811 = queryWeight, product of:
                3.642556 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.017981464 = queryNorm
              0.3161539 = fieldWeight in 5194, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.09375 = fieldNorm(doc=5194)
          0.23668166 = weight(abstract_txt:thesaurus in 5194) [ClassicSimilarity], result of:
            0.23668166 = score(doc=5194,freq=2.0), product of:
              0.34555972 = queryWeight, product of:
                3.7199996 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.017981464 = queryNorm
              0.6849226 = fieldWeight in 5194, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.09375 = fieldNorm(doc=5194)
        0.24 = coord(6/25)
    
  3. Francu, V.: Multilingual access to information using an intermediate language (2003) 0.16
    0.16023873 = sum of:
      0.16023873 = product of:
        0.66766137 = sum of:
          0.014708649 = weight(abstract_txt:between in 1742) [ClassicSimilarity], result of:
            0.014708649 = score(doc=1742,freq=1.0), product of:
              0.07765762 = queryWeight, product of:
                1.2469767 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.017981464 = queryNorm
              0.18940382 = fieldWeight in 1742, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1742)
          0.098951414 = weight(abstract_txt:classes in 1742) [ClassicSimilarity], result of:
            0.098951414 = score(doc=1742,freq=2.0), product of:
              0.21965456 = queryWeight, product of:
                2.0971835 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.017981464 = queryNorm
              0.4504865 = fieldWeight in 1742, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1742)
          0.10458772 = weight(abstract_txt:descriptors in 1742) [ClassicSimilarity], result of:
            0.10458772 = score(doc=1742,freq=1.0), product of:
              0.28715914 = queryWeight, product of:
                2.397881 = boost
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.017981464 = queryNorm
              0.36421517 = fieldWeight in 1742, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1742)
          0.11919169 = weight(abstract_txt:classification in 1742) [ClassicSimilarity], result of:
            0.11919169 = score(doc=1742,freq=7.0), product of:
              0.20635271 = queryWeight, product of:
                2.8746595 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.017981464 = queryNorm
              0.57761145 = fieldWeight in 1742, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1742)
          0.091087535 = weight(abstract_txt:system in 1742) [ClassicSimilarity], result of:
            0.091087535 = score(doc=1742,freq=5.0), product of:
              0.2208811 = queryWeight, product of:
                3.642556 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.017981464 = queryNorm
              0.41238263 = fieldWeight in 1742, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1742)
          0.23913437 = weight(abstract_txt:thesaurus in 1742) [ClassicSimilarity], result of:
            0.23913437 = score(doc=1742,freq=6.0), product of:
              0.34555972 = queryWeight, product of:
                3.7199996 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.017981464 = queryNorm
              0.6920204 = fieldWeight in 1742, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1742)
        0.24 = coord(6/25)
    
  4. Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: FACET: thesaurus retrieval with semantic term expansion (2002) 0.15
    0.15388964 = sum of:
      0.15388964 = product of:
        0.64120686 = sum of:
          0.014708649 = weight(abstract_txt:between in 175) [ClassicSimilarity], result of:
            0.014708649 = score(doc=175,freq=1.0), product of:
              0.07765762 = queryWeight, product of:
                1.2469767 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.017981464 = queryNorm
              0.18940382 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.06996922 = weight(abstract_txt:classes in 175) [ClassicSimilarity], result of:
            0.06996922 = score(doc=175,freq=1.0), product of:
              0.21965456 = queryWeight, product of:
                2.0971835 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.017981464 = queryNorm
              0.3185421 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.18115124 = weight(abstract_txt:descriptors in 175) [ClassicSimilarity], result of:
            0.18115124 = score(doc=175,freq=3.0), product of:
              0.28715914 = queryWeight, product of:
                2.397881 = boost
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.017981464 = queryNorm
              0.63083917 = fieldWeight in 175, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.076347545 = weight(abstract_txt:links in 175) [ClassicSimilarity], result of:
            0.076347545 = score(doc=175,freq=1.0), product of:
              0.26649925 = queryWeight, product of:
                2.829176 = boost
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.017981464 = queryNorm
              0.28648314 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.040735584 = weight(abstract_txt:system in 175) [ClassicSimilarity], result of:
            0.040735584 = score(doc=175,freq=1.0), product of:
              0.2208811 = queryWeight, product of:
                3.642556 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.017981464 = queryNorm
              0.18442312 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.2582946 = weight(abstract_txt:thesaurus in 175) [ClassicSimilarity], result of:
            0.2582946 = score(doc=175,freq=7.0), product of:
              0.34555972 = queryWeight, product of:
                3.7199996 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.017981464 = queryNorm
              0.7474674 = fieldWeight in 175, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
        0.24 = coord(6/25)
    
  5. Doorn, M. van; Polman, K.: From classification to thesaurus ... and back? : subject indexing tools at the library of the Afrika-Studiecentrum Leiden (2010) 0.14
    0.14467654 = sum of:
      0.14467654 = product of:
        0.6028189 = sum of:
          0.045721103 = weight(abstract_txt:linking in 4062) [ClassicSimilarity], result of:
            0.045721103 = score(doc=4062,freq=1.0), product of:
              0.12009973 = queryWeight, product of:
                1.0965346 = boost
                6.091085 = idf(docFreq=271, maxDocs=44218)
                0.017981464 = queryNorm
              0.3806928 = fieldWeight in 4062, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.091085 = idf(docFreq=271, maxDocs=44218)
                0.0625 = fieldNorm(doc=4062)
          0.06623162 = weight(abstract_txt:assigned in 4062) [ClassicSimilarity], result of:
            0.06623162 = score(doc=4062,freq=2.0), product of:
              0.122038774 = queryWeight, product of:
                1.1053511 = boost
                6.140059 = idf(docFreq=258, maxDocs=44218)
                0.017981464 = queryNorm
              0.54270965 = fieldWeight in 4062, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.140059 = idf(docFreq=258, maxDocs=44218)
                0.0625 = fieldNorm(doc=4062)
          0.119528815 = weight(abstract_txt:descriptors in 4062) [ClassicSimilarity], result of:
            0.119528815 = score(doc=4062,freq=1.0), product of:
              0.28715914 = queryWeight, product of:
                2.397881 = boost
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.017981464 = queryNorm
              0.4162459 = fieldWeight in 4062, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.0625 = fieldNorm(doc=4062)
          0.05148597 = weight(abstract_txt:classification in 4062) [ClassicSimilarity], result of:
            0.05148597 = score(doc=4062,freq=1.0), product of:
              0.20635271 = queryWeight, product of:
                2.8746595 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.017981464 = queryNorm
              0.2495047 = fieldWeight in 4062, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=4062)
          0.046554953 = weight(abstract_txt:system in 4062) [ClassicSimilarity], result of:
            0.046554953 = score(doc=4062,freq=1.0), product of:
              0.2208811 = queryWeight, product of:
                3.642556 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.017981464 = queryNorm
              0.21076928 = fieldWeight in 4062, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0625 = fieldNorm(doc=4062)
          0.27329645 = weight(abstract_txt:thesaurus in 4062) [ClassicSimilarity], result of:
            0.27329645 = score(doc=4062,freq=6.0), product of:
              0.34555972 = queryWeight, product of:
                3.7199996 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.017981464 = queryNorm
              0.7908805 = fieldWeight in 4062, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.0625 = fieldNorm(doc=4062)
        0.24 = coord(6/25)