Document (#39517)

Author
Posch, L.
Schaer, P.
Bleier, A.
Strohmaier, M.
Title
¬A system for probabilistic linking of thesauri and classification systems
Source
Künstliche Intelligenz. 2015, S.1-4
Year
2015
Abstract
This paper presents a system which creates and visualizes probabilistic semantic links between concepts in a thesaurus and classes in a classification system. For creating the links, we build on the Polylingual Labeled Topic Model (PLL-TM) (Posch et al., in KI 2015: advances in artificial intelligence, 2015). PLL-TM identifies probable thesaurus descriptors for each class in the classification system by using information from the natural language text of documents, their assigned thesaurus descriptors and their designated classes. The links are then presented to users of the system in an interactive visualization, providing them with an automatically generated overview of the relations between the thesaurus and the classification system.
Content
Vgl.: http://link.springer.com/article/10.1007%2Fs13218-015-0413-9.
Theme
Semantische Interoperabilität

Similar documents (author)

  1. Schaer, P.: Integration von Open-Access-Repositorien in Fachportale (2010) 5.73
    5.7298613 = sum of:
      5.7298613 = weight(author_txt:schaer in 140) [ClassicSimilarity], result of:
        5.7298613 = fieldWeight in 140, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.167778 = idf(docFreq=11, maxDocs=42306)
          0.625 = fieldNorm(doc=140)
    
  2. Munkelt, J.; Schaer, P.: Towards an IR test collection for the German National Library (2018) 4.58
    4.583889 = sum of:
      4.583889 = weight(author_txt:schaer in 781) [ClassicSimilarity], result of:
        4.583889 = fieldWeight in 781, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.167778 = idf(docFreq=11, maxDocs=42306)
          0.5 = fieldNorm(doc=781)
    
  3. Mayr, P.; Schaer, P.; Mutschke, P.: ¬A science model driven retrieval prototype (2011) 3.44
    3.4379168 = sum of:
      3.4379168 = weight(author_txt:schaer in 2650) [ClassicSimilarity], result of:
        3.4379168 = fieldWeight in 2650, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.167778 = idf(docFreq=11, maxDocs=42306)
          0.375 = fieldNorm(doc=2650)
    
  4. Neumann, M.; Steinberg, J.; Schaer, P.: Web-ccraping for non-programmers : introducing OXPath for digital library metadata harvesting (2017) 3.44
    3.4379168 = sum of:
      3.4379168 = weight(author_txt:schaer in 814) [ClassicSimilarity], result of:
        3.4379168 = fieldWeight in 814, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.167778 = idf(docFreq=11, maxDocs=42306)
          0.375 = fieldNorm(doc=814)
    
  5. Munkelt, J.; Schaer, P.; Lepsky, K.: Towards an IR test collection for the German National Library (2018) 3.44
    3.4379168 = sum of:
      3.4379168 = weight(author_txt:schaer in 1230) [ClassicSimilarity], result of:
        3.4379168 = fieldWeight in 1230, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.167778 = idf(docFreq=11, maxDocs=42306)
          0.375 = fieldNorm(doc=1230)
    

Similar documents (content)

  1. Loosjes, T.P.; Tichelaar, P.A.; Goossens, J.; Stuurman, P.: Ontsluiting op onderwerp (1977) 0.18
    0.18279487 = sum of:
      0.18279487 = product of:
        0.7616453 = sum of:
          0.021362266 = weight(abstract_txt:between in 979) [ClassicSimilarity], result of:
            0.021362266 = score(doc=979,freq=1.0), product of:
              0.07816542 = queryWeight, product of:
                1.2523535 = boost
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.017842062 = queryNorm
              0.2732956 = fieldWeight in 979, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.078125 = fieldNorm(doc=979)
          0.10024482 = weight(abstract_txt:classes in 979) [ClassicSimilarity], result of:
            0.10024482 = score(doc=979,freq=1.0), product of:
              0.21909092 = queryWeight, product of:
                2.0966783 = boost
                5.8566265 = idf(docFreq=328, maxDocs=42306)
                0.017842062 = queryNorm
              0.45754895 = fieldWeight in 979, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8566265 = idf(docFreq=328, maxDocs=42306)
                0.078125 = fieldNorm(doc=979)
          0.25481004 = weight(abstract_txt:descriptors in 979) [ClassicSimilarity], result of:
            0.25481004 = score(doc=979,freq=3.0), product of:
              0.28293523 = queryWeight, product of:
                2.3826659 = boost
                6.655472 = idf(docFreq=147, maxDocs=42306)
                0.017842062 = queryNorm
              0.90059495 = fieldWeight in 979, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.655472 = idf(docFreq=147, maxDocs=42306)
                0.078125 = fieldNorm(doc=979)
          0.111279875 = weight(abstract_txt:classification in 979) [ClassicSimilarity], result of:
            0.111279875 = score(doc=979,freq=3.0), product of:
              0.20519358 = queryWeight, product of:
                2.8695679 = boost
                4.007765 = idf(docFreq=2089, maxDocs=42306)
                0.017842062 = queryNorm
              0.54231656 = fieldWeight in 979, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.007765 = idf(docFreq=2089, maxDocs=42306)
                0.078125 = fieldNorm(doc=979)
          0.08066288 = weight(abstract_txt:system in 979) [ClassicSimilarity], result of:
            0.08066288 = score(doc=979,freq=2.0), product of:
              0.21696813 = queryWeight, product of:
                3.6139174 = boost
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.017842062 = queryNorm
              0.37177292 = fieldWeight in 979, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.078125 = fieldNorm(doc=979)
          0.1932855 = weight(abstract_txt:thesaurus in 979) [ClassicSimilarity], result of:
            0.1932855 = score(doc=979,freq=2.0), product of:
              0.33940318 = queryWeight, product of:
                3.6905627 = boost
                5.1544023 = idf(docFreq=663, maxDocs=42306)
                0.017842062 = queryNorm
              0.5694864 = fieldWeight in 979, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1544023 = idf(docFreq=663, maxDocs=42306)
                0.078125 = fieldNorm(doc=979)
        0.24 = coord(6/25)
    
  2. Williamson, N.J.: Deriving a thesaurus from a restructured UDC (1996) 0.17
    0.16628265 = sum of:
      0.16628265 = product of:
        0.6928444 = sum of:
          0.08725131 = weight(abstract_txt:class in 5263) [ClassicSimilarity], result of:
            0.08725131 = score(doc=5263,freq=2.0), product of:
              0.111418396 = queryWeight, product of:
                1.0572631 = boost
                5.906481 = idf(docFreq=312, maxDocs=42306)
                0.017842062 = queryNorm
              0.78309613 = fieldWeight in 5263, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.906481 = idf(docFreq=312, maxDocs=42306)
                0.09375 = fieldNorm(doc=5263)
          0.019636616 = weight(abstract_txt:their in 5263) [ClassicSimilarity], result of:
            0.019636616 = score(doc=5263,freq=1.0), product of:
              0.06543951 = queryWeight, product of:
                1.1458813 = boost
                3.2007766 = idf(docFreq=4683, maxDocs=42306)
                0.017842062 = queryNorm
              0.3000728 = fieldWeight in 5263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2007766 = idf(docFreq=4683, maxDocs=42306)
                0.09375 = fieldNorm(doc=5263)
          0.17653757 = weight(abstract_txt:descriptors in 5263) [ClassicSimilarity], result of:
            0.17653757 = score(doc=5263,freq=1.0), product of:
              0.28293523 = queryWeight, product of:
                2.3826659 = boost
                6.655472 = idf(docFreq=147, maxDocs=42306)
                0.017842062 = queryNorm
              0.6239505 = fieldWeight in 5263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.655472 = idf(docFreq=147, maxDocs=42306)
                0.09375 = fieldNorm(doc=5263)
          0.109031565 = weight(abstract_txt:classification in 5263) [ClassicSimilarity], result of:
            0.109031565 = score(doc=5263,freq=2.0), product of:
              0.20519358 = queryWeight, product of:
                2.8695679 = boost
                4.007765 = idf(docFreq=2089, maxDocs=42306)
                0.017842062 = queryNorm
              0.53135955 = fieldWeight in 5263, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.007765 = idf(docFreq=2089, maxDocs=42306)
                0.09375 = fieldNorm(doc=5263)
          0.068444714 = weight(abstract_txt:system in 5263) [ClassicSimilarity], result of:
            0.068444714 = score(doc=5263,freq=1.0), product of:
              0.21696813 = queryWeight, product of:
                3.6139174 = boost
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.017842062 = queryNorm
              0.31545976 = fieldWeight in 5263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.09375 = fieldNorm(doc=5263)
          0.23194258 = weight(abstract_txt:thesaurus in 5263) [ClassicSimilarity], result of:
            0.23194258 = score(doc=5263,freq=2.0), product of:
              0.33940318 = queryWeight, product of:
                3.6905627 = boost
                5.1544023 = idf(docFreq=663, maxDocs=42306)
                0.017842062 = queryNorm
              0.68338364 = fieldWeight in 5263, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1544023 = idf(docFreq=663, maxDocs=42306)
                0.09375 = fieldNorm(doc=5263)
        0.24 = coord(6/25)
    
  3. Francu, V.: Multilingual access to information using an intermediate language (2003) 0.16
    0.15834786 = sum of:
      0.15834786 = product of:
        0.65978277 = sum of:
          0.014953586 = weight(abstract_txt:between in 3743) [ClassicSimilarity], result of:
            0.014953586 = score(doc=3743,freq=1.0), product of:
              0.07816542 = queryWeight, product of:
                1.2523535 = boost
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.017842062 = queryNorm
              0.19130693 = fieldWeight in 3743, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3743)
          0.09923731 = weight(abstract_txt:classes in 3743) [ClassicSimilarity], result of:
            0.09923731 = score(doc=3743,freq=2.0), product of:
              0.21909092 = queryWeight, product of:
                2.0966783 = boost
                5.8566265 = idf(docFreq=328, maxDocs=42306)
                0.017842062 = queryNorm
              0.45295033 = fieldWeight in 3743, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8566265 = idf(docFreq=328, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3743)
          0.10298025 = weight(abstract_txt:descriptors in 3743) [ClassicSimilarity], result of:
            0.10298025 = score(doc=3743,freq=1.0), product of:
              0.28293523 = queryWeight, product of:
                2.3826659 = boost
                6.655472 = idf(docFreq=147, maxDocs=42306)
                0.017842062 = queryNorm
              0.3639711 = fieldWeight in 3743, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.655472 = idf(docFreq=147, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3743)
          0.118987985 = weight(abstract_txt:classification in 3743) [ClassicSimilarity], result of:
            0.118987985 = score(doc=3743,freq=7.0), product of:
              0.20519358 = queryWeight, product of:
                2.8695679 = boost
                4.007765 = idf(docFreq=2089, maxDocs=42306)
                0.017842062 = queryNorm
              0.5798816 = fieldWeight in 3743, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.007765 = idf(docFreq=2089, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3743)
          0.08927744 = weight(abstract_txt:system in 3743) [ClassicSimilarity], result of:
            0.08927744 = score(doc=3743,freq=5.0), product of:
              0.21696813 = queryWeight, product of:
                3.6139174 = boost
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.017842062 = queryNorm
              0.4114772 = fieldWeight in 3743, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3743)
          0.23434621 = weight(abstract_txt:thesaurus in 3743) [ClassicSimilarity], result of:
            0.23434621 = score(doc=3743,freq=6.0), product of:
              0.33940318 = queryWeight, product of:
                3.6905627 = boost
                5.1544023 = idf(docFreq=663, maxDocs=42306)
                0.017842062 = queryNorm
              0.69046557 = fieldWeight in 3743, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.1544023 = idf(docFreq=663, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3743)
        0.24 = coord(6/25)
    
  4. Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: FACET: thesaurus retrieval with semantic term expansion (2002) 0.15
    0.15155375 = sum of:
      0.15155375 = product of:
        0.63147396 = sum of:
          0.014953586 = weight(abstract_txt:between in 2176) [ClassicSimilarity], result of:
            0.014953586 = score(doc=2176,freq=1.0), product of:
              0.07816542 = queryWeight, product of:
                1.2523535 = boost
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.017842062 = queryNorm
              0.19130693 = fieldWeight in 2176, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2176)
          0.07017137 = weight(abstract_txt:classes in 2176) [ClassicSimilarity], result of:
            0.07017137 = score(doc=2176,freq=1.0), product of:
              0.21909092 = queryWeight, product of:
                2.0966783 = boost
                5.8566265 = idf(docFreq=328, maxDocs=42306)
                0.017842062 = queryNorm
              0.32028425 = fieldWeight in 2176, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8566265 = idf(docFreq=328, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2176)
          0.17836702 = weight(abstract_txt:descriptors in 2176) [ClassicSimilarity], result of:
            0.17836702 = score(doc=2176,freq=3.0), product of:
              0.28293523 = queryWeight, product of:
                2.3826659 = boost
                6.655472 = idf(docFreq=147, maxDocs=42306)
                0.017842062 = queryNorm
              0.63041645 = fieldWeight in 2176, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.655472 = idf(docFreq=147, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2176)
          0.074933074 = weight(abstract_txt:links in 2176) [ClassicSimilarity], result of:
            0.074933074 = score(doc=2176,freq=1.0), product of:
              0.26201764 = queryWeight, product of:
                2.808216 = boost
                5.2294374 = idf(docFreq=615, maxDocs=42306)
                0.017842062 = queryNorm
              0.28598484 = fieldWeight in 2176, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2294374 = idf(docFreq=615, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2176)
          0.039926086 = weight(abstract_txt:system in 2176) [ClassicSimilarity], result of:
            0.039926086 = score(doc=2176,freq=1.0), product of:
              0.21696813 = queryWeight, product of:
                3.6139174 = boost
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.017842062 = queryNorm
              0.1840182 = fieldWeight in 2176, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2176)
          0.2531228 = weight(abstract_txt:thesaurus in 2176) [ClassicSimilarity], result of:
            0.2531228 = score(doc=2176,freq=7.0), product of:
              0.33940318 = queryWeight, product of:
                3.6905627 = boost
                5.1544023 = idf(docFreq=663, maxDocs=42306)
                0.017842062 = queryNorm
              0.745788 = fieldWeight in 2176, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.1544023 = idf(docFreq=663, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2176)
        0.24 = coord(6/25)
    
  5. Doorn, M. van; Polman, K.: From classification to thesaurus ... and back? : subject indexing tools at the library of the Afrika-Studiecentrum Leiden (2010) 0.14
    0.1425681 = sum of:
      0.1425681 = product of:
        0.5940337 = sum of:
          0.04565026 = weight(abstract_txt:linking in 1063) [ClassicSimilarity], result of:
            0.04565026 = score(doc=1063,freq=1.0), product of:
              0.119437836 = queryWeight, product of:
                1.0946507 = boost
                6.11535 = idf(docFreq=253, maxDocs=42306)
                0.017842062 = queryNorm
              0.38220936 = fieldWeight in 1063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.11535 = idf(docFreq=253, maxDocs=42306)
                0.0625 = fieldNorm(doc=1063)
          0.06583969 = weight(abstract_txt:assigned in 1063) [ClassicSimilarity], result of:
            0.06583969 = score(doc=1063,freq=2.0), product of:
              0.12101196 = queryWeight, product of:
                1.1018406 = boost
                6.155516 = idf(docFreq=243, maxDocs=42306)
                0.017842062 = queryNorm
              0.5440759 = fieldWeight in 1063, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.155516 = idf(docFreq=243, maxDocs=42306)
                0.0625 = fieldNorm(doc=1063)
          0.11769172 = weight(abstract_txt:descriptors in 1063) [ClassicSimilarity], result of:
            0.11769172 = score(doc=1063,freq=1.0), product of:
              0.28293523 = queryWeight, product of:
                2.3826659 = boost
                6.655472 = idf(docFreq=147, maxDocs=42306)
                0.017842062 = queryNorm
              0.415967 = fieldWeight in 1063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.655472 = idf(docFreq=147, maxDocs=42306)
                0.0625 = fieldNorm(doc=1063)
          0.051397976 = weight(abstract_txt:classification in 1063) [ClassicSimilarity], result of:
            0.051397976 = score(doc=1063,freq=1.0), product of:
              0.20519358 = queryWeight, product of:
                2.8695679 = boost
                4.007765 = idf(docFreq=2089, maxDocs=42306)
                0.017842062 = queryNorm
              0.2504853 = fieldWeight in 1063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.007765 = idf(docFreq=2089, maxDocs=42306)
                0.0625 = fieldNorm(doc=1063)
          0.04562981 = weight(abstract_txt:system in 1063) [ClassicSimilarity], result of:
            0.04562981 = score(doc=1063,freq=1.0), product of:
              0.21696813 = queryWeight, product of:
                3.6139174 = boost
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.017842062 = queryNorm
              0.21030651 = fieldWeight in 1063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.0625 = fieldNorm(doc=1063)
          0.26782423 = weight(abstract_txt:thesaurus in 1063) [ClassicSimilarity], result of:
            0.26782423 = score(doc=1063,freq=6.0), product of:
              0.33940318 = queryWeight, product of:
                3.6905627 = boost
                5.1544023 = idf(docFreq=663, maxDocs=42306)
                0.017842062 = queryNorm
              0.7891035 = fieldWeight in 1063, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.1544023 = idf(docFreq=663, maxDocs=42306)
                0.0625 = fieldNorm(doc=1063)
        0.24 = coord(6/25)