Document (#38022)

Author
Kempf, A.O.
Zapilko, B.
Title
Normdatenpflege in Zeiten der Automatisierung : Erstellung und Evaluation automatisch aufgebauter Thesaurus-Crosskonkordanzen
Source
Information - Wissenschaft und Praxis. 64(2013) H.4, S.199-208
Year
2013
Abstract
Thesaurus-Crosskonkordanzen bilden eine wichtige Voraussetzung für die integrierte Suche in einer verteilten Datenstruktur. Ihr Aufbau erfordert allerdings erhebliche personelle Ressourcen. Der vorliegende Beitrag liefert Evaluationsergebnisse des Library Track 2012 der Ontology Alignment Evaluation Initiative (OAEI), in dem Crosskonkordanzen zwischen dem Thesaurus Sozialwissenschaften (TheSoz) und dem Standard Thesaurus Wirtschaft (STW) erstmals automatisch erstellt wurden. Die Evaluation weist auf deutliche Unterschiede in den getesteten Matching- Tools hin und stellt die qualitativen Unterschiede einer automatisch im Vergleich zu einer intellektuell erstellten Crosskonkordanz heraus. Die Ergebnisse sprechen für einen Einsatz automatisch generierter Thesaurus-Crosskonkordanzen, um Domänenexperten eine maschinell erzeugte Vorselektion von möglichen Äquivalenzrelationen anzubieten.
Content
Vgl.: http://www.degruyter.com/view/j/iwp.2013.64.issue-4/iwp-2013-0025/iwp-2013-0025.xml?format=INT.
Theme
Semantische Interoperabilität
Object
Thesaurus Sozialwissenschaften
Standard Thesaurus Wirtschaft

Similar documents (author)

  1. Kempf, A.O.; Ritze, D.; Eckert, K.; Zapilko, B.: New ways of mapping knowledge organization systems : using a semi­automatic matching­procedure for building up vocabulary crosswalks (2013) 3.90
    3.9032779 = sum of:
      3.9032779 = sum of:
        1.6442382 = weight(author_txt:kempf in 989) [ClassicSimilarity], result of:
          1.6442382 = score(doc=989,freq=1.0), product of:
            0.6290211 = queryWeight, product of:
              8.364683 = idf(docFreq=27, maxDocs=44218)
              0.075199634 = queryNorm
            2.6139636 = fieldWeight in 989, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.364683 = idf(docFreq=27, maxDocs=44218)
              0.3125 = fieldNorm(doc=989)
        2.2590396 = weight(author_txt:zapilko in 989) [ClassicSimilarity], result of:
          2.2590396 = score(doc=989,freq=1.0), product of:
            0.7773882 = queryWeight, product of:
              1.1116968 = boost
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.075199634 = queryNorm
            2.905935 = fieldWeight in 989, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.3125 = fieldNorm(doc=989)
    
  2. Kempf, A.O.; Ritze, D.; Eckert, K.; Zapilko, B.: New ways of mapping knowledge organization systems : using a semi-automatic matching procedure for building up vocabulary crosswalks (2014) 3.90
    3.9032779 = sum of:
      3.9032779 = sum of:
        1.6442382 = weight(author_txt:kempf in 1371) [ClassicSimilarity], result of:
          1.6442382 = score(doc=1371,freq=1.0), product of:
            0.6290211 = queryWeight, product of:
              8.364683 = idf(docFreq=27, maxDocs=44218)
              0.075199634 = queryNorm
            2.6139636 = fieldWeight in 1371, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.364683 = idf(docFreq=27, maxDocs=44218)
              0.3125 = fieldNorm(doc=1371)
        2.2590396 = weight(author_txt:zapilko in 1371) [ClassicSimilarity], result of:
          2.2590396 = score(doc=1371,freq=1.0), product of:
            0.7773882 = queryWeight, product of:
              1.1116968 = boost
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.075199634 = queryNorm
            2.905935 = fieldWeight in 1371, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.3125 = fieldNorm(doc=1371)
    
  3. Zapilko, B.: Dynamisches Browsing im Kontext von Informationsarchitekturen (2010) 2.26
    2.2590396 = sum of:
      2.2590396 = product of:
        4.5180793 = sum of:
          4.5180793 = weight(author_txt:zapilko in 3744) [ClassicSimilarity], result of:
            4.5180793 = score(doc=3744,freq=1.0), product of:
              0.7773882 = queryWeight, product of:
                1.1116968 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.075199634 = queryNorm
              5.81187 = fieldWeight in 3744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.625 = fieldNorm(doc=3744)
        0.5 = coord(1/2)
    
  4. Zapilko, B.: InFoLiS (2017) 2.26
    2.2590396 = sum of:
      2.2590396 = product of:
        4.5180793 = sum of:
          4.5180793 = weight(author_txt:zapilko in 1031) [ClassicSimilarity], result of:
            4.5180793 = score(doc=1031,freq=1.0), product of:
              0.7773882 = queryWeight, product of:
                1.1116968 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.075199634 = queryNorm
              5.81187 = fieldWeight in 1031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.625 = fieldNorm(doc=1031)
        0.5 = coord(1/2)
    
  5. Stempfhuber, M.; Zapilko, B.: Modelling text-fact-integration in digital libraries (2009) 1.81
    1.8072318 = sum of:
      1.8072318 = product of:
        3.6144636 = sum of:
          3.6144636 = weight(author_txt:zapilko in 3393) [ClassicSimilarity], result of:
            3.6144636 = score(doc=3393,freq=1.0), product of:
              0.7773882 = queryWeight, product of:
                1.1116968 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.075199634 = queryNorm
              4.649496 = fieldWeight in 3393, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.5 = fieldNorm(doc=3393)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Mayr, P.; Petras, V.: Crosskonkordanzen : Terminologie Mapping und deren Effektivität für das Information Retrieval 0.29
    0.29306674 = sum of:
      0.29306674 = product of:
        1.4653337 = sum of:
          0.07484578 = weight(abstract_txt:sozialwissenschaften in 1996) [ClassicSimilarity], result of:
            0.07484578 = score(doc=1996,freq=1.0), product of:
              0.10634478 = queryWeight, product of:
                1.017323 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.013924431 = queryNorm
              0.70380306 = fieldWeight in 1996, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.09375 = fieldNorm(doc=1996)
          0.15736458 = weight(abstract_txt:crosskonkordanz in 1996) [ClassicSimilarity], result of:
            0.15736458 = score(doc=1996,freq=1.0), product of:
              0.17453237 = queryWeight, product of:
                1.3032831 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.013924431 = queryNorm
              0.9016355 = fieldWeight in 1996, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.09375 = fieldNorm(doc=1996)
          0.15736458 = weight(abstract_txt:evaluationsergebnisse in 1996) [ClassicSimilarity], result of:
            0.15736458 = score(doc=1996,freq=1.0), product of:
              0.17453237 = queryWeight, product of:
                1.3032831 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.013924431 = queryNorm
              0.9016355 = fieldWeight in 1996, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.09375 = fieldNorm(doc=1996)
          0.04791227 = weight(abstract_txt:evaluation in 1996) [ClassicSimilarity], result of:
            0.04791227 = score(doc=1996,freq=1.0), product of:
              0.113922514 = queryWeight, product of:
                1.8237536 = boost
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.013924431 = queryNorm
              0.42056894 = fieldWeight in 1996, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.09375 = fieldNorm(doc=1996)
          1.0278465 = weight(abstract_txt:crosskonkordanzen in 1996) [ClassicSimilarity], result of:
            1.0278465 = score(doc=1996,freq=4.0), product of:
              0.6098506 = queryWeight, product of:
                4.872395 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.013924431 = queryNorm
              1.6854069 = fieldWeight in 1996, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.09375 = fieldNorm(doc=1996)
        0.2 = coord(5/25)
    
  2. Schott, H.; Schroeder, A.: Crosskonkordanzen von Thesauri und Klassifikationen (2004) 0.23
    0.23000889 = sum of:
      0.23000889 = product of:
        1.1500444 = sum of:
          0.06477734 = weight(abstract_txt:verteilten in 3126) [ClassicSimilarity], result of:
            0.06477734 = score(doc=3126,freq=1.0), product of:
              0.10906219 = queryWeight, product of:
                1.0302387 = boost
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.013924431 = queryNorm
              0.59394866 = fieldWeight in 3126, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.078125 = fieldNorm(doc=3126)
          0.07009328 = weight(abstract_txt:integrierte in 3126) [ClassicSimilarity], result of:
            0.07009328 = score(doc=3126,freq=1.0), product of:
              0.11495019 = queryWeight, product of:
                1.0576832 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.013924431 = queryNorm
              0.6097709 = fieldWeight in 3126, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.078125 = fieldNorm(doc=3126)
          0.07642449 = weight(abstract_txt:intellektuell in 3126) [ClassicSimilarity], result of:
            0.07642449 = score(doc=3126,freq=1.0), product of:
              0.121771924 = queryWeight, product of:
                1.0886151 = boost
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.013924431 = queryNorm
              0.62760353 = fieldWeight in 3126, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.078125 = fieldNorm(doc=3126)
          0.08221068 = weight(abstract_txt:maschinell in 3126) [ClassicSimilarity], result of:
            0.08221068 = score(doc=3126,freq=1.0), product of:
              0.1278432 = queryWeight, product of:
                1.115423 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.013924431 = queryNorm
              0.6430587 = fieldWeight in 3126, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.078125 = fieldNorm(doc=3126)
          0.8565387 = weight(abstract_txt:crosskonkordanzen in 3126) [ClassicSimilarity], result of:
            0.8565387 = score(doc=3126,freq=4.0), product of:
              0.6098506 = queryWeight, product of:
                4.872395 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.013924431 = queryNorm
              1.4045058 = fieldWeight in 3126, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.078125 = fieldNorm(doc=3126)
        0.2 = coord(5/25)
    
  3. Mayr, P.: Re-Ranking auf Basis von Bradfordizing für die verteilte Suche in Digitalen Bibliotheken (2009) 0.19
    0.19377291 = sum of:
      0.19377291 = product of:
        0.8073872 = sum of:
          0.04366004 = weight(abstract_txt:sozialwissenschaften in 4302) [ClassicSimilarity], result of:
            0.04366004 = score(doc=4302,freq=1.0), product of:
              0.10634478 = queryWeight, product of:
                1.017323 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.013924431 = queryNorm
              0.4105518 = fieldWeight in 4302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4302)
          0.05299294 = weight(abstract_txt:qualitativen in 4302) [ClassicSimilarity], result of:
            0.05299294 = score(doc=4302,freq=1.0), product of:
              0.12100559 = queryWeight, product of:
                1.0851842 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.013924431 = queryNorm
              0.43793795 = fieldWeight in 4302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4302)
          0.053497143 = weight(abstract_txt:intellektuell in 4302) [ClassicSimilarity], result of:
            0.053497143 = score(doc=4302,freq=1.0), product of:
              0.121771924 = queryWeight, product of:
                1.0886151 = boost
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.013924431 = queryNorm
              0.43932247 = fieldWeight in 4302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4302)
          0.018134383 = weight(abstract_txt:einer in 4302) [ClassicSimilarity], result of:
            0.018134383 = score(doc=4302,freq=1.0), product of:
              0.085382536 = queryWeight, product of:
                1.5788684 = boost
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.013924431 = queryNorm
              0.21238984 = fieldWeight in 4302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4302)
          0.039525606 = weight(abstract_txt:evaluation in 4302) [ClassicSimilarity], result of:
            0.039525606 = score(doc=4302,freq=2.0), product of:
              0.113922514 = queryWeight, product of:
                1.8237536 = boost
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.013924431 = queryNorm
              0.34695166 = fieldWeight in 4302, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4302)
          0.59957707 = weight(abstract_txt:crosskonkordanzen in 4302) [ClassicSimilarity], result of:
            0.59957707 = score(doc=4302,freq=4.0), product of:
              0.6098506 = queryWeight, product of:
                4.872395 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.013924431 = queryNorm
              0.98315406 = fieldWeight in 4302, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4302)
        0.24 = coord(6/25)
    
  4. Strötgen, R.; Kokkelink, S.: Metadatenextraktion aus Internetquellen : Heterogenitätsbehandlung im Projekt CARMEN (2001) 0.17
    0.17113906 = sum of:
      0.17113906 = product of:
        0.71307945 = sum of:
          0.04989719 = weight(abstract_txt:sozialwissenschaften in 5808) [ClassicSimilarity], result of:
            0.04989719 = score(doc=5808,freq=1.0), product of:
              0.10634478 = queryWeight, product of:
                1.017323 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.013924431 = queryNorm
              0.46920204 = fieldWeight in 5808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.0625 = fieldNorm(doc=5808)
          0.051821873 = weight(abstract_txt:verteilten in 5808) [ClassicSimilarity], result of:
            0.051821873 = score(doc=5808,freq=1.0), product of:
              0.10906219 = queryWeight, product of:
                1.0302387 = boost
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.013924431 = queryNorm
              0.47515893 = fieldWeight in 5808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.0625 = fieldNorm(doc=5808)
          0.086464435 = weight(abstract_txt:intellektuell in 5808) [ClassicSimilarity], result of:
            0.086464435 = score(doc=5808,freq=2.0), product of:
              0.121771924 = queryWeight, product of:
                1.0886151 = boost
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.013924431 = queryNorm
              0.7100523 = fieldWeight in 5808, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.0625 = fieldNorm(doc=5808)
          0.02072501 = weight(abstract_txt:einer in 5808) [ClassicSimilarity], result of:
            0.02072501 = score(doc=5808,freq=1.0), product of:
              0.085382536 = queryWeight, product of:
                1.5788684 = boost
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.013924431 = queryNorm
              0.24273124 = fieldWeight in 5808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.0625 = fieldNorm(doc=5808)
          0.16155547 = weight(abstract_txt:automatisch in 5808) [ClassicSimilarity], result of:
            0.16155547 = score(doc=5808,freq=1.0), product of:
              0.36945927 = queryWeight, product of:
                3.7923992 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.013924431 = queryNorm
              0.43727544 = fieldWeight in 5808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.0625 = fieldNorm(doc=5808)
          0.3426155 = weight(abstract_txt:crosskonkordanzen in 5808) [ClassicSimilarity], result of:
            0.3426155 = score(doc=5808,freq=1.0), product of:
              0.6098506 = queryWeight, product of:
                4.872395 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.013924431 = queryNorm
              0.5618023 = fieldWeight in 5808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0625 = fieldNorm(doc=5808)
        0.24 = coord(6/25)
    
  5. Mayr, P.; Zapilko, B.; Sure, Y.: ¬Ein Mehr-Thesauri-Szenario auf Basis von SKOS und Crosskonkordanzen (2010) 0.15
    0.14569679 = sum of:
      0.14569679 = product of:
        0.91060495 = sum of:
          0.062371485 = weight(abstract_txt:sozialwissenschaften in 3392) [ClassicSimilarity], result of:
            0.062371485 = score(doc=3392,freq=1.0), product of:
              0.10634478 = queryWeight, product of:
                1.017323 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.013924431 = queryNorm
              0.58650255 = fieldWeight in 3392, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.078125 = fieldNorm(doc=3392)
          0.066556945 = weight(abstract_txt:anzubieten in 3392) [ClassicSimilarity], result of:
            0.066556945 = score(doc=3392,freq=1.0), product of:
              0.111050636 = queryWeight, product of:
                1.0395881 = boost
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.013924431 = queryNorm
              0.5993387 = fieldWeight in 3392, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.078125 = fieldNorm(doc=3392)
          0.17601219 = weight(abstract_txt:thesaurus in 3392) [ClassicSimilarity], result of:
            0.17601219 = score(doc=3392,freq=3.0), product of:
              0.25178906 = queryWeight, product of:
                3.5002913 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.013924431 = queryNorm
              0.6990462 = fieldWeight in 3392, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.078125 = fieldNorm(doc=3392)
          0.6056643 = weight(abstract_txt:crosskonkordanzen in 3392) [ClassicSimilarity], result of:
            0.6056643 = score(doc=3392,freq=2.0), product of:
              0.6098506 = queryWeight, product of:
                4.872395 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.013924431 = queryNorm
              0.9931356 = fieldWeight in 3392, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.078125 = fieldNorm(doc=3392)
        0.16 = coord(4/25)