Document (#38023)

Author
Kempf, A.O.
Zapilko, B.
Title
Normdatenpflege in Zeiten der Automatisierung : Erstellung und Evaluation automatisch aufgebauter Thesaurus-Crosskonkordanzen
Source
Information - Wissenschaft und Praxis. 64(2013) H.4, S.199-208
Year
2013
Abstract
Thesaurus-Crosskonkordanzen bilden eine wichtige Voraussetzung für die integrierte Suche in einer verteilten Datenstruktur. Ihr Aufbau erfordert allerdings erhebliche personelle Ressourcen. Der vorliegende Beitrag liefert Evaluationsergebnisse des Library Track 2012 der Ontology Alignment Evaluation Initiative (OAEI), in dem Crosskonkordanzen zwischen dem Thesaurus Sozialwissenschaften (TheSoz) und dem Standard Thesaurus Wirtschaft (STW) erstmals automatisch erstellt wurden. Die Evaluation weist auf deutliche Unterschiede in den getesteten Matching- Tools hin und stellt die qualitativen Unterschiede einer automatisch im Vergleich zu einer intellektuell erstellten Crosskonkordanz heraus. Die Ergebnisse sprechen für einen Einsatz automatisch generierter Thesaurus-Crosskonkordanzen, um Domänenexperten eine maschinell erzeugte Vorselektion von möglichen Äquivalenzrelationen anzubieten.
Content
Vgl.: http://www.degruyter.com/view/j/iwp.2013.64.issue-4/iwp-2013-0025/iwp-2013-0025.xml?format=INT.
Theme
Semantische Interoperabilität
Object
Thesaurus Sozialwissenschaften
Standard Thesaurus Wirtschaft

Similar documents (author)

  1. Kempf, A.O.; Ritze, D.; Eckert, K.; Zapilko, B.: New ways of mapping knowledge organization systems : using a semi­automatic matching­procedure for building up vocabulary crosswalks (2013) 3.92
    3.9177434 = sum of:
      3.9177434 = sum of:
        1.7016068 = weight(author_txt:kempf in 2990) [ClassicSimilarity], result of:
          1.7016068 = score(doc=2990,freq=1.0), product of:
            0.6425226 = queryWeight, product of:
              8.47463 = idf(docFreq=23, maxDocs=42306)
              0.07581718 = queryNorm
            2.648322 = fieldWeight in 2990, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.47463 = idf(docFreq=23, maxDocs=42306)
              0.3125 = fieldNorm(doc=2990)
        2.2161367 = weight(author_txt:zapilko in 2990) [ClassicSimilarity], result of:
          2.2161367 = score(doc=2990,freq=1.0), product of:
            0.7662667 = queryWeight, product of:
              1.0920582 = boost
              9.254789 = idf(docFreq=10, maxDocs=42306)
              0.07581718 = queryNorm
            2.8921218 = fieldWeight in 2990, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.254789 = idf(docFreq=10, maxDocs=42306)
              0.3125 = fieldNorm(doc=2990)
    
  2. Kempf, A.O.; Ritze, D.; Eckert, K.; Zapilko, B.: New ways of mapping knowledge organization systems : using a semi-automatic matching procedure for building up vocabulary crosswalks (2014) 3.92
    3.9177434 = sum of:
      3.9177434 = sum of:
        1.7016068 = weight(author_txt:kempf in 3372) [ClassicSimilarity], result of:
          1.7016068 = score(doc=3372,freq=1.0), product of:
            0.6425226 = queryWeight, product of:
              8.47463 = idf(docFreq=23, maxDocs=42306)
              0.07581718 = queryNorm
            2.648322 = fieldWeight in 3372, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.47463 = idf(docFreq=23, maxDocs=42306)
              0.3125 = fieldNorm(doc=3372)
        2.2161367 = weight(author_txt:zapilko in 3372) [ClassicSimilarity], result of:
          2.2161367 = score(doc=3372,freq=1.0), product of:
            0.7662667 = queryWeight, product of:
              1.0920582 = boost
              9.254789 = idf(docFreq=10, maxDocs=42306)
              0.07581718 = queryNorm
            2.8921218 = fieldWeight in 3372, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.254789 = idf(docFreq=10, maxDocs=42306)
              0.3125 = fieldNorm(doc=3372)
    
  3. Zapilko, B.: Dynamisches Browsing im Kontext von Informationsarchitekturen (2010) 2.22
    2.2161367 = sum of:
      2.2161367 = product of:
        4.4322734 = sum of:
          4.4322734 = weight(author_txt:zapilko in 745) [ClassicSimilarity], result of:
            4.4322734 = score(doc=745,freq=1.0), product of:
              0.7662667 = queryWeight, product of:
                1.0920582 = boost
                9.254789 = idf(docFreq=10, maxDocs=42306)
                0.07581718 = queryNorm
              5.7842436 = fieldWeight in 745, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.254789 = idf(docFreq=10, maxDocs=42306)
                0.625 = fieldNorm(doc=745)
        0.5 = coord(1/2)
    
  4. Zapilko, B.: InFoLiS (2017) 2.22
    2.2161367 = sum of:
      2.2161367 = product of:
        4.4322734 = sum of:
          4.4322734 = weight(author_txt:zapilko in 3032) [ClassicSimilarity], result of:
            4.4322734 = score(doc=3032,freq=1.0), product of:
              0.7662667 = queryWeight, product of:
                1.0920582 = boost
                9.254789 = idf(docFreq=10, maxDocs=42306)
                0.07581718 = queryNorm
              5.7842436 = fieldWeight in 3032, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.254789 = idf(docFreq=10, maxDocs=42306)
                0.625 = fieldNorm(doc=3032)
        0.5 = coord(1/2)
    
  5. Stempfhuber, M.; Zapilko, B.: Modelling text-fact-integration in digital libraries (2009) 1.77
    1.7729093 = sum of:
      1.7729093 = product of:
        3.5458186 = sum of:
          3.5458186 = weight(author_txt:zapilko in 394) [ClassicSimilarity], result of:
            3.5458186 = score(doc=394,freq=1.0), product of:
              0.7662667 = queryWeight, product of:
                1.0920582 = boost
                9.254789 = idf(docFreq=10, maxDocs=42306)
                0.07581718 = queryNorm
              4.6273947 = fieldWeight in 394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.254789 = idf(docFreq=10, maxDocs=42306)
                0.5 = fieldNorm(doc=394)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Mayr, P.; Petras, V.: Crosskonkordanzen : Terminologie Mapping und deren Effektivität für das Information Retrieval 0.29
    0.29059348 = sum of:
      0.29059348 = product of:
        1.4529674 = sum of:
          0.075264126 = weight(abstract_txt:sozialwissenschaften in 3997) [ClassicSimilarity], result of:
            0.075264126 = score(doc=3997,freq=1.0), product of:
              0.1069062 = queryWeight, product of:
                7.5095496 = idf(docFreq=62, maxDocs=42306)
                0.014236033 = queryNorm
              0.70402026 = fieldWeight in 3997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5095496 = idf(docFreq=62, maxDocs=42306)
                0.09375 = fieldNorm(doc=3997)
          0.15592778 = weight(abstract_txt:evaluationsergebnisse in 3997) [ClassicSimilarity], result of:
            0.15592778 = score(doc=3997,freq=1.0), product of:
              0.17373735 = queryWeight, product of:
                1.2748091 = boost
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.014236033 = queryNorm
              0.89749146 = fieldWeight in 3997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.09375 = fieldNorm(doc=3997)
          0.15592778 = weight(abstract_txt:crosskonkordanz in 3997) [ClassicSimilarity], result of:
            0.15592778 = score(doc=3997,freq=1.0), product of:
              0.17373735 = queryWeight, product of:
                1.2748091 = boost
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.014236033 = queryNorm
              0.89749146 = fieldWeight in 3997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.09375 = fieldNorm(doc=3997)
          0.04837162 = weight(abstract_txt:evaluation in 3997) [ClassicSimilarity], result of:
            0.04837162 = score(doc=3997,freq=1.0), product of:
              0.11482727 = queryWeight, product of:
                1.7950712 = boost
                4.4933925 = idf(docFreq=1285, maxDocs=42306)
                0.014236033 = queryNorm
              0.42125553 = fieldWeight in 3997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4933925 = idf(docFreq=1285, maxDocs=42306)
                0.09375 = fieldNorm(doc=3997)
          1.0174761 = weight(abstract_txt:crosskonkordanzen in 3997) [ClassicSimilarity], result of:
            1.0174761 = score(doc=3997,freq=4.0), product of:
              0.60668087 = queryWeight, product of:
                4.7644053 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.014236033 = queryNorm
              1.677119 = fieldWeight in 3997, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.09375 = fieldNorm(doc=3997)
        0.2 = coord(5/25)
    
  2. Schott, H.; Schroeder, A.: Crosskonkordanzen von Thesauri und Klassifikationen (2004) 0.23
    0.22885533 = sum of:
      0.22885533 = product of:
        1.1442766 = sum of:
          0.06481494 = weight(abstract_txt:verteilten in 4127) [ClassicSimilarity], result of:
            0.06481494 = score(doc=4127,freq=1.0), product of:
              0.10927356 = queryWeight, product of:
                1.0110115 = boost
                7.5922413 = idf(docFreq=57, maxDocs=42306)
                0.014236033 = queryNorm
              0.5931438 = fieldWeight in 4127, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5922413 = idf(docFreq=57, maxDocs=42306)
                0.078125 = fieldNorm(doc=4127)
          0.06923016 = weight(abstract_txt:integrierte in 4127) [ClassicSimilarity], result of:
            0.06923016 = score(doc=4127,freq=1.0), product of:
              0.11418137 = queryWeight, product of:
                1.033466 = boost
                7.760864 = idf(docFreq=48, maxDocs=42306)
                0.014236033 = queryNorm
              0.60631746 = fieldWeight in 4127, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.760864 = idf(docFreq=48, maxDocs=42306)
                0.078125 = fieldNorm(doc=4127)
          0.07702283 = weight(abstract_txt:intellektuell in 4127) [ClassicSimilarity], result of:
            0.07702283 = score(doc=4127,freq=1.0), product of:
              0.12259647 = queryWeight, product of:
                1.070872 = boost
                8.041766 = idf(docFreq=36, maxDocs=42306)
                0.014236033 = queryNorm
              0.628263 = fieldWeight in 4127, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.041766 = idf(docFreq=36, maxDocs=42306)
                0.078125 = fieldNorm(doc=4127)
          0.08531202 = weight(abstract_txt:maschinell in 4127) [ClassicSimilarity], result of:
            0.08531202 = score(doc=4127,freq=1.0), product of:
              0.1312417 = queryWeight, product of:
                1.1079865 = boost
                8.320479 = idf(docFreq=27, maxDocs=42306)
                0.014236033 = queryNorm
              0.65003747 = fieldWeight in 4127, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.320479 = idf(docFreq=27, maxDocs=42306)
                0.078125 = fieldNorm(doc=4127)
          0.84789664 = weight(abstract_txt:crosskonkordanzen in 4127) [ClassicSimilarity], result of:
            0.84789664 = score(doc=4127,freq=4.0), product of:
              0.60668087 = queryWeight, product of:
                4.7644053 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.014236033 = queryNorm
              1.3975991 = fieldWeight in 4127, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.078125 = fieldNorm(doc=4127)
        0.2 = coord(5/25)
    
  3. Mayr, P.: Re-Ranking auf Basis von Bradfordizing für die verteilte Suche in Digitalen Bibliotheken (2009) 0.19
    0.19290182 = sum of:
      0.19290182 = product of:
        0.8037576 = sum of:
          0.043904077 = weight(abstract_txt:sozialwissenschaften in 1303) [ClassicSimilarity], result of:
            0.043904077 = score(doc=1303,freq=1.0), product of:
              0.1069062 = queryWeight, product of:
                7.5095496 = idf(docFreq=62, maxDocs=42306)
                0.014236033 = queryNorm
              0.4106785 = fieldWeight in 1303, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5095496 = idf(docFreq=62, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1303)
          0.053915977 = weight(abstract_txt:intellektuell in 1303) [ClassicSimilarity], result of:
            0.053915977 = score(doc=1303,freq=1.0), product of:
              0.12259647 = queryWeight, product of:
                1.070872 = boost
                8.041766 = idf(docFreq=36, maxDocs=42306)
                0.014236033 = queryNorm
              0.43978408 = fieldWeight in 1303, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.041766 = idf(docFreq=36, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1303)
          0.053915977 = weight(abstract_txt:qualitativen in 1303) [ClassicSimilarity], result of:
            0.053915977 = score(doc=1303,freq=1.0), product of:
              0.12259647 = queryWeight, product of:
                1.070872 = boost
                8.041766 = idf(docFreq=36, maxDocs=42306)
                0.014236033 = queryNorm
              0.43978408 = fieldWeight in 1303, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.041766 = idf(docFreq=36, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1303)
          0.018589348 = weight(abstract_txt:einer in 1303) [ClassicSimilarity], result of:
            0.018589348 = score(doc=1303,freq=1.0), product of:
              0.086939305 = queryWeight, product of:
                1.5619504 = boost
                3.9098482 = idf(docFreq=2304, maxDocs=42306)
                0.014236033 = queryNorm
              0.21381983 = fieldWeight in 1303, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9098482 = idf(docFreq=2304, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1303)
          0.039904553 = weight(abstract_txt:evaluation in 1303) [ClassicSimilarity], result of:
            0.039904553 = score(doc=1303,freq=2.0), product of:
              0.11482727 = queryWeight, product of:
                1.7950712 = boost
                4.4933925 = idf(docFreq=1285, maxDocs=42306)
                0.014236033 = queryNorm
              0.3475181 = fieldWeight in 1303, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4933925 = idf(docFreq=1285, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1303)
          0.5935277 = weight(abstract_txt:crosskonkordanzen in 1303) [ClassicSimilarity], result of:
            0.5935277 = score(doc=1303,freq=4.0), product of:
              0.60668087 = queryWeight, product of:
                4.7644053 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.014236033 = queryNorm
              0.9783194 = fieldWeight in 1303, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1303)
        0.24 = coord(6/25)
    
  4. Strötgen, R.; Kokkelink, S.: Metadatenextraktion aus Internetquellen : Heterogenitätsbehandlung im Projekt CARMEN (2001) 0.17
    0.17057568 = sum of:
      0.17057568 = product of:
        0.710732 = sum of:
          0.050176088 = weight(abstract_txt:sozialwissenschaften in 809) [ClassicSimilarity], result of:
            0.050176088 = score(doc=809,freq=1.0), product of:
              0.1069062 = queryWeight, product of:
                7.5095496 = idf(docFreq=62, maxDocs=42306)
                0.014236033 = queryNorm
              0.46934685 = fieldWeight in 809, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5095496 = idf(docFreq=62, maxDocs=42306)
                0.0625 = fieldNorm(doc=809)
          0.05185195 = weight(abstract_txt:verteilten in 809) [ClassicSimilarity], result of:
            0.05185195 = score(doc=809,freq=1.0), product of:
              0.10927356 = queryWeight, product of:
                1.0110115 = boost
                7.5922413 = idf(docFreq=57, maxDocs=42306)
                0.014236033 = queryNorm
              0.47451508 = fieldWeight in 809, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5922413 = idf(docFreq=57, maxDocs=42306)
                0.0625 = fieldNorm(doc=809)
          0.08714137 = weight(abstract_txt:intellektuell in 809) [ClassicSimilarity], result of:
            0.08714137 = score(doc=809,freq=2.0), product of:
              0.12259647 = queryWeight, product of:
                1.070872 = boost
                8.041766 = idf(docFreq=36, maxDocs=42306)
                0.014236033 = queryNorm
              0.7107984 = fieldWeight in 809, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.041766 = idf(docFreq=36, maxDocs=42306)
                0.0625 = fieldNorm(doc=809)
          0.021244967 = weight(abstract_txt:einer in 809) [ClassicSimilarity], result of:
            0.021244967 = score(doc=809,freq=1.0), product of:
              0.086939305 = queryWeight, product of:
                1.5619504 = boost
                3.9098482 = idf(docFreq=2304, maxDocs=42306)
                0.014236033 = queryNorm
              0.24436551 = fieldWeight in 809, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9098482 = idf(docFreq=2304, maxDocs=42306)
                0.0625 = fieldNorm(doc=809)
          0.16115895 = weight(abstract_txt:automatisch in 809) [ClassicSimilarity], result of:
            0.16115895 = score(doc=809,freq=1.0), product of:
              0.36942643 = queryWeight, product of:
                3.7178557 = boost
                6.9798555 = idf(docFreq=106, maxDocs=42306)
                0.014236033 = queryNorm
              0.43624097 = fieldWeight in 809, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9798555 = idf(docFreq=106, maxDocs=42306)
                0.0625 = fieldNorm(doc=809)
          0.33915865 = weight(abstract_txt:crosskonkordanzen in 809) [ClassicSimilarity], result of:
            0.33915865 = score(doc=809,freq=1.0), product of:
              0.60668087 = queryWeight, product of:
                4.7644053 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.014236033 = queryNorm
              0.55903965 = fieldWeight in 809, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.0625 = fieldNorm(doc=809)
        0.24 = coord(6/25)
    
  5. Mayr, P.; Zapilko, B.; Sure, Y.: ¬Ein Mehr-Thesauri-Szenario auf Basis von SKOS und Crosskonkordanzen (2010) 0.14
    0.1445815 = sum of:
      0.1445815 = product of:
        0.90363437 = sum of:
          0.06272011 = weight(abstract_txt:sozialwissenschaften in 393) [ClassicSimilarity], result of:
            0.06272011 = score(doc=393,freq=1.0), product of:
              0.1069062 = queryWeight, product of:
                7.5095496 = idf(docFreq=62, maxDocs=42306)
                0.014236033 = queryNorm
              0.5866836 = fieldWeight in 393, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5095496 = idf(docFreq=62, maxDocs=42306)
                0.078125 = fieldNorm(doc=393)
          0.06571783 = weight(abstract_txt:anzubieten in 393) [ClassicSimilarity], result of:
            0.06571783 = score(doc=393,freq=1.0), product of:
              0.11028603 = queryWeight, product of:
                1.0156845 = boost
                7.6273327 = idf(docFreq=55, maxDocs=42306)
                0.014236033 = queryNorm
              0.5958854 = fieldWeight in 393, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6273327 = idf(docFreq=55, maxDocs=42306)
                0.078125 = fieldNorm(doc=393)
          0.17564295 = weight(abstract_txt:thesaurus in 393) [ClassicSimilarity], result of:
            0.17564295 = score(doc=393,freq=3.0), product of:
              0.25182667 = queryWeight, product of:
                3.4318984 = boost
                5.1544023 = idf(docFreq=663, maxDocs=42306)
                0.014236033 = queryNorm
              0.69747555 = fieldWeight in 393, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1544023 = idf(docFreq=663, maxDocs=42306)
                0.078125 = fieldNorm(doc=393)
          0.59955347 = weight(abstract_txt:crosskonkordanzen in 393) [ClassicSimilarity], result of:
            0.59955347 = score(doc=393,freq=2.0), product of:
              0.60668087 = queryWeight, product of:
                4.7644053 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.014236033 = queryNorm
              0.9882518 = fieldWeight in 393, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.078125 = fieldNorm(doc=393)
        0.16 = coord(4/25)