Document (#38589)

Author
Stiller, J.
Olensky, M.
Petras, V.
Title
¬A framework for the evaluation of automatic metadata enrichments
Source
Metadata and semantics research: 8th Research Conference, MTSR 2014, Karlsruhe, Germany, November 27-29, 2014, Proceedings. Eds.: S. Closs et al
Imprint
Cham : Springer
Year
2014
Pages
S.238-249
Series
Communications in computer and information science; 478
Abstract
Automatic enrichment of collections connects data to vocabularies, which supports the contextualization of content and adds searchable text to metadata. The paper introduces a framework of four dimensions (frequency, coverage, relevance and error rate) that measure both the suitability of the enrichment for the object and the enrichments' contribution to search success. To verify the framework, it is applied to the evaluation of automatic enrichments in the digital library Europeana. The analysis of 100 result sets and their corresponding queries (1,121 documents total) shows the framework is a valuable tool for guiding enrichments and determining the value of enrichment efforts.
Theme
Metadaten
Object
Europeana

Similar documents (author)

  1. Petras, V.: Heterogenitätsbehandlung und Terminology Mapping durch Crosskonkordanzen : eine Fallstudie (2010) 1.87
    1.8734696 = sum of:
      1.8734696 = product of:
        3.7469392 = sum of:
          3.7469392 = weight(author_txt:petras in 731) [ClassicSimilarity], result of:
            3.7469392 = score(doc=731,freq=1.0), product of:
              0.6539319 = queryWeight, product of:
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.07132938 = queryNorm
              5.7298613 = fieldWeight in 731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.625 = fieldNorm(doc=731)
        0.5 = coord(1/2)
    
  2. Gradmann, S.; Olensky, M.: Semantische Kontextualisierung von Museumsbeständen in Europeana (2013) 1.87
    1.8650792 = sum of:
      1.8650792 = product of:
        3.7301583 = sum of:
          3.7301583 = weight(author_txt:olensky in 2940) [ClassicSimilarity], result of:
            3.7301583 = score(doc=2940,freq=1.0), product of:
              0.7565535 = queryWeight, product of:
                1.0756068 = boost
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.07132938 = queryNorm
              4.9304624 = fieldWeight in 2940, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.5 = fieldNorm(doc=2940)
        0.5 = coord(1/2)
    
  3. Petras, V.; Bank, M.: Vergleich der Suchmaschinen AltaVista und HotBot bezüglich Treffermengen und Aktualität (1998) 1.50
    1.4987756 = sum of:
      1.4987756 = product of:
        2.9975512 = sum of:
          2.9975512 = weight(author_txt:petras in 3515) [ClassicSimilarity], result of:
            2.9975512 = score(doc=3515,freq=1.0), product of:
              0.6539319 = queryWeight, product of:
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.07132938 = queryNorm
              4.583889 = fieldWeight in 3515, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.5 = fieldNorm(doc=3515)
        0.5 = coord(1/2)
    
  4. Mayr, P.; Petras, V.: Crosskonkordanzen : Terminologie Mapping und deren Effektivität für das Information Retrieval 1.50
    1.4987756 = sum of:
      1.4987756 = product of:
        2.9975512 = sum of:
          2.9975512 = weight(author_txt:petras in 3997) [ClassicSimilarity], result of:
            2.9975512 = score(doc=3997,freq=1.0), product of:
              0.6539319 = queryWeight, product of:
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.07132938 = queryNorm
              4.583889 = fieldWeight in 3997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.5 = fieldNorm(doc=3997)
        0.5 = coord(1/2)
    
  5. Mayr, P.; Petras, V.: Cross-concordances : terminology mapping and its effectiveness for information retrieval (2008) 1.50
    1.4987756 = sum of:
      1.4987756 = product of:
        2.9975512 = sum of:
          2.9975512 = weight(author_txt:petras in 143) [ClassicSimilarity], result of:
            2.9975512 = score(doc=143,freq=1.0), product of:
              0.6539319 = queryWeight, product of:
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.07132938 = queryNorm
              4.583889 = fieldWeight in 143, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.5 = fieldNorm(doc=143)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Nicholls, P.; Ridley, J.: ¬A context for evaluating for multimedia (1996) 0.09
    0.092269436 = sum of:
      0.092269436 = product of:
        0.576684 = sum of:
          0.07798128 = weight(abstract_txt:introduces in 5201) [ClassicSimilarity], result of:
            0.07798128 = score(doc=5201,freq=1.0), product of:
              0.10674066 = queryWeight, product of:
                5.8445415 = idf(docFreq=332, maxDocs=42306)
                0.018263308 = queryNorm
              0.7305677 = fieldWeight in 5201, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8445415 = idf(docFreq=332, maxDocs=42306)
                0.125 = fieldNorm(doc=5201)
          0.17842467 = weight(abstract_txt:adds in 5201) [ClassicSimilarity], result of:
            0.17842467 = score(doc=5201,freq=1.0), product of:
              0.18534161 = queryWeight, product of:
                1.317715 = boost
                7.7014403 = idf(docFreq=51, maxDocs=42306)
                0.018263308 = queryNorm
              0.96268004 = fieldWeight in 5201, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7014403 = idf(docFreq=51, maxDocs=42306)
                0.125 = fieldNorm(doc=5201)
          0.100232154 = weight(abstract_txt:evaluation in 5201) [ClassicSimilarity], result of:
            0.100232154 = score(doc=5201,freq=2.0), product of:
              0.12618499 = queryWeight, product of:
                1.5376372 = boost
                4.4933925 = idf(docFreq=1285, maxDocs=42306)
                0.018263308 = queryNorm
              0.7943271 = fieldWeight in 5201, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4933925 = idf(docFreq=1285, maxDocs=42306)
                0.125 = fieldNorm(doc=5201)
          0.2200459 = weight(abstract_txt:framework in 5201) [ClassicSimilarity], result of:
            0.2200459 = score(doc=5201,freq=2.0), product of:
              0.26854795 = queryWeight, product of:
                3.1723125 = boost
                4.635178 = idf(docFreq=1115, maxDocs=42306)
                0.018263308 = queryNorm
              0.8193914 = fieldWeight in 5201, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.635178 = idf(docFreq=1115, maxDocs=42306)
                0.125 = fieldNorm(doc=5201)
        0.16 = coord(4/25)
    
  2. Milstead, J.L.: Methodologies for subject analysis in bibliographic databases (1992) 0.09
    0.086552344 = sum of:
      0.086552344 = product of:
        0.72126955 = sum of:
          0.07958447 = weight(abstract_txt:determining in 3092) [ClassicSimilarity], result of:
            0.07958447 = score(doc=3092,freq=1.0), product of:
              0.13107334 = queryWeight, product of:
                1.1081339 = boost
                6.4765344 = idf(docFreq=176, maxDocs=42306)
                0.018263308 = queryNorm
              0.6071751 = fieldWeight in 3092, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4765344 = idf(docFreq=176, maxDocs=42306)
                0.09375 = fieldNorm(doc=3092)
          0.21353455 = weight(abstract_txt:automatic in 3092) [ClassicSimilarity], result of:
            0.21353455 = score(doc=3092,freq=3.0), product of:
              0.25308958 = queryWeight, product of:
                2.6670601 = boost
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.018263308 = queryNorm
              0.8437114 = fieldWeight in 3092, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.09375 = fieldNorm(doc=3092)
          0.42815053 = weight(abstract_txt:enrichment in 3092) [ClassicSimilarity], result of:
            0.42815053 = score(doc=3092,freq=1.0), product of:
              0.5804082 = queryWeight, product of:
                4.038894 = boost
                7.8684945 = idf(docFreq=43, maxDocs=42306)
                0.018263308 = queryNorm
              0.7376714 = fieldWeight in 3092, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8684945 = idf(docFreq=43, maxDocs=42306)
                0.09375 = fieldNorm(doc=3092)
        0.12 = coord(3/25)
    
  3. Rindflesch, T.C.; Fizsman, M.: The interaction of domain knowledge and linguistic structure in natural language processing : interpreting hypernymic propositions in biomedical text (2003) 0.07
    0.07426988 = sum of:
      0.07426988 = product of:
        0.30945787 = sum of:
          0.039111335 = weight(abstract_txt:contribution in 4098) [ClassicSimilarity], result of:
            0.039111335 = score(doc=4098,freq=1.0), product of:
              0.106960826 = queryWeight, product of:
                1.0010308 = boost
                5.850566 = idf(docFreq=330, maxDocs=42306)
                0.018263308 = queryNorm
              0.36566037 = fieldWeight in 4098, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.850566 = idf(docFreq=330, maxDocs=42306)
                0.0625 = fieldNorm(doc=4098)
          0.041679796 = weight(abstract_txt:valuable in 4098) [ClassicSimilarity], result of:
            0.041679796 = score(doc=4098,freq=1.0), product of:
              0.1115938 = queryWeight, product of:
                1.0224806 = boost
                5.97593 = idf(docFreq=291, maxDocs=42306)
                0.018263308 = queryNorm
              0.37349564 = fieldWeight in 4098, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.97593 = idf(docFreq=291, maxDocs=42306)
                0.0625 = fieldNorm(doc=4098)
          0.048526496 = weight(abstract_txt:supports in 4098) [ClassicSimilarity], result of:
            0.048526496 = score(doc=4098,freq=1.0), product of:
              0.123502456 = queryWeight, product of:
                1.0756546 = boost
                6.2867084 = idf(docFreq=213, maxDocs=42306)
                0.018263308 = queryNorm
              0.39291927 = fieldWeight in 4098, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2867084 = idf(docFreq=213, maxDocs=42306)
                0.0625 = fieldNorm(doc=4098)
          0.06251332 = weight(abstract_txt:error in 4098) [ClassicSimilarity], result of:
            0.06251332 = score(doc=4098,freq=1.0), product of:
              0.1462193 = queryWeight, product of:
                1.1704082 = boost
                6.8405 = idf(docFreq=122, maxDocs=42306)
                0.018263308 = queryNorm
              0.42753124 = fieldWeight in 4098, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8405 = idf(docFreq=122, maxDocs=42306)
                0.0625 = fieldNorm(doc=4098)
          0.035437416 = weight(abstract_txt:evaluation in 4098) [ClassicSimilarity], result of:
            0.035437416 = score(doc=4098,freq=1.0), product of:
              0.12618499 = queryWeight, product of:
                1.5376372 = boost
                4.4933925 = idf(docFreq=1285, maxDocs=42306)
                0.018263308 = queryNorm
              0.28083703 = fieldWeight in 4098, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4933925 = idf(docFreq=1285, maxDocs=42306)
                0.0625 = fieldNorm(doc=4098)
          0.08218949 = weight(abstract_txt:automatic in 4098) [ClassicSimilarity], result of:
            0.08218949 = score(doc=4098,freq=1.0), product of:
              0.25308958 = queryWeight, product of:
                2.6670601 = boost
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.018263308 = queryNorm
              0.32474467 = fieldWeight in 4098, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.0625 = fieldNorm(doc=4098)
        0.24 = coord(6/25)
    
  4. Hobson, S.P.; Dorr, B.J.; Monz, C.; Schwartz, R.: Task-based evaluation of text summarization using Relevance Prediction (2007) 0.07
    0.074103326 = sum of:
      0.074103326 = product of:
        0.37051663 = sum of:
          0.03899064 = weight(abstract_txt:introduces in 2939) [ClassicSimilarity], result of:
            0.03899064 = score(doc=2939,freq=1.0), product of:
              0.10674066 = queryWeight, product of:
                5.8445415 = idf(docFreq=332, maxDocs=42306)
                0.018263308 = queryNorm
              0.36528385 = fieldWeight in 2939, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8445415 = idf(docFreq=332, maxDocs=42306)
                0.0625 = fieldNorm(doc=2939)
          0.04999226 = weight(abstract_txt:corresponding in 2939) [ClassicSimilarity], result of:
            0.04999226 = score(doc=2939,freq=1.0), product of:
              0.12597707 = queryWeight, product of:
                1.0863776 = boost
                6.349379 = idf(docFreq=200, maxDocs=42306)
                0.018263308 = queryNorm
              0.3968362 = fieldWeight in 2939, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.349379 = idf(docFreq=200, maxDocs=42306)
                0.0625 = fieldNorm(doc=2939)
          0.061379407 = weight(abstract_txt:evaluation in 2939) [ClassicSimilarity], result of:
            0.061379407 = score(doc=2939,freq=3.0), product of:
              0.12618499 = queryWeight, product of:
                1.5376372 = boost
                4.4933925 = idf(docFreq=1285, maxDocs=42306)
                0.018263308 = queryNorm
              0.486424 = fieldWeight in 2939, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4933925 = idf(docFreq=1285, maxDocs=42306)
                0.0625 = fieldNorm(doc=2939)
          0.14235637 = weight(abstract_txt:automatic in 2939) [ClassicSimilarity], result of:
            0.14235637 = score(doc=2939,freq=3.0), product of:
              0.25308958 = queryWeight, product of:
                2.6670601 = boost
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.018263308 = queryNorm
              0.56247425 = fieldWeight in 2939, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.0625 = fieldNorm(doc=2939)
          0.07779797 = weight(abstract_txt:framework in 2939) [ClassicSimilarity], result of:
            0.07779797 = score(doc=2939,freq=1.0), product of:
              0.26854795 = queryWeight, product of:
                3.1723125 = boost
                4.635178 = idf(docFreq=1115, maxDocs=42306)
                0.018263308 = queryNorm
              0.28969863 = fieldWeight in 2939, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.635178 = idf(docFreq=1115, maxDocs=42306)
                0.0625 = fieldNorm(doc=2939)
        0.2 = coord(5/25)
    
  5. Parinov, S.: Semantic enrichment of research outputs metadat : new CRIS facilities for authors (2014) 0.07
    0.07319458 = sum of:
      0.07319458 = product of:
        0.60995483 = sum of:
          0.035437416 = weight(abstract_txt:evaluation in 3585) [ClassicSimilarity], result of:
            0.035437416 = score(doc=3585,freq=1.0), product of:
              0.12618499 = queryWeight, product of:
                1.5376372 = boost
                4.4933925 = idf(docFreq=1285, maxDocs=42306)
                0.018263308 = queryNorm
              0.28083703 = fieldWeight in 3585, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4933925 = idf(docFreq=1285, maxDocs=42306)
                0.0625 = fieldNorm(doc=3585)
          0.08013183 = weight(abstract_txt:metadata in 3585) [ClassicSimilarity], result of:
            0.08013183 = score(doc=3585,freq=3.0), product of:
              0.15072869 = queryWeight, product of:
                1.6805367 = boost
                4.9109836 = idf(docFreq=846, maxDocs=42306)
                0.018263308 = queryNorm
              0.53162956 = fieldWeight in 3585, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9109836 = idf(docFreq=846, maxDocs=42306)
                0.0625 = fieldNorm(doc=3585)
          0.4943856 = weight(abstract_txt:enrichment in 3585) [ClassicSimilarity], result of:
            0.4943856 = score(doc=3585,freq=3.0), product of:
              0.5804082 = queryWeight, product of:
                4.038894 = boost
                7.8684945 = idf(docFreq=43, maxDocs=42306)
                0.018263308 = queryNorm
              0.8517895 = fieldWeight in 3585, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.8684945 = idf(docFreq=43, maxDocs=42306)
                0.0625 = fieldNorm(doc=3585)
        0.12 = coord(3/25)