Document (#37045)

Author
Margaritopoulos, M.
Margaritopoulos, T.
Mavridis, I.
Manitsaris, A.
Title
Quantifying and measuring metadata completeness
Source
Journal of the American Society for Information Science and Technology. 63(2012) no.4, S.724-737
Year
2012
Abstract
Completeness of metadata is one of the most essential characteristics of their quality. An incomplete metadata record is a record of degraded quality. Existing approaches to measure metadata completeness limit their scope in counting the existence of values in fields, regardless of the metadata hierarchy as defined in international standards. Such a traditional approach overlooks several issues that need to be taken into account. This paper presents a fine-grained metrics system for measuring metadata completeness, based on field completeness. A metadata field is considered to be a container of multiple pieces of information. In this regard, the proposed system is capable of following the hierarchy of metadata as it is set by the metadata schema and admeasuring the effect of multiple values of multivalued fields. An application of the proposed metrics system, after being configured according to specific user requirements, to measure completeness of a real-world set of metadata is demonstrated. The results prove its ability to assess the sufficiency of metadata to describe a resource and provide targeted measures of completeness throughout the metadata hierarchy.
Theme
Metadaten

Similar documents (content)

  1. Park, J.-r.: Metadata quality in digital repositories : a survey of the current state of the art (2009) 0.17
    0.17287372 = sum of:
      0.17287372 = product of:
        1.0804608 = sum of:
          0.068991385 = weight(abstract_txt:quality in 802) [ClassicSimilarity], result of:
            0.068991385 = score(doc=802,freq=7.0), product of:
              0.071261406 = queryWeight, product of:
                1.279395 = boost
                4.6838336 = idf(docFreq=1062, maxDocs=42306)
                0.01189182 = queryNorm
              0.9681452 = fieldWeight in 802, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.6838336 = idf(docFreq=1062, maxDocs=42306)
                0.078125 = fieldNorm(doc=802)
          0.07597813 = weight(abstract_txt:measuring in 802) [ClassicSimilarity], result of:
            0.07597813 = score(doc=802,freq=1.0), product of:
              0.14537272 = queryWeight, product of:
                1.827338 = boost
                6.6898394 = idf(docFreq=142, maxDocs=42306)
                0.01189182 = queryNorm
              0.5226437 = fieldWeight in 802, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6898394 = idf(docFreq=142, maxDocs=42306)
                0.078125 = fieldNorm(doc=802)
          0.4254056 = weight(abstract_txt:completeness in 802) [ClassicSimilarity], result of:
            0.4254056 = score(doc=802,freq=1.0), product of:
              0.6959563 = queryWeight, product of:
                7.4800143 = boost
                7.824043 = idf(docFreq=45, maxDocs=42306)
                0.01189182 = queryNorm
              0.6112533 = fieldWeight in 802, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.824043 = idf(docFreq=45, maxDocs=42306)
                0.078125 = fieldNorm(doc=802)
          0.5100857 = weight(abstract_txt:metadata in 802) [ClassicSimilarity], result of:
            0.5100857 = score(doc=802,freq=8.0), product of:
              0.47004524 = queryWeight, product of:
                8.048647 = boost
                4.9109836 = idf(docFreq=846, maxDocs=42306)
                0.01189182 = queryNorm
              1.0851843 = fieldWeight in 802, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.9109836 = idf(docFreq=846, maxDocs=42306)
                0.078125 = fieldNorm(doc=802)
        0.16 = coord(4/25)
    
  2. Foulonneau, M.: Information redundancy across metadata collections (2007) 0.16
    0.1622717 = sum of:
      0.1622717 = product of:
        1.0141981 = sum of:
          0.020861035 = weight(abstract_txt:quality in 2916) [ClassicSimilarity], result of:
            0.020861035 = score(doc=2916,freq=1.0), product of:
              0.071261406 = queryWeight, product of:
                1.279395 = boost
                4.6838336 = idf(docFreq=1062, maxDocs=42306)
                0.01189182 = queryNorm
              0.2927396 = fieldWeight in 2916, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6838336 = idf(docFreq=1062, maxDocs=42306)
                0.0625 = fieldNorm(doc=2916)
          0.033542827 = weight(abstract_txt:record in 2916) [ClassicSimilarity], result of:
            0.033542827 = score(doc=2916,freq=1.0), product of:
              0.09780557 = queryWeight, product of:
                1.4988537 = boost
                5.4872665 = idf(docFreq=475, maxDocs=42306)
                0.01189182 = queryNorm
              0.34295416 = fieldWeight in 2916, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4872665 = idf(docFreq=475, maxDocs=42306)
                0.0625 = fieldNorm(doc=2916)
          0.48129147 = weight(abstract_txt:completeness in 2916) [ClassicSimilarity], result of:
            0.48129147 = score(doc=2916,freq=2.0), product of:
              0.6959563 = queryWeight, product of:
                7.4800143 = boost
                7.824043 = idf(docFreq=45, maxDocs=42306)
                0.01189182 = queryNorm
              0.6915542 = fieldWeight in 2916, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.824043 = idf(docFreq=45, maxDocs=42306)
                0.0625 = fieldNorm(doc=2916)
          0.4785028 = weight(abstract_txt:metadata in 2916) [ClassicSimilarity], result of:
            0.4785028 = score(doc=2916,freq=11.0), product of:
              0.47004524 = queryWeight, product of:
                8.048647 = boost
                4.9109836 = idf(docFreq=846, maxDocs=42306)
                0.01189182 = queryNorm
              1.0179931 = fieldWeight in 2916, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                4.9109836 = idf(docFreq=846, maxDocs=42306)
                0.0625 = fieldNorm(doc=2916)
        0.16 = coord(4/25)
    
  3. Margaritopoulos, T.; Margaritopoulos, M.; Mavridis, I.; Manitsaris, A.: ¬A conceptual framework for metadata quality assessment (2008) 0.12
    0.11605928 = sum of:
      0.11605928 = product of:
        0.5802964 = sum of:
          0.05419856 = weight(abstract_txt:quality in 463) [ClassicSimilarity], result of:
            0.05419856 = score(doc=463,freq=3.0), product of:
              0.071261406 = queryWeight, product of:
                1.279395 = boost
                4.6838336 = idf(docFreq=1062, maxDocs=42306)
                0.01189182 = queryNorm
              0.7605598 = fieldWeight in 463, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6838336 = idf(docFreq=1062, maxDocs=42306)
                0.09375 = fieldNorm(doc=463)
          0.039964635 = weight(abstract_txt:fields in 463) [ClassicSimilarity], result of:
            0.039964635 = score(doc=463,freq=1.0), product of:
              0.0838855 = queryWeight, product of:
                1.3881004 = boost
                5.0818014 = idf(docFreq=713, maxDocs=42306)
                0.01189182 = queryNorm
              0.47641888 = fieldWeight in 463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0818014 = idf(docFreq=713, maxDocs=42306)
                0.09375 = fieldNorm(doc=463)
          0.05031424 = weight(abstract_txt:record in 463) [ClassicSimilarity], result of:
            0.05031424 = score(doc=463,freq=1.0), product of:
              0.09780557 = queryWeight, product of:
                1.4988537 = boost
                5.4872665 = idf(docFreq=475, maxDocs=42306)
                0.01189182 = queryNorm
              0.51443124 = fieldWeight in 463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4872665 = idf(docFreq=475, maxDocs=42306)
                0.09375 = fieldNorm(doc=463)
          0.06098406 = weight(abstract_txt:values in 463) [ClassicSimilarity], result of:
            0.06098406 = score(doc=463,freq=1.0), product of:
              0.11118526 = queryWeight, product of:
                1.5980893 = boost
                5.850566 = idf(docFreq=330, maxDocs=42306)
                0.01189182 = queryNorm
              0.5484905 = fieldWeight in 463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.850566 = idf(docFreq=330, maxDocs=42306)
                0.09375 = fieldNorm(doc=463)
          0.37483492 = weight(abstract_txt:metadata in 463) [ClassicSimilarity], result of:
            0.37483492 = score(doc=463,freq=3.0), product of:
              0.47004524 = queryWeight, product of:
                8.048647 = boost
                4.9109836 = idf(docFreq=846, maxDocs=42306)
                0.01189182 = queryNorm
              0.79744434 = fieldWeight in 463, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9109836 = idf(docFreq=846, maxDocs=42306)
                0.09375 = fieldNorm(doc=463)
        0.2 = coord(5/25)
    
  4. Arazy, O.; Kopak, R.: On the measurability of information quality (2011) 0.11
    0.114355884 = sum of:
      0.114355884 = product of:
        0.7147243 = sum of:
          0.036132373 = weight(abstract_txt:quality in 1136) [ClassicSimilarity], result of:
            0.036132373 = score(doc=1136,freq=3.0), product of:
              0.071261406 = queryWeight, product of:
                1.279395 = boost
                4.6838336 = idf(docFreq=1062, maxDocs=42306)
                0.01189182 = queryNorm
              0.50703984 = fieldWeight in 1136, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6838336 = idf(docFreq=1062, maxDocs=42306)
                0.0625 = fieldNorm(doc=1136)
          0.028350161 = weight(abstract_txt:multiple in 1136) [ClassicSimilarity], result of:
            0.028350161 = score(doc=1136,freq=1.0), product of:
              0.087431416 = queryWeight, product of:
                1.4171349 = boost
                5.188096 = idf(docFreq=641, maxDocs=42306)
                0.01189182 = queryNorm
              0.324256 = fieldWeight in 1136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.188096 = idf(docFreq=641, maxDocs=42306)
                0.0625 = fieldNorm(doc=1136)
          0.060782507 = weight(abstract_txt:measuring in 1136) [ClassicSimilarity], result of:
            0.060782507 = score(doc=1136,freq=1.0), product of:
              0.14537272 = queryWeight, product of:
                1.827338 = boost
                6.6898394 = idf(docFreq=142, maxDocs=42306)
                0.01189182 = queryNorm
              0.41811496 = fieldWeight in 1136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6898394 = idf(docFreq=142, maxDocs=42306)
                0.0625 = fieldNorm(doc=1136)
          0.5894593 = weight(abstract_txt:completeness in 1136) [ClassicSimilarity], result of:
            0.5894593 = score(doc=1136,freq=3.0), product of:
              0.6959563 = queryWeight, product of:
                7.4800143 = boost
                7.824043 = idf(docFreq=45, maxDocs=42306)
                0.01189182 = queryNorm
              0.8469775 = fieldWeight in 1136, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.824043 = idf(docFreq=45, maxDocs=42306)
                0.0625 = fieldNorm(doc=1136)
        0.16 = coord(4/25)
    
  5. Dimitroff, A.: Mental models theory and search outcome in a bibliographic retrieval system (1992) 0.10
    0.1041855 = sum of:
      0.1041855 = product of:
        0.6511594 = sum of:
          0.065746404 = weight(abstract_txt:incomplete in 3315) [ClassicSimilarity], result of:
            0.065746404 = score(doc=3315,freq=1.0), product of:
              0.09278426 = queryWeight, product of:
                1.032285 = boost
                7.5583396 = idf(docFreq=59, maxDocs=42306)
                0.01189182 = queryNorm
              0.7085943 = fieldWeight in 3315, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5583396 = idf(docFreq=59, maxDocs=42306)
                0.09375 = fieldNorm(doc=3315)
          0.02461201 = weight(abstract_txt:system in 3315) [ClassicSimilarity], result of:
            0.02461201 = score(doc=3315,freq=2.0), product of:
              0.05516811 = queryWeight, product of:
                1.3786916 = boost
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.01189182 = queryNorm
              0.44612747 = fieldWeight in 3315, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3649042 = idf(docFreq=3974, maxDocs=42306)
                0.09375 = fieldNorm(doc=3315)
          0.05031424 = weight(abstract_txt:record in 3315) [ClassicSimilarity], result of:
            0.05031424 = score(doc=3315,freq=1.0), product of:
              0.09780557 = queryWeight, product of:
                1.4988537 = boost
                5.4872665 = idf(docFreq=475, maxDocs=42306)
                0.01189182 = queryNorm
              0.51443124 = fieldWeight in 3315, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4872665 = idf(docFreq=475, maxDocs=42306)
                0.09375 = fieldNorm(doc=3315)
          0.5104867 = weight(abstract_txt:completeness in 3315) [ClassicSimilarity], result of:
            0.5104867 = score(doc=3315,freq=1.0), product of:
              0.6959563 = queryWeight, product of:
                7.4800143 = boost
                7.824043 = idf(docFreq=45, maxDocs=42306)
                0.01189182 = queryNorm
              0.733504 = fieldWeight in 3315, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.824043 = idf(docFreq=45, maxDocs=42306)
                0.09375 = fieldNorm(doc=3315)
        0.16 = coord(4/25)