Document (#35944)

Author
Wagger, S.
Park, R.
Bedford, D.A.D.
Title
Lessons learned in content architecture harmonization and metadata models
Source
Aslib proceedings. 62(2010) nos.4/5, S.387-405
Year
2010
Abstract
Purpose - This paper aims to review key content, architecture, and metadata model decisions and strategies in creation of a publication portal (on DVD to start), based on a 30+ year series of flagship reports from the World Bank. Design/methodology/approach - The paper describes and analyzes key considerations and aspects of the project, including content architecture, content analysis, DTD selection, retrospective conversion, vendor management, design of metadata architectures, use of automated profiling methods, user-information behavior, and search architectures supporting complex content architectures. It includes the challenges of applying an institutionally based taxonomy required to express subject-matter responsibilities and relationships within the World Bank. Findings - The team learned that the metadata behavior and architecture (inheritance, relationships, variations) are more complex than simple links between parent and child objects. The project also reinforced the importance of comprehensive and dynamic topic taxonomy for classifying content that is both historical and current. The approach to defining classes for each full report (parent) will be likely to change, given what has been learned. The team would recommend that parts be classified and the sum of the part classes be assigned to the whole report. As a result of this exploratory work, the Bank's approach to classification and indexing of report series is changing from a top-down to a bottom-up inheritance. Originality/value - The study provides insights into both general and World Bank-specific challenges in creating a publication portal and derives some best practices for content architecture, metadata architecture, and use of automated profiling methods.
Footnote
Beitrag in einem Special Issue: Content architecture: exploiting and managing diverse resources: proceedings of the first national conference of the United Kingdom chapter of the International Society for Knowedge Organization (ISKO)

Similar documents (author)

  1. Park, A.L.: ¬A comparison of a new OCLC/PRISM searches with earlier OCLC derived searches (1992) 4.65
    4.6463795 = sum of:
      4.6463795 = weight(author_txt:park in 4239) [ClassicSimilarity], result of:
        4.6463795 = fieldWeight in 4239, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4342074 = idf(docFreq=70, maxDocs=44218)
          0.625 = fieldNorm(doc=4239)
    
  2. Park, T.K.: ¬The nature of relevance in information retrieval : an empirical study (1993) 4.65
    4.6463795 = sum of:
      4.6463795 = weight(author_txt:park in 5336) [ClassicSimilarity], result of:
        4.6463795 = fieldWeight in 5336, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4342074 = idf(docFreq=70, maxDocs=44218)
          0.625 = fieldNorm(doc=5336)
    
  3. Park, T.K.: ¬The nature of relevance in information retrieval : an empirical study (1992) 4.65
    4.6463795 = sum of:
      4.6463795 = weight(author_txt:park in 5370) [ClassicSimilarity], result of:
        4.6463795 = fieldWeight in 5370, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4342074 = idf(docFreq=70, maxDocs=44218)
          0.625 = fieldNorm(doc=5370)
    
  4. Park, A.L.: Automated authority control : making the transition (1992) 4.65
    4.6463795 = sum of:
      4.6463795 = weight(author_txt:park in 5394) [ClassicSimilarity], result of:
        4.6463795 = fieldWeight in 5394, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4342074 = idf(docFreq=70, maxDocs=44218)
          0.625 = fieldNorm(doc=5394)
    
  5. Park, T.K.: Toward a theory of user-based relevance : a call for a new paradigm of inquiry (1994) 4.65
    4.6463795 = sum of:
      4.6463795 = weight(author_txt:park in 6926) [ClassicSimilarity], result of:
        4.6463795 = fieldWeight in 6926, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4342074 = idf(docFreq=70, maxDocs=44218)
          0.625 = fieldNorm(doc=6926)
    

Similar documents (content)

  1. Willis, C.; Greenberg, J.; White, H.: Analysis and synthesis of metadata goals for scientific data (2012) 0.12
    0.11939175 = sum of:
      0.11939175 = product of:
        0.4974656 = sum of:
          0.032525063 = weight(abstract_txt:relationships in 367) [ClassicSimilarity], result of:
            0.032525063 = score(doc=367,freq=2.0), product of:
              0.0874811 = queryWeight, product of:
                4.807296 = idf(docFreq=981, maxDocs=44218)
                0.018197568 = queryNorm
              0.37179533 = fieldWeight in 367, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.807296 = idf(docFreq=981, maxDocs=44218)
                0.0546875 = fieldNorm(doc=367)
          0.031383716 = weight(abstract_txt:publication in 367) [ClassicSimilarity], result of:
            0.031383716 = score(doc=367,freq=1.0), product of:
              0.10762547 = queryWeight, product of:
                1.1091759 = boost
                5.3321366 = idf(docFreq=580, maxDocs=44218)
                0.018197568 = queryNorm
              0.2916012 = fieldWeight in 367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3321366 = idf(docFreq=580, maxDocs=44218)
                0.0546875 = fieldNorm(doc=367)
          0.049091052 = weight(abstract_txt:report in 367) [ClassicSimilarity], result of:
            0.049091052 = score(doc=367,freq=1.0), product of:
              0.16601378 = queryWeight, product of:
                1.6871768 = boost
                5.4071717 = idf(docFreq=538, maxDocs=44218)
                0.018197568 = queryNorm
              0.2957047 = fieldWeight in 367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4071717 = idf(docFreq=538, maxDocs=44218)
                0.0546875 = fieldNorm(doc=367)
          0.12906069 = weight(abstract_txt:architectures in 367) [ClassicSimilarity], result of:
            0.12906069 = score(doc=367,freq=1.0), product of:
              0.31623155 = queryWeight, product of:
                2.32858 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.018197568 = queryNorm
              0.40812084 = fieldWeight in 367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.0546875 = fieldNorm(doc=367)
          0.1805736 = weight(abstract_txt:metadata in 367) [ClassicSimilarity], result of:
            0.1805736 = score(doc=367,freq=9.0), product of:
              0.22548315 = queryWeight, product of:
                2.5384579 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.018197568 = queryNorm
              0.80082965 = fieldWeight in 367, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.0546875 = fieldNorm(doc=367)
          0.0748315 = weight(abstract_txt:content in 367) [ClassicSimilarity], result of:
            0.0748315 = score(doc=367,freq=2.0), product of:
              0.23148052 = queryWeight, product of:
                3.0432255 = boost
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.018197568 = queryNorm
              0.32327342 = fieldWeight in 367, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.0546875 = fieldNorm(doc=367)
        0.24 = coord(6/25)
    
  2. Braun, S.: Manifold: a custom analytics platform to visualize research impact (2015) 0.12
    0.11919393 = sum of:
      0.11919393 = product of:
        0.49664137 = sum of:
          0.040532663 = weight(abstract_txt:challenges in 2906) [ClassicSimilarity], result of:
            0.040532663 = score(doc=2906,freq=1.0), product of:
              0.10062694 = queryWeight, product of:
                1.0725068 = boost
                5.155857 = idf(docFreq=692, maxDocs=44218)
                0.018197568 = queryNorm
              0.40280133 = fieldWeight in 2906, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.155857 = idf(docFreq=692, maxDocs=44218)
                0.078125 = fieldNorm(doc=2906)
          0.044833884 = weight(abstract_txt:publication in 2906) [ClassicSimilarity], result of:
            0.044833884 = score(doc=2906,freq=1.0), product of:
              0.10762547 = queryWeight, product of:
                1.1091759 = boost
                5.3321366 = idf(docFreq=580, maxDocs=44218)
                0.018197568 = queryNorm
              0.41657317 = fieldWeight in 2906, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3321366 = idf(docFreq=580, maxDocs=44218)
                0.078125 = fieldNorm(doc=2906)
          0.052028127 = weight(abstract_txt:automated in 2906) [ClassicSimilarity], result of:
            0.052028127 = score(doc=2906,freq=1.0), product of:
              0.11885103 = queryWeight, product of:
                1.1655861 = boost
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.018197568 = queryNorm
              0.43775916 = fieldWeight in 2906, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.078125 = fieldNorm(doc=2906)
          0.07013008 = weight(abstract_txt:report in 2906) [ClassicSimilarity], result of:
            0.07013008 = score(doc=2906,freq=1.0), product of:
              0.16601378 = queryWeight, product of:
                1.6871768 = boost
                5.4071717 = idf(docFreq=538, maxDocs=44218)
                0.018197568 = queryNorm
              0.42243528 = fieldWeight in 2906, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4071717 = idf(docFreq=538, maxDocs=44218)
                0.078125 = fieldNorm(doc=2906)
          0.12562977 = weight(abstract_txt:learned in 2906) [ClassicSimilarity], result of:
            0.12562977 = score(doc=2906,freq=1.0), product of:
              0.2448704 = queryWeight, product of:
                2.0490694 = boost
                6.5669885 = idf(docFreq=168, maxDocs=44218)
                0.018197568 = queryNorm
              0.51304597 = fieldWeight in 2906, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5669885 = idf(docFreq=168, maxDocs=44218)
                0.078125 = fieldNorm(doc=2906)
          0.16348687 = weight(abstract_txt:architecture in 2906) [ClassicSimilarity], result of:
            0.16348687 = score(doc=2906,freq=1.0), product of:
              0.3677391 = queryWeight, product of:
                3.551186 = boost
                5.690534 = idf(docFreq=405, maxDocs=44218)
                0.018197568 = queryNorm
              0.444573 = fieldWeight in 2906, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.690534 = idf(docFreq=405, maxDocs=44218)
                0.078125 = fieldNorm(doc=2906)
        0.24 = coord(6/25)
    
  3. Kurth, M.; Ruddy, D.; Rupp, N.: Repurposing MARC metadata : using digital project experience to develop a metadata management design (2004) 0.11
    0.11105024 = sum of:
      0.11105024 = product of:
        0.5552512 = sum of:
          0.032855276 = weight(abstract_txt:relationships in 4748) [ClassicSimilarity], result of:
            0.032855276 = score(doc=4748,freq=1.0), product of:
              0.0874811 = queryWeight, product of:
                4.807296 = idf(docFreq=981, maxDocs=44218)
                0.018197568 = queryNorm
              0.37557 = fieldWeight in 4748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.807296 = idf(docFreq=981, maxDocs=44218)
                0.078125 = fieldNorm(doc=4748)
          0.023305763 = weight(abstract_txt:approach in 4748) [ClassicSimilarity], result of:
            0.023305763 = score(doc=4748,freq=1.0), product of:
              0.079649575 = queryWeight, product of:
                1.1686387 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.018197568 = queryNorm
              0.29260373 = fieldWeight in 4748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.078125 = fieldNorm(doc=4748)
          0.12562977 = weight(abstract_txt:learned in 4748) [ClassicSimilarity], result of:
            0.12562977 = score(doc=4748,freq=1.0), product of:
              0.2448704 = queryWeight, product of:
                2.0490694 = boost
                6.5669885 = idf(docFreq=168, maxDocs=44218)
                0.018197568 = queryNorm
              0.51304597 = fieldWeight in 4748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5669885 = idf(docFreq=168, maxDocs=44218)
                0.078125 = fieldNorm(doc=4748)
          0.29786915 = weight(abstract_txt:metadata in 4748) [ClassicSimilarity], result of:
            0.29786915 = score(doc=4748,freq=12.0), product of:
              0.22548315 = queryWeight, product of:
                2.5384579 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.018197568 = queryNorm
              1.3210262 = fieldWeight in 4748, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.078125 = fieldNorm(doc=4748)
          0.07559124 = weight(abstract_txt:content in 4748) [ClassicSimilarity], result of:
            0.07559124 = score(doc=4748,freq=1.0), product of:
              0.23148052 = queryWeight, product of:
                3.0432255 = boost
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.018197568 = queryNorm
              0.3265555 = fieldWeight in 4748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.078125 = fieldNorm(doc=4748)
        0.2 = coord(5/25)
    
  4. Zimmermann, E.H.: CRIS-Cross : Current Research Information Systems at a Crossroads (2002) 0.10
    0.10462298 = sum of:
      0.10462298 = product of:
        0.4359291 = sum of:
          0.03912939 = weight(abstract_txt:complex in 3590) [ClassicSimilarity], result of:
            0.03912939 = score(doc=3590,freq=1.0), product of:
              0.0982908 = queryWeight, product of:
                1.0599841 = boost
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.018197568 = queryNorm
              0.3980982 = fieldWeight in 3590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.078125 = fieldNorm(doc=3590)
          0.040532663 = weight(abstract_txt:challenges in 3590) [ClassicSimilarity], result of:
            0.040532663 = score(doc=3590,freq=1.0), product of:
              0.10062694 = queryWeight, product of:
                1.0725068 = boost
                5.155857 = idf(docFreq=692, maxDocs=44218)
                0.018197568 = queryNorm
              0.40280133 = fieldWeight in 3590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.155857 = idf(docFreq=692, maxDocs=44218)
                0.078125 = fieldNorm(doc=3590)
          0.07858482 = weight(abstract_txt:taxonomy in 3590) [ClassicSimilarity], result of:
            0.07858482 = score(doc=3590,freq=1.0), product of:
              0.15645997 = queryWeight, product of:
                1.3373483 = boost
                6.429029 = idf(docFreq=193, maxDocs=44218)
                0.018197568 = queryNorm
              0.5022679 = fieldWeight in 3590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.429029 = idf(docFreq=193, maxDocs=44218)
                0.078125 = fieldNorm(doc=3590)
          0.038604114 = weight(abstract_txt:world in 3590) [ClassicSimilarity], result of:
            0.038604114 = score(doc=3590,freq=1.0), product of:
              0.11150567 = queryWeight, product of:
                1.3827288 = boost
                4.4314575 = idf(docFreq=1429, maxDocs=44218)
                0.018197568 = queryNorm
              0.34620762 = fieldWeight in 3590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4314575 = idf(docFreq=1429, maxDocs=44218)
                0.078125 = fieldNorm(doc=3590)
          0.07559124 = weight(abstract_txt:content in 3590) [ClassicSimilarity], result of:
            0.07559124 = score(doc=3590,freq=1.0), product of:
              0.23148052 = queryWeight, product of:
                3.0432255 = boost
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.018197568 = queryNorm
              0.3265555 = fieldWeight in 3590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.078125 = fieldNorm(doc=3590)
          0.16348687 = weight(abstract_txt:architecture in 3590) [ClassicSimilarity], result of:
            0.16348687 = score(doc=3590,freq=1.0), product of:
              0.3677391 = queryWeight, product of:
                3.551186 = boost
                5.690534 = idf(docFreq=405, maxDocs=44218)
                0.018197568 = queryNorm
              0.444573 = fieldWeight in 3590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.690534 = idf(docFreq=405, maxDocs=44218)
                0.078125 = fieldNorm(doc=3590)
        0.24 = coord(6/25)
    
  5. Tsui, E.; Wang, W.M.; Cheung, C.F.; Lau, A.S.M.: ¬A concept-relationship acquisition and inference approach for hierarchical taxonomy construction from tags (2010) 0.10
    0.100631885 = sum of:
      0.100631885 = product of:
        0.41929954 = sum of:
          0.033443093 = weight(abstract_txt:behavior in 4220) [ClassicSimilarity], result of:
            0.033443093 = score(doc=4220,freq=1.0), product of:
              0.10272002 = queryWeight, product of:
                1.0836036 = boost
                5.2092032 = idf(docFreq=656, maxDocs=44218)
                0.018197568 = queryNorm
              0.3255752 = fieldWeight in 4220, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2092032 = idf(docFreq=656, maxDocs=44218)
                0.0625 = fieldNorm(doc=4220)
          0.0416225 = weight(abstract_txt:automated in 4220) [ClassicSimilarity], result of:
            0.0416225 = score(doc=4220,freq=1.0), product of:
              0.11885103 = queryWeight, product of:
                1.1655861 = boost
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.018197568 = queryNorm
              0.35020733 = fieldWeight in 4220, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.0625 = fieldNorm(doc=4220)
          0.026367461 = weight(abstract_txt:approach in 4220) [ClassicSimilarity], result of:
            0.026367461 = score(doc=4220,freq=2.0), product of:
              0.079649575 = queryWeight, product of:
                1.1686387 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.018197568 = queryNorm
              0.33104333 = fieldWeight in 4220, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=4220)
          0.18860357 = weight(abstract_txt:taxonomy in 4220) [ClassicSimilarity], result of:
            0.18860357 = score(doc=4220,freq=9.0), product of:
              0.15645997 = queryWeight, product of:
                1.3373483 = boost
                6.429029 = idf(docFreq=193, maxDocs=44218)
                0.018197568 = queryNorm
              1.2054429 = fieldWeight in 4220, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.429029 = idf(docFreq=193, maxDocs=44218)
                0.0625 = fieldNorm(doc=4220)
          0.06878994 = weight(abstract_txt:metadata in 4220) [ClassicSimilarity], result of:
            0.06878994 = score(doc=4220,freq=1.0), product of:
              0.22548315 = queryWeight, product of:
                2.5384579 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.018197568 = queryNorm
              0.30507794 = fieldWeight in 4220, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.0625 = fieldNorm(doc=4220)
          0.060472988 = weight(abstract_txt:content in 4220) [ClassicSimilarity], result of:
            0.060472988 = score(doc=4220,freq=1.0), product of:
              0.23148052 = queryWeight, product of:
                3.0432255 = boost
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.018197568 = queryNorm
              0.2612444 = fieldWeight in 4220, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.0625 = fieldNorm(doc=4220)
        0.24 = coord(6/25)