Search (10 results, page 1 of 1)

  • × author_ss:"Lagoze, C."
  1. Payette, S.; Blanchi, C.; Lagoze, C.; Overly, E.A.: Interoperability for digital objects and repositories : the Cornell/CNRI experiments (1999) 0.10
    0.095209256 = product of:
      0.14281388 = sum of:
        0.12714498 = weight(_text_:digital in 1248) [ClassicSimilarity], result of:
          0.12714498 = score(doc=1248,freq=26.0), product of:
            0.20228553 = queryWeight, product of:
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.051282257 = queryNorm
            0.6285421 = fieldWeight in 1248, product of:
              5.0990195 = tf(freq=26.0), with freq of:
                26.0 = termFreq=26.0
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.03125 = fieldNorm(doc=1248)
        0.015668897 = product of:
          0.031337794 = sum of:
            0.031337794 = weight(_text_:library in 1248) [ClassicSimilarity], result of:
              0.031337794 = score(doc=1248,freq=8.0), product of:
                0.13484047 = queryWeight, product of:
                  2.6293786 = idf(docFreq=8668, maxDocs=44218)
                  0.051282257 = queryNorm
                0.23240642 = fieldWeight in 1248, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  2.6293786 = idf(docFreq=8668, maxDocs=44218)
                  0.03125 = fieldNorm(doc=1248)
          0.5 = coord(1/2)
      0.6666667 = coord(2/3)
    
    Abstract
    For several years the Digital Library Research Group at Cornell University and the Corporation for National Research Initiatives (CNRI) have been engaged in research focused on the design and development of infrastructures for open architecture, confederated digital libraries. The goal of this effort is to achieve interoperability and extensibility of digital library systems through the definition of key digital library services and their open interfaces, allowing flexible interaction of existing services and augmentation of the infrastructure with new services. Some aspects of this research have included the development and deployment of the Dienst software, the Handle System®, and the architecture of digital objects and repositories. In this paper, we describe the joint effort by Cornell and CNRI to prototype a rich and deployable architecture for interoperable digital objects and repositories. This effort has challenged us to move theories of interoperability closer to practice. The Cornell/CNRI collaboration builds on two existing projects focusing on the development of interoperable digital libraries. Details relating to the technology of these projects are described elsewhere. Both projects were strongly influenced by the fundamental abstractions of repositories and digital objects as articulated by Kahn and Wilensky in A Framework for Distributed Digital Object Services. Furthermore, both programs were influenced by the container architecture described in the Warwick Framework, and by the notions of distributed dynamic objects presented by Lagoze and Daniel in their Distributed Active Relationship work. With these common roots, one would expect that the CNRI and Cornell repositories would be at least theoretically interoperable. However, the actual test would be the extent to which our independently developed repositories were practically interoperable. This paper focuses on the definition of interoperability in the joint Cornell/CNRI work and the set of experiments conducted to formally test it. Our motivation for this work is the eventual deployment of formally tested reference implementations of the repository architecture for experimentation and development by fellow digital library researchers. In Section 2, we summarize the digital object and repository approach that was the focus of our interoperability experiments. In Section 3, we describe the set of experiments that progressively tested interoperability at increasing levels of functionality. In Section 4, we discuss general conclusions, and in Section 5, we give a preview of our future work, including our plans to evolve our experimentation to the point of defining a set of formal metrics for measuring interoperability for repositories and digital objects. This is still a work in progress that is expected to undergo additional refinements during its development.
  2. Daniel Jr., R.; Lagoze, C.: Extending the Warwick framework : from metadata containers to active digital objects (1997) 0.06
    0.0583028 = product of:
      0.0874542 = sum of:
        0.075580746 = weight(_text_:digital in 1264) [ClassicSimilarity], result of:
          0.075580746 = score(doc=1264,freq=12.0), product of:
            0.20228553 = queryWeight, product of:
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.051282257 = queryNorm
            0.37363398 = fieldWeight in 1264, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.02734375 = fieldNorm(doc=1264)
        0.011873455 = product of:
          0.02374691 = sum of:
            0.02374691 = weight(_text_:library in 1264) [ClassicSimilarity], result of:
              0.02374691 = score(doc=1264,freq=6.0), product of:
                0.13484047 = queryWeight, product of:
                  2.6293786 = idf(docFreq=8668, maxDocs=44218)
                  0.051282257 = queryNorm
                0.17611115 = fieldWeight in 1264, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  2.6293786 = idf(docFreq=8668, maxDocs=44218)
                  0.02734375 = fieldNorm(doc=1264)
          0.5 = coord(1/2)
      0.6666667 = coord(2/3)
    
    Abstract
    Defining metadata as "data about data" provokes more questions than it answers. What are the forms of the data and metadata? Can we be more specific about the manner in which the metadata is "about" the data? Are data and metadata distinguished only in the context of their relationship? Is the nature of the relationship between the datasets declarative or procedural? Can the metadata itself be described by other data? Over the past several years, we have been engaged in a number of efforts examining the role, format, composition, and architecture of metadata for networked resources. During this time, we have noticed the tendency to be led astray by comfortable, but somewhat inappropriate, models in the non-digital information environment. Rather than pursuing familiar models, there is the need for a new model that fully exploits the unique combination of computation and connectivity that characterizes the digital library. In this paper, we describe an extension of the Warwick Framework that we call Distributed Active Relationships (DARs). DARs provide a powerful model for representing data and metadata in digital library objects. They explicitly express the relationships between networked resources, and even allow those relationships to be dynamically downloadable and executable. The DAR model is based on the following principles, which our examination of the "data about data" definition has led us to regard as axiomatic: * There is no essential distinction between data and metadata. We can only make such a distinction in terms of a particular "about" relationship. As a result, what is metadata in the context of one "about" relationship may be data in another. * There is no single "about" relationship. There are many different and important relationships between data resources. * Resources can be related without regard for their location. The connectivity in networked information architectures makes it possible to have data in one repository describe data in another repository. * The computational power of the networked information environment makes it possible to consider active or dynamic relationships between data sets. This adds considerable power to the "data about data" definition. First, data about another data set may not physically exist, but may be automatically derived. Second, the "about" relationship may be an executable object -- in a sense interpretable metadata. As will be shown, this provides useful mechanisms for handling complex metadata problems such as rights management of digital objects. The remainder of this paper describes the development and consequences of the DAR model. Section 2 reviews the Warwick Framework, which is the basis for the model described in this paper. Section 3 examines the concept of the Warwick Framework Catalog, which provides a mechanism for expressing the relationships between the packages in a Warwick Framework container. With that background established, section 4 generalizes the Warwick Framework by removing the restriction that it only contains "metadata". This allows us to consider digital library objects that are aggregations of (possibly distributed) data sets, with the relationships between the data sets expressed using a Warwick Framework Catalog. Section 5 further extends the model by describing Distributed Active Relationships (DARs). DARs are the explicit relationships that have the potential to be executable, as alluded to earlier. Finally, section 6 describes two possible implementations of these concepts.
  3. Faaborg, A.; Lagoze, C.: Semantic browsing (2003) 0.06
    0.057353042 = product of:
      0.08602956 = sum of:
        0.061711423 = weight(_text_:digital in 1026) [ClassicSimilarity], result of:
          0.061711423 = score(doc=1026,freq=2.0), product of:
            0.20228553 = queryWeight, product of:
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.051282257 = queryNorm
            0.30507088 = fieldWeight in 1026, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1026)
        0.024318133 = product of:
          0.048636265 = sum of:
            0.048636265 = weight(_text_:22 in 1026) [ClassicSimilarity], result of:
              0.048636265 = score(doc=1026,freq=2.0), product of:
                0.17958173 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.051282257 = queryNorm
                0.2708308 = fieldWeight in 1026, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1026)
          0.5 = coord(1/2)
      0.6666667 = coord(2/3)
    
    Source
    Research and advanced technology for digital libraries : 7th European Conference, proceedings / ECDL 2003, Trondheim, Norway, August 17-22, 2003
  4. Davis, J.R.; Lagoze, C.: NCSTRL : design and deployment of a globally distributed digital library (2000) 0.05
    0.050281137 = product of:
      0.075421706 = sum of:
        0.061711423 = weight(_text_:digital in 4435) [ClassicSimilarity], result of:
          0.061711423 = score(doc=4435,freq=2.0), product of:
            0.20228553 = queryWeight, product of:
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.051282257 = queryNorm
            0.30507088 = fieldWeight in 4435, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4435)
        0.013710285 = product of:
          0.02742057 = sum of:
            0.02742057 = weight(_text_:library in 4435) [ClassicSimilarity], result of:
              0.02742057 = score(doc=4435,freq=2.0), product of:
                0.13484047 = queryWeight, product of:
                  2.6293786 = idf(docFreq=8668, maxDocs=44218)
                  0.051282257 = queryNorm
                0.20335563 = fieldWeight in 4435, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6293786 = idf(docFreq=8668, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=4435)
          0.5 = coord(1/2)
      0.6666667 = coord(2/3)
    
  5. Hitchcock, S.; Bergmark, D.; Brody, T.; Gutteridge, C.; Carr, L.; Hall, W.; Lagoze, C.; Harnad, S.: Open citation linking : the way forward (2002) 0.05
    0.04808734 = product of:
      0.07213101 = sum of:
        0.06233795 = weight(_text_:digital in 1207) [ClassicSimilarity], result of:
          0.06233795 = score(doc=1207,freq=4.0), product of:
            0.20228553 = queryWeight, product of:
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.051282257 = queryNorm
            0.3081681 = fieldWeight in 1207, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1207)
        0.00979306 = product of:
          0.01958612 = sum of:
            0.01958612 = weight(_text_:library in 1207) [ClassicSimilarity], result of:
              0.01958612 = score(doc=1207,freq=2.0), product of:
                0.13484047 = queryWeight, product of:
                  2.6293786 = idf(docFreq=8668, maxDocs=44218)
                  0.051282257 = queryNorm
                0.14525402 = fieldWeight in 1207, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6293786 = idf(docFreq=8668, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1207)
          0.5 = coord(1/2)
      0.6666667 = coord(2/3)
    
    Abstract
    The speed of scientific communication - the rate of ideas affecting other researchers' ideas - is increasing dramatically. The factor driving this is free, unrestricted access to research papers. Measurements of user activity in mature eprint archives of research papers such as arXiv have shown, for the first time, the degree to which such services support an evolving network of texts commenting on, citing, classifying, abstracting, listing and revising other texts. The Open Citation project has built tools to measure this activity, to build new archives, and has been closely involved with the development of the infrastructure to support open access on which these new services depend. This is the story of the project, intertwined with the concurrent emergence of the Open Archives Initiative (OAI). The paper describes the broad scope of the project's work, showing how it has progressed from early demonstrators of reference linking to produce Citebase, a Web-based citation and impact-ranked search service, and how it has supported the development of the EPrints.org software for building OAI-compliant archives. The work has been underpinned by analysis and experiments on the semantics of documents (digital objects) to determine the features required for formally perfect linking - instantiated as an application programming interface (API) for reference linking - that will enable other applications to build on this work in broader digital library information environments.
  6. Lagoze, C.; Van de Sompel, H.: ¬The making of the Open Archives Initiative Protocol for Metadata Harvesting (2003) 0.05
    0.04634326 = product of:
      0.069514886 = sum of:
        0.05289551 = weight(_text_:digital in 4771) [ClassicSimilarity], result of:
          0.05289551 = score(doc=4771,freq=2.0), product of:
            0.20228553 = queryWeight, product of:
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.051282257 = queryNorm
            0.26148933 = fieldWeight in 4771, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.046875 = fieldNorm(doc=4771)
        0.016619375 = product of:
          0.03323875 = sum of:
            0.03323875 = weight(_text_:library in 4771) [ClassicSimilarity], result of:
              0.03323875 = score(doc=4771,freq=4.0), product of:
                0.13484047 = queryWeight, product of:
                  2.6293786 = idf(docFreq=8668, maxDocs=44218)
                  0.051282257 = queryNorm
                0.24650425 = fieldWeight in 4771, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.6293786 = idf(docFreq=8668, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4771)
          0.5 = coord(1/2)
      0.6666667 = coord(2/3)
    
    Abstract
    The authors, who jointly serve as the Open Archives Initiative (OAI) executive, reflect an the three-year history of the OAI. Three years of technical work recently culminated in the release of a stabie production version 2 of the OAI Protocol for Metadata Harvesting (OAI-PMH). This technical product, the work that led up to it, and the process that made it possible have attracted some favor from the digital library and information community. The paper explores a number of factors in the history of the OAI that the authors believe have contributed to this positive response. The factors include focus an a defined problem Statement, an operational model in which strong leadership is balanced with solicited participation, a healthy dose of community building and Support, and sensible technical decisions.
    Source
    Library hi tech. 21(2003) no.2, S.118-128
  7. Arms, W.Y.; Dushay, N.; Fulker, D.; Lagoze, C.: ¬A case study in metadata harvesting : the NSDL (2003) 0.05
    0.04634326 = product of:
      0.069514886 = sum of:
        0.05289551 = weight(_text_:digital in 4780) [ClassicSimilarity], result of:
          0.05289551 = score(doc=4780,freq=2.0), product of:
            0.20228553 = queryWeight, product of:
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.051282257 = queryNorm
            0.26148933 = fieldWeight in 4780, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.046875 = fieldNorm(doc=4780)
        0.016619375 = product of:
          0.03323875 = sum of:
            0.03323875 = weight(_text_:library in 4780) [ClassicSimilarity], result of:
              0.03323875 = score(doc=4780,freq=4.0), product of:
                0.13484047 = queryWeight, product of:
                  2.6293786 = idf(docFreq=8668, maxDocs=44218)
                  0.051282257 = queryNorm
                0.24650425 = fieldWeight in 4780, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.6293786 = idf(docFreq=8668, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4780)
          0.5 = coord(1/2)
      0.6666667 = coord(2/3)
    
    Abstract
    This paper describes the use of the Open Archives Initiative Protocol for Metadata Harvesting in the NSF's National Science Digital Library (NSDL). The protocol is used both as a method to ingest metadata into a central Metadata Repository and also as the means by which the repository exports metadata to service providers. The NSDL Search Service is used to illustrate this architecture. An early version of the Metadata Repository was an alpha test site for version 1 of the protocol and the production repository was a beta test site for version 2. This paper describes the implementation experience and early practical tests. Despite some teething troubles and the long-term difficulties of semantic compatibility, the overall conclusion is optimism that the Open Archive Initiative will be a successful part of the NSDL.
    Source
    Library hi tech. 21(2003) no.2, S.228-237
  8. Van de Sompel, H.; Nelson, M.L.; Lagoze, C.; Warner, S.: Resource harvesting within the OAI-PMH framework (2004) 0.04
    0.03562911 = product of:
      0.106887326 = sum of:
        0.106887326 = weight(_text_:digital in 4110) [ClassicSimilarity], result of:
          0.106887326 = score(doc=4110,freq=6.0), product of:
            0.20228553 = queryWeight, product of:
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.051282257 = queryNorm
            0.5283983 = fieldWeight in 4110, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4110)
      0.33333334 = coord(1/3)
    
    Abstract
    Motivated by preservation and resource discovery, we examine how digital resources, and not just metadata about resources, can be harvested using the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH). We review and critique existing techniques for identifying and gathering digital resources using metadata harvested through the OAI-PMH. We introduce an alternative solution that builds on the introduction of complex object formats that provide a more accurate way to describe digital resources. We argue that the use of complex object formats as OAI-PMH metadata formats results in a reliable and attractive approach for incremental harvesting of resources using the OAI-PMH.
  9. Lagoze, C.; Hunter, J.: ¬The ABC Ontology and Model (2002) 0.04
    0.035263672 = product of:
      0.10579102 = sum of:
        0.10579102 = weight(_text_:digital in 1282) [ClassicSimilarity], result of:
          0.10579102 = score(doc=1282,freq=2.0), product of:
            0.20228553 = queryWeight, product of:
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.051282257 = queryNorm
            0.52297866 = fieldWeight in 1282, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.09375 = fieldNorm(doc=1282)
      0.33333334 = coord(1/3)
    
    Source
    Journal of digital information. 2(2002) no.2,
  10. Lagoze, C.: Keeping Dublin Core simple : Cross-domain discovery or resource description? (2001) 0.03
    0.02539581 = product of:
      0.038093716 = sum of:
        0.031168975 = weight(_text_:digital in 1216) [ClassicSimilarity], result of:
          0.031168975 = score(doc=1216,freq=4.0), product of:
            0.20228553 = queryWeight, product of:
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.051282257 = queryNorm
            0.15408406 = fieldWeight in 1216, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.01953125 = fieldNorm(doc=1216)
        0.006924739 = product of:
          0.013849478 = sum of:
            0.013849478 = weight(_text_:library in 1216) [ClassicSimilarity], result of:
              0.013849478 = score(doc=1216,freq=4.0), product of:
                0.13484047 = queryWeight, product of:
                  2.6293786 = idf(docFreq=8668, maxDocs=44218)
                  0.051282257 = queryNorm
                0.1027101 = fieldWeight in 1216, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.6293786 = idf(docFreq=8668, maxDocs=44218)
                  0.01953125 = fieldNorm(doc=1216)
          0.5 = coord(1/2)
      0.6666667 = coord(2/3)
    
    Abstract
    Reality is messy. Individuals perceive or define objects differently. Objects may change over time, morphing into new versions of their former selves or into things altogether different. A book can give rise to a translation, derivation, or edition, and these resulting objects are related in complex ways to each other and to the people and contexts in which they were created or transformed. Providing a normalized view of such a messy reality is a precondition for managing information. From the first library catalogs, through Melvil Dewey's Decimal Classification system in the nineteenth century, to today's MARC encoding of AACR2 cataloging rules, libraries have epitomized the process of what David Levy calls "order making", whereby catalogers impose a veneer of regularity on the natural disorder of the artifacts they encounter. The pre-digital library within which the Catalog and its standards evolved was relatively self-contained and controlled. Creating and maintaining catalog records was, and still is, the task of professionals. Today's Web, in contrast, has brought together a diversity of information management communities, with a variety of order-making standards, into what Stuart Weibel has called the Internet Commons. The sheer scale of this context has motivated a search for new ways to describe and index information. Second-generation search engines such as Google can yield astonishingly good search results, while tools such as ResearchIndex for automatic citation indexing and techniques for inferring "Web communities" from constellations of hyperlinks promise even better methods for focusing queries on information from authoritative sources. Such "automated digital libraries," according to Bill Arms, promise to radically reduce the cost of managing information. Alongside the development of such automated methods, there is increasing interest in metadata as a means of imposing pre-defined order on Web content. While the size and changeability of the Web makes professional cataloging impractical, a minimal amount of information ordering, such as that represented by the Dublin Core (DC), may vastly improve the quality of an automatic index at low cost; indeed, recent work suggests that some types of simple description may be generated with little or no human intervention.