Search (18 results, page 1 of 1)

  • × language_ss:"e"
  • × author_ss:"Koch, T."
  1. Koch, T.; Vizine-Goetz, D.: DDC and knowledge organization in the digital library : Research and development. Demonstration pages (1999) 0.01
    0.014525372 = product of:
      0.043576114 = sum of:
        0.015144923 = weight(_text_:in in 942) [ClassicSimilarity], result of:
          0.015144923 = score(doc=942,freq=16.0), product of:
            0.059380736 = queryWeight, product of:
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.043654136 = queryNorm
            0.25504774 = fieldWeight in 942, product of:
              4.0 = tf(freq=16.0), with freq of:
                16.0 = termFreq=16.0
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.046875 = fieldNorm(doc=942)
        0.02843119 = weight(_text_:und in 942) [ClassicSimilarity], result of:
          0.02843119 = score(doc=942,freq=8.0), product of:
            0.09675359 = queryWeight, product of:
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.043654136 = queryNorm
            0.29385152 = fieldWeight in 942, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.046875 = fieldNorm(doc=942)
      0.33333334 = coord(2/6)
    
    Abstract
    Der Workshop gibt einen Einblick in die aktuelle Forschung und Entwicklung zur Wissensorganisation in digitalen Bibliotheken. Diane Vizine-Goetz vom OCLC Office of Research in Dublin, Ohio, stellt die Forschungsprojekte von OCLC zur Anpassung und Weiterentwicklung der Dewey Decimal Classification als Wissensorganisationsinstrument fuer grosse digitale Dokumentensammlungen vor. Traugott Koch, NetLab, Universität Lund in Schweden, demonstriert die Ansätze und Lösungen des EU-Projekts DESIRE zum Einsatz von intellektueller und vor allem automatischer Klassifikation in Fachinformationsdiensten im Internet.
    Content
    1. Increased Importance of Knowledge Organization in Internet Services - 2. Quality Subject Service and the role of classification - 3. Developing the DDC into a knowledge organization instrument for the digital library. OCLC site - 4. DESIRE's Barefoot Solutions of Automatic Classification - 5. Advanced Classification Solutions in DESIRE and CORC - 6. Future directions of research and development - 7. General references
  2. Koch, T.: Searching the Web : systematic overview over indexes (1995) 0.01
    0.010872297 = product of:
      0.03261689 = sum of:
        0.008924231 = weight(_text_:in in 3205) [ClassicSimilarity], result of:
          0.008924231 = score(doc=3205,freq=2.0), product of:
            0.059380736 = queryWeight, product of:
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.043654136 = queryNorm
            0.15028831 = fieldWeight in 3205, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.078125 = fieldNorm(doc=3205)
        0.02369266 = weight(_text_:und in 3205) [ClassicSimilarity], result of:
          0.02369266 = score(doc=3205,freq=2.0), product of:
            0.09675359 = queryWeight, product of:
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.043654136 = queryNorm
            0.24487628 = fieldWeight in 3205, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.078125 = fieldNorm(doc=3205)
      0.33333334 = coord(2/6)
    
    Source
    Wissen in elektronischen Netzwerken: Strukturierung, Erschließung und Retrieval von Informationsressourcen im Internet. Eine Auswahl von Vorträgen der 19. Jahrestagung der Gesellschaft für Klassifikation, Basel 1995. Hrsg.: H.-C. Hobohm u. H.-J. Wätjen
  3. Koch, T.; Ardö, A.; Falcoz, F.; Nielsen, M.; Dandfær, M.: Improving resource discovery and retrieval on the Internet : the Nordic WAIS/World Wide Web Project and the classification of WAIS databases (1995) 0.01
    0.010872297 = product of:
      0.03261689 = sum of:
        0.008924231 = weight(_text_:in in 3370) [ClassicSimilarity], result of:
          0.008924231 = score(doc=3370,freq=2.0), product of:
            0.059380736 = queryWeight, product of:
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.043654136 = queryNorm
            0.15028831 = fieldWeight in 3370, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.078125 = fieldNorm(doc=3370)
        0.02369266 = weight(_text_:und in 3370) [ClassicSimilarity], result of:
          0.02369266 = score(doc=3370,freq=2.0), product of:
            0.09675359 = queryWeight, product of:
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.043654136 = queryNorm
            0.24487628 = fieldWeight in 3370, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.216367 = idf(docFreq=13101, maxDocs=44218)
              0.078125 = fieldNorm(doc=3370)
      0.33333334 = coord(2/6)
    
    Source
    Wissen in elektronischen Netzwerken: Strukturierung, Erschließung und Retrieval von Informationsressourcen im Internet. Eine Auswahl von Vorträgen der 19. Jahrestagung der Gesellschaft für Klassifikation, Basel 1995. Hrsg.: H.-C. Hobohm u. H.-J. Wätjen
  4. Koch, T.: Quality-controlled subject gateways : definitions, typologies, empirical overview (2000) 0.01
    0.008982609 = product of:
      0.026947826 = sum of:
        0.006246961 = weight(_text_:in in 631) [ClassicSimilarity], result of:
          0.006246961 = score(doc=631,freq=2.0), product of:
            0.059380736 = queryWeight, product of:
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.043654136 = queryNorm
            0.10520181 = fieldWeight in 631, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.0546875 = fieldNorm(doc=631)
        0.020700864 = product of:
          0.04140173 = sum of:
            0.04140173 = weight(_text_:22 in 631) [ClassicSimilarity], result of:
              0.04140173 = score(doc=631,freq=2.0), product of:
                0.15286934 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.043654136 = queryNorm
                0.2708308 = fieldWeight in 631, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=631)
          0.5 = coord(1/2)
      0.33333334 = coord(2/6)
    
    Content
    Mit einem Anhang, der in fachlicher Ordnung vorhandene Subject gateways auflistet; vgl. unter: http://www.lub.lu.se/tk/SBIGs.html
    Date
    22. 6.2002 19:37:55
  5. Koch, T.; Vizine-Goetz, D.: Automatic classification and content navigation support for Web services : DESIRE II cooperates with OCLC (1998) 0.00
    0.0023281053 = product of:
      0.013968632 = sum of:
        0.013968632 = weight(_text_:in in 1568) [ClassicSimilarity], result of:
          0.013968632 = score(doc=1568,freq=10.0), product of:
            0.059380736 = queryWeight, product of:
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.043654136 = queryNorm
            0.23523843 = fieldWeight in 1568, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1568)
      0.16666667 = coord(1/6)
    
    Abstract
    Emerging standards in knowledge representation and organization are preparing the way for distributed vocabulary support in Internet search services. NetLab researchers are exploring several innovative solutions for searching and browsing in the subject-based Internet gateway, Electronic Engineering Library, Sweden (EELS). The implementation of the EELS service is described, specifically, the generation of the robot-gathered database 'All' engineering and the automated application of the Ei thesaurus and classification scheme. NetLab and OCLC researchers are collaborating to investigate advanced solutions to automated classification in the DESIRE II context. A plan for furthering the development of distributed vocabulary support in Internet search services is offered.
  6. Ardö, A.; Koch, T.: Wide-area information server (WAIS) as the hub of an electronic library service at Lund University (1993) 0.00
    0.0017848461 = product of:
      0.010709076 = sum of:
        0.010709076 = weight(_text_:in in 8459) [ClassicSimilarity], result of:
          0.010709076 = score(doc=8459,freq=8.0), product of:
            0.059380736 = queryWeight, product of:
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.043654136 = queryNorm
            0.18034597 = fieldWeight in 8459, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.046875 = fieldNorm(doc=8459)
      0.16666667 = coord(1/6)
    
    Abstract
    Electronic information sources are being collected at the Lund University Library within the areas of computer science, Internet use and environmental studies. Within each area there are several different types of sources e.g. the environment area has bibliographic information, journal content pages, a local directory database on environmental related research projects and an archive of articles from relevant electronic conferences. A seed-bank database is planned in collaboration with the Nordic Gene Bank. The popularity of the wide area information server is growing and there are several hundred available sources today, however, improvements need to be done with the possibilities to select sources and in the search and relevance-ranking algorithms
    Source
    Opportunity 2000: understanding and serving users in an electronic library; 15th Int. Symp., 12.-15.10.1992; Festschrift in honour of Herbert S. White. Ed.: A.H. Helal
  7. Koch, T.: Suchmaschinen im Internet (1996) 0.00
    0.0017848461 = product of:
      0.010709076 = sum of:
        0.010709076 = weight(_text_:in in 5281) [ClassicSimilarity], result of:
          0.010709076 = score(doc=5281,freq=2.0), product of:
            0.059380736 = queryWeight, product of:
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.043654136 = queryNorm
            0.18034597 = fieldWeight in 5281, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.09375 = fieldNorm(doc=5281)
      0.16666667 = coord(1/6)
    
    Footnote
    Vortrag, gehalten auf der INETBIB-Tagung in Dortmund am 11.3.1996
  8. Hakala, J.; Husby, O.; Koch, T.: Warwick framework and Dublin core set provide a comprehensive infrastructure for network resource description (1996) 0.00
    0.0017848461 = product of:
      0.010709076 = sum of:
        0.010709076 = weight(_text_:in in 6921) [ClassicSimilarity], result of:
          0.010709076 = score(doc=6921,freq=8.0), product of:
            0.059380736 = queryWeight, product of:
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.043654136 = queryNorm
            0.18034597 = fieldWeight in 6921, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.046875 = fieldNorm(doc=6921)
      0.16666667 = coord(1/6)
    
    Abstract
    Defines metadata, which for librarians means catalogue records for printed publications, and the need for them in document retrieval. The existence of many different sources of metadata had led to a need for simpler metadata schemes and a stadardised exchange format. The 1st Metadata Workshop, held in Dublin, Ohio, reached consensus on a simple resource description record, known as the Dublin core set, with 13 elements, which can serve to unify various models. Lists institutions intending to use it. The 2nd workshop resulted in a proposal for a container record architecture comprising more and different types of metadata than a Dublin core record. The Nordic Metadata project, outlined here, aims to improve the knowledge transfer to the Nordic countries and allow them to paricipate actively in international developments
  9. Koch, T.: ¬The message is the medium : online all the time for everyone (1996) 0.00
    0.0017848461 = product of:
      0.010709076 = sum of:
        0.010709076 = weight(_text_:in in 1725) [ClassicSimilarity], result of:
          0.010709076 = score(doc=1725,freq=2.0), product of:
            0.059380736 = queryWeight, product of:
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.043654136 = queryNorm
            0.18034597 = fieldWeight in 1725, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.09375 = fieldNorm(doc=1725)
      0.16666667 = coord(1/6)
    
    Footnote
    Rez. in: Electronic library 16(1998) no.1, S.61 (D.J. Parkes); InfoManage 4(1997) no.10, S.8 (G. St.Clair)
  10. Ardö, A.; Koch, T.: Automatic classification applied to full-text Internet documents in a robot-generated subject index (1999) 0.00
    0.0017848461 = product of:
      0.010709076 = sum of:
        0.010709076 = weight(_text_:in in 382) [ClassicSimilarity], result of:
          0.010709076 = score(doc=382,freq=2.0), product of:
            0.059380736 = queryWeight, product of:
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.043654136 = queryNorm
            0.18034597 = fieldWeight in 382, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.09375 = fieldNorm(doc=382)
      0.16666667 = coord(1/6)
    
  11. Koch, T.; Ardö, A.; Brümmer, A.: ¬The building and maintenance of robot based internet search services : A review of current indexing and data collection methods. Prepared to meet the requirements of Work Package 3 of EU Telematics for Research, project DESIRE. Version D3.11v0.3 (Draft version 3) (1996) 0.00
    0.0017848461 = product of:
      0.010709076 = sum of:
        0.010709076 = weight(_text_:in in 1669) [ClassicSimilarity], result of:
          0.010709076 = score(doc=1669,freq=18.0), product of:
            0.059380736 = queryWeight, product of:
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.043654136 = queryNorm
            0.18034597 = fieldWeight in 1669, product of:
              4.2426405 = tf(freq=18.0), with freq of:
                18.0 = termFreq=18.0
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.03125 = fieldNorm(doc=1669)
      0.16666667 = coord(1/6)
    
    Abstract
    After a short outline of problems, possibilities and difficulties of systematic information retrieval on the Internet and a description of efforts for development in this area, a specification of the terminology for this report is required. Although the process of retrieval is generally seen as an iterative process of browsing and information retrieval and several important services on the net have taken this fact into consideration, the emphasis of this report lays on the general retrieval tools for the whole of Internet. In order to be able to evaluate the differences, possibilities and restrictions of the different services it is necessary to begin with organizing the existing varieties in a typological/ taxonomical survey. The possibilities and weaknesses will be briefly compared and described for the most important services in the categories robot-based WWW-catalogues of different types, list- or form-based catalogues and simultaneous or collected search services respectively. It will however for different reasons not be possible to rank them in order of "best" services. Still more important are the weaknesses and problems common for all attempts of indexing the Internet. The problems of the quality of the input, the technical performance and the general problem of indexing virtual hypertext are shown to be at least as difficult as the different aspects of harvesting, indexing and information retrieval. Some of the attempts made in the area of further development of retrieval services will be mentioned in relation to descriptions of the contents of documents and standardization efforts. Internet harvesting and indexing technology and retrieval software is thoroughly reviewed. Details about all services and software are listed in analytical forms in Annex 1-3.
  12. Koch, T.; Golub, K.; Ardö, A.: Users browsing behaviour in a DDC-based Web service : a log analysis (2006) 0.00
    0.0015457221 = product of:
      0.009274333 = sum of:
        0.009274333 = weight(_text_:in in 2234) [ClassicSimilarity], result of:
          0.009274333 = score(doc=2234,freq=6.0), product of:
            0.059380736 = queryWeight, product of:
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.043654136 = queryNorm
            0.1561842 = fieldWeight in 2234, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.046875 = fieldNorm(doc=2234)
      0.16666667 = coord(1/6)
    
    Footnote
    Beitrag in einem Themenheft "Moving beyond the presentation layer: content and context in the Dewey Decimal Classification (DDC) System"
  13. Koch, T.: Information mapping : a glimpse at the future (1993) 0.00
    0.0014873719 = product of:
      0.008924231 = sum of:
        0.008924231 = weight(_text_:in in 4351) [ClassicSimilarity], result of:
          0.008924231 = score(doc=4351,freq=2.0), product of:
            0.059380736 = queryWeight, product of:
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.043654136 = queryNorm
            0.15028831 = fieldWeight in 4351, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.078125 = fieldNorm(doc=4351)
      0.16666667 = coord(1/6)
    
    Abstract
    Suggests information mapping as a technique that may be applied to online information retrieval to yield information rather than just data. The technique is illustrated using the example of subject areas in the field of health care
  14. Koch, T.; Neuroth, H.; Day, M.: Renardus: Cross-browsing European subject gateways via a common classification system (DDC) (2003) 0.00
    0.0014724231 = product of:
      0.008834538 = sum of:
        0.008834538 = weight(_text_:in in 3821) [ClassicSimilarity], result of:
          0.008834538 = score(doc=3821,freq=16.0), product of:
            0.059380736 = queryWeight, product of:
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.043654136 = queryNorm
            0.14877784 = fieldWeight in 3821, product of:
              4.0 = tf(freq=16.0), with freq of:
                16.0 = termFreq=16.0
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.02734375 = fieldNorm(doc=3821)
      0.16666667 = coord(1/6)
    
    Abstract
    This paper presents the approach and first results of the classification mapping process in the EU project Renardus. The outcome in Renardus is a cross-browsing feature based an the Dewey Decimal Classification (DDC) and improved subject searching across distributed and heterogeneous European subject gateways. The paper presents the project's initial experiences and decisions, e.g. an investigation of the use of classification systems by Renardus partners' gateways, general mapping approaches and issues, the definition of mapping relationships and some information an technical solutions and the mapping tool. There is also a demonstration of the use of the mapping information in Renardus and the presentation of several features that have been implemented to aid end-user navigation in a large and deep browsing structure like the DDC. Classification mapping for crossbrowsing is a labour intensive and complex effort which at the moment raises many open questions and leaves many more future potential work tasks than completed useful solutions.
    Content
    "1. The EU projeet Renardus Renardus is a project funded by the European Commission as part of the Information Society Technologies (IST) programme, part of the European Union's 5th Framework Programme. Partners in Renardus include national libraries, research centres and subject gateway services from Denmark, Finland, Germany, The Netherlands, Sweden and the UK, co-ordinated by the National Library of the Netherlands. The project aims to develop a Web-based service to enable searching and browsing across a range of distributed European-based information services designed for the academic and research communities - and in particular those services known as subject gateways. These gateways are services that provide access to Internet resources. They tend to be selective with regard to the resources they give access to, and are usually based an the manual creation of descriptive metadata. Services typically provide users with both search and browse facilities, and offen offer hierarchical browse structures based an subject classification schemes (Koch & Day, 1997). Predecessor projects like the EU project DESIRE have already developed solutions for the description of individual resources and for automatic classification at the level of an individual subject gateway using established classification systems. Renardus intends to develop a service that can cross-search and cross-browse a number of distributed subject gateways through the use of a common metadata profile and by the mapping all locally-used classification schemes to a common scheme. A thorough review of existing data models (Becker, et al., 2000) was used as the basis for the agreement of a minimum set of Dublin Core-based metadata elements that could be utilised as a common data model. A comprehensive mapping effort from the individual gateways' metadata element sets and content encoding schemes to the common profile has taken place. This provides the infrastructure for interoperability between all participating databases and thus is the necessary prerequisite for cross-searching."
    Source
    Subject retrieval in a networked environment: Proceedings of the IFLA Satellite Meeting held in Dublin, OH, 14-16 August 2001 and sponsored by the IFLA Classification and Indexing Section, the IFLA Information Technology Section and OCLC. Ed.: I.C. McIlwaine
  15. Ardö, A.; Godby, J.; Houghton, A.; Koch, T.; Reighart, R.; Thompson, R.; Vizine-Goetz, D.: Browsing engineering resources on the Web : a general knowledge organization scheme (Dewey) vs. a special scheme (EI) (2000) 0.00
    0.0012620769 = product of:
      0.0075724614 = sum of:
        0.0075724614 = weight(_text_:in in 86) [ClassicSimilarity], result of:
          0.0075724614 = score(doc=86,freq=4.0), product of:
            0.059380736 = queryWeight, product of:
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.043654136 = queryNorm
            0.12752387 = fieldWeight in 86, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.046875 = fieldNorm(doc=86)
      0.16666667 = coord(1/6)
    
    Series
    Advances in knowledge organization; vol.7
    Source
    Dynamism and stability in knowledge organization: Proceedings of the 6th International ISKO-Conference, 10-13 July 2000, Toronto, Canada. Ed.: C. Beghtol et al
  16. Koch, T.; Ardö, A.: Automatic classification of full-text HTML-documents from one specific subject area : DESIRE II D3.6a, Working Paper 2 (2000) 0.00
    0.0011898974 = product of:
      0.0071393843 = sum of:
        0.0071393843 = weight(_text_:in in 1667) [ClassicSimilarity], result of:
          0.0071393843 = score(doc=1667,freq=2.0), product of:
            0.059380736 = queryWeight, product of:
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.043654136 = queryNorm
            0.120230645 = fieldWeight in 1667, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.0625 = fieldNorm(doc=1667)
      0.16666667 = coord(1/6)
    
    Content
    1 Introduction / 2 Method overview / 3 Ei thesaurus preprocessing / 4 Automatic classification process: 4.1 Matching -- 4.2 Weighting -- 4.3 Preparation for display / 5 Results of the classification process / 6 Evaluations / 7 Software / 8 Other applications / 9 Experiments with universal classification systems / References / Appendix A: Ei classification service: Software / Appendix B: Use of the classification software as subject filter in a WWW harvester.
  17. Koch, T.; Ardö, A.; Noodén, L.: ¬The construction of a robot-generated subject index : DESIRE II D3.6a, Working Paper 1 (1999) 0.00
    8.9242304E-4 = product of:
      0.005354538 = sum of:
        0.005354538 = weight(_text_:in in 1668) [ClassicSimilarity], result of:
          0.005354538 = score(doc=1668,freq=2.0), product of:
            0.059380736 = queryWeight, product of:
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.043654136 = queryNorm
            0.09017298 = fieldWeight in 1668, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.046875 = fieldNorm(doc=1668)
      0.16666667 = coord(1/6)
    
    Abstract
    This working paper describes the creation of a test database to carry out the automatic classification tasks of the DESIRE II work package D3.6a on. It is an improved version of NetLab's existing "All" Engineering database created after a comparative study of the outcome of two different approaches to collecting the documents. These two methods were selected from seven different general methodologies to build robot-generated subject indices, presented in this paper. We found a surprisingly low overlap between the Engineering link collections we used as seed pages for the robot and subsequently an even more surprisingly low overlap between the resources collected by the two different approaches. That inspite of using basically the same services to start the harvesting process from. A intellectual evaluation of the contents of both databases showed almost exactly the same percentage of relevant documents (77%), indicating that the main difference between those aproaches was the coverage of the resulting database.
  18. Weibel, S.L.; Koch, T.: ¬The Dublin Core Metatdata Initiative : mission, current activities, and future directions (2000) 0.00
    7.4368593E-4 = product of:
      0.0044621155 = sum of:
        0.0044621155 = weight(_text_:in in 1237) [ClassicSimilarity], result of:
          0.0044621155 = score(doc=1237,freq=2.0), product of:
            0.059380736 = queryWeight, product of:
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.043654136 = queryNorm
            0.07514416 = fieldWeight in 1237, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.3602545 = idf(docFreq=30841, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1237)
      0.16666667 = coord(1/6)
    
    Abstract
    Metadata is a keystone component for a broad spectrum of applications that are emerging on the Web to help stitch together content and services and make them more visible to users. The Dublin Core Metadata Initiative (DCMI) has led the development of structured metadata to support resource discovery. This international community has, over a period of 6 years and 8 workshops, brought forth: * A core standard that enhances cross-disciplinary discovery and has been translated into 25 languages to date; * A conceptual framework that supports the modular development of auxiliary metadata components; * An open consensus building process that has brought to fruition Australian, European and North American standards with promise as a global standard for resource discovery; * An open community of hundreds of practitioners and theorists who have found a common ground of principles, procedures, core semantics, and a framework to support interoperable metadata. The 8th Dublin Core Metadata Workshop capped an active year of progress that included standardization of the 15-element core foundation and approval of an initial array of Dublin Core Qualifiers. While there is important work to be done to promote stability and increased adoption of the Dublin Core, the time has come to look beyond the core elements towards a broader metadata agenda. This report describes the new mission statement of the Dublin Core Metadata Initiative (DCMI) that supports the agenda, recapitulates the important milestones of the year 2000, outlines activities of the 8th DCMI workshop in Ottawa, and summarizes the 2001 workplan.