Search (7 results, page 1 of 1)

Automatic classification research at OCLC (2002) 0.06

0.05542263 = product of:
  0.08313394 = sum of:
    0.059332274 = weight(_text_:electronic in 1563) [ClassicSimilarity], result of:
      0.059332274 = score(doc=1563,freq=2.0), product of:
        0.19623034 = queryWeight, product of:
          3.9095051 = idf(docFreq=2409, maxDocs=44218)
          0.05019314 = queryNorm
        0.30236036 = fieldWeight in 1563, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9095051 = idf(docFreq=2409, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1563)
    0.023801671 = product of:
      0.047603343 = sum of:
        0.047603343 = weight(_text_:22 in 1563) [ClassicSimilarity], result of:
          0.047603343 = score(doc=1563,freq=2.0), product of:
            0.17576782 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05019314 = queryNorm
            0.2708308 = fieldWeight in 1563, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1563)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: OCLC enlists the cooperation of the world's libraries to make the written record of humankind's cultural heritage more accessible through electronic media. Part of this goal can be accomplished through the application of the principles of knowledge organization. We believe that cultural artifacts are effectively lost unless they are indexed, cataloged and classified. Accordingly, OCLC has developed products, sponsored research projects, and encouraged the participation in international standards communities whose outcome has been improved library classification schemes, cataloging productivity tools, and new proposals for the creation and maintenance of metadata. Though cataloging and classification requires expert intellectual effort, we recognize that at least some of the work must be automated if we hope to keep pace with cultural change
Date: 5. 5.2003 9:22:09

Subramanian, S.; Shafer, K.E.: Clustering (1998) 0.03

0.028253464 = product of:
  0.08476039 = sum of:
    0.08476039 = weight(_text_:electronic in 1103) [ClassicSimilarity], result of:
      0.08476039 = score(doc=1103,freq=2.0), product of:
        0.19623034 = queryWeight, product of:
          3.9095051 = idf(docFreq=2409, maxDocs=44218)
          0.05019314 = queryNorm
        0.43194336 = fieldWeight in 1103, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9095051 = idf(docFreq=2409, maxDocs=44218)
          0.078125 = fieldNorm(doc=1103)
  0.33333334 = coord(1/3)

Abstract: This article presents our exploration of computer science clustering algorithms as they relate to the Scorpion system. Scorpion is a research project at OCLC that explores the indexing and cataloging of electronic resources. For a more complete description of the Scorpion, please visit the Scorpion Web site at <http://purl.oclc.org/scorpion>

Shafer, K.E.: Evaluating Scorpion results (1998) 0.03

0.028253464 = product of:
  0.08476039 = sum of:
    0.08476039 = weight(_text_:electronic in 1569) [ClassicSimilarity], result of:
      0.08476039 = score(doc=1569,freq=2.0), product of:
        0.19623034 = queryWeight, product of:
          3.9095051 = idf(docFreq=2409, maxDocs=44218)
          0.05019314 = queryNorm
        0.43194336 = fieldWeight in 1569, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9095051 = idf(docFreq=2409, maxDocs=44218)
          0.078125 = fieldNorm(doc=1569)
  0.33333334 = coord(1/3)

Abstract: Scorpion is a research project at OCLC that builds tools for automatic subject assignment by combining library science and information retrieval techniques. A thesis of Scorpion is that the Dewey Decimal Classification (Dewey) can be used to perform automatic subject assignment for electronic items.

Koch, T.; Vizine-Goetz, D.: Automatic classification and content navigation support for Web services : DESIRE II cooperates with OCLC (1998) 0.02
```
0.019777425 = product of:
  0.059332274 = sum of:
    0.059332274 = weight(_text_:electronic in 1568) [ClassicSimilarity], result of:
      0.059332274 = score(doc=1568,freq=2.0), product of:
        0.19623034 = queryWeight, product of:
          3.9095051 = idf(docFreq=2409, maxDocs=44218)
          0.05019314 = queryNorm
        0.30236036 = fieldWeight in 1568, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9095051 = idf(docFreq=2409, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1568)
  0.33333334 = coord(1/3)
```
Abstract

Emerging standards in knowledge representation and organization are preparing the way for distributed vocabulary support in Internet search services. NetLab researchers are exploring several innovative solutions for searching and browsing in the subject-based Internet gateway, Electronic Engineering Library, Sweden (EELS). The implementation of the EELS service is described, specifically, the generation of the robot-gathered database 'All' engineering and the automated application of the Ei thesaurus and classification scheme. NetLab and OCLC researchers are collaborating to investigate advanced solutions to automated classification in the DESIRE II context. A plan for furthering the development of distributed vocabulary support in Internet search services is offered.

Reiner, U.: Automatische DDC-Klassifizierung von bibliografischen Titeldatensätzen (2009) 0.01

0.011334131 = product of:
  0.03400239 = sum of:
    0.03400239 = product of:
      0.06800478 = sum of:
        0.06800478 = weight(_text_:22 in 611) [ClassicSimilarity], result of:
          0.06800478 = score(doc=611,freq=2.0), product of:
            0.17576782 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05019314 = queryNorm
            0.38690117 = fieldWeight in 611, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=611)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 22. 8.2009 12:54:24

Dolin, R.; Agrawal, D.; El Abbadi, A.; Pearlman, J.: Using automated classification for summarizing and selecting heterogeneous information sources (1998) 0.01
```
0.008476039 = product of:
  0.025428116 = sum of:
    0.025428116 = weight(_text_:electronic in 1253) [ClassicSimilarity], result of:
      0.025428116 = score(doc=1253,freq=2.0), product of:
        0.19623034 = queryWeight, product of:
          3.9095051 = idf(docFreq=2409, maxDocs=44218)
          0.05019314 = queryNorm
        0.129583 = fieldWeight in 1253, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9095051 = idf(docFreq=2409, maxDocs=44218)
          0.0234375 = fieldNorm(doc=1253)
  0.33333334 = coord(1/3)
```
Abstract

We are currently experimenting with newsgroups as collections. We have built an initial prototype which automatically classifies and summarizes newsgroups within the LCC. (The prototype can be tested below, and more details may be found at http://pharos.alexandria.ucsb.edu/). The prototype uses electronic library catalog records as a `training set' and Latent Semantic Indexing (LSI) for IR. We use the training set to build a rich set of classification terminology, and associate these terms with the relevant categories in the LCC. This association between terms and classification categories allows us to relate users' queries to nodes in the LCC so that users can select appropriate query categories. Newsgroups are similarly associated with classification categories. Pharos then matches the categories selected by users to relevant newsgroups. In principle, this approach allows users to exclude newsgroups that might have been selected based on an unintended meaning of a query term, and to include newsgroups with relevant content even though the exact query terms may not have been used. This work is extensible to other types of classification, including geographical, temporal, and image feature. Before discussing the methodology of the collection summarization and selection, we first present an online demonstration below. The demonstration is not intended to be a complete end-user interface. Rather, it is intended merely to offer a view of the process to suggest the "look and feel" of the prototype. The demo works as follows. First supply it with a few keywords of interest. The system will then use those terms to try to return to you the most relevant subject categories within the LCC. Assuming that the system recognizes any of your terms (it has over 400,000 terms indexed), it will give you a list of 15 LCC categories sorted by relevancy ranking. From there, you have two choices. The first choice, by clicking on the "News" links, is to get a list of newsgroups which the system has identified as relevant to the LCC category you select. The other choice, by clicking on the LCC ID links, is to enter the LCC hierarchy starting at the category of your choice and navigate the tree until you locate the best category for your query. From there, again, you can get a list of newsgroups by clicking on the "News" links. After having shown this demonstration to many people, we would like to suggest that you first give it easier examples before trying to break it. For example, "prostate cancer" (discussed below), "remote sensing", "investment banking", and "gershwin" all work reasonably well.

Reiner, U.: Automatische DDC-Klassifizierung bibliografischer Titeldatensätze der Deutschen Nationalbibliografie (2009) 0.00

0.0045336518 = product of:
  0.013600955 = sum of:
    0.013600955 = product of:
      0.02720191 = sum of:
        0.02720191 = weight(_text_:22 in 3284) [ClassicSimilarity], result of:
          0.02720191 = score(doc=3284,freq=2.0), product of:
            0.17576782 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05019314 = queryNorm
            0.15476047 = fieldWeight in 3284, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=3284)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Date: 22. 1.2010 14:41:24

Search (7 results, page 1 of 1)

Authors

Years

Languages

Themes