Search (8 results, page 1 of 1)

  • × author_ss:"Humphrey, S.M."
  • × type_ss:"a"
  1. Humphrey, S.M.: Knowledge-based systems for indexing (1994) 0.01
    0.005054501 = product of:
      0.035381503 = sum of:
        0.011415146 = weight(_text_:information in 2987) [ClassicSimilarity], result of:
          0.011415146 = score(doc=2987,freq=4.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.21943474 = fieldWeight in 2987, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0625 = fieldNorm(doc=2987)
        0.023966359 = weight(_text_:retrieval in 2987) [ClassicSimilarity], result of:
          0.023966359 = score(doc=2987,freq=2.0), product of:
            0.08963835 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.029633347 = queryNorm
            0.26736724 = fieldWeight in 2987, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0625 = fieldNorm(doc=2987)
      0.14285715 = coord(2/14)
    
    Abstract
    Human indexing for information retrieval is intellectually labor intensive. It requires maintaining a system of indexing rules and policies, which in turn require maintaining a controlled indexing vocabulary. These activities are being performed at the National Library of Medicine in support of indexing the MEDLINE database using the MeSH thesaurus. An additional requirement of the conventional indexing operation is maintaining and developing a user interface, known as the Automated Indexing and Management System (AIMS). Describes knowledge-based indexing, based on a unique prototype, called MedIndEx (Medical Indexing Expert)
    Imprint
    Medford, NJ : Learned information
  2. Humphrey, S.M.: Use and management of classification systems for knowledge-based indexing (1992) 0.00
    0.0045768693 = product of:
      0.032038085 = sum of:
        0.008071727 = weight(_text_:information in 2094) [ClassicSimilarity], result of:
          0.008071727 = score(doc=2094,freq=2.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.1551638 = fieldWeight in 2094, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0625 = fieldNorm(doc=2094)
        0.023966359 = weight(_text_:retrieval in 2094) [ClassicSimilarity], result of:
          0.023966359 = score(doc=2094,freq=2.0), product of:
            0.08963835 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.029633347 = queryNorm
            0.26736724 = fieldWeight in 2094, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0625 = fieldNorm(doc=2094)
      0.14285715 = coord(2/14)
    
    Abstract
    The MedIndEx (Medical Indexing Expert) research project combines artificial intelligence and information retrieval principles and methods to develop and test an interactive knowledge-based prototype for computer-assisted indexing of the MEDLINE database. By encoding the indexing scheme in a knowledge base, and designing a system for indexers to use in a workstation environment, the objective of this project is to facilitate "expert indexing" that is performed at the National Library of Medicine
  3. Humphrey, S.M.: Automatic indexing of documents from journal descriptors : a preliminary investigation (1999) 0.00
    0.0038537113 = product of:
      0.026975978 = sum of:
        0.020922182 = weight(_text_:web in 3769) [ClassicSimilarity], result of:
          0.020922182 = score(doc=3769,freq=2.0), product of:
            0.09670874 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.029633347 = queryNorm
            0.21634221 = fieldWeight in 3769, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.046875 = fieldNorm(doc=3769)
        0.0060537956 = weight(_text_:information in 3769) [ClassicSimilarity], result of:
          0.0060537956 = score(doc=3769,freq=2.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.116372846 = fieldWeight in 3769, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=3769)
      0.14285715 = coord(2/14)
    
    Abstract
    A new, fully automated approach for indedexing documents is presented based on associating textwords in a training set of bibliographic citations with the indexing of journals. This journal-level indexing is in the form of a consistent, timely set of journal descriptors (JDs) indexing the individual journals themselves. This indexing is maintained in journal records in a serials authority database. The advantage of this novel approach is that the training set does not depend on previous manual indexing of thousands of documents (i.e., any such indexing already in the training set is not used), but rather the relatively small intellectual effort of indexing at the journal level, usually a matter of a few thousand unique journals for which retrospective indexing to maintain consistency and currency may be feasible. If successful, JD indexing would provide topical categorization of documents outside the training set, i.e., journal articles, monographs, Web documents, reports from the grey literature, etc., and therefore be applied in searching. Because JDs are quite general, corresponding to subject domains, their most problable use would be for improving or refining search results
    Source
    Journal of the American Society for Information Science. 50(1999) no.8, S.661-674
  4. Humphrey, S.M.; Névéol, A.; Browne, A.; Gobeil, J.; Ruch, P.; Darmoni, S.J.: Comparing a rule-based versus statistical system for automatic categorization of MEDLINE documents according to biomedical specialty (2009) 0.00
    0.0033881254 = product of:
      0.023716876 = sum of:
        0.008737902 = weight(_text_:information in 3300) [ClassicSimilarity], result of:
          0.008737902 = score(doc=3300,freq=6.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.16796975 = fieldWeight in 3300, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3300)
        0.014978974 = weight(_text_:retrieval in 3300) [ClassicSimilarity], result of:
          0.014978974 = score(doc=3300,freq=2.0), product of:
            0.08963835 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.029633347 = queryNorm
            0.16710453 = fieldWeight in 3300, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3300)
      0.14285715 = coord(2/14)
    
    Abstract
    Automatic document categorization is an important research problem in Information Science and Natural Language Processing. Many applications, including, Word Sense Disambiguation and Information Retrieval in large collections, can benefit from such categorization. This paper focuses on automatic categorization of documents from the biomedical literature into broad discipline-based categories. Two different systems are described and contrasted: CISMeF, which uses rules based on human indexing of the documents by the Medical Subject Headings (MeSH) controlled vocabulary in order to assign metaterms (MTs), and Journal Descriptor Indexing (JDI), based on human categorization of about 4,000 journals and statistical associations between journal descriptors (JDs) and textwords in the documents. We evaluate and compare the performance of these systems against a gold standard of humanly assigned categories for 100 MEDLINE documents, using six measures selected from trec_eval. The results show that for five of the measures performance is comparable, and for one measure JDI is superior. We conclude that these results favor JDI, given the significantly greater intellectual overhead involved in human indexing and maintaining a rule base for mapping MeSH terms to MTs. We also note a JDI method that associates JDs with MeSH indexing rather than textwords, and it may be worthwhile to investigate whether this JDI method (statistical) and CISMeF (rule-based) might be combined and then evaluated showing they are complementary to one another.
    Source
    Journal of the American Society for Information Science and Technology. 60(2009) no.12, S.2530-2539
  5. Humphrey, S.M.: Indexing biomedical documents : from thesaural to knowledge-based retrieval systems (1992) 0.00
    0.002118347 = product of:
      0.029656855 = sum of:
        0.029656855 = weight(_text_:retrieval in 7641) [ClassicSimilarity], result of:
          0.029656855 = score(doc=7641,freq=4.0), product of:
            0.08963835 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.029633347 = queryNorm
            0.33085006 = fieldWeight in 7641, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0546875 = fieldNorm(doc=7641)
      0.071428575 = coord(1/14)
    
    Abstract
    Interactice knowledge-based indexing of the National Library of Medicine's MEDLINE database is advocated. It is established that in the current setting concept indexing is needed and cannot be fully automated. Compatibility between conventional and knowledge-based indexing is highlighted, followed by discussion of indexing as a cognitive process. The section of knowledge-based indexing describes how NLM's MedIndEx prototype addresses problems in conventional indexing and includes the contention that constructing a knowledge base adapted from a conventional classified thesaurus and indexing scheme is not as daunting as it may seem. Extension of the prototype to an intelligent search assistant illustrates use of the same knowledge base to integrate indexing and retrieval applications. Suggested are also future directions for knowledge-based indeing
  6. Humphrey, S.M.: ¬The MedIndEx prototype for computer assisted MEDLINE database indexing (1993) 0.00
    8.64828E-4 = product of:
      0.012107591 = sum of:
        0.012107591 = weight(_text_:information in 7819) [ClassicSimilarity], result of:
          0.012107591 = score(doc=7819,freq=2.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.23274569 = fieldWeight in 7819, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.09375 = fieldNorm(doc=7819)
      0.071428575 = coord(1/14)
    
    Source
    Indexing, providing access to information: looking back, looking ahead. Proceedings of the 25th Annual Meeting of the American Society of Indexers. Ed.: N.C. Mulvany
  7. Lancaster, F.W.; Ulvila, J.W.; Humphrey, S.M.; Smith, L.C.; Allen, B.; Herner, S.: Evaluation of interactive knowledge-based systems : overview and design for empirical testing (1996) 0.00
    5.04483E-4 = product of:
      0.0070627616 = sum of:
        0.0070627616 = weight(_text_:information in 3000) [ClassicSimilarity], result of:
          0.0070627616 = score(doc=3000,freq=2.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.13576832 = fieldWeight in 3000, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3000)
      0.071428575 = coord(1/14)
    
    Source
    Journal of the American Society for Information Science. 47(1996) no.1, S.57-69
  8. Humphrey, S.M.; Rogers, W.J.; Kilicoglu, H.; Demner-Fushman, D.; Rindflesch, T.C.: Word sense disambiguation by selecting the best semantic type based on journal descriptor indexing : preliminary experiment (2006) 0.00
    2.8827597E-4 = product of:
      0.0040358636 = sum of:
        0.0040358636 = weight(_text_:information in 4912) [ClassicSimilarity], result of:
          0.0040358636 = score(doc=4912,freq=2.0), product of:
            0.052020688 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.029633347 = queryNorm
            0.0775819 = fieldWeight in 4912, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.03125 = fieldNorm(doc=4912)
      0.071428575 = coord(1/14)
    
    Source
    Journal of the American Society for Information Science and Technology. 57(2006) no.1, S.96-113