Search (6 results, page 1 of 1)

Search Engines and Beyond : Developing efficient knowledge management systems, April 19-20 1999, Boston, Mass (1999) 0.02
```
0.015483556 = product of:
  0.030967113 = sum of:
    0.023722902 = weight(_text_:management in 2596) [ClassicSimilarity], result of:
      0.023722902 = score(doc=2596,freq=2.0), product of:
        0.15925534 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.047248192 = queryNorm
        0.14896142 = fieldWeight in 2596, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.03125 = fieldNorm(doc=2596)
    0.0072442107 = product of:
      0.014488421 = sum of:
        0.014488421 = weight(_text_:science in 2596) [ClassicSimilarity], result of:
          0.014488421 = score(doc=2596,freq=2.0), product of:
            0.124457374 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.047248192 = queryNorm
            0.11641272 = fieldWeight in 2596, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.03125 = fieldNorm(doc=2596)
      0.5 = coord(1/2)
  0.5 = coord(2/4)
```
Content

Ramana Rao (Inxight, Palo Alto, CA) 7 ± 2 Insights on achieving Effective Information Access Session One: Updates and a twelve month perspective Danny Sullivan (Search Engine Watch, US / England) Portalization and other search trends Carol Tenopir (University of Tennessee) Search realities faced by end users and professional searchers Session Two: Today's search engines and beyond Daniel Hoogterp (Retrieval Technologies, McLean, VA) Effective presentation and utilization of search techniques Rick Kenny (Fulcrum Technologies, Ontario, Canada) Beyond document clustering: The knowledge impact statement Gary Stock (Ingenius, Kalamazoo, MI) Automated change monitoring Gary Culliss (Direct Hit, Wellesley Hills, MA) User popularity ranked search engines Byron Dom (IBM, CA) Automatically finding the best pages on the World Wide Web (CLEVER) Peter Tomassi (LookSmart, San Francisco, CA) Adding human intellect to search technology Session Three: Panel discussion: Human v automated categorization and editing Ev Brenner (New York, NY)- Chairman James Callan (University of Massachusetts, MA) Marc Krellenstein (Northern Light Technology, Cambridge, MA) Dan Miller (Ask Jeeves, Berkeley, CA) Session Four: Updates and a twelve month perspective Steve Arnold (AIT, Harrods Creek, KY) Review: The leading edge in search and retrieval software Ellen Voorhees (NIST, Gaithersburg, MD) TREC update Session Five: Search engines now and beyond Intelligent Agents John Snyder (Muscat, Cambridge, England) Practical issues behind intelligent agents Text summarization Therese Firmin, (Dept of Defense, Ft George G. Meade, MD) The TIPSTER/SUMMAC evaluation of automatic text summarization systems Cross language searching Elizabeth Liddy (TextWise, Syracuse, NY) A conceptual interlingua approach to cross-language retrieval. Video search and retrieval Armon Amir (IBM, Almaden, CA) CueVideo: Modular system for automatic indexing and browsing of video/audio Speech recognition Michael Witbrock (Lycos, Waltham, MA) Retrieval of spoken documents Visualization James A. Wise (Integral Visuals, Richland, WA) Information visualization in the new millennium: Emerging science or passing fashion? Text mining David Evans (Claritech, Pittsburgh, PA) Text mining - towards decision support
Humphrey, S.M.; Névéol, A.; Browne, A.; Gobeil, J.; Ruch, P.; Darmoni, S.J.: Comparing a rule-based versus statistical system for automatic categorization of MEDLINE documents according to biomedical specialty (2009) 0.00
```
0.0032015191 = product of:
  0.012806077 = sum of:
    0.012806077 = product of:
      0.025612153 = sum of:
        0.025612153 = weight(_text_:science in 3300) [ClassicSimilarity], result of:
          0.025612153 = score(doc=3300,freq=4.0), product of:
            0.124457374 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.047248192 = queryNorm
            0.20579056 = fieldWeight in 3300, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3300)
      0.5 = coord(1/2)
  0.25 = coord(1/4)
```
Abstract

Automatic document categorization is an important research problem in Information Science and Natural Language Processing. Many applications, including, Word Sense Disambiguation and Information Retrieval in large collections, can benefit from such categorization. This paper focuses on automatic categorization of documents from the biomedical literature into broad discipline-based categories. Two different systems are described and contrasted: CISMeF, which uses rules based on human indexing of the documents by the Medical Subject Headings (MeSH) controlled vocabulary in order to assign metaterms (MTs), and Journal Descriptor Indexing (JDI), based on human categorization of about 4,000 journals and statistical associations between journal descriptors (JDs) and textwords in the documents. We evaluate and compare the performance of these systems against a gold standard of humanly assigned categories for 100 MEDLINE documents, using six measures selected from trec_eval. The results show that for five of the measures performance is comparable, and for one measure JDI is superior. We conclude that these results favor JDI, given the significantly greater intellectual overhead involved in human indexing and maintaining a rule base for mapping MeSH terms to MTs. We also note a JDI method that associates JDs with MeSH indexing rather than textwords, and it may be worthwhile to investigate whether this JDI method (statistical) and CISMeF (rule-based) might be combined and then evaluated showing they are complementary to one another.

Source

Journal of the American Society for Information Science and Technology. 60(2009) no.12, S.2530-2539

Golub, K.; Soergel, D.; Buchanan, G.; Tudhope, D.; Lykke, M.; Hiom, D.: ¬A framework for evaluating automatic indexing or classification in the context of retrieval (2016) 0.00

0.0032015191 = product of:
  0.012806077 = sum of:
    0.012806077 = product of:
      0.025612153 = sum of:
        0.025612153 = weight(_text_:science in 3311) [ClassicSimilarity], result of:
          0.025612153 = score(doc=3311,freq=4.0), product of:
            0.124457374 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.047248192 = queryNorm
            0.20579056 = fieldWeight in 3311, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3311)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Series: Advances in information science
Source: Journal of the Association for Information Science and Technology. 67(2016) no.1, S.3-16

Smiraglia, R.P.; Cai, X.: Tracking the evolution of clustering, machine learning, automatic indexing and automatic classification in knowledge organization (2017) 0.00
```
0.0032015191 = product of:
  0.012806077 = sum of:
    0.012806077 = product of:
      0.025612153 = sum of:
        0.025612153 = weight(_text_:science in 3627) [ClassicSimilarity], result of:
          0.025612153 = score(doc=3627,freq=4.0), product of:
            0.124457374 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.047248192 = queryNorm
            0.20579056 = fieldWeight in 3627, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3627)
      0.5 = coord(1/2)
  0.25 = coord(1/4)
```
Abstract

A very important extension of the traditional domain of knowledge organization (KO) arises from attempts to incorporate techniques devised in the computer science domain for automatic concept extraction and for grouping, categorizing, clustering and otherwise organizing knowledge using mechanical means. Four specific terms have emerged to identify the most prevalent techniques: machine learning, clustering, automatic indexing, and automatic classification. Our study presents three domain analytical case analyses in search of answers. The first case relies on citations located using the ISKO-supported "Knowledge Organization Bibliography." The second case relies on works in both Web of Science and SCOPUS. Case three applies co-word analysis and citation analysis to the contents of the papers in the present special issue. We observe scholars involved in "clustering" and "automatic classification" who share common thematic emphases. But we have found no coherence, no common activity and no social semantics. We have not found a research front, or a common teleology within the KO domain. We also have found a lively group of authors who have succeeded in submitting papers to this special issue, and their work quite interestingly aligns with the case studies we report. There is an emphasis on KO for information retrieval; there is much work on clustering (which involves conceptual points within texts) and automatic classification (which involves semantic groupings at the meta-document level).

Chung, Y.M.; Lee, J.Y.: ¬A corpus-based approach to comparative evaluation of statistical term association measures (2001) 0.00

0.0022638158 = product of:
  0.009055263 = sum of:
    0.009055263 = product of:
      0.018110527 = sum of:
        0.018110527 = weight(_text_:science in 5769) [ClassicSimilarity], result of:
          0.018110527 = score(doc=5769,freq=2.0), product of:
            0.124457374 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.047248192 = queryNorm
            0.1455159 = fieldWeight in 5769, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5769)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Source: Journal of the American Society for Information Science and technology. 52(2001) no.4, S.283-296

Vilares, D.; Alonso, M.A.; Gómez-Rodríguez, C.: On the usefulness of lexical and syntactic processing in polarity classification of Twitter messages (2015) 0.00

0.0022638158 = product of:
  0.009055263 = sum of:
    0.009055263 = product of:
      0.018110527 = sum of:
        0.018110527 = weight(_text_:science in 2161) [ClassicSimilarity], result of:
          0.018110527 = score(doc=2161,freq=2.0), product of:
            0.124457374 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.047248192 = queryNorm
            0.1455159 = fieldWeight in 2161, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2161)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Source: Journal of the Association for Information Science and Technology. 66(2015) no.9, S.1799-1816

Search (6 results, page 1 of 1)

Authors

Years

Types