Search (71 results, page 2 of 4)

  • × theme_ss:"Volltextretrieval"
  • × year_i:[1990 TO 2000}
  1. Paijmans, H.: Gravity wells of meaning : detecting information rich passages in scientific texts (1997) 0.01
    0.006654713 = product of:
      0.016636781 = sum of:
        0.00770594 = weight(_text_:a in 7444) [ClassicSimilarity], result of:
          0.00770594 = score(doc=7444,freq=4.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.14413087 = fieldWeight in 7444, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=7444)
        0.0089308405 = product of:
          0.017861681 = sum of:
            0.017861681 = weight(_text_:information in 7444) [ClassicSimilarity], result of:
              0.017861681 = score(doc=7444,freq=4.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.21943474 = fieldWeight in 7444, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0625 = fieldNorm(doc=7444)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Presents research in which 4 term weigthing schemes were used to detect information rich passages in texts and the results compared. Demonstrates that word categories and frequency derived weights have a close correlation but that weighting according to the first mention theory or the cue method shows no correlation with frequency based weights
    Type
    a
  2. Pirkola, A.; Jarvelin, K.: ¬The effect of anaphor and ellipsis resolution on proximity searching in a text database (1995) 0.01
    0.0065874713 = product of:
      0.016468678 = sum of:
        0.009632425 = weight(_text_:a in 4088) [ClassicSimilarity], result of:
          0.009632425 = score(doc=4088,freq=16.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.18016359 = fieldWeight in 4088, product of:
              4.0 = tf(freq=16.0), with freq of:
                16.0 = termFreq=16.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4088)
        0.006836252 = product of:
          0.013672504 = sum of:
            0.013672504 = weight(_text_:information in 4088) [ClassicSimilarity], result of:
              0.013672504 = score(doc=4088,freq=6.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.16796975 = fieldWeight in 4088, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4088)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    So far, methods for ellipsis and anaphor resolution have been developed and the effects of anaphor resolution have been analyzed in the context of statistical information retrieval of scientific abstracts. No significant improvements has been observed. Analyzes the effects of ellipsis and anaphor resolution on proximity searching in a full text database. Anaphora and ellipsis are classified on the basis of the type of their correlates / antecedents rather than, as traditional, on the basis of their own linguistic type. The classification differentiates proper names and common nouns of basic words, compound words, and phrases. The study was carried out in a newspaper article database containing 55.000 full text articles. A set of 154 keyword pairs in different categories was created. Human resolution of keyword ellipsis and anaphora was performed to identify sentences and paragraphs which would match proximity searches after resolution. Findings indicate that ellipsis and anaphor resolution is most relevant for proper name phrases and only marginal in the other keyword categories. Therefore the recall effect of restricted resolution of proper name phrases only was analyzed for keyword pairs containing at least 1 proper name phrase. Findings indicate a recall increase of 38.2% in sentence searches, and 28.8% in paragraph searches when proper name ellipsis were resolved. The recall increase was 17.6% sentence searches, and 19.8% in paragraph searches when proper name anaphora were resolved. Some simple and computationally justifiable resolution method might be developed only for proper name phrases to support keyword based full text information retrieval. Discusses elements of such a method
    Source
    Information processing and management. 32(1996) no.2, S.199-216
    Type
    a
  3. Ellis, D.; Furner, J.; Willett, P.: On the creation of hypertext links in full-text documents : measurement of retrieval effectiveness (1996) 0.01
    0.0064903568 = product of:
      0.016225891 = sum of:
        0.012278981 = weight(_text_:a in 4214) [ClassicSimilarity], result of:
          0.012278981 = score(doc=4214,freq=26.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.22966442 = fieldWeight in 4214, product of:
              5.0990195 = tf(freq=26.0), with freq of:
                26.0 = termFreq=26.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4214)
        0.003946911 = product of:
          0.007893822 = sum of:
            0.007893822 = weight(_text_:information in 4214) [ClassicSimilarity], result of:
              0.007893822 = score(doc=4214,freq=2.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.09697737 = fieldWeight in 4214, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4214)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    An important stage in the process or retrieval of objects from a hypertext database is the creation of a set of internodal links that are intended to represent the relationships existing between objects; this operation is often undertaken manually, just as index terms are often manually assigned to documents in a conventional retrieval system. In an earlier article (1994), the results were published of a study in which several different sets of links were inserted, each by a different person, between the paragraphs of each of a number of full-text documents. These results showed little similarity between the link-sets, a finding that was comparable with those of studies of inter-indexer consistency, which suggest that there is generally only a low level of agreement between the sets of index terms assigned to a document by different indexers. In this article, a description is provided of an investigation into the nature of the relationship existing between (i) the levels of inter-linker consistency obtaining among the group of hypertext databases used in our earlier experiments, and (ii) the levels of effectiveness of a number of searches carried out in those databases. An account is given of the implementation of the searches and of the methods used in the calculation of numerical values expressing their effectiveness. Analysis of the results of a comparison between recorded levels of consistency and those of effectiveness does not allow us to draw conclusions about the consistency - effectiveness relationship that are equivalent to those drawn in comparable studies of inter-indexer consistency
    Source
    Journal of the American Society for Information Science. 47(1996) no.4, S.287-300
    Type
    a
  4. Huang, Y.-L.: ¬A theoretic and empirical research of cluster indexing for Mandarine Chinese full text document (1998) 0.01
    0.006474727 = product of:
      0.016186817 = sum of:
        0.010661141 = weight(_text_:a in 513) [ClassicSimilarity], result of:
          0.010661141 = score(doc=513,freq=10.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.19940455 = fieldWeight in 513, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=513)
        0.005525676 = product of:
          0.011051352 = sum of:
            0.011051352 = weight(_text_:information in 513) [ClassicSimilarity], result of:
              0.011051352 = score(doc=513,freq=2.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.13576832 = fieldWeight in 513, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=513)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Since most popular commercialized systems for full text retrieval are designed with full text scaning and Boolean logic query mode, these systems use an oversimplified relationship between the indexing form and the content of document. Reports the use of Singular Value Decomposition (SVD) to develop a Cluster Indexing Model (CIM) based on a Vector Space Model (VSM) in orer to explore the index theory of cluster indexing for chinese full text documents. From a series of experiments, it was found that the indexing performance of CIM is better than traditional VSM, and has almost equivalent effectiveness of the authority control of index terms
    Source
    Bulletin of library and information science. 1998, no.24, S.44-68
    Type
    a
  5. Albus, W.; Smulders, H.: Doeltreffend zoeken in volledige teksten : 2. full-text retrieval bij de HavenInformatieBank (1998) 0.01
    0.0064290287 = product of:
      0.016072571 = sum of:
        0.008258085 = weight(_text_:a in 2368) [ClassicSimilarity], result of:
          0.008258085 = score(doc=2368,freq=6.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.1544581 = fieldWeight in 2368, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2368)
        0.007814486 = product of:
          0.015628971 = sum of:
            0.015628971 = weight(_text_:information in 2368) [ClassicSimilarity], result of:
              0.015628971 = score(doc=2368,freq=4.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.1920054 = fieldWeight in 2368, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2368)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    At Rotterdam Port Authority an information database has been created with approx. 100.000 full text documents online. Topic software has been used to identify word groups and refine search strategies to optimize precision and recall. The software guides users from selected terms to other relevant word combinations. Although the system would benefit from further refinement, users are generally satisfied. The database includes a number of foreign language documents but lacks a thesaurus of foreign terms
    Footnote
    Übers. d. Titels: Effective searching on full texts: 1. full-text-retrieval on the Harbour information database
    Type
    a
  6. Shewale, N.G.: Information delivery of full-text databases in academic and research libraries using hypermedia (1998) 0.01
    0.0063276635 = product of:
      0.015819158 = sum of:
        0.004767807 = weight(_text_:a in 3566) [ClassicSimilarity], result of:
          0.004767807 = score(doc=3566,freq=2.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.089176424 = fieldWeight in 3566, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3566)
        0.011051352 = product of:
          0.022102704 = sum of:
            0.022102704 = weight(_text_:information in 3566) [ClassicSimilarity], result of:
              0.022102704 = score(doc=3566,freq=8.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.27153665 = fieldWeight in 3566, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3566)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Discusses the introduction of hypermedia in information delivery and its advantages. Describes the special features in the design of full text databases and steps in developing in-house hypermedia databases. It may not be the job of the library professionals to design and develop the hypermedia system for information retrieval, but their ideas and experience will play an important role
    Source
    Information management in academic and research libraries: Proceedings of the 5th National convention for Automation of Libraries in Education and Research (CALIBER-98), Bhubaneswar, India, 4-5 March 1998. Ed. by M. Mahapatra et al
    Type
    a
  7. Molto, M.: Improving full text search performance through textual analysis (1993) 0.01
    0.0063011474 = product of:
      0.015752869 = sum of:
        0.009437811 = weight(_text_:a in 5099) [ClassicSimilarity], result of:
          0.009437811 = score(doc=5099,freq=6.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.17652355 = fieldWeight in 5099, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=5099)
        0.006315058 = product of:
          0.012630116 = sum of:
            0.012630116 = weight(_text_:information in 5099) [ClassicSimilarity], result of:
              0.012630116 = score(doc=5099,freq=2.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.1551638 = fieldWeight in 5099, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0625 = fieldNorm(doc=5099)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Explores the potential of text analysis as a tool in full text search and design improvement. Reports on a trial analysis performed in the domain of family history. The findings offered insights into possible gains and losses in using one search or design strategy versus another and strong evidence was provided to the potential of text analysis. Makes search and design recommendation
    Source
    Information processing and management. 29(1993) no.5, S.614-632
    Type
    a
  8. Ashford, J.H.: Full text retrieval in document management : a review (1995) 0.01
    0.0063011474 = product of:
      0.015752869 = sum of:
        0.009437811 = weight(_text_:a in 2054) [ClassicSimilarity], result of:
          0.009437811 = score(doc=2054,freq=6.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.17652355 = fieldWeight in 2054, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=2054)
        0.006315058 = product of:
          0.012630116 = sum of:
            0.012630116 = weight(_text_:information in 2054) [ClassicSimilarity], result of:
              0.012630116 = score(doc=2054,freq=2.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.1551638 = fieldWeight in 2054, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0625 = fieldNorm(doc=2054)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Full text management which applied to document management tends to be centred on text storage and retrieval. Recent developments are concerned with integration with relational database management system products to deliver document management services offering both the flexibility of text retrieval and the ability to support process based funnctions. There has been a move towards client server architectures, more user friendly user interfaces and more flexible and easier to understand retrieval. Advocates caution in choosing tasks for full text methods. Identifies document management functions for which the combined use of database management systems or special purpose tools should be considered
    Source
    Information management and technology. 28(1995) no.1, S.28-32
    Type
    a
  9. Couvreur, T.R.; Benzel, R.N.; Miller, S.F.; Zeitler, D.N.; Lee, D.L.; Singhal, M.; Shivaratri, N.; Wong, W.Y.P.: ¬An analysis of performance and cost factors in searching large text databases using parallel search systems (1994) 0.01
    0.0060245167 = product of:
      0.015061291 = sum of:
        0.009535614 = weight(_text_:a in 7657) [ClassicSimilarity], result of:
          0.009535614 = score(doc=7657,freq=8.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.17835285 = fieldWeight in 7657, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=7657)
        0.005525676 = product of:
          0.011051352 = sum of:
            0.011051352 = weight(_text_:information in 7657) [ClassicSimilarity], result of:
              0.011051352 = score(doc=7657,freq=2.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.13576832 = fieldWeight in 7657, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=7657)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    The results of modelling the performance of searching large text databases (>10 GBytes) via various parallel hardware architectures and search algorithms are discussed. The performance under load and the cost of each configuration are compared. Strengths, weaknesses, performance sensitivities, and search features supported for each configuration are also addressed. In addition, a common search workload used in the modelling is described. The search workload is derived from a set of searches run against the Chemical Abstracts file of bibliographic and abstract text available on STN International. This common workload is applied to all configurations modelled to provide a common basis of comparison
    Source
    Journal of the American Society for Information Science. 45(1994) no.7, S.443-464
    Type
    a
  10. Preston, L.A.; Ebbs, C.M.; Luther, J.: 'Full text' access evaluation : are we getting the real thing? (1998) 0.01
    0.0060245167 = product of:
      0.015061291 = sum of:
        0.009535614 = weight(_text_:a in 2695) [ClassicSimilarity], result of:
          0.009535614 = score(doc=2695,freq=8.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.17835285 = fieldWeight in 2695, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2695)
        0.005525676 = product of:
          0.011051352 = sum of:
            0.011051352 = weight(_text_:information in 2695) [ClassicSimilarity], result of:
              0.011051352 = score(doc=2695,freq=2.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.13576832 = fieldWeight in 2695, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2695)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Examines the comparative value of full text access and journal subscription through a sampling of journal issues available online from the Information Access Corporation and in print at James Madison University in Virginia. Presents a model for assessing the value of full text online subscriptions which can be used to evaluate journal subscription, back file retention, document delivery options, index selection, and online access
    Footnote
    Part of an issue devoted to 'Experimentation and collaboration: creating series for a new millenium', part 2, Proceedings of the North American Serials Interest Group, Inc.'s 12th annual conference, 29 May - 1 June 1997, University of Michigan Ann Arbor, Michigan
    Type
    a
  11. Sieverts, E.G.; Hofstede, M.; Groeniger, B.O.: Software for information storage and retrieval tested, evaluated and compared : pt.4: indexing and full-text retrieval programs (1992) 0.01
    0.005948606 = product of:
      0.014871514 = sum of:
        0.008173384 = weight(_text_:a in 2051) [ClassicSimilarity], result of:
          0.008173384 = score(doc=2051,freq=8.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.15287387 = fieldWeight in 2051, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=2051)
        0.0066981306 = product of:
          0.013396261 = sum of:
            0.013396261 = weight(_text_:information in 2051) [ClassicSimilarity], result of:
              0.013396261 = score(doc=2051,freq=4.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.16457605 = fieldWeight in 2051, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2051)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    In this article, the fourth in a series on microcomputer software for information storage and retrieval, test results of six indexing and full-text retrieval programs are presented and various properties and qualities of these programs are discussed. The common faeture of programs in these categories is that they are primarily meant to retrieve words (or combination of them) in large text files. To do this they either simply index existing text files in one or more formats (indexing programs), or they store and index them in their own database format (full-text retrieval programs). The programs reviewed in this issue are the indexing programs Ask-it, Texplore and ZYindex and the full-text retrieval programs KAware, textMaster and WordCruncher. All programs run under MS-DOS. In addition ZYindex has a Windows and a Unix version and TextMaster is also available for for Unix. For each of the six programs almost 100 facts and test results are tabulated. The programs are also discussed individually
    Type
    a
  12. McKinin, E.J.; Sievert, M.E.; Johnson, D.; Mitchell, J.A.: ¬The Medline/full-text research project (1991) 0.01
    0.005948606 = product of:
      0.014871514 = sum of:
        0.008173384 = weight(_text_:a in 5385) [ClassicSimilarity], result of:
          0.008173384 = score(doc=5385,freq=8.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.15287387 = fieldWeight in 5385, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=5385)
        0.0066981306 = product of:
          0.013396261 = sum of:
            0.013396261 = weight(_text_:information in 5385) [ClassicSimilarity], result of:
              0.013396261 = score(doc=5385,freq=4.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.16457605 = fieldWeight in 5385, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5385)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    This project was designed to test the relative efficacy of index terms and full-text for the retrieval of documents in those MEDLINE journals for which full-text searching was also available. The full-text files used were MEDIS from Mead Data Central and CCML from BRS Information Technologies. One hundred clinical medical topics were searches in these two files as well as the MEDLINE file to accumulate the necessary data. It was found that full-text identified significantly more relevant articles than did the indexed file. Most relevant items missed in the full-text files, but identified in MEDLINE, were missed because the searcher failed to account for some aspect of natural language, used a logical or positional operator that was too restrictive, or included a concept which was implied, but not expressed in the natural language. Very few of the unique relevant full-text citations would have been retrievaed by title or abstract alone. Finally, as of July, 1990 the more current issue of a journal was just as likely to appear in MEDLINE as in one of the full-text files.
    Source
    Journal of the American Society for Information Science. 42(1991), S.297-307
    Type
    a
  13. Poler, E.M.: Overcoming limitations in database searches (1992) 0.01
    0.00588199 = product of:
      0.014704974 = sum of:
        0.0068111527 = weight(_text_:a in 2329) [ClassicSimilarity], result of:
          0.0068111527 = score(doc=2329,freq=2.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.12739488 = fieldWeight in 2329, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.078125 = fieldNorm(doc=2329)
        0.007893822 = product of:
          0.015787644 = sum of:
            0.015787644 = weight(_text_:information in 2329) [ClassicSimilarity], result of:
              0.015787644 = score(doc=2329,freq=2.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.19395474 = fieldWeight in 2329, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2329)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Imprint
    Medford, NJ : Learned Information Inc.
    Type
    a
  14. Marcus, J.: ¬The full text ideal (1995) 0.01
    0.00588199 = product of:
      0.014704974 = sum of:
        0.0068111527 = weight(_text_:a in 4086) [ClassicSimilarity], result of:
          0.0068111527 = score(doc=4086,freq=2.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.12739488 = fieldWeight in 4086, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.078125 = fieldNorm(doc=4086)
        0.007893822 = product of:
          0.015787644 = sum of:
            0.015787644 = weight(_text_:information in 4086) [ClassicSimilarity], result of:
              0.015787644 = score(doc=4086,freq=2.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.19395474 = fieldWeight in 4086, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.078125 = fieldNorm(doc=4086)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Reviews developments in full text databases in 1995. Numbers of full text databases have increased by 12%. Consumer services are offering more full text databases, as is the Internet. News feeds are increasingly being offered to corporate users. Contemplates the ideal information service and how full text databases could evolve in this environment
    Type
    a
  15. Nahl, D.; Tenopir, C.: Affective and cognitive searching behavior of novice end-users of a full-text database (1996) 0.01
    0.00588199 = product of:
      0.014704974 = sum of:
        0.0068111527 = weight(_text_:a in 4213) [ClassicSimilarity], result of:
          0.0068111527 = score(doc=4213,freq=8.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.12739488 = fieldWeight in 4213, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4213)
        0.007893822 = product of:
          0.015787644 = sum of:
            0.015787644 = weight(_text_:information in 4213) [ClassicSimilarity], result of:
              0.015787644 = score(doc=4213,freq=8.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.19395474 = fieldWeight in 4213, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4213)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Footnote
    Novice end users were given 2 hours of training in searching a full-text magazine database (Magazine ASAP(TM)) on DIALOG. Subjects searched during 3 to 4 sessions in the presence of a trained monitor who prompted them to think aloud throughout the sessions. qualitative analysis of the transcripts and transaction logs yielded empirical information on user variables (purpose, motivation, satisfaction), uses of the database, move types, and every question users asked during the searches. The spontaneous, naturalistic questions were categorized according to affective, cognitive, and sensorimotor speech acts. Results show that most of the searches were performed for the self and were work related. The most common use of the database was to retrieve full-text articles online and to download and print them out rather than read them on screen. The majority of searches were judged satisfactory. Innovative uses included browsing for background information and obtaining contextualized sentences for language teaching. Searchers made twice as many moves to limit sets as moves to expand sets. Affective questions outnumbered cognitive and sensorimotor questions by two to one. This preponderance of affective micro-information needs during searching might be addressed by new system functions
    Source
    Journal of the American Society for Information Science. 47(1996) no.4, S.276-286
    Type
    a
  16. Basch, R.: Searching full-text : tips and techniques (1992) 0.01
    0.0056083994 = product of:
      0.014020998 = sum of:
        0.00770594 = weight(_text_:a in 5050) [ClassicSimilarity], result of:
          0.00770594 = score(doc=5050,freq=4.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.14413087 = fieldWeight in 5050, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=5050)
        0.006315058 = product of:
          0.012630116 = sum of:
            0.012630116 = weight(_text_:information in 5050) [ClassicSimilarity], result of:
              0.012630116 = score(doc=5050,freq=2.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.1551638 = fieldWeight in 5050, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0625 = fieldNorm(doc=5050)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Gives tips for effective searching of full-text databases: think creatively about vocabulary and use of the OR connector to link synonyms; use truncation to pick up plurals and alternative work endings; anticipate international differences in spellimg, likie 'labor' and 'labour', and in vocabulary, like 'truck' and 'lorry'; avoid the AND operator, preferring proximity connectors like NEAR, WITHIN or PRE; use jargon instead of trying to anticipate all the abstract terminology that a writer might have used; and exploit any indexing available
    Source
    Information today. 9(1992) no.10, S.10-12
    Type
    a
  17. Turtle, H.; Flood, J.: Query evaluation : strategies and optimizations (1995) 0.01
    0.0056083994 = product of:
      0.014020998 = sum of:
        0.00770594 = weight(_text_:a in 4087) [ClassicSimilarity], result of:
          0.00770594 = score(doc=4087,freq=4.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.14413087 = fieldWeight in 4087, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=4087)
        0.006315058 = product of:
          0.012630116 = sum of:
            0.012630116 = weight(_text_:information in 4087) [ClassicSimilarity], result of:
              0.012630116 = score(doc=4087,freq=2.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.1551638 = fieldWeight in 4087, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4087)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Discusses the 2 major query evaluation strategies used in large text retrieval systems and analyzes the performance of these strategies. Discusses several optimization techniques that can be used to reduce evaluation costs and present simulation results to compare the performance of these optimization techniques when evaluating natural language queries with a collection of full text legal materials
    Source
    Information processing and management. 31(1995) no.6, S.831-850
    Type
    a
  18. Witt, M.: Au sujet des mots-clés (1997) 0.01
    0.0056083994 = product of:
      0.014020998 = sum of:
        0.00770594 = weight(_text_:a in 1666) [ClassicSimilarity], result of:
          0.00770594 = score(doc=1666,freq=4.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.14413087 = fieldWeight in 1666, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=1666)
        0.006315058 = product of:
          0.012630116 = sum of:
            0.012630116 = weight(_text_:information in 1666) [ClassicSimilarity], result of:
              0.012630116 = score(doc=1666,freq=2.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.1551638 = fieldWeight in 1666, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1666)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Information research using keywords is largely ignored by readers but in medium sized libraries such as that of the City of Science and Industry (Paris), with a multimedia collection of some 200.000 documents, it is the most effective tool. The reader, however, needs guidance in using this technique, and in refining searches to reduce noise. Examples are given demonstrating how keyword searches can identify non-indexed material, documents in other languages, and collective authors
    Type
    a
  19. Blake, P.: Leading edge : Verity keeps it in the family (1997) 0.00
    0.0049910345 = product of:
      0.012477586 = sum of:
        0.005779455 = weight(_text_:a in 7398) [ClassicSimilarity], result of:
          0.005779455 = score(doc=7398,freq=4.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.10809815 = fieldWeight in 7398, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=7398)
        0.0066981306 = product of:
          0.013396261 = sum of:
            0.013396261 = weight(_text_:information in 7398) [ClassicSimilarity], result of:
              0.013396261 = score(doc=7398,freq=4.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.16457605 = fieldWeight in 7398, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046875 = fieldNorm(doc=7398)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Verity Search 97 software will index and search e-mail, attachments, folders and documents on local and network disk drives. The Internet may be searched via the same front end and changes to particular documents or pages may be monitored. Documents may be viewed in their native formats including ASCII, HTML, PDF and popular word processors, with highlighted search terms. Agents may be launched into the Internet to retrieve information according to a user-specified profile. The software can index about 700 MB an hour. Describes the search technology which includes fuzzy logic and natural language. The Web version of Personal Search 97 works with Netscape Navigator or Microsoft Internet Explorer, while the Exchange version will work regardless of any attachment to an Exchange server. Search 97 Personal improves online time and access time and allows searches to be refined offline
    Source
    Information world review. 1997, no.122, S.15-16
    Type
    a
  20. Wacholder, N.; Byrd, R.J.: Retrieving information from full text using linguistic knowledge (1994) 0.00
    0.0049160775 = product of:
      0.012290194 = sum of:
        0.004086692 = weight(_text_:a in 8524) [ClassicSimilarity], result of:
          0.004086692 = score(doc=8524,freq=2.0), product of:
            0.053464882 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046368346 = queryNorm
            0.07643694 = fieldWeight in 8524, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=8524)
        0.008203502 = product of:
          0.016407004 = sum of:
            0.016407004 = weight(_text_:information in 8524) [ClassicSimilarity], result of:
              0.016407004 = score(doc=8524,freq=6.0), product of:
                0.08139861 = queryWeight, product of:
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046368346 = queryNorm
                0.20156369 = fieldWeight in 8524, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  1.7554779 = idf(docFreq=20772, maxDocs=44218)
                  0.046875 = fieldNorm(doc=8524)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    Examines how techniques in the field of natural language processing can be applied to the analysis of text in information retrieval. State of the art text searching programs cannot distinguish, for example, between occurrences of the sickness, AIDS and aids as tool or between library school and school nor equate such terms as online or on-line which are variants of the same form. To make these distinction, systems must incorporate knowledge about the meaning of words in context. Research in natural language processing has concentrated on the automatic 'understanding' of language; how to analyze the grammatical structure and meaning of text. Although many asoects of this research remain experimental, describes how these techniques to recognize spelling variants, names, acronyms, and abbreviations
    Imprint
    Medford, NJ : Learned Information
    Type
    a

Languages

Types

  • a 67
  • m 1
  • r 1
  • s 1
  • x 1
  • More… Less…