Search (48 results, page 1 of 3)

  • × author_ss:"Salton, G."
  1. Salton, G.: Automatic text processing : the transformation, analysis, and retrieval of information by computer (1989) 0.30
    0.29714388 = product of:
      0.4754302 = sum of:
        0.07230785 = weight(_text_:retrieval in 1307) [ClassicSimilarity], result of:
          0.07230785 = score(doc=1307,freq=6.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.5788671 = fieldWeight in 1307, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.078125 = fieldNorm(doc=1307)
        0.06049911 = weight(_text_:use in 1307) [ClassicSimilarity], result of:
          0.06049911 = score(doc=1307,freq=4.0), product of:
            0.12644777 = queryWeight, product of:
              3.0620887 = idf(docFreq=5623, maxDocs=44218)
              0.041294612 = queryNorm
            0.47845137 = fieldWeight in 1307, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.0620887 = idf(docFreq=5623, maxDocs=44218)
              0.078125 = fieldNorm(doc=1307)
        0.019324033 = weight(_text_:of in 1307) [ClassicSimilarity], result of:
          0.019324033 = score(doc=1307,freq=6.0), product of:
            0.06457475 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041294612 = queryNorm
            0.2992506 = fieldWeight in 1307, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.078125 = fieldNorm(doc=1307)
        0.23412317 = sum of:
          0.031212443 = weight(_text_:on in 1307) [ClassicSimilarity], result of:
            0.031212443 = score(doc=1307,freq=4.0), product of:
              0.090823986 = queryWeight, product of:
                2.199415 = idf(docFreq=13325, maxDocs=44218)
                0.041294612 = queryNorm
              0.3436586 = fieldWeight in 1307, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.199415 = idf(docFreq=13325, maxDocs=44218)
                0.078125 = fieldNorm(doc=1307)
          0.20291072 = weight(_text_:line in 1307) [ClassicSimilarity], result of:
            0.20291072 = score(doc=1307,freq=4.0), product of:
              0.23157367 = queryWeight, product of:
                5.6078424 = idf(docFreq=440, maxDocs=44218)
                0.041294612 = queryNorm
              0.87622535 = fieldWeight in 1307, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.6078424 = idf(docFreq=440, maxDocs=44218)
                0.078125 = fieldNorm(doc=1307)
        0.08917602 = product of:
          0.17835204 = sum of:
            0.17835204 = weight(_text_:computers in 1307) [ClassicSimilarity], result of:
              0.17835204 = score(doc=1307,freq=4.0), product of:
                0.21710795 = queryWeight, product of:
                  5.257537 = idf(docFreq=625, maxDocs=44218)
                  0.041294612 = queryNorm
                0.82149017 = fieldWeight in 1307, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  5.257537 = idf(docFreq=625, maxDocs=44218)
                  0.078125 = fieldNorm(doc=1307)
          0.5 = coord(1/2)
      0.625 = coord(5/8)
    
    COMPASS
    Information retrieval / Use of / On-line computers
    Subject
    Information retrieval / Use of / On-line computers
  2. Salton, G.; Waldstein, R.H.: Term relevance weights in on-line information retrieval (1978) 0.08
    0.08291881 = product of:
      0.33167523 = sum of:
        0.066795126 = weight(_text_:retrieval in 5484) [ClassicSimilarity], result of:
          0.066795126 = score(doc=5484,freq=2.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.5347345 = fieldWeight in 5484, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.125 = fieldNorm(doc=5484)
        0.26488012 = sum of:
          0.035312846 = weight(_text_:on in 5484) [ClassicSimilarity], result of:
            0.035312846 = score(doc=5484,freq=2.0), product of:
              0.090823986 = queryWeight, product of:
                2.199415 = idf(docFreq=13325, maxDocs=44218)
                0.041294612 = queryNorm
              0.3888053 = fieldWeight in 5484, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.199415 = idf(docFreq=13325, maxDocs=44218)
                0.125 = fieldNorm(doc=5484)
          0.22956727 = weight(_text_:line in 5484) [ClassicSimilarity], result of:
            0.22956727 = score(doc=5484,freq=2.0), product of:
              0.23157367 = queryWeight, product of:
                5.6078424 = idf(docFreq=440, maxDocs=44218)
                0.041294612 = queryNorm
              0.9913358 = fieldWeight in 5484, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6078424 = idf(docFreq=440, maxDocs=44218)
                0.125 = fieldNorm(doc=5484)
      0.25 = coord(2/8)
    
  3. Salton, G.; Araya, J.: On the use of clustered file organizations in information search and retrieval (1990) 0.07
    0.071673915 = product of:
      0.14334783 = sum of:
        0.050096344 = weight(_text_:retrieval in 2409) [ClassicSimilarity], result of:
          0.050096344 = score(doc=2409,freq=2.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.40105087 = fieldWeight in 2409, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.09375 = fieldNorm(doc=2409)
        0.051335193 = weight(_text_:use in 2409) [ClassicSimilarity], result of:
          0.051335193 = score(doc=2409,freq=2.0), product of:
            0.12644777 = queryWeight, product of:
              3.0620887 = idf(docFreq=5623, maxDocs=44218)
              0.041294612 = queryNorm
            0.40597942 = fieldWeight in 2409, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0620887 = idf(docFreq=5623, maxDocs=44218)
              0.09375 = fieldNorm(doc=2409)
        0.023188837 = weight(_text_:of in 2409) [ClassicSimilarity], result of:
          0.023188837 = score(doc=2409,freq=6.0), product of:
            0.06457475 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041294612 = queryNorm
            0.3591007 = fieldWeight in 2409, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.09375 = fieldNorm(doc=2409)
        0.018727465 = product of:
          0.03745493 = sum of:
            0.03745493 = weight(_text_:on in 2409) [ClassicSimilarity], result of:
              0.03745493 = score(doc=2409,freq=4.0), product of:
                0.090823986 = queryWeight, product of:
                  2.199415 = idf(docFreq=13325, maxDocs=44218)
                  0.041294612 = queryNorm
                0.4123903 = fieldWeight in 2409, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.199415 = idf(docFreq=13325, maxDocs=44218)
                  0.09375 = fieldNorm(doc=2409)
          0.5 = coord(1/2)
      0.5 = coord(4/8)
    
    Imprint
    Edmonton, Alberta : Univ. of Alberta, Faculty of Extension
    Source
    Library classification and its functions. Int. Conf. on ..., 20.-21.6.1989, Edmonton, Alberta. Ed.: A. Nitecki u. T. Fell
  4. Salton, G.: Thoughts about modern retrieval technologies (1988) 0.05
    0.05069307 = product of:
      0.10138614 = sum of:
        0.029222867 = weight(_text_:retrieval in 1522) [ClassicSimilarity], result of:
          0.029222867 = score(doc=1522,freq=2.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.23394634 = fieldWeight in 1522, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1522)
        0.042349376 = weight(_text_:use in 1522) [ClassicSimilarity], result of:
          0.042349376 = score(doc=1522,freq=4.0), product of:
            0.12644777 = queryWeight, product of:
              3.0620887 = idf(docFreq=5623, maxDocs=44218)
              0.041294612 = queryNorm
            0.33491597 = fieldWeight in 1522, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.0620887 = idf(docFreq=5623, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1522)
        0.022089208 = weight(_text_:of in 1522) [ClassicSimilarity], result of:
          0.022089208 = score(doc=1522,freq=16.0), product of:
            0.06457475 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041294612 = queryNorm
            0.34207192 = fieldWeight in 1522, product of:
              4.0 = tf(freq=16.0), with freq of:
                16.0 = termFreq=16.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1522)
        0.007724685 = product of:
          0.01544937 = sum of:
            0.01544937 = weight(_text_:on in 1522) [ClassicSimilarity], result of:
              0.01544937 = score(doc=1522,freq=2.0), product of:
                0.090823986 = queryWeight, product of:
                  2.199415 = idf(docFreq=13325, maxDocs=44218)
                  0.041294612 = queryNorm
                0.17010231 = fieldWeight in 1522, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.199415 = idf(docFreq=13325, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1522)
          0.5 = coord(1/2)
      0.5 = coord(4/8)
    
    Abstract
    Paper presented at the 30th Annual Conference of the National Federation of Astracting and Information Services, Philadelphia, 28 Feb-2 Mar 88. In recent years, the amount and the variety of available machine-readable data, new technologies have been introduced, such as high density storage devices, and fancy graphic displays useful for information transformation and access. New approaches have also been considered for processing the stored data based on the construction of knowledge bases representing the contents and structure of the information, and the use of expert system techniques to control the user-system interactions. Provides a brief evaluation of the new information processing technologies, and of the software methods proposed for information manipulation.
    Source
    Information services and use. 8(1988) no.2/3/4, S.107-113
  5. Salton, G.; Rijsbergen, C.J. van; Maron, M.E.: Panel on key issues in information retrieval (1983) 0.04
    0.042208925 = product of:
      0.11255713 = sum of:
        0.08180699 = weight(_text_:retrieval in 7410) [ClassicSimilarity], result of:
          0.08180699 = score(doc=7410,freq=12.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.6549133 = fieldWeight in 7410, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0625 = fieldNorm(doc=7410)
        0.0154592255 = weight(_text_:of in 7410) [ClassicSimilarity], result of:
          0.0154592255 = score(doc=7410,freq=6.0), product of:
            0.06457475 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041294612 = queryNorm
            0.23940048 = fieldWeight in 7410, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0625 = fieldNorm(doc=7410)
        0.015290912 = product of:
          0.030581824 = sum of:
            0.030581824 = weight(_text_:on in 7410) [ClassicSimilarity], result of:
              0.030581824 = score(doc=7410,freq=6.0), product of:
                0.090823986 = queryWeight, product of:
                  2.199415 = idf(docFreq=13325, maxDocs=44218)
                  0.041294612 = queryNorm
                0.33671528 = fieldWeight in 7410, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  2.199415 = idf(docFreq=13325, maxDocs=44218)
                  0.0625 = fieldNorm(doc=7410)
          0.5 = coord(1/2)
      0.375 = coord(3/8)
    
    Abstract
    Contribution to an issue devoted to the 6th Annual International Conference of the Special Interest Group on Information Retrieval of the Association for Computing Machinery (USA) held at the National Library of Medicine, Bethesda, Maryland, from 6-8 June 83. The following papers were presented in session 12 which was a panel on key issues in information retrieval: SALTON, G.: Research problems in automatic information retrieval; RIJSBERGEN, C.J. van: Information retrieval: new directions, old solutions; MARON, M.E.: Open problems in information retrieval
  6. Salton, G.: Mathematics and information retrieval (1979) 0.04
    0.037114087 = product of:
      0.0989709 = sum of:
        0.06534432 = weight(_text_:retrieval in 5467) [ClassicSimilarity], result of:
          0.06534432 = score(doc=5467,freq=10.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.5231199 = fieldWeight in 5467, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5467)
        0.025901893 = weight(_text_:of in 5467) [ClassicSimilarity], result of:
          0.025901893 = score(doc=5467,freq=22.0), product of:
            0.06457475 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041294612 = queryNorm
            0.40111488 = fieldWeight in 5467, product of:
              4.690416 = tf(freq=22.0), with freq of:
                22.0 = termFreq=22.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5467)
        0.007724685 = product of:
          0.01544937 = sum of:
            0.01544937 = weight(_text_:on in 5467) [ClassicSimilarity], result of:
              0.01544937 = score(doc=5467,freq=2.0), product of:
                0.090823986 = queryWeight, product of:
                  2.199415 = idf(docFreq=13325, maxDocs=44218)
                  0.041294612 = queryNorm
                0.17010231 = fieldWeight in 5467, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.199415 = idf(docFreq=13325, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5467)
          0.5 = coord(1/2)
      0.375 = coord(3/8)
    
    Abstract
    The development of a given discipline in science and technology often depends on the availability of theorie capable of describing the processes which control the field and of modelling the interactions between the processes. The absence of an accepted theory of information retrieval has benn blamed for the relative disorder and the lack of technical advances in the area. The main mathematical approaches to information retrieval are examined in this study, including both algebraic and probabilistic models, and the difficulties which impede the formalization of information retrieval processes are described. A number of developments are covered where new theoretical understandings have directly led to the improvemenet of retrieval techniques and operations
    Source
    Journal of documentation. 35(1979) no.1, S.1-29
  7. Buckley, C.; Allan, J.; Salton, G.: Automatic routing and retrieval using Smart : TREC-2 (1995) 0.04
    0.03634963 = product of:
      0.09693235 = sum of:
        0.050096344 = weight(_text_:retrieval in 5699) [ClassicSimilarity], result of:
          0.050096344 = score(doc=5699,freq=8.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.40105087 = fieldWeight in 5699, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.046875 = fieldNorm(doc=5699)
        0.025667597 = weight(_text_:use in 5699) [ClassicSimilarity], result of:
          0.025667597 = score(doc=5699,freq=2.0), product of:
            0.12644777 = queryWeight, product of:
              3.0620887 = idf(docFreq=5623, maxDocs=44218)
              0.041294612 = queryNorm
            0.20298971 = fieldWeight in 5699, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0620887 = idf(docFreq=5623, maxDocs=44218)
              0.046875 = fieldNorm(doc=5699)
        0.021168415 = weight(_text_:of in 5699) [ClassicSimilarity], result of:
          0.021168415 = score(doc=5699,freq=20.0), product of:
            0.06457475 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041294612 = queryNorm
            0.32781258 = fieldWeight in 5699, product of:
              4.472136 = tf(freq=20.0), with freq of:
                20.0 = termFreq=20.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.046875 = fieldNorm(doc=5699)
      0.375 = coord(3/8)
    
    Abstract
    The Smart information retrieval project emphazises completely automatic approaches to the understanding and retrieval of large quantities of text. The work in the TREC-2 environment continues, performing both routing and ad hoc experiments. The ad hoc work extends investigations into combining global similarities, giving an overall indication of how a document matches a query, with local similarities identifying a smaller part of the document that matches the query. The performance of ad hoc runs is good, but it is clear that full advantage of the available local information is not been taken advantage of. The routing experiments use conventional relevance feedback approaches to routing, but with a much greater degree of query expansion than was previously done. The length of a query vector is increased by a factor of 5 to 10 by adding terms found in previously seen relevant documents. This approach improves effectiveness by 30-40% over the original query
    Theme
    Semantisches Umfeld in Indexierung u. Retrieval
  8. Salton, G.; Buckley, C.; Allan, J.: Automatic structuring of text files (1992) 0.03
    0.031155093 = product of:
      0.08308025 = sum of:
        0.033397563 = weight(_text_:retrieval in 6507) [ClassicSimilarity], result of:
          0.033397563 = score(doc=6507,freq=2.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.26736724 = fieldWeight in 6507, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0625 = fieldNorm(doc=6507)
        0.03422346 = weight(_text_:use in 6507) [ClassicSimilarity], result of:
          0.03422346 = score(doc=6507,freq=2.0), product of:
            0.12644777 = queryWeight, product of:
              3.0620887 = idf(docFreq=5623, maxDocs=44218)
              0.041294612 = queryNorm
            0.27065295 = fieldWeight in 6507, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0620887 = idf(docFreq=5623, maxDocs=44218)
              0.0625 = fieldNorm(doc=6507)
        0.0154592255 = weight(_text_:of in 6507) [ClassicSimilarity], result of:
          0.0154592255 = score(doc=6507,freq=6.0), product of:
            0.06457475 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041294612 = queryNorm
            0.23940048 = fieldWeight in 6507, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0625 = fieldNorm(doc=6507)
      0.375 = coord(3/8)
    
    Abstract
    In many practical information retrieval situations, it is necessary to process heterogeneous text databases that vary greatly in scope and coverage and deal with many different subjects. In such an environment it is important to provide flexible access to individual text pieces and to structure the collection so that related text elements are identified and properly linked. Describes methods for the automatic structuring of heterogeneous text collections and the construction of browsing tools and access procedures that facilitate collection use. Illustrates these emthods with searches using a large automated encyclopedia
  9. Lesk, M.E.; Salton, G.: Relevance assements and retrieval system evaluation (1969) 0.03
    0.028150002 = product of:
      0.07506667 = sum of:
        0.050615493 = weight(_text_:retrieval in 4151) [ClassicSimilarity], result of:
          0.050615493 = score(doc=4151,freq=6.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.40520695 = fieldWeight in 4151, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4151)
        0.013526822 = weight(_text_:of in 4151) [ClassicSimilarity], result of:
          0.013526822 = score(doc=4151,freq=6.0), product of:
            0.06457475 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041294612 = queryNorm
            0.20947541 = fieldWeight in 4151, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4151)
        0.010924355 = product of:
          0.02184871 = sum of:
            0.02184871 = weight(_text_:on in 4151) [ClassicSimilarity], result of:
              0.02184871 = score(doc=4151,freq=4.0), product of:
                0.090823986 = queryWeight, product of:
                  2.199415 = idf(docFreq=13325, maxDocs=44218)
                  0.041294612 = queryNorm
                0.24056101 = fieldWeight in 4151, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.199415 = idf(docFreq=13325, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=4151)
          0.5 = coord(1/2)
      0.375 = coord(3/8)
    
    Abstract
    Two widerly used criteria for evaluating the effectiveness of information retrieval systems are, respectively, the recall and the precision. Since the determiniation of these measures is dependent on a distinction between documents which are relevant to a given query and documents which are not relevant to that query, it has sometimes been claimed that an accurate, generally valid evaluation cannot be based on recall and precision measure. A study was made to determine the effect of variations in relevance assesments do not produce significant variations in average recall and precision. It thus appears that properly computed recall and precision data may represent effectiveness indicators which are gemerally valid for many distinct user classes.
    Source
    Information storage and retrieval. 4(1969), S.343-359
  10. Salton, G.: Another look at automatic text-retrieval systems (1986) 0.03
    0.027281757 = product of:
      0.10912703 = sum of:
        0.093349025 = weight(_text_:retrieval in 1356) [ClassicSimilarity], result of:
          0.093349025 = score(doc=1356,freq=10.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.74731416 = fieldWeight in 1356, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.078125 = fieldNorm(doc=1356)
        0.015778005 = weight(_text_:of in 1356) [ClassicSimilarity], result of:
          0.015778005 = score(doc=1356,freq=4.0), product of:
            0.06457475 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041294612 = queryNorm
            0.24433708 = fieldWeight in 1356, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.078125 = fieldNorm(doc=1356)
      0.25 = coord(2/8)
    
    Footnote
    Bezugnahme auf: Blair, D.C.: An evaluation of retrieval effectiveness for a full-text document-retrieval system. Comm. ACM 28(1985) S.280-299. - Vgl. auch: Blair, D.C.: Full text retrieval ... Int. Class. 13(1986) S.18-23; Blair, D.C., M.E. Maron: full-text information retrieval ... Inf. Proc. Man. 26(1990) S.437-447.
    Source
    Communications of the Association for Computing Machinery. 29(1986), S.648-656
  11. Salton, G.: Automatic processing of foreign language documents (1985) 0.03
    0.026421588 = product of:
      0.07045757 = sum of:
        0.04418082 = weight(_text_:retrieval in 3650) [ClassicSimilarity], result of:
          0.04418082 = score(doc=3650,freq=14.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.3536936 = fieldWeight in 3650, product of:
              3.7416575 = tf(freq=14.0), with freq of:
                14.0 = termFreq=14.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.03125 = fieldNorm(doc=3650)
        0.021862645 = weight(_text_:of in 3650) [ClassicSimilarity], result of:
          0.021862645 = score(doc=3650,freq=48.0), product of:
            0.06457475 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041294612 = queryNorm
            0.33856338 = fieldWeight in 3650, product of:
              6.928203 = tf(freq=48.0), with freq of:
                48.0 = termFreq=48.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.03125 = fieldNorm(doc=3650)
        0.004414106 = product of:
          0.008828212 = sum of:
            0.008828212 = weight(_text_:on in 3650) [ClassicSimilarity], result of:
              0.008828212 = score(doc=3650,freq=2.0), product of:
                0.090823986 = queryWeight, product of:
                  2.199415 = idf(docFreq=13325, maxDocs=44218)
                  0.041294612 = queryNorm
                0.097201325 = fieldWeight in 3650, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.199415 = idf(docFreq=13325, maxDocs=44218)
                  0.03125 = fieldNorm(doc=3650)
          0.5 = coord(1/2)
      0.375 = coord(3/8)
    
    Abstract
    The attempt to computerize a process, such as indexing, abstracting, classifying, or retrieving information, begins with an analysis of the process into its intellectual and nonintellectual components. That part of the process which is amenable to computerization is mechanical or algorithmic. What is not is intellectual or creative and requires human intervention. Gerard Salton has been an innovator, experimenter, and promoter in the area of mechanized information systems since the early 1960s. He has been particularly ingenious at analyzing the process of information retrieval into its algorithmic components. He received a doctorate in applied mathematics from Harvard University before moving to the computer science department at Cornell, where he developed a prototype automatic retrieval system called SMART. Working with this system he and his students contributed for over a decade to our theoretical understanding of the retrieval process. On a more practical level, they have contributed design criteria for operating retrieval systems. The following selection presents one of the early descriptions of the SMART system; it is valuable as it shows the direction automatic retrieval methods were to take beyond simple word-matching techniques. These include various word normalization techniques to improve recall, for instance, the separation of words into stems and affixes; the correlation and clustering, using statistical association measures, of related terms; and the identification, using a concept thesaurus, of synonymous, broader, narrower, and sibling terms. They include, as weIl, techniques, both linguistic and statistical, to deal with the thorny problem of how to automatically extract from texts index terms that consist of more than one word. They include weighting techniques and various documentrequest matching algorithms. Significant among the latter are those which produce a retrieval output of citations ranked in relevante order. During the 1970s, Salton and his students went an to further refine these various techniques, particularly the weighting and statistical association measures. Many of their early innovations seem commonplace today. Some of their later techniques are still ahead of their time and await technological developments for implementation. The particular focus of the selection that follows is an the evaluation of a particular component of the SMART system, a multilingual thesaurus. By mapping English language expressions and their German equivalents to a common concept number, the thesaurus permitted the automatic processing of German language documents against English language queries and vice versa. The results of the evaluation, as it turned out, were somewhat inconclusive. However, this SMART experiment suggested in a bold and optimistic way how one might proceed to answer such complex questions as What is meant by retrieval language compatability? How it is to be achieved, and how evaluated?
    Footnote
    Original in: Journal of the American Society for Information Science 21(1970) no.3, S.187-194.
    Source
    Theory of subject analysis: a sourcebook. Ed.: L.M. Chan, et al
  12. Salton, G.: SMART System: 1961-1976 (2009) 0.03
    0.025755715 = product of:
      0.0686819 = sum of:
        0.047231287 = weight(_text_:retrieval in 3879) [ClassicSimilarity], result of:
          0.047231287 = score(doc=3879,freq=4.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.37811437 = fieldWeight in 3879, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0625 = fieldNorm(doc=3879)
        0.012622404 = weight(_text_:of in 3879) [ClassicSimilarity], result of:
          0.012622404 = score(doc=3879,freq=4.0), product of:
            0.06457475 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041294612 = queryNorm
            0.19546966 = fieldWeight in 3879, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0625 = fieldNorm(doc=3879)
        0.008828212 = product of:
          0.017656423 = sum of:
            0.017656423 = weight(_text_:on in 3879) [ClassicSimilarity], result of:
              0.017656423 = score(doc=3879,freq=2.0), product of:
                0.090823986 = queryWeight, product of:
                  2.199415 = idf(docFreq=13325, maxDocs=44218)
                  0.041294612 = queryNorm
                0.19440265 = fieldWeight in 3879, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.199415 = idf(docFreq=13325, maxDocs=44218)
                  0.0625 = fieldNorm(doc=3879)
          0.5 = coord(1/2)
      0.375 = coord(3/8)
    
    Abstract
    While a number of researchers had experimented during the 1950's on automatic indexing and retrieval in various forms, it was Gerard Salton who brought the information retrieval experimental paradigm to full fruition, with his "SMART" system. His work has been enormously influential.
    Source
    Encyclopedia of library and information sciences. 3rd ed. Ed.: M.J. Bates
  13. Salton, G.; Fox, E.: Extended Boolean information retrieval (1983) 0.02
    0.021161474 = product of:
      0.0846459 = sum of:
        0.066795126 = weight(_text_:retrieval in 1137) [ClassicSimilarity], result of:
          0.066795126 = score(doc=1137,freq=2.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.5347345 = fieldWeight in 1137, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.125 = fieldNorm(doc=1137)
        0.017850775 = weight(_text_:of in 1137) [ClassicSimilarity], result of:
          0.017850775 = score(doc=1137,freq=2.0), product of:
            0.06457475 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041294612 = queryNorm
            0.27643585 = fieldWeight in 1137, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.125 = fieldNorm(doc=1137)
      0.25 = coord(2/8)
    
    Source
    Communications of the Association for Computing Machinery. 26(1983) no.11, S.1022-1036
  14. Salton, G.: Historical note: the past thirty years in information retrieval (1987) 0.02
    0.021161474 = product of:
      0.0846459 = sum of:
        0.066795126 = weight(_text_:retrieval in 3910) [ClassicSimilarity], result of:
          0.066795126 = score(doc=3910,freq=2.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.5347345 = fieldWeight in 3910, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.125 = fieldNorm(doc=3910)
        0.017850775 = weight(_text_:of in 3910) [ClassicSimilarity], result of:
          0.017850775 = score(doc=3910,freq=2.0), product of:
            0.06457475 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041294612 = queryNorm
            0.27643585 = fieldWeight in 3910, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.125 = fieldNorm(doc=3910)
      0.25 = coord(2/8)
    
    Source
    Journal of the American Society for Information Science. 38(1987) no.5, S.375-380
  15. Salton, G.; Fox, E.A.; Voorhees, E.: Advanced feedback methods in information retrieval (1985) 0.02
    0.021161474 = product of:
      0.0846459 = sum of:
        0.066795126 = weight(_text_:retrieval in 5445) [ClassicSimilarity], result of:
          0.066795126 = score(doc=5445,freq=2.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.5347345 = fieldWeight in 5445, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.125 = fieldNorm(doc=5445)
        0.017850775 = weight(_text_:of in 5445) [ClassicSimilarity], result of:
          0.017850775 = score(doc=5445,freq=2.0), product of:
            0.06457475 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041294612 = queryNorm
            0.27643585 = fieldWeight in 5445, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.125 = fieldNorm(doc=5445)
      0.25 = coord(2/8)
    
    Source
    Journal of the American Society for Information Science. 36(1985), S.200-210
  16. Yu, C.T.; Salton, G.: Effective information retrieval using term accuracy (1971) 0.02
    0.021161474 = product of:
      0.0846459 = sum of:
        0.066795126 = weight(_text_:retrieval in 5485) [ClassicSimilarity], result of:
          0.066795126 = score(doc=5485,freq=2.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.5347345 = fieldWeight in 5485, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.125 = fieldNorm(doc=5485)
        0.017850775 = weight(_text_:of in 5485) [ClassicSimilarity], result of:
          0.017850775 = score(doc=5485,freq=2.0), product of:
            0.06457475 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041294612 = queryNorm
            0.27643585 = fieldWeight in 5485, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.125 = fieldNorm(doc=5485)
      0.25 = coord(2/8)
    
    Source
    Communications of the Association for Computing Machinery. 20(1971), S.135-142
  17. Buckley, C.; Singhal, A.; Mitra, M.; Salton, G.: New retrieval approaches using SMART : TREC 4 (1996) 0.02
    0.021058753 = product of:
      0.08423501 = sum of:
        0.07084693 = weight(_text_:retrieval in 7528) [ClassicSimilarity], result of:
          0.07084693 = score(doc=7528,freq=4.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.5671716 = fieldWeight in 7528, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.09375 = fieldNorm(doc=7528)
        0.013388081 = weight(_text_:of in 7528) [ClassicSimilarity], result of:
          0.013388081 = score(doc=7528,freq=2.0), product of:
            0.06457475 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041294612 = queryNorm
            0.20732689 = fieldWeight in 7528, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.09375 = fieldNorm(doc=7528)
      0.25 = coord(2/8)
    
    Imprint
    Gaithersburgh, MD : National Institute of Standards and Technology
    Source
    The Fourth Text Retrieval Conference (TREC-4). Ed.: K. Harman
  18. Salton, G.: ¬The state of retrieval system evaluation (1992) 0.02
    0.019451013 = product of:
      0.07780405 = sum of:
        0.057846278 = weight(_text_:retrieval in 5250) [ClassicSimilarity], result of:
          0.057846278 = score(doc=5250,freq=6.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.46309367 = fieldWeight in 5250, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0625 = fieldNorm(doc=5250)
        0.019957775 = weight(_text_:of in 5250) [ClassicSimilarity], result of:
          0.019957775 = score(doc=5250,freq=10.0), product of:
            0.06457475 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041294612 = queryNorm
            0.3090647 = fieldWeight in 5250, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0625 = fieldNorm(doc=5250)
      0.25 = coord(2/8)
    
    Abstract
    Substatioal misgivings have been voiced over the years about the methodologies used to evaluate IR procedures and about the credibility of many of the available test results. In this note, an attempt is made to review the state of retrieval evaluation and to separate certain misgivings about the design of retrieval tests from conclusions that can legitimately be drawn from the evaluation results
  19. Salton, G.; Buckley, C.: Improving retrieval performance by relevance feedback (1990) 0.02
    0.01761717 = product of:
      0.07046868 = sum of:
        0.057846278 = weight(_text_:retrieval in 5442) [ClassicSimilarity], result of:
          0.057846278 = score(doc=5442,freq=6.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.46309367 = fieldWeight in 5442, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0625 = fieldNorm(doc=5442)
        0.012622404 = weight(_text_:of in 5442) [ClassicSimilarity], result of:
          0.012622404 = score(doc=5442,freq=4.0), product of:
            0.06457475 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041294612 = queryNorm
            0.19546966 = fieldWeight in 5442, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0625 = fieldNorm(doc=5442)
      0.25 = coord(2/8)
    
    Abstract
    Relevance feedback is an automatic process, introduced over 20 years ago, designed to produce improved query formulations following an initial retrieval operation. The principal relevance feedback methods described over the years are examined briefly, and evaluation data are included to demonstrate the effectiveness of the various methods. Prescriptions are given for conducting text retrieval operations iteratively using relevance feedback
    Source
    Journal of the American Society for Information Science. 41(1990) no.4, S.288-297
  20. Salton, G.; Lesk, M.E.: Computer evaluation of indexing and text processing (1968) 0.02
    0.017257487 = product of:
      0.06902995 = sum of:
        0.050096344 = weight(_text_:retrieval in 77) [ClassicSimilarity], result of:
          0.050096344 = score(doc=77,freq=2.0), product of:
            0.124912694 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.041294612 = queryNorm
            0.40105087 = fieldWeight in 77, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.09375 = fieldNorm(doc=77)
        0.018933605 = weight(_text_:of in 77) [ClassicSimilarity], result of:
          0.018933605 = score(doc=77,freq=4.0), product of:
            0.06457475 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041294612 = queryNorm
            0.2932045 = fieldWeight in 77, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.09375 = fieldNorm(doc=77)
      0.25 = coord(2/8)
    
    Footnote
    Wiederabgedruckt in: Readings in information retrieval. Ed.: K. Sparck Jones u. P. Willett. San Francisco: Morgan Kaufmann 1997. S.60-84.
    Source
    Journal of the Association for Computing Machinery. 15(1968), S.8-36