Search (14 results, page 1 of 1)

  • × author_ss:"Harman, D."
  1. Harman, D.: ¬The Text REtrieval Conferences (TRECs) : providing a test-bed for information retrieval systems (1998) 0.03
    0.03405343 = product of:
      0.10216028 = sum of:
        0.05872617 = weight(_text_:applications in 1314) [ClassicSimilarity], result of:
          0.05872617 = score(doc=1314,freq=2.0), product of:
            0.17247584 = queryWeight, product of:
              4.4025097 = idf(docFreq=1471, maxDocs=44218)
              0.03917671 = queryNorm
            0.34048924 = fieldWeight in 1314, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4025097 = idf(docFreq=1471, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1314)
        0.014818345 = weight(_text_:of in 1314) [ClassicSimilarity], result of:
          0.014818345 = score(doc=1314,freq=8.0), product of:
            0.061262865 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.03917671 = queryNorm
            0.24188137 = fieldWeight in 1314, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1314)
        0.028615767 = weight(_text_:systems in 1314) [ClassicSimilarity], result of:
          0.028615767 = score(doc=1314,freq=2.0), product of:
            0.12039685 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.03917671 = queryNorm
            0.23767869 = fieldWeight in 1314, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1314)
      0.33333334 = coord(3/9)
    
    Abstract
    The Text REtrieval Conference (TREC) workshop series encourages research in information retrieval from large text applications by providing a large test collection, uniform scoring procedures and a forum for organizations interested in comparing their results. Now in its seventh year, the conference has become the major experimental effort in the field. Participants in the TREC conferences have examined a wide variety of retrieval techniques, including methods using automatic thesauri, sophisticated term weighting, natural language techniques, relevance feedback and advanced pattern matching. The TREC conference series is co-sponsored by the National Institute of Standards and Technology (NIST) and the Information Technology Office of the Defense Advanced Research Projects Agency (DARPA)
    Source
    Bulletin of the American Society for Information Science. 24(1998), April/May, S.11-13
  2. Harman, D.: User-friendly systems instead of user-friendly front-ends (1992) 0.02
    0.020857532 = product of:
      0.09385889 = sum of:
        0.018148692 = weight(_text_:of in 3323) [ClassicSimilarity], result of:
          0.018148692 = score(doc=3323,freq=12.0), product of:
            0.061262865 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.03917671 = queryNorm
            0.29624295 = fieldWeight in 3323, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3323)
        0.0757102 = weight(_text_:systems in 3323) [ClassicSimilarity], result of:
          0.0757102 = score(doc=3323,freq=14.0), product of:
            0.12039685 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.03917671 = queryNorm
            0.6288387 = fieldWeight in 3323, product of:
              3.7416575 = tf(freq=14.0), with freq of:
                14.0 = termFreq=14.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3323)
      0.22222222 = coord(2/9)
    
    Abstract
    Most commercial on-line information retrieval systems are not designed to service end users and, therefore, have often built front-ends to their systems specifically to serve the end user market. These front ends have not been well accepted, mostly because the underlying systems are still difficult for end users to use successfully in searching. New techniques, based on statistical methods, that allow natural language input and return lists of records in order of likely relevance, have long been available from research laboratories. Presents 4 prototype implementations of these statistical retrieval systems that demonstrate their potential as powerful and easily used retrieval systems able to service all users. The systems consist of: the PRISE system; the CITE system; the Muscat system; and the News Retrieval Tool
    Source
    Journal of the American Society for Information Science. 43(1992), S.164-174
  3. Harman, D.; McCoy, W.; Toense, R.: Prototyping a distributed information retrieval system that uses statistical ranking (1991) 0.02
    0.017071081 = product of:
      0.07681987 = sum of:
        0.0128330635 = weight(_text_:of in 3844) [ClassicSimilarity], result of:
          0.0128330635 = score(doc=3844,freq=6.0), product of:
            0.061262865 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.03917671 = queryNorm
            0.20947541 = fieldWeight in 3844, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3844)
        0.0639868 = weight(_text_:systems in 3844) [ClassicSimilarity], result of:
          0.0639868 = score(doc=3844,freq=10.0), product of:
            0.12039685 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.03917671 = queryNorm
            0.5314657 = fieldWeight in 3844, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3844)
      0.22222222 = coord(2/9)
    
    Abstract
    Centralised systems continue to dominate the information retrieval market, with increased competition from CD-ROM based systems. As more large organisations begin to implement office automation systems, however, many will find that neither of these types of retrieval systems will satisfy their requirements, especially those requirements involving easy integration into other systems and heavy usage by casual end users. A prototype distributed information retrieval system was designed and built using a distributed architecture and using statistical ranking techniques to help provide better service for the end user. The distributed architecture was shown to be a feasible alternative to centralised or CD-ROM information retrieval, and user testing of the ranking methodology showed both widespread user enthusiasm for this retrieval technique and very fast response times
  4. Harman, D.: Ranking algorithms (1992) 0.01
    0.011876687 = product of:
      0.053445093 = sum of:
        0.020741362 = weight(_text_:of in 3511) [ClassicSimilarity], result of:
          0.020741362 = score(doc=3511,freq=12.0), product of:
            0.061262865 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.03917671 = queryNorm
            0.33856338 = fieldWeight in 3511, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0625 = fieldNorm(doc=3511)
        0.03270373 = weight(_text_:systems in 3511) [ClassicSimilarity], result of:
          0.03270373 = score(doc=3511,freq=2.0), product of:
            0.12039685 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.03917671 = queryNorm
            0.2716328 = fieldWeight in 3511, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.0625 = fieldNorm(doc=3511)
      0.22222222 = coord(2/9)
    
    Abstract
    Presents both a summary of past research done in the development of ranking algorithms and detailed instructions on implementing a ranking type of retrieval system. This type of retrieval system takes as input a natural language query without Boolean syntax and produces a list of records that 'answer' the query, with the records ranked in order of likely relevance. Ranking retrieval systems are particularly appropriate for end-users
  5. Voorhees, E.M.; Harman, D.: Overview of the Sixth Text REtrieval Conference (TREC-6) (2000) 0.01
    0.011549704 = product of:
      0.051973667 = sum of:
        0.014818345 = weight(_text_:of in 6438) [ClassicSimilarity], result of:
          0.014818345 = score(doc=6438,freq=2.0), product of:
            0.061262865 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.03917671 = queryNorm
            0.24188137 = fieldWeight in 6438, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.109375 = fieldNorm(doc=6438)
        0.037155323 = product of:
          0.074310645 = sum of:
            0.074310645 = weight(_text_:22 in 6438) [ClassicSimilarity], result of:
              0.074310645 = score(doc=6438,freq=2.0), product of:
                0.13719016 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03917671 = queryNorm
                0.5416616 = fieldWeight in 6438, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=6438)
          0.5 = coord(1/2)
      0.22222222 = coord(2/9)
    
    Date
    11. 8.2001 16:22:19
  6. Harman, D.: Overview of the Second Text Retrieval Conference : TREC-2 (1995) 0.01
    0.010526687 = product of:
      0.04737009 = sum of:
        0.014666359 = weight(_text_:of in 1915) [ClassicSimilarity], result of:
          0.014666359 = score(doc=1915,freq=6.0), product of:
            0.061262865 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.03917671 = queryNorm
            0.23940048 = fieldWeight in 1915, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0625 = fieldNorm(doc=1915)
        0.03270373 = weight(_text_:systems in 1915) [ClassicSimilarity], result of:
          0.03270373 = score(doc=1915,freq=2.0), product of:
            0.12039685 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.03917671 = queryNorm
            0.2716328 = fieldWeight in 1915, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.0625 = fieldNorm(doc=1915)
      0.22222222 = coord(2/9)
    
    Abstract
    The conference was attended by about 150 people involved in 31 participating groups. Its goal was to bring research groups together to discuss their work on a new large test collection. There was a large variation of retrieval techniques reported on, including methods using automatic thesauri, sophisticated term weighting, natural language techniques, relevance feedback, and advanced pattern matching. As results had been run through a common evaluation package, groups were able to compare the effectiveness of different techniques, and discuss how differences between the systems affected performance
  7. Harman, D.: Relevance feedback and other query modification techniques (1992) 0.01
    0.010526687 = product of:
      0.04737009 = sum of:
        0.014666359 = weight(_text_:of in 3508) [ClassicSimilarity], result of:
          0.014666359 = score(doc=3508,freq=6.0), product of:
            0.061262865 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.03917671 = queryNorm
            0.23940048 = fieldWeight in 3508, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0625 = fieldNorm(doc=3508)
        0.03270373 = weight(_text_:systems in 3508) [ClassicSimilarity], result of:
          0.03270373 = score(doc=3508,freq=2.0), product of:
            0.12039685 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.03917671 = queryNorm
            0.2716328 = fieldWeight in 3508, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.0625 = fieldNorm(doc=3508)
      0.22222222 = coord(2/9)
    
    Abstract
    Presents a survey of relevance feedback techniques that have been used in past research, recommends various query modification approaches for use in different retrieval systems, and gives some guidelines for the efficient design of the relevance feedback component of a retrieval system
  8. Harman, D.: Overview of the first Text Retrieval Conference (1993) 0.01
    0.010040707 = product of:
      0.04518318 = sum of:
        0.016567415 = weight(_text_:of in 548) [ClassicSimilarity], result of:
          0.016567415 = score(doc=548,freq=10.0), product of:
            0.061262865 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.03917671 = queryNorm
            0.2704316 = fieldWeight in 548, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0546875 = fieldNorm(doc=548)
        0.028615767 = weight(_text_:systems in 548) [ClassicSimilarity], result of:
          0.028615767 = score(doc=548,freq=2.0), product of:
            0.12039685 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.03917671 = queryNorm
            0.23767869 = fieldWeight in 548, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.0546875 = fieldNorm(doc=548)
      0.22222222 = coord(2/9)
    
    Abstract
    The first Text Retrieval Conference (TREC-1) was held in early November and was attended by about 100 people working in the 25 participating groups. The goal of the conference was to bring research gropus together to discuss their work on a new large test collection. There was a large variety of retrieval techniques reported on, including methods using automatic thesauri, sophisticated term weighting, natural language techniques, relevance feedback, and advanced pattern matching. As results had been run through a common evaluation package, groups were able to compare the effectiveness of different techniques, and discuss how differences among the systems affected performance
    Source
    Proceedings of the 14th National Online Meeting 1993, New York, 4-6 May 1993. Ed.: M.E. Williams
  9. Over, P.; Dang, H.; Harman, D.: DUC in context (2007) 0.01
    0.009928614 = product of:
      0.044678763 = sum of:
        0.011975031 = weight(_text_:of in 934) [ClassicSimilarity], result of:
          0.011975031 = score(doc=934,freq=4.0), product of:
            0.061262865 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.03917671 = queryNorm
            0.19546966 = fieldWeight in 934, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0625 = fieldNorm(doc=934)
        0.03270373 = weight(_text_:systems in 934) [ClassicSimilarity], result of:
          0.03270373 = score(doc=934,freq=2.0), product of:
            0.12039685 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.03917671 = queryNorm
            0.2716328 = fieldWeight in 934, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.0625 = fieldNorm(doc=934)
      0.22222222 = coord(2/9)
    
    Abstract
    Recent years have seen increased interest in text summarization with emphasis on evaluation of prototype systems. Many factors can affect the design of such evaluations, requiring choices among competing alternatives. This paper examines several major themes running through three evaluations: SUMMAC, NTCIR, and DUC, with a concentration on DUC. The themes are extrinsic and intrinsic evaluation, evaluation procedures and methods, generic versus focused summaries, single- and multi-document summaries, length and compression issues, extracts versus abstracts, and issues with genre.
  10. Harman, D.; Voorhees, E.: Social TREC : an overview (2006) 0.00
    0.0018816947 = product of:
      0.016935252 = sum of:
        0.016935252 = weight(_text_:of in 4458) [ClassicSimilarity], result of:
          0.016935252 = score(doc=4458,freq=2.0), product of:
            0.061262865 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.03917671 = queryNorm
            0.27643585 = fieldWeight in 4458, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.125 = fieldNorm(doc=4458)
      0.11111111 = coord(1/9)
    
    Source
    Annual review of information science and technology. 40(2006), S.xxx-xxx
  11. Smeaton, A.F.; Harman, D.: ¬The TREC experiments and their impact on Europe (1997) 0.00
    0.0018816947 = product of:
      0.016935252 = sum of:
        0.016935252 = weight(_text_:of in 7702) [ClassicSimilarity], result of:
          0.016935252 = score(doc=7702,freq=8.0), product of:
            0.061262865 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.03917671 = queryNorm
            0.27643585 = fieldWeight in 7702, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0625 = fieldNorm(doc=7702)
      0.11111111 = coord(1/9)
    
    Abstract
    Reviews the overall results of the TREC experiments in information retrieval, which differed from other information retrieval research projects in that the document collections used in the research were massive, and the groups participating in the collaborative evaluation are among the main organizations in the field. Reviews the findings of TREC, the way in which it operates and the specialist 'tracks' it supports and concentrates on european involvement in TREC, examining the participants and the emergence of European TREC like exercises
    Source
    Journal of information science. 23(1997) no.2, S.169-174
  12. Harman, D.: How effective is suffixing? (1991) 0.00
    0.0018816947 = product of:
      0.016935252 = sum of:
        0.016935252 = weight(_text_:of in 2106) [ClassicSimilarity], result of:
          0.016935252 = score(doc=2106,freq=2.0), product of:
            0.061262865 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.03917671 = queryNorm
            0.27643585 = fieldWeight in 2106, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.125 = fieldNorm(doc=2106)
      0.11111111 = coord(1/9)
    
    Source
    Journal of the American Society for Information Science. 42(1991) no.1, S.7-15
  13. Harman, D.: Automatic indexing (1994) 0.00
    0.0016295954 = product of:
      0.014666359 = sum of:
        0.014666359 = weight(_text_:of in 7729) [ClassicSimilarity], result of:
          0.014666359 = score(doc=7729,freq=6.0), product of:
            0.061262865 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.03917671 = queryNorm
            0.23940048 = fieldWeight in 7729, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0625 = fieldNorm(doc=7729)
      0.11111111 = coord(1/9)
    
    Content
    Enthält die Abschnitte: What constitutes a record; What constitutes a word and what 'words' to index; Use of stop lists; Use of suffixing or stemming; Advanced automatic indexing techniques (term weighting, query expansion, the use of multiple-word phrases for indexing)
  14. Harman, D.; Fox, E.; Baeza-Yates, R.; Lee, W.: Inverted files (1992) 0.00
    9.408473E-4 = product of:
      0.008467626 = sum of:
        0.008467626 = weight(_text_:of in 3497) [ClassicSimilarity], result of:
          0.008467626 = score(doc=3497,freq=2.0), product of:
            0.061262865 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.03917671 = queryNorm
            0.13821793 = fieldWeight in 3497, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0625 = fieldNorm(doc=3497)
      0.11111111 = coord(1/9)
    
    Abstract
    This chaper presents a survey of the various structures (techniques) that can be used in building inverted files, and gives the details for producing an inverted file using sorted arrays. The chapter ends with 2 modifications to this basic method that are affective for large data collections