Search (36 results, page 1 of 2)

  • × author_ss:"Croft, W.B."
  1. Allan, J.; Callan, J.P.; Croft, W.B.; Ballesteros, L.; Broglio, J.; Xu, J.; Shu, H.: INQUERY at TREC-5 (1997) 0.10
    0.09687523 = product of:
      0.12109403 = sum of:
        0.0045134346 = weight(_text_:a in 3103) [ClassicSimilarity], result of:
          0.0045134346 = score(doc=3103,freq=2.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.12739488 = fieldWeight in 3103, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.078125 = fieldNorm(doc=3103)
        0.036398914 = weight(_text_:u in 3103) [ClassicSimilarity], result of:
          0.036398914 = score(doc=3103,freq=2.0), product of:
            0.10061107 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.03072615 = queryNorm
            0.3617784 = fieldWeight in 3103, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.078125 = fieldNorm(doc=3103)
        0.05936684 = weight(_text_:j in 3103) [ClassicSimilarity], result of:
          0.05936684 = score(doc=3103,freq=6.0), product of:
            0.09763223 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.03072615 = queryNorm
            0.608066 = fieldWeight in 3103, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.078125 = fieldNorm(doc=3103)
        0.020814845 = product of:
          0.04162969 = sum of:
            0.04162969 = weight(_text_:22 in 3103) [ClassicSimilarity], result of:
              0.04162969 = score(doc=3103,freq=2.0), product of:
                0.10759774 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03072615 = queryNorm
                0.38690117 = fieldWeight in 3103, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=3103)
          0.5 = coord(1/2)
      0.8 = coord(4/5)
    
    Date
    27. 2.1999 20:55:22
    Source
    The Fifth Text Retrieval Conference (TREC-5). Ed.: E.M. Voorhees u. D.K. Harman
    Type
    a
  2. Callan, J.; Croft, W.B.; Broglio, J.: TREC and TIPSTER experiments with INQUERY (1995) 0.08
    0.0754092 = product of:
      0.09426149 = sum of:
        0.0048763216 = product of:
          0.043886892 = sum of:
            0.043886892 = weight(_text_:p in 1944) [ClassicSimilarity], result of:
              0.043886892 = score(doc=1944,freq=2.0), product of:
                0.11047626 = queryWeight, product of:
                  3.5955126 = idf(docFreq=3298, maxDocs=44218)
                  0.03072615 = queryNorm
                0.39725178 = fieldWeight in 1944, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5955126 = idf(docFreq=3298, maxDocs=44218)
                  0.078125 = fieldNorm(doc=1944)
          0.11111111 = coord(1/9)
        0.0045134346 = weight(_text_:a in 1944) [ClassicSimilarity], result of:
          0.0045134346 = score(doc=1944,freq=2.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.12739488 = fieldWeight in 1944, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.078125 = fieldNorm(doc=1944)
        0.036398914 = weight(_text_:u in 1944) [ClassicSimilarity], result of:
          0.036398914 = score(doc=1944,freq=2.0), product of:
            0.10061107 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.03072615 = queryNorm
            0.3617784 = fieldWeight in 1944, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.078125 = fieldNorm(doc=1944)
        0.04847282 = weight(_text_:j in 1944) [ClassicSimilarity], result of:
          0.04847282 = score(doc=1944,freq=4.0), product of:
            0.09763223 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.03072615 = queryNorm
            0.4964838 = fieldWeight in 1944, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.078125 = fieldNorm(doc=1944)
      0.8 = coord(4/5)
    
    Footnote
    Wiederabgedruckt in: Readings in information retrieval. Ed.: K. Sparck Jones u. P. Willett. San Francisco: Morgan Kaufmann 1997. S.436-439.
    Type
    a
  3. Allan, J.; Croft, W.B.; Callan, J.: ¬The University of Massachusetts and a dozen TRECs (2005) 0.07
    0.065703385 = product of:
      0.10950564 = sum of:
        0.0076595526 = weight(_text_:a in 5086) [ClassicSimilarity], result of:
          0.0076595526 = score(doc=5086,freq=4.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.2161963 = fieldWeight in 5086, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.09375 = fieldNorm(doc=5086)
        0.043678693 = weight(_text_:u in 5086) [ClassicSimilarity], result of:
          0.043678693 = score(doc=5086,freq=2.0), product of:
            0.10061107 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.03072615 = queryNorm
            0.43413407 = fieldWeight in 5086, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.09375 = fieldNorm(doc=5086)
        0.05816739 = weight(_text_:j in 5086) [ClassicSimilarity], result of:
          0.05816739 = score(doc=5086,freq=4.0), product of:
            0.09763223 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.03072615 = queryNorm
            0.5957806 = fieldWeight in 5086, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.09375 = fieldNorm(doc=5086)
      0.6 = coord(3/5)
    
    Source
    TREC: experiment and evaluation in information retrieval. Ed.: E.M. Voorhees, u. D.K. Harman
    Type
    a
  4. Croft, W.B.: Advances in information retrieval : Recent research from the Center for Intelligent Information Retrieval (2000) 0.05
    0.05229746 = product of:
      0.087162435 = sum of:
        0.002708061 = weight(_text_:a in 6860) [ClassicSimilarity], result of:
          0.002708061 = score(doc=6860,freq=2.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.07643694 = fieldWeight in 6860, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=6860)
        0.048834264 = weight(_text_:u in 6860) [ClassicSimilarity], result of:
          0.048834264 = score(doc=6860,freq=10.0), product of:
            0.10061107 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.03072615 = queryNorm
            0.48537666 = fieldWeight in 6860, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.046875 = fieldNorm(doc=6860)
        0.035620105 = weight(_text_:j in 6860) [ClassicSimilarity], result of:
          0.035620105 = score(doc=6860,freq=6.0), product of:
            0.09763223 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.03072615 = queryNorm
            0.3648396 = fieldWeight in 6860, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.046875 = fieldNorm(doc=6860)
      0.6 = coord(3/5)
    
    Content
    Enthält die Beiträge: CROFT, W.B.: Combining approaches to information retrieval; GREIFF, W.R.: The use of exploratory data analysis in information retrieval research; PONTE, J.M.: Language models for relevance feedback; PAPKA, R. u. J. ALLAN: Topic detection and tracking: event clustering as a basis for first story detection; CALLAN, J.: Distributed information retrieval; XU, J. u. W.B. CROFT: Topic-based language models for ditributed retrieval; LU, Z. u. K.S. McKINLEY: The effect of collection organization and query locality on information retrieval system performance; BALLESTEROS, L.A.: Cross-language retrieval via transitive translation; SANDERSON, M. u. D. LAWRIE: Building, testing, and applying concept hierarchies; RAVELA, S. u. C. LUO: Appearance-based global similarity retrieval of images
  5. Croft, W.B.; Lucia, T.J.; Cringean, J.: Retrieving documents by plausible inference : an experimental study (1989) 0.04
    0.04191861 = product of:
      0.06986435 = sum of:
        0.007802114 = product of:
          0.070219025 = sum of:
            0.070219025 = weight(_text_:p in 3915) [ClassicSimilarity], result of:
              0.070219025 = score(doc=3915,freq=2.0), product of:
                0.11047626 = queryWeight, product of:
                  3.5955126 = idf(docFreq=3298, maxDocs=44218)
                  0.03072615 = queryNorm
                0.63560283 = fieldWeight in 3915, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5955126 = idf(docFreq=3298, maxDocs=44218)
                  0.125 = fieldNorm(doc=3915)
          0.11111111 = coord(1/9)
        0.0072214957 = weight(_text_:a in 3915) [ClassicSimilarity], result of:
          0.0072214957 = score(doc=3915,freq=2.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.20383182 = fieldWeight in 3915, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.125 = fieldNorm(doc=3915)
        0.054840736 = weight(_text_:j in 3915) [ClassicSimilarity], result of:
          0.054840736 = score(doc=3915,freq=2.0), product of:
            0.09763223 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.03072615 = queryNorm
            0.5617073 = fieldWeight in 3915, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.125 = fieldNorm(doc=3915)
      0.6 = coord(3/5)
    
    Editor
    Willett, P.
    Type
    a
  6. Turtle, H.; Croft, W.B.: Inference networks for document retrieval (1990) 0.03
    0.027473202 = product of:
      0.045788668 = sum of:
        0.0048763216 = product of:
          0.043886892 = sum of:
            0.043886892 = weight(_text_:p in 1936) [ClassicSimilarity], result of:
              0.043886892 = score(doc=1936,freq=2.0), product of:
                0.11047626 = queryWeight, product of:
                  3.5955126 = idf(docFreq=3298, maxDocs=44218)
                  0.03072615 = queryNorm
                0.39725178 = fieldWeight in 1936, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5955126 = idf(docFreq=3298, maxDocs=44218)
                  0.078125 = fieldNorm(doc=1936)
          0.11111111 = coord(1/9)
        0.0045134346 = weight(_text_:a in 1936) [ClassicSimilarity], result of:
          0.0045134346 = score(doc=1936,freq=2.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.12739488 = fieldWeight in 1936, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.078125 = fieldNorm(doc=1936)
        0.036398914 = weight(_text_:u in 1936) [ClassicSimilarity], result of:
          0.036398914 = score(doc=1936,freq=2.0), product of:
            0.10061107 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.03072615 = queryNorm
            0.3617784 = fieldWeight in 1936, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.078125 = fieldNorm(doc=1936)
      0.6 = coord(3/5)
    
    Footnote
    Wiederabgedruckt in: Readings in information retrieval. Ed.: K. Sparck Jones u. P. Willett. San Francisco: Morgan Kaufmann 1997. S.287-298
    Type
    a
  7. Allan, J.; Ballesteros, L.; Callan, J.P.; Croft, W.B.; Lu, Z.: Recent experiment with INQUERY (1996) 0.02
    0.018618671 = product of:
      0.046546675 = sum of:
        0.005416122 = weight(_text_:a in 7568) [ClassicSimilarity], result of:
          0.005416122 = score(doc=7568,freq=2.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.15287387 = fieldWeight in 7568, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.09375 = fieldNorm(doc=7568)
        0.041130554 = weight(_text_:j in 7568) [ClassicSimilarity], result of:
          0.041130554 = score(doc=7568,freq=2.0), product of:
            0.09763223 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.03072615 = queryNorm
            0.4212805 = fieldWeight in 7568, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.09375 = fieldNorm(doc=7568)
      0.4 = coord(2/5)
    
    Type
    a
  8. Belkin, N.J.; Croft, W.B.: Retrieval techniques (1987) 0.02
    0.0162101 = product of:
      0.04052525 = sum of:
        0.0072214957 = weight(_text_:a in 334) [ClassicSimilarity], result of:
          0.0072214957 = score(doc=334,freq=2.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.20383182 = fieldWeight in 334, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.125 = fieldNorm(doc=334)
        0.033303753 = product of:
          0.066607505 = sum of:
            0.066607505 = weight(_text_:22 in 334) [ClassicSimilarity], result of:
              0.066607505 = score(doc=334,freq=2.0), product of:
                0.10759774 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03072615 = queryNorm
                0.61904186 = fieldWeight in 334, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.125 = fieldNorm(doc=334)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Source
    Annual review of information science and technology. 22(1987), S.109-145
    Type
    a
  9. Xu, J.; Croft, W.B.: Topic-based language models for distributed retrieval (2000) 0.01
    0.011475784 = product of:
      0.028689459 = sum of:
        0.008124183 = weight(_text_:a in 38) [ClassicSimilarity], result of:
          0.008124183 = score(doc=38,freq=18.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.22931081 = fieldWeight in 38, product of:
              4.2426405 = tf(freq=18.0), with freq of:
                18.0 = termFreq=18.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=38)
        0.020565277 = weight(_text_:j in 38) [ClassicSimilarity], result of:
          0.020565277 = score(doc=38,freq=2.0), product of:
            0.09763223 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.03072615 = queryNorm
            0.21064025 = fieldWeight in 38, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.046875 = fieldNorm(doc=38)
      0.4 = coord(2/5)
    
    Abstract
    Effective retrieval in a distributed environment is an important but difficult problem. Lack of effectiveness appears to have two major causes. First, existing collection selection algorithms do not work well on heterogeneous collections. Second, relevant documents are scattered over many collections and searching a few collections misses many relevant documents. We propose a topic-oriented approach to distributed retrieval. With this approach, we structure the document set of a distributed retrieval environment around a set of topics. Retrieval for a query involves first selecting the right topics for the query and then dispatching the search process to collections that contain such topics. The content of a topic is characterized by a language model. In environments where the labeling of documents by topics is unavailable, document clustering is employed for topic identification. Based on these ideas, three methods are proposed to suit different environments. We show that all three methods improve effectiveness of distributed retrieval
    Type
    a
  10. Luk, R.W.P.; Leong, H.V.; Dillon, T.S.; Chan, A.T.S.; Croft, W.B.; Allen, J.: ¬A survey in indexing and searching XML documents (2002) 0.01
    0.010648275 = product of:
      0.026620686 = sum of:
        0.0060554086 = weight(_text_:a in 460) [ClassicSimilarity], result of:
          0.0060554086 = score(doc=460,freq=10.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.1709182 = fieldWeight in 460, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=460)
        0.020565277 = weight(_text_:j in 460) [ClassicSimilarity], result of:
          0.020565277 = score(doc=460,freq=2.0), product of:
            0.09763223 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.03072615 = queryNorm
            0.21064025 = fieldWeight in 460, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.046875 = fieldNorm(doc=460)
      0.4 = coord(2/5)
    
    Abstract
    XML holds the promise to yield (1) a more precise search by providing additional information in the elements, (2) a better integrated search of documents from heterogeneous sources, (3) a powerful search paradigm using structural as well as content specifications, and (4) data and information exchange to share resources and to support cooperative search. We survey several indexing techniques for XML documents, grouping them into flatfile, semistructured, and structured indexing paradigms. Searching techniques and supporting techniques for searching are reviewed, including full text search and multistage search. Because searching XML documents can be very flexible, various search result presentations are discussed, as well as database and information retrieval system integration and XML query languages. We also survey various retrieval models, examining how they would be used or extended for retrieving XML documents. To conclude the article, we discuss various open issues that XML poses with respect to information retrieval and database research.
    Type
    a
  11. Kim, Y.; Seo, J.; Croft, W.B.; Smith, D.A.: Automatic suggestion of phrasal-concept queries for literature search (2014) 0.01
    0.008660466 = product of:
      0.021651166 = sum of:
        0.0045134346 = weight(_text_:a in 2692) [ClassicSimilarity], result of:
          0.0045134346 = score(doc=2692,freq=8.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.12739488 = fieldWeight in 2692, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2692)
        0.01713773 = weight(_text_:j in 2692) [ClassicSimilarity], result of:
          0.01713773 = score(doc=2692,freq=2.0), product of:
            0.09763223 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.03072615 = queryNorm
            0.17553353 = fieldWeight in 2692, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2692)
      0.4 = coord(2/5)
    
    Abstract
    Both general and domain-specific search engines have adopted query suggestion techniques to help users formulate effective queries. In the specific domain of literature search (e.g., finding academic papers), the initial queries are usually based on a draft paper or abstract, rather than short lists of keywords. In this paper, we investigate phrasal-concept query suggestions for literature search. These suggestions explicitly specify important phrasal concepts related to an initial detailed query. The merits of phrasal-concept query suggestions for this domain are their readability and retrieval effectiveness: (1) phrasal concepts are natural for academic authors because of their frequent use of terminology and subject-specific phrases and (2) academic papers describe their key ideas via these subject-specific phrases, and thus phrasal concepts can be used effectively to find those papers. We propose a novel phrasal-concept query suggestion technique that generates queries by identifying key phrasal-concepts from pseudo-labeled documents and combines them with related phrases. Our proposed technique is evaluated in terms of both user preference and retrieval effectiveness. We conduct user experiments to verify a preference for our approach, in comparison to baseline query suggestion methods, and demonstrate the effectiveness of the technique with retrieval experiments.
    Type
    a
  12. Croft, W.B.; Thompson, R.H.: I3R: a new approach to the desing of document retrieval systems (1987) 0.00
    0.0017872291 = product of:
      0.008936145 = sum of:
        0.008936145 = weight(_text_:a in 3898) [ClassicSimilarity], result of:
          0.008936145 = score(doc=3898,freq=4.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.25222903 = fieldWeight in 3898, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.109375 = fieldNorm(doc=3898)
      0.2 = coord(1/5)
    
    Type
    a
  13. Croft, W.B.; Harper, D.J.: Using probabilistic models of document retrieval without relevance information (1979) 0.00
    0.0016147757 = product of:
      0.0080738785 = sum of:
        0.0080738785 = weight(_text_:a in 4520) [ClassicSimilarity], result of:
          0.0080738785 = score(doc=4520,freq=10.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.22789092 = fieldWeight in 4520, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=4520)
      0.2 = coord(1/5)
    
    Abstract
    Based on a probablistic model, proposes strategies for the initial search and an intermediate search. Retrieval experiences with the Cranfield collection of 1,400 documents show that this initial search strategy is better than conventional search strategies both in terms of retrieval effectiveness and in terms of the number of queries that retrieve relevant documents. The intermediate search is a useful substitute for a relevance feedback search. A cluster search would be an effective alternative strategy.
    Type
    a
  14. Murdock, V.; Kelly, D.; Croft, W.B.; Belkin, N.J.; Yuan, X.: Identifying and improving retrieval for procedural questions (2007) 0.00
    0.0015319105 = product of:
      0.0076595526 = sum of:
        0.0076595526 = weight(_text_:a in 902) [ClassicSimilarity], result of:
          0.0076595526 = score(doc=902,freq=16.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.2161963 = fieldWeight in 902, product of:
              4.0 = tf(freq=16.0), with freq of:
                16.0 = termFreq=16.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=902)
      0.2 = coord(1/5)
    
    Abstract
    People use questions to elicit information from other people in their everyday lives and yet the most common method of obtaining information from a search engine is by posing keywords. There has been research that suggests users are better at expressing their information needs in natural language, however the vast majority of work to improve document retrieval has focused on queries posed as sets of keywords or Boolean queries. This paper focuses on improving document retrieval for the subset of natural language questions asking about how something is done. We classify questions as asking either for a description of a process or asking for a statement of fact, with better than 90% accuracy. Further we identify non-content features of documents relevant to questions asking about a process. Finally we demonstrate that we can use these features to significantly improve the precision of document retrieval results for questions asking about a process. Our approach, based on exploiting the structure of documents, shows a significant improvement in precision at rank one for questions asking about how something is done.
    Type
    a
  15. Croft, W.B.: Approaches to intelligent information retrieval (1987) 0.00
    0.0014442991 = product of:
      0.0072214957 = sum of:
        0.0072214957 = weight(_text_:a in 1094) [ClassicSimilarity], result of:
          0.0072214957 = score(doc=1094,freq=2.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.20383182 = fieldWeight in 1094, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.125 = fieldNorm(doc=1094)
      0.2 = coord(1/5)
    
    Type
    a
  16. Croft, W.B.; Turtle, H.R.: Retrieval strategies for hypertext (1993) 0.00
    0.0014442991 = product of:
      0.0072214957 = sum of:
        0.0072214957 = weight(_text_:a in 4711) [ClassicSimilarity], result of:
          0.0072214957 = score(doc=4711,freq=2.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.20383182 = fieldWeight in 4711, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.125 = fieldNorm(doc=4711)
      0.2 = coord(1/5)
    
    Type
    a
  17. Croft, W.B.: Clustering large files of documents using the single link method (1977) 0.00
    0.0014442991 = product of:
      0.0072214957 = sum of:
        0.0072214957 = weight(_text_:a in 5489) [ClassicSimilarity], result of:
          0.0072214957 = score(doc=5489,freq=2.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.20383182 = fieldWeight in 5489, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.125 = fieldNorm(doc=5489)
      0.2 = coord(1/5)
    
    Type
    a
  18. Croft, W.B.: Knowledge-based and statistical approaches to text retrieval (1993) 0.00
    0.0014442991 = product of:
      0.0072214957 = sum of:
        0.0072214957 = weight(_text_:a in 7863) [ClassicSimilarity], result of:
          0.0072214957 = score(doc=7863,freq=2.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.20383182 = fieldWeight in 7863, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.125 = fieldNorm(doc=7863)
      0.2 = coord(1/5)
    
    Type
    a
  19. Shneiderman, B.; Byrd, D.; Croft, W.B.: Clarifying search : a user-interface framework for text searches (1997) 0.00
    0.0014442991 = product of:
      0.0072214957 = sum of:
        0.0072214957 = weight(_text_:a in 1471) [ClassicSimilarity], result of:
          0.0072214957 = score(doc=1471,freq=2.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.20383182 = fieldWeight in 1471, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.125 = fieldNorm(doc=1471)
      0.2 = coord(1/5)
    
  20. Shneiderman, B.; Byrd, D.; Croft, W.B.: Clarifying search : a user-interface framework for text searches (1997) 0.00
    0.0014442991 = product of:
      0.0072214957 = sum of:
        0.0072214957 = weight(_text_:a in 1258) [ClassicSimilarity], result of:
          0.0072214957 = score(doc=1258,freq=8.0), product of:
            0.035428695 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03072615 = queryNorm
            0.20383182 = fieldWeight in 1258, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=1258)
      0.2 = coord(1/5)
    
    Abstract
    Current user interfaces for textual database searching leave much to be desired: individually, they are often confusing, and as a group, they are seriously inconsistent. We propose a four- phase framework for user-interface design: the framework provides common structure and terminology for searching while preserving the distinct features of individual collections and search mechanisms. Users will benefit from faster learning, increased comprehension, and better control, leading to more effective searches and higher satisfaction.
    Type
    a