Search (4 results, page 1 of 1)

  • × author_ss:"Dorr, B.J."
  1. Zajic, D.; Dorr, B.J.; Lin, J.; Schwartz, R.: Multi-candidate reduction : sentence compression as a tool for document summarization tasks (2007) 0.09
    0.08835813 = product of:
      0.17671625 = sum of:
        0.17671625 = product of:
          0.3534325 = sum of:
            0.3534325 = weight(_text_:compression in 944) [ClassicSimilarity], result of:
              0.3534325 = score(doc=944,freq=6.0), product of:
                0.36069217 = queryWeight, product of:
                  7.314861 = idf(docFreq=79, maxDocs=44218)
                  0.049309507 = queryNorm
                0.97987294 = fieldWeight in 944, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  7.314861 = idf(docFreq=79, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=944)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    This article examines the application of two single-document sentence compression techniques to the problem of multi-document summarization-a "parse-and-trim" approach and a statistical noisy-channel approach. We introduce the multi-candidate reduction (MCR) framework for multi-document summarization, in which many compressed candidates are generated for each source sentence. These candidates are then selected for inclusion in the final summary based on a combination of static and dynamic features. Evaluations demonstrate that sentence compression is a valuable component of a larger multi-document summarization framework.
  2. Zajic, D.M.; Dorr, B.J.; Lin, J.: Single-document and multi-document summarization techniques for email threads using sentence compression (2008) 0.09
    0.08835813 = product of:
      0.17671625 = sum of:
        0.17671625 = product of:
          0.3534325 = sum of:
            0.3534325 = weight(_text_:compression in 2105) [ClassicSimilarity], result of:
              0.3534325 = score(doc=2105,freq=6.0), product of:
                0.36069217 = queryWeight, product of:
                  7.314861 = idf(docFreq=79, maxDocs=44218)
                  0.049309507 = queryNorm
                0.97987294 = fieldWeight in 2105, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  7.314861 = idf(docFreq=79, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2105)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    We present two approaches to email thread summarization: collective message summarization (CMS) applies a multi-document summarization approach, while individual message summarization (IMS) treats the problem as a sequence of single-document summarization tasks. Both approaches are implemented in our general framework driven by sentence compression. Instead of a purely extractive approach, we employ linguistic and statistical methods to generate multiple compressions, and then select from those candidates to produce a final summary. We demonstrate these ideas on the Enron email collection - a very challenging corpus because of the highly technical language. Experimental results point to two findings: that CMS represents a better approach to email thread summarization, and that current sentence compression techniques do not improve summarization performance in this genre.
  3. Dorr, B.J.; Olsen, M.B.: Multilingual generation : the role of telicity in lexical choice and syntactic realization (1996) 0.02
    0.02338265 = product of:
      0.0467653 = sum of:
        0.0467653 = product of:
          0.0935306 = sum of:
            0.0935306 = weight(_text_:22 in 536) [ClassicSimilarity], result of:
              0.0935306 = score(doc=536,freq=2.0), product of:
                0.1726735 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.049309507 = queryNorm
                0.5416616 = fieldWeight in 536, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=536)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    31. 7.1996 9:22:19
  4. Dorr, B.J.: Large-scale dictionary construction for foreign language tutoring and interlingual machine translation (1997) 0.01
    0.010021136 = product of:
      0.020042272 = sum of:
        0.020042272 = product of:
          0.040084545 = sum of:
            0.040084545 = weight(_text_:22 in 3244) [ClassicSimilarity], result of:
              0.040084545 = score(doc=3244,freq=2.0), product of:
                0.1726735 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.049309507 = queryNorm
                0.23214069 = fieldWeight in 3244, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3244)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    31. 7.1996 9:22:19