Search (6 results, page 1 of 1)

  • × author_ss:"Grossman, D."
  1. Lundquist, C.; Frieder, O.; Holmes, D.O.; Grossman, D.: ¬A parallel relational database management system approach to relevance feedback in information retrieval (1999) 0.05
    0.050087467 = product of:
      0.1502624 = sum of:
        0.12083158 = weight(_text_:o in 4303) [ClassicSimilarity], result of:
          0.12083158 = score(doc=4303,freq=2.0), product of:
            0.1816457 = queryWeight, product of:
              5.017288 = idf(docFreq=795, maxDocs=44218)
              0.03620396 = queryNorm
            0.6652047 = fieldWeight in 4303, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.017288 = idf(docFreq=795, maxDocs=44218)
              0.09375 = fieldNorm(doc=4303)
        0.029430822 = product of:
          0.058861643 = sum of:
            0.058861643 = weight(_text_:22 in 4303) [ClassicSimilarity], result of:
              0.058861643 = score(doc=4303,freq=2.0), product of:
                0.12678011 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03620396 = queryNorm
                0.46428138 = fieldWeight in 4303, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=4303)
          0.5 = coord(1/2)
      0.33333334 = coord(2/6)
    
    Date
    17. 1.2000 12:22:18
  2. Aljlayl, M.; Frieder, O.; Grossman, D.: On bidirectional English-Arabic search (2002) 0.03
    0.026683219 = product of:
      0.08004966 = sum of:
        0.05034649 = weight(_text_:o in 5227) [ClassicSimilarity], result of:
          0.05034649 = score(doc=5227,freq=2.0), product of:
            0.1816457 = queryWeight, product of:
              5.017288 = idf(docFreq=795, maxDocs=44218)
              0.03620396 = queryNorm
            0.27716863 = fieldWeight in 5227, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.017288 = idf(docFreq=795, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5227)
        0.029703168 = product of:
          0.059406336 = sum of:
            0.059406336 = weight(_text_:al in 5227) [ClassicSimilarity], result of:
              0.059406336 = score(doc=5227,freq=4.0), product of:
                0.16592026 = queryWeight, product of:
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.03620396 = queryNorm
                0.3580415 = fieldWeight in 5227, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5227)
          0.5 = coord(1/2)
      0.33333334 = coord(2/6)
    
    Abstract
    Aljlayl, Frieder, and Grossman review machine translation of query methodologies and apply them to English-Arabic/Arabic-English Cross-Language Information Retrieval. In the dictionary method, replacement of each term with all possible equivalents in the target language results in considerable ambiguity, while taking the first term in the dictionary list reduces the ambiguity but may fail to capture the meaning. A Two-Phase method takes all possible equivalents and translates them back, retaining only those that generate the original term. It results in an average query length of six terms in TREC7 and 12 in TREC9. Arabic to English translations consistently preformed below the original English queries, and the Two-Phase method consistently preformed at the highest level and significantly better than the Every-Match method. Machine translation using other techniques is economical for queries but not likely so for documents. Using ALKAFI, a commercial translation system from Arabic to English and the Al-Mutarjim Al-Arabey system for English to Arabic, nearly 60% of monolingual retrievals were generated going from Arabic to English. Smaller numbers of terms in the source query improve performance, and these systems require syntactically well-formed queries for good performance.
  3. Aqeel, S.U.; Beitzel, S.M.; Jensen, E.C.; Grossman, D.; Frieder, O.: On the development of name search techniques for Arabic (2006) 0.03
    0.025043733 = product of:
      0.0751312 = sum of:
        0.06041579 = weight(_text_:o in 5289) [ClassicSimilarity], result of:
          0.06041579 = score(doc=5289,freq=2.0), product of:
            0.1816457 = queryWeight, product of:
              5.017288 = idf(docFreq=795, maxDocs=44218)
              0.03620396 = queryNorm
            0.33260235 = fieldWeight in 5289, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.017288 = idf(docFreq=795, maxDocs=44218)
              0.046875 = fieldNorm(doc=5289)
        0.014715411 = product of:
          0.029430822 = sum of:
            0.029430822 = weight(_text_:22 in 5289) [ClassicSimilarity], result of:
              0.029430822 = score(doc=5289,freq=2.0), product of:
                0.12678011 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03620396 = queryNorm
                0.23214069 = fieldWeight in 5289, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5289)
          0.5 = coord(1/2)
      0.33333334 = coord(2/6)
    
    Date
    22. 7.2006 17:20:20
  4. Cathey, R.J.; Jensen, E.C.; Beitzel, S.M.; Frieder, O.; Grossman, D.: Exploiting parallelism to support scalable hierarchical clustering (2007) 0.02
    0.016782165 = product of:
      0.10069298 = sum of:
        0.10069298 = weight(_text_:o in 448) [ClassicSimilarity], result of:
          0.10069298 = score(doc=448,freq=8.0), product of:
            0.1816457 = queryWeight, product of:
              5.017288 = idf(docFreq=795, maxDocs=44218)
              0.03620396 = queryNorm
            0.55433726 = fieldWeight in 448, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              5.017288 = idf(docFreq=795, maxDocs=44218)
              0.0390625 = fieldNorm(doc=448)
      0.16666667 = coord(1/6)
    
    Abstract
    A distributed memory parallel version of the group average hierarchical agglomerative clustering algorithm is proposed to enable scaling the document clustering problem to large collections. Using standard message passing operations reduces interprocess communication while maintaining efficient load balancing. In a series of experiments using a subset of a standard Text REtrieval Conference (TREC) test collection, our parallel hierarchical clustering algorithm is shown to be scalable in terms of processors efficiently used and the collection size. Results show that our algorithm performs close to the expected O(n**2/p) time on p processors rather than the worst-case O(n**3/p) time. Furthermore, the O(n**2/p) memory complexity per node allows larger collections to be clustered as the number of nodes increases. While partitioning algorithms such as k-means are trivially parallelizable, our results confirm those of other studies which showed that hierarchical algorithms produce significantly tighter clusters in the document clustering task. Finally, we show how our parallel hierarchical agglomerative clustering algorithm can be used as the clustering subroutine for a parallel version of the buckshot algorithm to cluster the complete TREC collection at near theoretical runtime expectations.
  5. Beitzel, S.M.; Jensen, E.C.; Chowdhury, A.; Grossman, D.; Frieder, O; Goharian, N.: Fusion of effective retrieval strategies in the same information retrieval system (2004) 0.01
    0.010069299 = product of:
      0.06041579 = sum of:
        0.06041579 = weight(_text_:o in 2502) [ClassicSimilarity], result of:
          0.06041579 = score(doc=2502,freq=2.0), product of:
            0.1816457 = queryWeight, product of:
              5.017288 = idf(docFreq=795, maxDocs=44218)
              0.03620396 = queryNorm
            0.33260235 = fieldWeight in 2502, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.017288 = idf(docFreq=795, maxDocs=44218)
              0.046875 = fieldNorm(doc=2502)
      0.16666667 = coord(1/6)
    
  6. Beitzel, S.M.; Jensen, E.C.; Chowdhury, A.; Frieder, O.; Grossman, D.: Temporal analysis of a very large topically categorized Web query log (2007) 0.01
    0.008391082 = product of:
      0.05034649 = sum of:
        0.05034649 = weight(_text_:o in 60) [ClassicSimilarity], result of:
          0.05034649 = score(doc=60,freq=2.0), product of:
            0.1816457 = queryWeight, product of:
              5.017288 = idf(docFreq=795, maxDocs=44218)
              0.03620396 = queryNorm
            0.27716863 = fieldWeight in 60, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.017288 = idf(docFreq=795, maxDocs=44218)
              0.0390625 = fieldNorm(doc=60)
      0.16666667 = coord(1/6)