Search (4 results, page 1 of 1)

  • × author_ss:"Zhang, Y."
  • × year_i:[2000 TO 2010}
  1. Zhang, Y.; Xu, W.: Fast exact maximum likelihood estimation for mixture of language model (2008) 0.02
    0.021422965 = product of:
      0.04284593 = sum of:
        0.04284593 = product of:
          0.08569186 = sum of:
            0.08569186 = weight(_text_:o in 2082) [ClassicSimilarity], result of:
              0.08569186 = score(doc=2082,freq=2.0), product of:
                0.25764057 = queryWeight, product of:
                  5.017288 = idf(docFreq=795, maxDocs=44218)
                  0.05135056 = queryNorm
                0.33260235 = fieldWeight in 2082, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.017288 = idf(docFreq=795, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2082)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Language modeling is an effective and theoretically attractive probabilistic framework for text information retrieval. The basic idea of this approach is to estimate a language model of a given document (or document set), and then do retrieval or classification based on this model. A common language modeling approach assumes the data D is generated from a mixture of several language models. The core problem is to find the maximum likelihood estimation of one language model mixture, given the fixed mixture weights and the other language model mixture. The EM algorithm is usually used to find the solution. In this paper, we proof that an exact maximum likelihood estimation of the unknown mixture component exists and can be calculated using the new algorithm we proposed. We further improve the algorithm and provide an efficient algorithm of O(k) complexity to find the exact solution, where k is the number of words occurring at least once in data D. Furthermore, we proof the probabilities of many words are exactly zeros, and the MLE estimation is implemented as a feature selection technique explicitly.
  2. Zhang, Y.: Scholarly use of Internet-based electronic resources (2001) 0.01
    0.012506341 = product of:
      0.025012681 = sum of:
        0.025012681 = product of:
          0.100050725 = sum of:
            0.100050725 = weight(_text_:authors in 5212) [ClassicSimilarity], result of:
              0.100050725 = score(doc=5212,freq=4.0), product of:
                0.23409766 = queryWeight, product of:
                  4.558814 = idf(docFreq=1258, maxDocs=44218)
                  0.05135056 = queryNorm
                0.42738882 = fieldWeight in 5212, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.558814 = idf(docFreq=1258, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5212)
          0.25 = coord(1/4)
      0.5 = coord(1/2)
    
    Abstract
    By Internet resources Zhang means any electronic file accessible by any Internet protocol. Their usage is determined by an examination of the citations to such sources in a nine-year sample of four print and four electronic LIS journals, by a survey of editors of these journals, and by a survey of scholars with "in press" papers in these journals. Citations were gathered from Social Science Citation Index and manually classed as e-sources by the format used. All authors with "in press" papers were asked about their use and opinion of Internet sources and for any suggestions for improvement. Use of electronic sources is heavy and access is very high. Access and ability explain most usage while satisfaction was not significant. Citation of e-journals increases over the eight years. Authors report under citation of e-journals in favor of print equivalents. Traditional reasons are given for citing and not citing, but additional reasons are also present for e-journals.
  3. Zhang, Y.; Jansen, B.J.; Spink, A.: Identification of factors predicting clickthrough in Web searching using neural network analysis (2009) 0.01
    0.010435938 = product of:
      0.020871876 = sum of:
        0.020871876 = product of:
          0.04174375 = sum of:
            0.04174375 = weight(_text_:22 in 2742) [ClassicSimilarity], result of:
              0.04174375 = score(doc=2742,freq=2.0), product of:
                0.17982091 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.05135056 = queryNorm
                0.23214069 = fieldWeight in 2742, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2742)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 3.2009 17:49:11
  4. Zhang, Y.; Li, Y.: ¬A user-centered functional metadata evaluation of moving image collections (2008) 0.01
    0.0073694317 = product of:
      0.014738863 = sum of:
        0.014738863 = product of:
          0.058955453 = sum of:
            0.058955453 = weight(_text_:authors in 1884) [ClassicSimilarity], result of:
              0.058955453 = score(doc=1884,freq=2.0), product of:
                0.23409766 = queryWeight, product of:
                  4.558814 = idf(docFreq=1258, maxDocs=44218)
                  0.05135056 = queryNorm
                0.25184128 = fieldWeight in 1884, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.558814 = idf(docFreq=1258, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1884)
          0.25 = coord(1/4)
      0.5 = coord(1/2)
    
    Abstract
    In this article, the authors report a series of evaluations of two metadata schemes developed for Moving Image Collections (MIC), an integrated online catalog of moving images. Through two online surveys and one experiment spanning various stages of metadata implementation, the MIC evaluation team explored a user-centered approach in which the four generic user tasks suggested by IFLA FRBR (International Association of Library Associations Functional Requirement for Bibliographic Records) were embedded in data collection and analyses. Diverse groups of users rated usefulness of individual metadata fields for finding, identifying, selecting, and obtaining moving images. The results demonstrate a consistency across these evaluations with respect to (a) identification of a set of useful metadata fields highly rated by target users for each of the FRBR generic tasks, and (b) indication of a significant interaction between MIC metadata fields and the FRBR generic tasks. The findings provide timely feedback for the MIC implementation specifically, and valuable suggestions to other similar metadata application settings in general. They also suggest the feasibility of using the four IFLA FRBR generic tasks as a framework for user-centered functional metadata evaluations.