Search (5 results, page 1 of 1)

  • × author_ss:"Arsenault, C."
  1. Arsenault, C.; Ménard, E.: Searching titles with initial articles in library catalogs : a case study and search behavior analysis (2007) 0.08
    0.080885045 = product of:
      0.121327564 = sum of:
        0.10130662 = weight(_text_:title in 2264) [ClassicSimilarity], result of:
          0.10130662 = score(doc=2264,freq=2.0), product of:
            0.27436262 = queryWeight, product of:
              5.570018 = idf(docFreq=457, maxDocs=44218)
              0.049257044 = queryNorm
            0.3692435 = fieldWeight in 2264, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.570018 = idf(docFreq=457, maxDocs=44218)
              0.046875 = fieldNorm(doc=2264)
        0.020020949 = product of:
          0.040041897 = sum of:
            0.040041897 = weight(_text_:22 in 2264) [ClassicSimilarity], result of:
              0.040041897 = score(doc=2264,freq=2.0), product of:
                0.17248978 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.049257044 = queryNorm
                0.23214069 = fieldWeight in 2264, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2264)
          0.5 = coord(1/2)
      0.6666667 = coord(2/3)
    
    Abstract
    This study examines problems caused by initial articles in library catalogs. The problematic records observed are those whose titles begin with a word erroneously considered to be an article at the retrieval stage. Many retrieval algorithms edit queries by removing initial words corresponding to articles found in an exclusion list even whether the initial word is an article or not. Consequently, a certain number of documents remain more difficult to find. The study also examines user behavior during known-item retrieval using the title index in library catalogs, concentrating on the problems caused by the presence of an initial article or of a word homograph to an article. Measures of success and effectiveness are taken to determine if retrieval is affected in such cases.
    Date
    10. 9.2000 17:38:22
  2. Arsenault, C.: Word division in the transcription of Chinese script in the title fields of bibliographic Records (2001) 0.04
    0.039397016 = product of:
      0.11819105 = sum of:
        0.11819105 = weight(_text_:title in 5434) [ClassicSimilarity], result of:
          0.11819105 = score(doc=5434,freq=2.0), product of:
            0.27436262 = queryWeight, product of:
              5.570018 = idf(docFreq=457, maxDocs=44218)
              0.049257044 = queryNorm
            0.43078408 = fieldWeight in 5434, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.570018 = idf(docFreq=457, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5434)
      0.33333334 = coord(1/3)
    
  3. Arsenault, C.: Testing the impact of syllable aggregation in romanized fields of Chinese language bibliographic records (2000) 0.03
    0.028140724 = product of:
      0.08442217 = sum of:
        0.08442217 = weight(_text_:title in 87) [ClassicSimilarity], result of:
          0.08442217 = score(doc=87,freq=2.0), product of:
            0.27436262 = queryWeight, product of:
              5.570018 = idf(docFreq=457, maxDocs=44218)
              0.049257044 = queryNorm
            0.3077029 = fieldWeight in 87, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.570018 = idf(docFreq=457, maxDocs=44218)
              0.0390625 = fieldNorm(doc=87)
      0.33333334 = coord(1/3)
    
    Abstract
    Today, two Romanization systems for Chinese data are in use in most libraries in the Western world: 1) Wade-Giles, and 2) Hanyu pinyin (simply referred to as pinyin). In 1997, the Library of Congress finally officially announced the adoption of pinyin for Romanizing Chinese data in its bibliographic records. One of the main problems in implementing the pinyin standard for library use is that pinyin, as opposed to Wade-Giles, aggregates Chinese "words" into single linguistic units. Chinese characters represent monosyllabic morphemes rather than words and are equally spaced from one another, and the Chinese text, in its original form, does not provide visual cues as to where a word starts or ends. When the script is romanized it is however essential that syllables or words be separated from one another, since, in most information retrieval techniques, the identification of "visual words" is required. In this respect, the Romanized strings could be divided either in monosyllables or in polysyllable words. This study aims to explore the impact of using either unaggregated pinyin (monosyllabic) or aggregated pinyin (polysyllabic) Romanization in Chinese-language bibliographic records. An experiment, using transaction log analysis, was carried out to observe variations in the retrieval performance of title searches-both phrase and keyword-in a large OPAC of Chinese language records. General results are presented and a summary of the pros and cons of using either method is given
  4. Arsenault, C.: Aggregation consistency and frequency of Chinese words and characters (2006) 0.03
    0.028140724 = product of:
      0.08442217 = sum of:
        0.08442217 = weight(_text_:title in 609) [ClassicSimilarity], result of:
          0.08442217 = score(doc=609,freq=2.0), product of:
            0.27436262 = queryWeight, product of:
              5.570018 = idf(docFreq=457, maxDocs=44218)
              0.049257044 = queryNorm
            0.3077029 = fieldWeight in 609, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.570018 = idf(docFreq=457, maxDocs=44218)
              0.0390625 = fieldNorm(doc=609)
      0.33333334 = coord(1/3)
    
    Abstract
    Purpose - Aims to measure syllable aggregation consistency of Romanized Chinese data in the title fields of bibliographic records. Also aims to verify if the term frequency distributions satisfy conventional bibliometric laws. Design/methodology/approach - Uses Cooper's interindexer formula to evaluate aggregation consistency within and between two sets of Chinese bibliographic data. Compares the term frequency distributions of polysyllabic words and monosyllabic characters (for vernacular and Romanized data) with the Lotka and the generalised Zipf theoretical distributions. The fits are tested with the Kolmogorov-Smirnov test. Findings - Finds high internal aggregation consistency within each data set but some aggregation discrepancy between sets. Shows that word (polysyllabic) distributions satisfy Lotka's law but that character (monosyllabic) distributions do not abide by the law. Research limitations/implications - The findings are limited to only two sets of bibliographic data (for aggregation consistency analysis) and to one set of data for the frequency distribution analysis. Only two bibliometric distributions are tested. Internal consistency within each database remains fairly high. Therefore the main argument against syllable aggregation does not appear to hold true. The analysis revealed that Chinese words and characters behave differently in terms of frequency distribution but that there is no noticeable difference between vernacular and Romanized data. The distribution of Romanized characters exhibits the worst case in terms of fit to either Lotka's or Zipf's laws, which indicates that Romanized data in aggregated form appear to be a preferable option. Originality/value - Provides empirical data on consistency and distribution of Romanized Chinese titles in bibliographic records.
  5. Arsenault, C.; Noruzi, A.: Analysis of work-to-work bibliographic relationships through FRBR : a Canadian perspective (2012) 0.02
    0.020974122 = product of:
      0.062922366 = sum of:
        0.062922366 = product of:
          0.12584473 = sum of:
            0.12584473 = weight(_text_:catalogue in 1923) [ClassicSimilarity], result of:
              0.12584473 = score(doc=1923,freq=4.0), product of:
                0.23806341 = queryWeight, product of:
                  4.8330836 = idf(docFreq=956, maxDocs=44218)
                  0.049257044 = queryNorm
                0.5286185 = fieldWeight in 1923, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.8330836 = idf(docFreq=956, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1923)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    The purpose of this study is to investigate the characteristics of Canadian publications by analyzing their bibliographic relationships based on the Functional Requirements for Bibliographic Records (FRBR) model. The study indicates frequencies of occurrence of work-to-work bibliographic relationships for manifestations published in 2009 and catalogued in the AMICUS online catalogue. The results show that approximately 4.4 percent of the 2009 bibliographic records in the AMICUS catalogue exhibit a work-to-work bibliographic relationship.