Document (#30436)

Author
Arsenault, C.
Title
Word division in the transcription of Chinese script in the title fields of bibliographic Records
Source
Cataloging and classification quarterly. 32(2001) no.3, S.109-137
Year
2001
Abstract
Recently, the Library of Congress adopted the pinyin Romanization system for transcribing Chinese data in its bibliographic records. In its canonical form, pinyin aggregates Chinese "words" into single linguistic units, but pinyin entries could be constructed following either a monosyllabic or a polysyllabic pattern. Although the former is easier and less costly to implement, the latter method is potentially more beneficial for end-users, as it reduces ambiguity, and generates a much larger variety of indexable terms. The current study investigates if following the polysyllabic method improves retrieval efficiency and effectiveness in item-specific searching within online bibliographic databases. Analysis of the results revealed that aggregation of monosyllables does improve efficiency significantly (p < .05), especially during keyword searches, while effectiveness remains mainly unaffected.
Theme
Formalerschließung

Similar documents (author)

  1. Arsenault, C.: Testing the impact of syllable aggregation in romanized fields of Chinese language bibliographic records (2000) 5.78
    5.7842436 = sum of:
      5.7842436 = weight(author_txt:arsenault in 1088) [ClassicSimilarity], result of:
        5.7842436 = fieldWeight in 1088, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.254789 = idf(docFreq=10, maxDocs=42306)
          0.625 = fieldNorm(doc=1088)
    
  2. Arsenault, C.: Aggregation consistency and frequency of Chinese words and characters (2006) 5.78
    5.7842436 = sum of:
      5.7842436 = weight(author_txt:arsenault in 1735) [ClassicSimilarity], result of:
        5.7842436 = fieldWeight in 1735, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.254789 = idf(docFreq=10, maxDocs=42306)
          0.625 = fieldNorm(doc=1735)
    
  3. Jacobs, C.; Arsenault, C.: Words can't describe it : streamlining PRECIS just for laughs! (1994) 4.63
    4.6273947 = sum of:
      4.6273947 = weight(author_txt:arsenault in 2267) [ClassicSimilarity], result of:
        4.6273947 = fieldWeight in 2267, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.254789 = idf(docFreq=10, maxDocs=42306)
          0.5 = fieldNorm(doc=2267)
    
  4. Arsenault, C.; Leide, J.E.: Format integration and the design of cataloging and classification curricula (2002) 4.63
    4.6273947 = sum of:
      4.6273947 = weight(author_txt:arsenault in 457) [ClassicSimilarity], result of:
        4.6273947 = fieldWeight in 457, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.254789 = idf(docFreq=10, maxDocs=42306)
          0.5 = fieldNorm(doc=457)
    
  5. Arsenault, C.; Ménard, E.: Searching titles with initial articles in library catalogs : a case study and search behavior analysis (2007) 4.63
    4.6273947 = sum of:
      4.6273947 = weight(author_txt:arsenault in 84) [ClassicSimilarity], result of:
        4.6273947 = fieldWeight in 84, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.254789 = idf(docFreq=10, maxDocs=42306)
          0.5 = fieldNorm(doc=84)
    

Similar documents (content)

  1. Arsenault, C.: Testing the impact of syllable aggregation in romanized fields of Chinese language bibliographic records (2000) 0.53
    0.5344351 = sum of:
      0.5344351 = product of:
        1.6701097 = sum of:
          0.07946387 = weight(abstract_txt:script in 1088) [ClassicSimilarity], result of:
            0.07946387 = score(doc=1088,freq=1.0), product of:
              0.15914413 = queryWeight, product of:
                1.2134553 = boost
                7.9891224 = idf(docFreq=38, maxDocs=42306)
                0.016416017 = queryNorm
              0.49932015 = fieldWeight in 1088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9891224 = idf(docFreq=38, maxDocs=42306)
                0.0625 = fieldNorm(doc=1088)
          0.09940398 = weight(abstract_txt:aggregates in 1088) [ClassicSimilarity], result of:
            0.09940398 = score(doc=1088,freq=1.0), product of:
              0.18476228 = queryWeight, product of:
                1.3074802 = boost
                8.608162 = idf(docFreq=20, maxDocs=42306)
                0.016416017 = queryNorm
              0.5380101 = fieldWeight in 1088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.608162 = idf(docFreq=20, maxDocs=42306)
                0.0625 = fieldNorm(doc=1088)
          0.14553903 = weight(abstract_txt:romanization in 1088) [ClassicSimilarity], result of:
            0.14553903 = score(doc=1088,freq=2.0), product of:
              0.18908358 = queryWeight, product of:
                1.3226818 = boost
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.016416017 = queryNorm
              0.7697074 = fieldWeight in 1088, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.0625 = fieldNorm(doc=1088)
          0.04661391 = weight(abstract_txt:records in 1088) [ClassicSimilarity], result of:
            0.04661391 = score(doc=1088,freq=3.0), product of:
              0.097422086 = queryWeight, product of:
                1.3426789 = boost
                4.419951 = idf(docFreq=1383, maxDocs=42306)
                0.016416017 = queryNorm
              0.47847372 = fieldWeight in 1088, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.419951 = idf(docFreq=1383, maxDocs=42306)
                0.0625 = fieldNorm(doc=1088)
          0.029062903 = weight(abstract_txt:method in 1088) [ClassicSimilarity], result of:
            0.029062903 = score(doc=1088,freq=1.0), product of:
              0.10254476 = queryWeight, product of:
                1.3775272 = boost
                4.534668 = idf(docFreq=1233, maxDocs=42306)
                0.016416017 = queryNorm
              0.28341675 = fieldWeight in 1088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.534668 = idf(docFreq=1233, maxDocs=42306)
                0.0625 = fieldNorm(doc=1088)
          0.049799193 = weight(abstract_txt:bibliographic in 1088) [ClassicSimilarity], result of:
            0.049799193 = score(doc=1088,freq=2.0), product of:
              0.13341032 = queryWeight, product of:
                1.9243488 = boost
                4.223163 = idf(docFreq=1684, maxDocs=42306)
                0.016416017 = queryNorm
              0.3732784 = fieldWeight in 1088, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.223163 = idf(docFreq=1684, maxDocs=42306)
                0.0625 = fieldNorm(doc=1088)
          0.31440756 = weight(abstract_txt:chinese in 1088) [ClassicSimilarity], result of:
            0.31440756 = score(doc=1088,freq=7.0), product of:
              0.30015612 = queryWeight, product of:
                2.886441 = boost
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.016416017 = queryNorm
              1.0474801 = fieldWeight in 1088, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.0625 = fieldNorm(doc=1088)
          0.9058193 = weight(abstract_txt:pinyin in 1088) [ClassicSimilarity], result of:
            0.9058193 = score(doc=1088,freq=7.0), product of:
              0.6077332 = queryWeight, product of:
                4.1071973 = boost
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.016416017 = queryNorm
              1.4904884 = fieldWeight in 1088, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.0625 = fieldNorm(doc=1088)
        0.32 = coord(8/25)
    
  2. Groom, L.: Converting Wade-Giles cataloging to Pinyin : the development and implementation of a conversion program for the Australian National CJK Service (1997) 0.31
    0.3074604 = sum of:
      0.3074604 = product of:
        1.5373019 = sum of:
          0.07326964 = weight(abstract_txt:division in 1598) [ClassicSimilarity], result of:
            0.07326964 = score(doc=1598,freq=1.0), product of:
              0.11505337 = queryWeight, product of:
                1.0317587 = boost
                6.792872 = idf(docFreq=128, maxDocs=42306)
                0.016416017 = queryNorm
              0.63683176 = fieldWeight in 1598, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.792872 = idf(docFreq=128, maxDocs=42306)
                0.09375 = fieldNorm(doc=1598)
          0.21830854 = weight(abstract_txt:romanization in 1598) [ClassicSimilarity], result of:
            0.21830854 = score(doc=1598,freq=2.0), product of:
              0.18908358 = queryWeight, product of:
                1.3226818 = boost
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.016416017 = queryNorm
              1.154561 = fieldWeight in 1598, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.09375 = fieldNorm(doc=1598)
          0.04036883 = weight(abstract_txt:records in 1598) [ClassicSimilarity], result of:
            0.04036883 = score(doc=1598,freq=1.0), product of:
              0.097422086 = queryWeight, product of:
                1.3426789 = boost
                4.419951 = idf(docFreq=1383, maxDocs=42306)
                0.016416017 = queryNorm
              0.41437042 = fieldWeight in 1598, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.419951 = idf(docFreq=1383, maxDocs=42306)
                0.09375 = fieldNorm(doc=1598)
          0.17825232 = weight(abstract_txt:chinese in 1598) [ClassicSimilarity], result of:
            0.17825232 = score(doc=1598,freq=1.0), product of:
              0.30015612 = queryWeight, product of:
                2.886441 = boost
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.016416017 = queryNorm
              0.5938654 = fieldWeight in 1598, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.09375 = fieldNorm(doc=1598)
          1.0271026 = weight(abstract_txt:pinyin in 1598) [ClassicSimilarity], result of:
            1.0271026 = score(doc=1598,freq=4.0), product of:
              0.6077332 = queryWeight, product of:
                4.1071973 = boost
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.016416017 = queryNorm
              1.6900551 = fieldWeight in 1598, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.09375 = fieldNorm(doc=1598)
        0.2 = coord(5/25)
    
  3. LC to convert to Pinyin for romanization of Chinese (1997) 0.24
    0.23641063 = sum of:
      0.23641063 = product of:
        1.4775665 = sum of:
          0.25727907 = weight(abstract_txt:romanization in 2096) [ClassicSimilarity], result of:
            0.25727907 = score(doc=2096,freq=1.0), product of:
              0.18908358 = queryWeight, product of:
                1.3226818 = boost
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.016416017 = queryNorm
              1.3606633 = fieldWeight in 2096, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.15625 = fieldNorm(doc=2096)
          0.06728138 = weight(abstract_txt:records in 2096) [ClassicSimilarity], result of:
            0.06728138 = score(doc=2096,freq=1.0), product of:
              0.097422086 = queryWeight, product of:
                1.3426789 = boost
                4.419951 = idf(docFreq=1383, maxDocs=42306)
                0.016416017 = queryNorm
              0.6906173 = fieldWeight in 2096, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.419951 = idf(docFreq=1383, maxDocs=42306)
                0.15625 = fieldNorm(doc=2096)
          0.29708722 = weight(abstract_txt:chinese in 2096) [ClassicSimilarity], result of:
            0.29708722 = score(doc=2096,freq=1.0), product of:
              0.30015612 = queryWeight, product of:
                2.886441 = boost
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.016416017 = queryNorm
              0.98977566 = fieldWeight in 2096, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.15625 = fieldNorm(doc=2096)
          0.85591877 = weight(abstract_txt:pinyin in 2096) [ClassicSimilarity], result of:
            0.85591877 = score(doc=2096,freq=1.0), product of:
              0.6077332 = queryWeight, product of:
                4.1071973 = boost
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.016416017 = queryNorm
              1.4083792 = fieldWeight in 2096, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.15625 = fieldNorm(doc=2096)
        0.16 = coord(4/25)
    
  4. Li, Y.: Consistency versus inconsistency : issues in Chinese cataloging in OCLC (2004) 0.20
    0.20062385 = sum of:
      0.20062385 = product of:
        1.2538991 = sum of:
          0.18192378 = weight(abstract_txt:romanization in 658) [ClassicSimilarity], result of:
            0.18192378 = score(doc=658,freq=2.0), product of:
              0.18908358 = queryWeight, product of:
                1.3226818 = boost
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.016416017 = queryNorm
              0.96213424 = fieldWeight in 658, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.078125 = fieldNorm(doc=658)
          0.03364069 = weight(abstract_txt:records in 658) [ClassicSimilarity], result of:
            0.03364069 = score(doc=658,freq=1.0), product of:
              0.097422086 = queryWeight, product of:
                1.3426789 = boost
                4.419951 = idf(docFreq=1383, maxDocs=42306)
                0.016416017 = queryNorm
              0.34530866 = fieldWeight in 658, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.419951 = idf(docFreq=1383, maxDocs=42306)
                0.078125 = fieldNorm(doc=658)
          0.29708722 = weight(abstract_txt:chinese in 658) [ClassicSimilarity], result of:
            0.29708722 = score(doc=658,freq=4.0), product of:
              0.30015612 = queryWeight, product of:
                2.886441 = boost
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.016416017 = queryNorm
              0.98977566 = fieldWeight in 658, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.078125 = fieldNorm(doc=658)
          0.74124736 = weight(abstract_txt:pinyin in 658) [ClassicSimilarity], result of:
            0.74124736 = score(doc=658,freq=3.0), product of:
              0.6077332 = queryWeight, product of:
                4.1071973 = boost
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.016416017 = queryNorm
              1.2196921 = fieldWeight in 658, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.078125 = fieldNorm(doc=658)
        0.16 = coord(4/25)
    
  5. Studwell, W.E.; Wang, R.; Wu, H.: ¬A tale of two decades : the controversy over the choice of a Chinese language romanization system in American cataloging practice (1993) 0.20
    0.19553867 = sum of:
      0.19553867 = product of:
        1.629489 = sum of:
          0.20582327 = weight(abstract_txt:romanization in 7954) [ClassicSimilarity], result of:
            0.20582327 = score(doc=7954,freq=1.0), product of:
              0.18908358 = queryWeight, product of:
                1.3226818 = boost
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.016416017 = queryNorm
              1.0885307 = fieldWeight in 7954, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.125 = fieldNorm(doc=7954)
          0.23766978 = weight(abstract_txt:chinese in 7954) [ClassicSimilarity], result of:
            0.23766978 = score(doc=7954,freq=1.0), product of:
              0.30015612 = queryWeight, product of:
                2.886441 = boost
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.016416017 = queryNorm
              0.7918205 = fieldWeight in 7954, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.334564 = idf(docFreq=203, maxDocs=42306)
                0.125 = fieldNorm(doc=7954)
          1.1859958 = weight(abstract_txt:pinyin in 7954) [ClassicSimilarity], result of:
            1.1859958 = score(doc=7954,freq=3.0), product of:
              0.6077332 = queryWeight, product of:
                4.1071973 = boost
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.016416017 = queryNorm
              1.9515074 = fieldWeight in 7954, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.125 = fieldNorm(doc=7954)
        0.12 = coord(3/25)