Document (#30435)

Author
Arsenault, C.
Title
Word division in the transcription of Chinese script in the title fields of bibliographic Records
Source
Cataloging and classification quarterly. 32(2001) no.3, S.109-137
Year
2001
Abstract
Recently, the Library of Congress adopted the pinyin Romanization system for transcribing Chinese data in its bibliographic records. In its canonical form, pinyin aggregates Chinese "words" into single linguistic units, but pinyin entries could be constructed following either a monosyllabic or a polysyllabic pattern. Although the former is easier and less costly to implement, the latter method is potentially more beneficial for end-users, as it reduces ambiguity, and generates a much larger variety of indexable terms. The current study investigates if following the polysyllabic method improves retrieval efficiency and effectiveness in item-specific searching within online bibliographic databases. Analysis of the results revealed that aggregation of monosyllables does improve efficiency significantly (p < .05), especially during keyword searches, while effectiveness remains mainly unaffected.
Theme
Formalerschließung

Similar documents (author)

  1. Arsenault, C.: Testing the impact of syllable aggregation in romanized fields of Chinese language bibliographic records (2000) 5.81
    5.81187 = sum of:
      5.81187 = weight(author_txt:arsenault in 87) [ClassicSimilarity], result of:
        5.81187 = fieldWeight in 87, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.625 = fieldNorm(doc=87)
    
  2. Arsenault, C.: Aggregation consistency and frequency of Chinese words and characters (2006) 5.81
    5.81187 = sum of:
      5.81187 = weight(author_txt:arsenault in 609) [ClassicSimilarity], result of:
        5.81187 = fieldWeight in 609, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.625 = fieldNorm(doc=609)
    
  3. Jacobs, C.; Arsenault, C.: Words can't describe it : streamlining PRECIS just for laughs! (1994) 4.65
    4.649496 = sum of:
      4.649496 = weight(author_txt:arsenault in 2198) [ClassicSimilarity], result of:
        4.649496 = fieldWeight in 2198, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.5 = fieldNorm(doc=2198)
    
  4. Arsenault, C.; Leide, J.E.: Format integration and the design of cataloging and classification curricula (2002) 4.65
    4.649496 = sum of:
      4.649496 = weight(author_txt:arsenault in 5456) [ClassicSimilarity], result of:
        4.649496 = fieldWeight in 5456, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.5 = fieldNorm(doc=5456)
    
  5. Arsenault, C.; Ménard, E.: Searching titles with initial articles in library catalogs : a case study and search behavior analysis (2007) 4.65
    4.649496 = sum of:
      4.649496 = weight(author_txt:arsenault in 2264) [ClassicSimilarity], result of:
        4.649496 = fieldWeight in 2264, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.5 = fieldNorm(doc=2264)
    

Similar documents (content)

  1. Arsenault, C.: Testing the impact of syllable aggregation in romanized fields of Chinese language bibliographic records (2000) 0.54
    0.5363188 = sum of:
      0.5363188 = product of:
        1.6759962 = sum of:
          0.07852265 = weight(abstract_txt:script in 87) [ClassicSimilarity], result of:
            0.07852265 = score(doc=87,freq=1.0), product of:
              0.15785 = queryWeight, product of:
                1.2120042 = boost
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.01636327 = queryNorm
              0.4974511 = fieldWeight in 87, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.0625 = fieldNorm(doc=87)
          0.09925773 = weight(abstract_txt:aggregates in 87) [ClassicSimilarity], result of:
            0.09925773 = score(doc=87,freq=1.0), product of:
              0.18454014 = queryWeight, product of:
                1.3104705 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.01636327 = queryNorm
              0.5378653 = fieldWeight in 87, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.0625 = fieldNorm(doc=87)
          0.14508735 = weight(abstract_txt:romanization in 87) [ClassicSimilarity], result of:
            0.14508735 = score(doc=87,freq=2.0), product of:
              0.18865035 = queryWeight, product of:
                1.324984 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.01636327 = queryNorm
              0.7690807 = fieldWeight in 87, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.0625 = fieldNorm(doc=87)
          0.04657309 = weight(abstract_txt:records in 87) [ClassicSimilarity], result of:
            0.04657309 = score(doc=87,freq=3.0), product of:
              0.09734364 = queryWeight, product of:
                1.3460171 = boost
                4.4196396 = idf(docFreq=1446, maxDocs=44218)
                0.01636327 = queryNorm
              0.47844002 = fieldWeight in 87, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4196396 = idf(docFreq=1446, maxDocs=44218)
                0.0625 = fieldNorm(doc=87)
          0.028400527 = weight(abstract_txt:method in 87) [ClassicSimilarity], result of:
            0.028400527 = score(doc=87,freq=1.0), product of:
              0.10095834 = queryWeight, product of:
                1.3707805 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.01636327 = queryNorm
              0.28130937 = fieldWeight in 87, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=87)
          0.049991038 = weight(abstract_txt:bibliographic in 87) [ClassicSimilarity], result of:
            0.049991038 = score(doc=87,freq=2.0), product of:
              0.13372311 = queryWeight, product of:
                1.9321715 = boost
                4.229516 = idf(docFreq=1749, maxDocs=44218)
                0.01636327 = queryNorm
              0.3738399 = fieldWeight in 87, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.229516 = idf(docFreq=1749, maxDocs=44218)
                0.0625 = fieldNorm(doc=87)
          0.3095635 = weight(abstract_txt:chinese in 87) [ClassicSimilarity], result of:
            0.3095635 = score(doc=87,freq=7.0), product of:
              0.29699937 = queryWeight, product of:
                2.8795207 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.01636327 = queryNorm
              1.0423036 = fieldWeight in 87, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.0625 = fieldNorm(doc=87)
          0.91860026 = weight(abstract_txt:pinyin in 87) [ClassicSimilarity], result of:
            0.91860026 = score(doc=87,freq=7.0), product of:
              0.6133006 = queryWeight, product of:
                4.137892 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.01636327 = queryNorm
              1.4977977 = fieldWeight in 87, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.0625 = fieldNorm(doc=87)
        0.32 = coord(8/25)
    
  2. Groom, L.: Converting Wade-Giles cataloging to Pinyin : the development and implementation of a conversion program for the Australian National CJK Service (1997) 0.31
    0.30960146 = sum of:
      0.30960146 = product of:
        1.5480072 = sum of:
          0.07294192 = weight(abstract_txt:division in 597) [ClassicSimilarity], result of:
            0.07294192 = score(doc=597,freq=1.0), product of:
              0.11468464 = queryWeight, product of:
                1.0330812 = boost
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.01636327 = queryNorm
              0.63602173 = fieldWeight in 597, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.09375 = fieldNorm(doc=597)
          0.21763103 = weight(abstract_txt:romanization in 597) [ClassicSimilarity], result of:
            0.21763103 = score(doc=597,freq=2.0), product of:
              0.18865035 = queryWeight, product of:
                1.324984 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.01636327 = queryNorm
              1.1536211 = fieldWeight in 597, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.09375 = fieldNorm(doc=597)
          0.04033348 = weight(abstract_txt:records in 597) [ClassicSimilarity], result of:
            0.04033348 = score(doc=597,freq=1.0), product of:
              0.09734364 = queryWeight, product of:
                1.3460171 = boost
                4.4196396 = idf(docFreq=1446, maxDocs=44218)
                0.01636327 = queryNorm
              0.4143412 = fieldWeight in 597, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4196396 = idf(docFreq=1446, maxDocs=44218)
                0.09375 = fieldNorm(doc=597)
          0.17550601 = weight(abstract_txt:chinese in 597) [ClassicSimilarity], result of:
            0.17550601 = score(doc=597,freq=1.0), product of:
              0.29699937 = queryWeight, product of:
                2.8795207 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.01636327 = queryNorm
              0.5909306 = fieldWeight in 597, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.09375 = fieldNorm(doc=597)
          1.0415949 = weight(abstract_txt:pinyin in 597) [ClassicSimilarity], result of:
            1.0415949 = score(doc=597,freq=4.0), product of:
              0.6133006 = queryWeight, product of:
                4.137892 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.01636327 = queryNorm
              1.698343 = fieldWeight in 597, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.09375 = fieldNorm(doc=597)
        0.2 = coord(5/25)
    
  3. LC to convert to Pinyin for romanization of Chinese (1997) 0.24
    0.23747341 = sum of:
      0.23747341 = product of:
        1.4842088 = sum of:
          0.25648063 = weight(abstract_txt:romanization in 1095) [ClassicSimilarity], result of:
            0.25648063 = score(doc=1095,freq=1.0), product of:
              0.18865035 = queryWeight, product of:
                1.324984 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.01636327 = queryNorm
              1.3595555 = fieldWeight in 1095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.15625 = fieldNorm(doc=1095)
          0.06722247 = weight(abstract_txt:records in 1095) [ClassicSimilarity], result of:
            0.06722247 = score(doc=1095,freq=1.0), product of:
              0.09734364 = queryWeight, product of:
                1.3460171 = boost
                4.4196396 = idf(docFreq=1446, maxDocs=44218)
                0.01636327 = queryNorm
              0.6905687 = fieldWeight in 1095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4196396 = idf(docFreq=1446, maxDocs=44218)
                0.15625 = fieldNorm(doc=1095)
          0.29251003 = weight(abstract_txt:chinese in 1095) [ClassicSimilarity], result of:
            0.29251003 = score(doc=1095,freq=1.0), product of:
              0.29699937 = queryWeight, product of:
                2.8795207 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.01636327 = queryNorm
              0.9848844 = fieldWeight in 1095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.15625 = fieldNorm(doc=1095)
          0.86799574 = weight(abstract_txt:pinyin in 1095) [ClassicSimilarity], result of:
            0.86799574 = score(doc=1095,freq=1.0), product of:
              0.6133006 = queryWeight, product of:
                4.137892 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.01636327 = queryNorm
              1.415286 = fieldWeight in 1095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.15625 = fieldNorm(doc=1095)
        0.16 = coord(4/25)
    
  4. Li, Y.: Consistency versus inconsistency : issues in Chinese cataloging in OCLC (2004) 0.20
    0.20146987 = sum of:
      0.20146987 = product of:
        1.2591867 = sum of:
          0.18135919 = weight(abstract_txt:romanization in 5657) [ClassicSimilarity], result of:
            0.18135919 = score(doc=5657,freq=2.0), product of:
              0.18865035 = queryWeight, product of:
                1.324984 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.01636327 = queryNorm
              0.96135086 = fieldWeight in 5657, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.078125 = fieldNorm(doc=5657)
          0.033611234 = weight(abstract_txt:records in 5657) [ClassicSimilarity], result of:
            0.033611234 = score(doc=5657,freq=1.0), product of:
              0.09734364 = queryWeight, product of:
                1.3460171 = boost
                4.4196396 = idf(docFreq=1446, maxDocs=44218)
                0.01636327 = queryNorm
              0.34528434 = fieldWeight in 5657, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4196396 = idf(docFreq=1446, maxDocs=44218)
                0.078125 = fieldNorm(doc=5657)
          0.29251003 = weight(abstract_txt:chinese in 5657) [ClassicSimilarity], result of:
            0.29251003 = score(doc=5657,freq=4.0), product of:
              0.29699937 = queryWeight, product of:
                2.8795207 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.01636327 = queryNorm
              0.9848844 = fieldWeight in 5657, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.078125 = fieldNorm(doc=5657)
          0.75170636 = weight(abstract_txt:pinyin in 5657) [ClassicSimilarity], result of:
            0.75170636 = score(doc=5657,freq=3.0), product of:
              0.6133006 = queryWeight, product of:
                4.137892 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.01636327 = queryNorm
              1.2256736 = fieldWeight in 5657, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.078125 = fieldNorm(doc=5657)
        0.16 = coord(4/25)
    
  5. Studwell, W.E.; Wang, R.; Wu, H.: ¬A tale of two decades : the controversy over the choice of a Chinese language romanization system in American cataloging practice (1993) 0.20
    0.19703072 = sum of:
      0.19703072 = product of:
        1.6419227 = sum of:
          0.20518449 = weight(abstract_txt:romanization in 7954) [ClassicSimilarity], result of:
            0.20518449 = score(doc=7954,freq=1.0), product of:
              0.18865035 = queryWeight, product of:
                1.324984 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.01636327 = queryNorm
              1.0876443 = fieldWeight in 7954, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.125 = fieldNorm(doc=7954)
          0.23400803 = weight(abstract_txt:chinese in 7954) [ClassicSimilarity], result of:
            0.23400803 = score(doc=7954,freq=1.0), product of:
              0.29699937 = queryWeight, product of:
                2.8795207 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.01636327 = queryNorm
              0.7879075 = fieldWeight in 7954, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.125 = fieldNorm(doc=7954)
          1.2027302 = weight(abstract_txt:pinyin in 7954) [ClassicSimilarity], result of:
            1.2027302 = score(doc=7954,freq=3.0), product of:
              0.6133006 = queryWeight, product of:
                4.137892 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.01636327 = queryNorm
              1.9610777 = fieldWeight in 7954, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.125 = fieldNorm(doc=7954)
        0.12 = coord(3/25)