Zeng, L.: Quality control of Chinese-language records using a rule-based data validation system : Part 2: a study of a rule-based data validation system for online Chinese cataloging (1993)
0.04
0.03890858 = product of:
0.20751242 = sum of:
0.026531162 = weight(_text_:26 in 579) [ClassicSimilarity], result of:
0.026531162 = score(doc=579,freq=2.0), product of:
0.113328174 = queryWeight, product of:
3.5315237 = idf(docFreq=3516, maxDocs=44218)
0.032090448 = queryNorm
0.23410915 = fieldWeight in 579, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5315237 = idf(docFreq=3516, maxDocs=44218)
0.046875 = fieldNorm(doc=579)
0.060327087 = product of:
0.12065417 = sum of:
0.12065417 = weight(_text_:rules in 579) [ClassicSimilarity], result of:
0.12065417 = score(doc=579,freq=10.0), product of:
0.16161752 = queryWeight, product of:
5.036312 = idf(docFreq=780, maxDocs=44218)
0.032090448 = queryNorm
0.74654144 = fieldWeight in 579, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
5.036312 = idf(docFreq=780, maxDocs=44218)
0.046875 = fieldNorm(doc=579)
0.5 = coord(1/2)
0.12065417 = weight(_text_:rules in 579) [ClassicSimilarity], result of:
0.12065417 = score(doc=579,freq=10.0), product of:
0.16161752 = queryWeight, product of:
5.036312 = idf(docFreq=780, maxDocs=44218)
0.032090448 = queryNorm
0.74654144 = fieldWeight in 579, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
5.036312 = idf(docFreq=780, maxDocs=44218)
0.046875 = fieldNorm(doc=579)
0.1875 = coord(3/16)
- Abstract
- The problem addressed by this two-part study is to evaluate the quality of Chinese records in the OCLC database and to determine the potential of a set of production rules for a rule-based data validation system lo support quality control of the Chinese records. The second part of the study emphasizes establishing production rules for such a system. Based on the results of error analysis, a set of production rules were developed and tested, focusing on improving completeness, consistency, and correctness of a record. The rules covered 11 of the total 19 types of errors. At least 65% , of the errors occurring in the investigated sample records could be detected automatically by applying the production rules.
- Source
- Cataloging and classification quarterly. 18(1993) no.1, S.3-26
Zeng, L.: Quality control of Chinese-language records using a rule-based data validation system : Part 1: an evaluation of the quality of Chinese-language records in the OCLC OLUC database (1993)
0.02
0.020150334 = product of:
0.10746844 = sum of:
0.026531162 = weight(_text_:26 in 580) [ClassicSimilarity], result of:
0.026531162 = score(doc=580,freq=2.0), product of:
0.113328174 = queryWeight, product of:
3.5315237 = idf(docFreq=3516, maxDocs=44218)
0.032090448 = queryNorm
0.23410915 = fieldWeight in 580, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5315237 = idf(docFreq=3516, maxDocs=44218)
0.046875 = fieldNorm(doc=580)
0.026979093 = product of:
0.053958185 = sum of:
0.053958185 = weight(_text_:rules in 580) [ClassicSimilarity], result of:
0.053958185 = score(doc=580,freq=2.0), product of:
0.16161752 = queryWeight, product of:
5.036312 = idf(docFreq=780, maxDocs=44218)
0.032090448 = queryNorm
0.33386347 = fieldWeight in 580, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
5.036312 = idf(docFreq=780, maxDocs=44218)
0.046875 = fieldNorm(doc=580)
0.5 = coord(1/2)
0.053958185 = weight(_text_:rules in 580) [ClassicSimilarity], result of:
0.053958185 = score(doc=580,freq=2.0), product of:
0.16161752 = queryWeight, product of:
5.036312 = idf(docFreq=780, maxDocs=44218)
0.032090448 = queryNorm
0.33386347 = fieldWeight in 580, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
5.036312 = idf(docFreq=780, maxDocs=44218)
0.046875 = fieldNorm(doc=580)
0.1875 = coord(3/16)
- Abstract
- The study consisted of two interrelated parts: (1) a quality analysis of the Chinese-language records in the OCLC database, with emphasis on identifying errors in member-contributed records; and (2) the development of a rule-based data validation system for quality control of Chinese-language records in the OCLC database, with emphasis on establishing a set of production rules for such a system. One thousand three hundred six member-contributed Chinese records were randomly selected from the OCLC database and were examined by the researcher. Commonly occurring errors were identified and were categorized into three classes: format errors, content deficiency and inconsistency errors, and typographical errors of editing and inputting. The relationship between the number of times a record had been enhanced and errors still occurring in it was also studied.
- Footnote
- Part 2 in: Cataloging and classification quarterly. 18(1993) no.1, S.3-26.