Search (2 results, page 1 of 1)
- Did you mean:
- object's%3a%22Context-based term frequency assessment%22 2
- object's%3a%22Context-based term frequnecy assessment%22 2
- objects%3a%22Context-based term frequency assessment%22 2
- object's%3a%22Context-based term freuqency assessment%22 2
- objects%3a%22Context-based term frequnecy assessment%22 2
-
Zeng, L.: Quality control of Chinese-language records using a rule-based data validation system : Part 2: a study of a rule-based data validation system for online Chinese cataloging (1993)
0.00
0.0035313342 = product of: 0.014125337 = sum of: 0.014125337 = product of: 0.056501348 = sum of: 0.056501348 = weight(_text_:based in 579) [ClassicSimilarity], result of: 0.056501348 = score(doc=579,freq=8.0), product of: 0.14144066 = queryWeight, product of: 3.0129938 = idf(docFreq=5906, maxDocs=44218) 0.04694356 = queryNorm 0.39947033 = fieldWeight in 579, product of: 2.828427 = tf(freq=8.0), with freq of: 8.0 = termFreq=8.0 3.0129938 = idf(docFreq=5906, maxDocs=44218) 0.046875 = fieldNorm(doc=579) 0.25 = coord(1/4) 0.25 = coord(1/4)
- Abstract
- The problem addressed by this two-part study is to evaluate the quality of Chinese records in the OCLC database and to determine the potential of a set of production rules for a rule-based data validation system lo support quality control of the Chinese records. The second part of the study emphasizes establishing production rules for such a system. Based on the results of error analysis, a set of production rules were developed and tested, focusing on improving completeness, consistency, and correctness of a record. The rules covered 11 of the total 19 types of errors. At least 65% , of the errors occurring in the investigated sample records could be detected automatically by applying the production rules.
-
Zeng, L.: Quality control of Chinese-language records using a rule-based data validation system : Part 1: an evaluation of the quality of Chinese-language records in the OCLC OLUC database (1993)
0.00
0.0024970302 = product of: 0.009988121 = sum of: 0.009988121 = product of: 0.039952483 = sum of: 0.039952483 = weight(_text_:based in 580) [ClassicSimilarity], result of: 0.039952483 = score(doc=580,freq=4.0), product of: 0.14144066 = queryWeight, product of: 3.0129938 = idf(docFreq=5906, maxDocs=44218) 0.04694356 = queryNorm 0.28246817 = fieldWeight in 580, product of: 2.0 = tf(freq=4.0), with freq of: 4.0 = termFreq=4.0 3.0129938 = idf(docFreq=5906, maxDocs=44218) 0.046875 = fieldNorm(doc=580) 0.25 = coord(1/4) 0.25 = coord(1/4)
- Abstract
- The study consisted of two interrelated parts: (1) a quality analysis of the Chinese-language records in the OCLC database, with emphasis on identifying errors in member-contributed records; and (2) the development of a rule-based data validation system for quality control of Chinese-language records in the OCLC database, with emphasis on establishing a set of production rules for such a system. One thousand three hundred six member-contributed Chinese records were randomly selected from the OCLC database and were examined by the researcher. Commonly occurring errors were identified and were categorized into three classes: format errors, content deficiency and inconsistency errors, and typographical errors of editing and inputting. The relationship between the number of times a record had been enhanced and errors still occurring in it was also studied.