-
Cui, H.: Competency evaluation of plant character ontologies against domain literature (2010)
0.01
0.009210425 = product of:
0.023026062 = sum of:
0.0077095623 = product of:
0.015419125 = sum of:
0.015419125 = weight(_text_:h in 3466) [ClassicSimilarity], result of:
0.015419125 = score(doc=3466,freq=2.0), product of:
0.11234521 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045219366 = queryNorm
0.13724773 = fieldWeight in 3466, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0390625 = fieldNorm(doc=3466)
0.5 = coord(1/2)
0.015316499 = product of:
0.030632999 = sum of:
0.030632999 = weight(_text_:22 in 3466) [ClassicSimilarity], result of:
0.030632999 = score(doc=3466,freq=2.0), product of:
0.15835051 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.045219366 = queryNorm
0.19345059 = fieldWeight in 3466, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0390625 = fieldNorm(doc=3466)
0.5 = coord(1/2)
0.4 = coord(2/5)
- Date
- 1. 6.2010 9:55:22
-
Cui, H.; Heidorn, P.B.; Zhang, H.: ¬An approach to automatic classification of text for information retrieval (2002)
0.00
0.003052831 = product of:
0.015264154 = sum of:
0.015264154 = product of:
0.030528309 = sum of:
0.030528309 = weight(_text_:h in 174) [ClassicSimilarity], result of:
0.030528309 = score(doc=174,freq=4.0), product of:
0.11234521 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045219366 = queryNorm
0.27173662 = fieldWeight in 174, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0546875 = fieldNorm(doc=174)
0.5 = coord(1/2)
0.2 = coord(1/5)
-
Cui, H.: CharaParser for fine-grained semantic annotation of organism morphological descriptions (2012)
0.00
0.0021805936 = product of:
0.010902967 = sum of:
0.010902967 = product of:
0.021805935 = sum of:
0.021805935 = weight(_text_:h in 45) [ClassicSimilarity], result of:
0.021805935 = score(doc=45,freq=4.0), product of:
0.11234521 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045219366 = queryNorm
0.1940976 = fieldWeight in 45, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0390625 = fieldNorm(doc=45)
0.5 = coord(1/2)
0.2 = coord(1/5)
- Abstract
- Biodiversity information organization is looking beyond the traditional document-level metadata approach and has started to look into factual content in textual documents to support more intelligent and semantic-based access. This article reports the development and evaluation of CharaParser, a software application for semantic annotation of morphological descriptions. CharaParser annotates semistructured morphological descriptions in such a detailed manner that all stated morphological characters of an organ are marked up in Extensible Markup Language format. Using an unsupervised machine learning algorithm and a general purpose syntactic parser as its key annotation tools, CharaParser requires minimal additional knowledge engineering work and seems to perform well across different description collections and/or taxon groups. The system has been formally evaluated on over 1,000 sentences randomly selected from Volume 19 of Flora of North American and Part H of Treatise on Invertebrate Paleontology. CharaParser reaches and exceeds 90% in sentence-wise recall and precision, exceeding other similar systems reported in the literature. It also significantly outperforms a heuristic rule-based system we developed earlier. Early evidence that enriching the lexicon of a syntactic parser with domain terms alone may be sufficient to adapt the parser for the biodiversity domain is also observed and may have significant implications.
-
Cui, H.; Boufford, D.; Selden, P.: Semantic annotation of biosystematics literature without training examples (2010)
0.00
0.0018502949 = product of:
0.009251474 = sum of:
0.009251474 = product of:
0.018502949 = sum of:
0.018502949 = weight(_text_:h in 3422) [ClassicSimilarity], result of:
0.018502949 = score(doc=3422,freq=2.0), product of:
0.11234521 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045219366 = queryNorm
0.16469726 = fieldWeight in 3422, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.046875 = fieldNorm(doc=3422)
0.5 = coord(1/2)
0.2 = coord(1/5)
-
Cui, H.; Stacy, S.: Welcome to LAC/Bienvenue à BAC : a new bilingual NACO partner (2020)
0.00
0.0018502949 = product of:
0.009251474 = sum of:
0.009251474 = product of:
0.018502949 = sum of:
0.018502949 = weight(_text_:h in 5803) [ClassicSimilarity], result of:
0.018502949 = score(doc=5803,freq=2.0), product of:
0.11234521 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045219366 = queryNorm
0.16469726 = fieldWeight in 5803, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.046875 = fieldNorm(doc=5803)
0.5 = coord(1/2)
0.2 = coord(1/5)
-
Cui, H.; Heidorn, P.B.: ¬The reusability of induced knowledge for the automatic semantic markup of taxonomic descriptions (2007)
0.00
0.0015419124 = product of:
0.0077095623 = sum of:
0.0077095623 = product of:
0.015419125 = sum of:
0.015419125 = weight(_text_:h in 84) [ClassicSimilarity], result of:
0.015419125 = score(doc=84,freq=2.0), product of:
0.11234521 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045219366 = queryNorm
0.13724773 = fieldWeight in 84, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0390625 = fieldNorm(doc=84)
0.5 = coord(1/2)
0.2 = coord(1/5)
-
Mao, J.; Cui, H.: Identifying bacterial biotope entities using sequence labeling : performance and feature analysis (2018)
0.00
0.0015419124 = product of:
0.0077095623 = sum of:
0.0077095623 = product of:
0.015419125 = sum of:
0.015419125 = weight(_text_:h in 4462) [ClassicSimilarity], result of:
0.015419125 = score(doc=4462,freq=2.0), product of:
0.11234521 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045219366 = queryNorm
0.13724773 = fieldWeight in 4462, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0390625 = fieldNorm(doc=4462)
0.5 = coord(1/2)
0.2 = coord(1/5)