-
Wahlster, W.: Verbmobil : Erkennung, Analyse, Transfer, Generierung und Synthese von Spontansprache (2001)
0.00
0.003798382 = product of:
0.011395145 = sum of:
0.011395145 = product of:
0.034185436 = sum of:
0.034185436 = weight(_text_:29 in 5629) [ClassicSimilarity], result of:
0.034185436 = score(doc=5629,freq=2.0), product of:
0.14659786 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0416745 = queryNorm
0.23319192 = fieldWeight in 5629, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.046875 = fieldNorm(doc=5629)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
- Date
- 29. 1.1997 18:49:05
-
Ferret, O.; Grau, B.; Hurault-Plantet, M.; Illouz, G.; Jacquemin, C.; Monceaux, L.; Robba, I.; Vilnat, A.: How NLP can improve question answering (2002)
0.00
0.003798382 = product of:
0.011395145 = sum of:
0.011395145 = product of:
0.034185436 = sum of:
0.034185436 = weight(_text_:29 in 1850) [ClassicSimilarity], result of:
0.034185436 = score(doc=1850,freq=2.0), product of:
0.14659786 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0416745 = queryNorm
0.23319192 = fieldWeight in 1850, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.046875 = fieldNorm(doc=1850)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
- Source
- Knowledge organization. 29(2002) nos.3/4, S.135-155
-
Sidhom, S.; Hassoun, M.: Morpho-syntactic parsing for a text mining environment : An NP recognition model for knowledge visualization and information retrieval (2002)
0.00
0.003798382 = product of:
0.011395145 = sum of:
0.011395145 = product of:
0.034185436 = sum of:
0.034185436 = weight(_text_:29 in 1852) [ClassicSimilarity], result of:
0.034185436 = score(doc=1852,freq=2.0), product of:
0.14659786 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0416745 = queryNorm
0.23319192 = fieldWeight in 1852, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.046875 = fieldNorm(doc=1852)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
- Source
- Knowledge organization. 29(2002) nos.3/4, S.171-180
-
L'Homme, D.; L'Homme, M.-C.; Lemay, C.: Benchmarking the performance of two Part-of-Speech (POS) taggers for terminological purposes (2002)
0.00
0.003798382 = product of:
0.011395145 = sum of:
0.011395145 = product of:
0.034185436 = sum of:
0.034185436 = weight(_text_:29 in 1855) [ClassicSimilarity], result of:
0.034185436 = score(doc=1855,freq=2.0), product of:
0.14659786 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0416745 = queryNorm
0.23319192 = fieldWeight in 1855, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.046875 = fieldNorm(doc=1855)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
- Source
- Knowledge organization. 29(2002) nos.3/4, S.204-216
-
Kostoff, R.N.; Block, J.A.: Factor matrix text filtering and clustering (2005)
0.00
0.003798382 = product of:
0.011395145 = sum of:
0.011395145 = product of:
0.034185436 = sum of:
0.034185436 = weight(_text_:29 in 3683) [ClassicSimilarity], result of:
0.034185436 = score(doc=3683,freq=2.0), product of:
0.14659786 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0416745 = queryNorm
0.23319192 = fieldWeight in 3683, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.046875 = fieldNorm(doc=3683)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
- Date
- 21. 7.2005 16:29:47
-
Navarretta, C.; Pedersen, B.S.; Hansen, D.H.: Language technology in knowledge-organization systems (2006)
0.00
0.003798382 = product of:
0.011395145 = sum of:
0.011395145 = product of:
0.034185436 = sum of:
0.034185436 = weight(_text_:29 in 5706) [ClassicSimilarity], result of:
0.034185436 = score(doc=5706,freq=2.0), product of:
0.14659786 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0416745 = queryNorm
0.23319192 = fieldWeight in 5706, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.046875 = fieldNorm(doc=5706)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
- Source
- New review of hypermedia and multimedia. 12(2006) no.1, S.29-49
-
Zhang, C.; Zeng, D.; Li, J.; Wang, F.-Y.; Zuo, W.: Sentiment analysis of Chinese documents : from sentence to document level (2009)
0.00
0.003798382 = product of:
0.011395145 = sum of:
0.011395145 = product of:
0.034185436 = sum of:
0.034185436 = weight(_text_:29 in 3296) [ClassicSimilarity], result of:
0.034185436 = score(doc=3296,freq=2.0), product of:
0.14659786 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0416745 = queryNorm
0.23319192 = fieldWeight in 3296, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.046875 = fieldNorm(doc=3296)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
- Date
- 2. 2.2010 19:29:56
-
Bian, G.-W.; Chen, H.-H.: Cross-language information access to multilingual collections on the Internet (2000)
0.00
0.003764213 = product of:
0.011292639 = sum of:
0.011292639 = product of:
0.033877917 = sum of:
0.033877917 = weight(_text_:22 in 4436) [ClassicSimilarity], result of:
0.033877917 = score(doc=4436,freq=2.0), product of:
0.145937 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0416745 = queryNorm
0.23214069 = fieldWeight in 4436, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=4436)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
- Date
- 16. 2.2000 14:22:39
-
Lorenz, S.: Konzeption und prototypische Realisierung einer begriffsbasierten Texterschließung (2006)
0.00
0.003764213 = product of:
0.011292639 = sum of:
0.011292639 = product of:
0.033877917 = sum of:
0.033877917 = weight(_text_:22 in 1746) [ClassicSimilarity], result of:
0.033877917 = score(doc=1746,freq=2.0), product of:
0.145937 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0416745 = queryNorm
0.23214069 = fieldWeight in 1746, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=1746)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
- Date
- 22. 3.2015 9:17:30
-
Herrera-Viedma, E.: Modeling the retrieval process for an information retrieval system using an ordinal fuzzy linguistic approach (2001)
0.00
0.0031653184 = product of:
0.009495955 = sum of:
0.009495955 = product of:
0.028487865 = sum of:
0.028487865 = weight(_text_:29 in 5752) [ClassicSimilarity], result of:
0.028487865 = score(doc=5752,freq=2.0), product of:
0.14659786 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0416745 = queryNorm
0.19432661 = fieldWeight in 5752, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0390625 = fieldNorm(doc=5752)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
- Date
- 29. 9.2001 14:00:25
-
Li, W.; Wong, K.-F.; Yuan, C.: Toward automatic Chinese temporal information extraction (2001)
0.00
0.0031653184 = product of:
0.009495955 = sum of:
0.009495955 = product of:
0.028487865 = sum of:
0.028487865 = weight(_text_:29 in 6029) [ClassicSimilarity], result of:
0.028487865 = score(doc=6029,freq=2.0), product of:
0.14659786 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0416745 = queryNorm
0.19432661 = fieldWeight in 6029, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0390625 = fieldNorm(doc=6029)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
- Date
- 29. 9.2001 14:02:50
-
Ibekwe-SanJuan, F.; SanJuan, E.: From term variants to research topics (2002)
0.00
0.0031653184 = product of:
0.009495955 = sum of:
0.009495955 = product of:
0.028487865 = sum of:
0.028487865 = weight(_text_:29 in 1853) [ClassicSimilarity], result of:
0.028487865 = score(doc=1853,freq=2.0), product of:
0.14659786 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0416745 = queryNorm
0.19432661 = fieldWeight in 1853, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0390625 = fieldNorm(doc=1853)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
- Source
- Knowledge organization. 29(2002) nos.3/4, S.181-197
-
Rosemblat, G.; Tse, T.; Gemoets, D.: Adapting a monolingual consumer health system for Spanish cross-language information retrieval (2004)
0.00
0.0031653184 = product of:
0.009495955 = sum of:
0.009495955 = product of:
0.028487865 = sum of:
0.028487865 = weight(_text_:29 in 2673) [ClassicSimilarity], result of:
0.028487865 = score(doc=2673,freq=2.0), product of:
0.14659786 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0416745 = queryNorm
0.19432661 = fieldWeight in 2673, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0390625 = fieldNorm(doc=2673)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
- Date
- 29. 8.2004 19:12:06
-
Tseng, Y.-H.: Automatic thesaurus generation for Chinese documents (2002)
0.00
0.0031653184 = product of:
0.009495955 = sum of:
0.009495955 = product of:
0.028487865 = sum of:
0.028487865 = weight(_text_:29 in 5226) [ClassicSimilarity], result of:
0.028487865 = score(doc=5226,freq=2.0), product of:
0.14659786 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0416745 = queryNorm
0.19432661 = fieldWeight in 5226, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0390625 = fieldNorm(doc=5226)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
- Abstract
- Tseng constructs a word co-occurrence based thesaurus by means of the automatic analysis of Chinese text. Words are identified by a longest dictionary match supplemented by a key word extraction algorithm that merges back nearby tokens and accepts shorter strings of characters if they occur more often than the longest string. Single character auxiliary words are a major source of error but this can be greatly reduced with the use of a 70-character 2680 word stop list. Extracted terms with their associate document weights are sorted by decreasing frequency and the top of this list is associated using a Dice coefficient modified to account for longer documents on the weights of term pairs. Co-occurrence is not in the document as a whole but in paragraph or sentence size sections in order to reduce computation time. A window of 29 characters or 11 words was found to be sufficient. A thesaurus was produced from 25,230 Chinese news articles and judges asked to review the top 50 terms associated with each of 30 single word query terms. They determined 69% to be relevant.
-
Sienel, J.; Weiss, M.; Laube, M.: Sprachtechnologien für die Informationsgesellschaft des 21. Jahrhunderts (2000)
0.00
0.0031368441 = product of:
0.009410532 = sum of:
0.009410532 = product of:
0.028231597 = sum of:
0.028231597 = weight(_text_:22 in 5557) [ClassicSimilarity], result of:
0.028231597 = score(doc=5557,freq=2.0), product of:
0.145937 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0416745 = queryNorm
0.19345059 = fieldWeight in 5557, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0390625 = fieldNorm(doc=5557)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
- Date
- 26.12.2000 13:22:17
-
Pinker, S.: Wörter und Regeln : Die Natur der Sprache (2000)
0.00
0.0031368441 = product of:
0.009410532 = sum of:
0.009410532 = product of:
0.028231597 = sum of:
0.028231597 = weight(_text_:22 in 734) [ClassicSimilarity], result of:
0.028231597 = score(doc=734,freq=2.0), product of:
0.145937 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0416745 = queryNorm
0.19345059 = fieldWeight in 734, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0390625 = fieldNorm(doc=734)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
- Date
- 19. 7.2002 14:22:31
-
Computational linguistics for the new millennium : divergence or synergy? Proceedings of the International Symposium held at the Ruprecht-Karls Universität Heidelberg, 21-22 July 2000. Festschrift in honour of Peter Hellwig on the occasion of his 60th birthday (2002)
0.00
0.0031368441 = product of:
0.009410532 = sum of:
0.009410532 = product of:
0.028231597 = sum of:
0.028231597 = weight(_text_:22 in 4900) [ClassicSimilarity], result of:
0.028231597 = score(doc=4900,freq=2.0), product of:
0.145937 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0416745 = queryNorm
0.19345059 = fieldWeight in 4900, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0390625 = fieldNorm(doc=4900)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
-
Jones, I.; Cunliffe, D.; Tudhope, D.: Natural language processing and knowledge organization systems as an aid to retrieval (2004)
0.00
0.0031335056 = product of:
0.009400517 = sum of:
0.009400517 = product of:
0.028201548 = sum of:
0.028201548 = weight(_text_:29 in 2677) [ClassicSimilarity], result of:
0.028201548 = score(doc=2677,freq=4.0), product of:
0.14659786 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0416745 = queryNorm
0.19237353 = fieldWeight in 2677, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.02734375 = fieldNorm(doc=2677)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
- Date
- 29. 8.2004 19:29:56
-
Rösener, C.: ¬Die Stecknadel im Heuhaufen : Natürlichsprachlicher Zugang zu Volltextdatenbanken (2005)
0.00
0.002532255 = product of:
0.0075967642 = sum of:
0.0075967642 = product of:
0.022790292 = sum of:
0.022790292 = weight(_text_:29 in 548) [ClassicSimilarity], result of:
0.022790292 = score(doc=548,freq=2.0), product of:
0.14659786 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0416745 = queryNorm
0.15546128 = fieldWeight in 548, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.03125 = fieldNorm(doc=548)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
- Date
- 29. 3.2009 11:11:45
-
Schürmann, H.: Software scannt Radio- und Fernsehsendungen : Recherche in Nachrichtenarchiven erleichtert (2001)
0.00
0.0021957909 = product of:
0.0065873726 = sum of:
0.0065873726 = product of:
0.019762117 = sum of:
0.019762117 = weight(_text_:22 in 5759) [ClassicSimilarity], result of:
0.019762117 = score(doc=5759,freq=2.0), product of:
0.145937 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0416745 = queryNorm
0.1354154 = fieldWeight in 5759, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.02734375 = fieldNorm(doc=5759)
0.33333334 = coord(1/3)
0.33333334 = coord(1/3)
- Source
- Handelsblatt. Nr.79 vom 24.4.2001, S.22