-
Ekmekcioglu, F.C.; Lynch, M.F.; Willet, P.: Development and evaluation of conflation techniques for the implementation of a document retrieval system for Turkish text databases (1995)
0.03
0.03099484 = product of:
0.13947678 = sum of:
0.0387704 = weight(_text_:retrieval in 5797) [ClassicSimilarity], result of:
0.0387704 = score(doc=5797,freq=4.0), product of:
0.1171842 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.038739666 = queryNorm
0.33085006 = fieldWeight in 5797, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.0546875 = fieldNorm(doc=5797)
0.100706376 = weight(_text_:techniques in 5797) [ClassicSimilarity], result of:
0.100706376 = score(doc=5797,freq=6.0), product of:
0.17065717 = queryWeight, product of:
4.405231 = idf(docFreq=1467, maxDocs=44218)
0.038739666 = queryNorm
0.5901092 = fieldWeight in 5797, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
4.405231 = idf(docFreq=1467, maxDocs=44218)
0.0546875 = fieldNorm(doc=5797)
0.22222222 = coord(2/9)
- Abstract
- Considers language processing techniques necessary for the implementation of a document retrieval system for Turkish text databases. Introduces the main characteristics of the Turkish language. Discusses the development of a stopword list and the evaluation of a stemming algorithm that takes account of the language's morphological structure. A 2 level description of Turkish morphology developed in Bilkent University, Ankara, is incorporated into a morphological parser, PC-KIMMO, to carry out stemming in Turkish databases. Describes the evaluation of string similarity measures - n-gram matching techniques - for Turkish. Reports experiments on 6 different Turkish text corpora
-
Ekmekcioglu, F.C.; Robertson, A.M.; Willett, P.: Effectiveness of query expansion in ranked-output document retrieval systems (1992)
0.02
0.016269911 = product of:
0.0732146 = sum of:
0.010552166 = weight(_text_:information in 5689) [ClassicSimilarity], result of:
0.010552166 = score(doc=5689,freq=2.0), product of:
0.06800663 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.038739666 = queryNorm
0.1551638 = fieldWeight in 5689, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0625 = fieldNorm(doc=5689)
0.06266243 = weight(_text_:retrieval in 5689) [ClassicSimilarity], result of:
0.06266243 = score(doc=5689,freq=8.0), product of:
0.1171842 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.038739666 = queryNorm
0.5347345 = fieldWeight in 5689, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.0625 = fieldNorm(doc=5689)
0.22222222 = coord(2/9)
- Abstract
- Reports an evaluation of 3 methods for the expansion of natural language queries in ranked output retrieval systems. The methods are based on term co-occurrence data, on Soundex codes, and on a string similarity measure. Searches for 110 queries in a data base of 26.280 titles and abstracts suggest that there is no significant difference in retrieval effectiveness between any of these methods and unexpanded searches
- Source
- Journal of information science. 18(1992) no.2, S.139-147
- Theme
- Semantisches Umfeld in Indexierung u. Retrieval
-
Ekmekcioglu, F.C.; Willett, P.: Effectiveness of stemming for Turkish text retrieval (2000)
0.01
0.006092181 = product of:
0.054829627 = sum of:
0.054829627 = weight(_text_:retrieval in 5423) [ClassicSimilarity], result of:
0.054829627 = score(doc=5423,freq=2.0), product of:
0.1171842 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.038739666 = queryNorm
0.46789268 = fieldWeight in 5423, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.109375 = fieldNorm(doc=5423)
0.11111111 = coord(1/9)