-
Ekmekcioglu, F.C.; Willett, P.: Effectiveness of stemming for Turkish text retrieval (2000)
0.01
0.012363703 = product of:
0.024727406 = sum of:
0.024727406 = product of:
0.049454812 = sum of:
0.049454812 = weight(_text_:2 in 5423) [ClassicSimilarity], result of:
0.049454812 = score(doc=5423,freq=2.0), product of:
0.1294644 = queryWeight, product of:
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.05242341 = queryNorm
0.38199544 = fieldWeight in 5423, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.109375 = fieldNorm(doc=5423)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Program. 34(2000) no.2, S.195-200
-
Ekmekcioglu, F.C.; Robertson, A.M.; Willett, P.: Effectiveness of query expansion in ranked-output document retrieval systems (1992)
0.01
0.007064973 = product of:
0.014129946 = sum of:
0.014129946 = product of:
0.028259892 = sum of:
0.028259892 = weight(_text_:2 in 5689) [ClassicSimilarity], result of:
0.028259892 = score(doc=5689,freq=2.0), product of:
0.1294644 = queryWeight, product of:
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.05242341 = queryNorm
0.2182831 = fieldWeight in 5689, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.0625 = fieldNorm(doc=5689)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Journal of information science. 18(1992) no.2, S.139-147
-
Ekmekcioglu, F.C.; Lynch, M.F.; Willet, P.: Development and evaluation of conflation techniques for the implementation of a document retrieval system for Turkish text databases (1995)
0.01
0.0061818515 = product of:
0.012363703 = sum of:
0.012363703 = product of:
0.024727406 = sum of:
0.024727406 = weight(_text_:2 in 5797) [ClassicSimilarity], result of:
0.024727406 = score(doc=5797,freq=2.0), product of:
0.1294644 = queryWeight, product of:
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.05242341 = queryNorm
0.19099772 = fieldWeight in 5797, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4695914 = idf(docFreq=10170, maxDocs=44218)
0.0546875 = fieldNorm(doc=5797)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- Considers language processing techniques necessary for the implementation of a document retrieval system for Turkish text databases. Introduces the main characteristics of the Turkish language. Discusses the development of a stopword list and the evaluation of a stemming algorithm that takes account of the language's morphological structure. A 2 level description of Turkish morphology developed in Bilkent University, Ankara, is incorporated into a morphological parser, PC-KIMMO, to carry out stemming in Turkish databases. Describes the evaluation of string similarity measures - n-gram matching techniques - for Turkish. Reports experiments on 6 different Turkish text corpora