-
Leppanen, E.: Homografiongelma tekstihaussa ja homografien disambiguoinnin vaikutukset (1996)
0.01
0.008966145 = product of:
0.03586458 = sum of:
0.027436804 = product of:
0.08231041 = sum of:
0.08231041 = weight(_text_:problem in 27) [ClassicSimilarity], result of:
0.08231041 = score(doc=27,freq=10.0), product of:
0.13082431 = queryWeight, product of:
4.244485 = idf(docFreq=1723, maxDocs=44218)
0.030822188 = queryNorm
0.6291675 = fieldWeight in 27, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
4.244485 = idf(docFreq=1723, maxDocs=44218)
0.046875 = fieldNorm(doc=27)
0.33333334 = coord(1/3)
0.008427775 = product of:
0.025283325 = sum of:
0.025283325 = weight(_text_:29 in 27) [ClassicSimilarity], result of:
0.025283325 = score(doc=27,freq=2.0), product of:
0.108422816 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.030822188 = queryNorm
0.23319192 = fieldWeight in 27, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.046875 = fieldNorm(doc=27)
0.33333334 = coord(1/3)
0.25 = coord(2/8)
- Abstract
- Homonymy is known to often cause false drops in free text searching in a full text database. The problem is quite common and difficult to avoid in Finnish, but nobody has examined it before. Reports on a study that examined the frequency of, and solutions to, the homonymy problem, based on searches made in a Finnish full text database containing about 55.000 newspaper articles. The results indicate that homonymy is not a very serious problem in full text searching, with only about 1 search result set out of 4 containing false drops caused by homonymy. Several other reasons for nonrelevance were much more common. However, in some set results there were a considerable number of homonymy errors, so the number seems to be very random. A study was also made into whether homonyms can be disambiguated by syntactic analysis. The result was that 75,2% of homonyms were disambiguated by this method. Verb homonyms were considerably easier to disambiguate than substantives. Although homonymy is not a very big problem it could perhaps easily be eliminated if there was a suitable syntactic analyzer in the IR system
- Date
- 9.12.1997 18:33:29
- Footnote
- Übers. d. Titels: The homonymy problem in free text searching and the results of homonymy disambiguation
-
Witt, M.: Au sujet des mots-clés (1997)
0.00
0.0019864459 = product of:
0.015891567 = sum of:
0.015891567 = product of:
0.047674697 = sum of:
0.047674697 = weight(_text_:29 in 1666) [ClassicSimilarity], result of:
0.047674697 = score(doc=1666,freq=4.0), product of:
0.108422816 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.030822188 = queryNorm
0.43971092 = fieldWeight in 1666, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0625 = fieldNorm(doc=1666)
0.33333334 = coord(1/3)
0.125 = coord(1/8)
- Date
- 29. 1.1996 16:50:24
29. 7.1998 18:19:41
-
Molto, M.: Improving full text search performance through textual analysis (1993)
0.00
0.0014046291 = product of:
0.011237033 = sum of:
0.011237033 = product of:
0.033711098 = sum of:
0.033711098 = weight(_text_:29 in 5099) [ClassicSimilarity], result of:
0.033711098 = score(doc=5099,freq=2.0), product of:
0.108422816 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.030822188 = queryNorm
0.31092256 = fieldWeight in 5099, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0625 = fieldNorm(doc=5099)
0.33333334 = coord(1/3)
0.125 = coord(1/8)
- Source
- Information processing and management. 29(1993) no.5, S.614-632
-
Kristensen, J.: Expanding end-users' query statements for free text searching with a search-aid thesaurus (1993)
0.00
0.0014046291 = product of:
0.011237033 = sum of:
0.011237033 = product of:
0.033711098 = sum of:
0.033711098 = weight(_text_:29 in 6621) [ClassicSimilarity], result of:
0.033711098 = score(doc=6621,freq=2.0), product of:
0.108422816 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.030822188 = queryNorm
0.31092256 = fieldWeight in 6621, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0625 = fieldNorm(doc=6621)
0.33333334 = coord(1/3)
0.125 = coord(1/8)
- Source
- Information processing and management. 29(1993) no.6, S.733-744
-
Laegreid, J.A.: SIFT: a Norwegian information retrieval system (1993)
0.00
0.0013919937 = product of:
0.01113595 = sum of:
0.01113595 = product of:
0.03340785 = sum of:
0.03340785 = weight(_text_:22 in 7701) [ClassicSimilarity], result of:
0.03340785 = score(doc=7701,freq=2.0), product of:
0.10793405 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.030822188 = queryNorm
0.30952093 = fieldWeight in 7701, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0625 = fieldNorm(doc=7701)
0.33333334 = coord(1/3)
0.125 = coord(1/8)
- Date
- 23. 1.1999 19:22:09
-
Albus, W.; Smulders, H.: Doeltreffend zoeken in volledige teksten : 1. full-text retrieval bij de HavenInformatieBank (1998)
0.00
0.0012290506 = product of:
0.009832405 = sum of:
0.009832405 = product of:
0.029497212 = sum of:
0.029497212 = weight(_text_:29 in 1682) [ClassicSimilarity], result of:
0.029497212 = score(doc=1682,freq=2.0), product of:
0.108422816 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.030822188 = queryNorm
0.27205724 = fieldWeight in 1682, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0546875 = fieldNorm(doc=1682)
0.33333334 = coord(1/3)
0.125 = coord(1/8)
- Date
- 29. 7.1998 19:54:49
-
Preston, L.A.; Ebbs, C.M.; Luther, J.: 'Full text' access evaluation : are we getting the real thing? (1998)
0.00
0.0012290506 = product of:
0.009832405 = sum of:
0.009832405 = product of:
0.029497212 = sum of:
0.029497212 = weight(_text_:29 in 2695) [ClassicSimilarity], result of:
0.029497212 = score(doc=2695,freq=2.0), product of:
0.108422816 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.030822188 = queryNorm
0.27205724 = fieldWeight in 2695, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0546875 = fieldNorm(doc=2695)
0.33333334 = coord(1/3)
0.125 = coord(1/8)
- Footnote
- Part of an issue devoted to 'Experimentation and collaboration: creating series for a new millenium', part 2, Proceedings of the North American Serials Interest Group, Inc.'s 12th annual conference, 29 May - 1 June 1997, University of Michigan Ann Arbor, Michigan