Search (1 results, page 1 of 1)

  • × language_ss:"fi"
  • × theme_ss:"Volltextretrieval"
  1. Leppanen, E.: Homografiongelma tekstihaussa ja homografien disambiguoinnin vaikutukset (1996) 0.01
    0.014799917 = product of:
      0.03699979 = sum of:
        0.022041133 = product of:
          0.11020566 = sum of:
            0.11020566 = weight(_text_:problem in 27) [ClassicSimilarity], result of:
              0.11020566 = score(doc=27,freq=10.0), product of:
                0.17516108 = queryWeight, product of:
                  4.244485 = idf(docFreq=1723, maxDocs=44218)
                  0.041267924 = queryNorm
                0.6291675 = fieldWeight in 27, product of:
                  3.1622777 = tf(freq=10.0), with freq of:
                    10.0 = termFreq=10.0
                  4.244485 = idf(docFreq=1723, maxDocs=44218)
                  0.046875 = fieldNorm(doc=27)
          0.2 = coord(1/5)
        0.014958657 = weight(_text_:of in 27) [ClassicSimilarity], result of:
          0.014958657 = score(doc=27,freq=10.0), product of:
            0.06453302 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.041267924 = queryNorm
            0.23179851 = fieldWeight in 27, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.046875 = fieldNorm(doc=27)
      0.4 = coord(2/5)
    
    Abstract
    Homonymy is known to often cause false drops in free text searching in a full text database. The problem is quite common and difficult to avoid in Finnish, but nobody has examined it before. Reports on a study that examined the frequency of, and solutions to, the homonymy problem, based on searches made in a Finnish full text database containing about 55.000 newspaper articles. The results indicate that homonymy is not a very serious problem in full text searching, with only about 1 search result set out of 4 containing false drops caused by homonymy. Several other reasons for nonrelevance were much more common. However, in some set results there were a considerable number of homonymy errors, so the number seems to be very random. A study was also made into whether homonyms can be disambiguated by syntactic analysis. The result was that 75,2% of homonyms were disambiguated by this method. Verb homonyms were considerably easier to disambiguate than substantives. Although homonymy is not a very big problem it could perhaps easily be eliminated if there was a suitable syntactic analyzer in the IR system
    Footnote
    Übers. d. Titels: The homonymy problem in free text searching and the results of homonymy disambiguation