Search (4 results, page 1 of 1)

  • × author_ss:"Kettunen, K."
  • × theme_ss:"Computerlinguistik"
  1. Kettunen, K.: Reductive and generative approaches to management of morphological variation of keywords in monolingual information retrieval : an overview (2009) 0.04
    0.040037964 = sum of:
      0.017983811 = product of:
        0.071935244 = sum of:
          0.071935244 = weight(_text_:authors in 2835) [ClassicSimilarity], result of:
            0.071935244 = score(doc=2835,freq=2.0), product of:
              0.23803101 = queryWeight, product of:
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.052213363 = queryNorm
              0.30220953 = fieldWeight in 2835, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.046875 = fieldNorm(doc=2835)
        0.25 = coord(1/4)
      0.022054153 = product of:
        0.044108305 = sum of:
          0.044108305 = weight(_text_:k in 2835) [ClassicSimilarity], result of:
            0.044108305 = score(doc=2835,freq=2.0), product of:
              0.18639012 = queryWeight, product of:
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.052213363 = queryNorm
              0.23664509 = fieldWeight in 2835, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.046875 = fieldNorm(doc=2835)
        0.5 = coord(1/2)
    
    Abstract
    Purpose - The purpose of this article is to discuss advantages and disadvantages of various means to manage morphological variation of keywords in monolingual information retrieval. Design/methodology/approach - The authors present a compilation of query results from 11 mostly European languages and a new general classification of the language dependent techniques for management of morphological variation. Variants of the different techniques are compared in some detail in terms of retrieval effectiveness and other criteria. The paper consists mainly of an overview of different management methods for keyword variation in information retrieval. Typical IR retrieval results of 11 languages and a new classification for keyword management methods are also presented. Findings - The main results of the paper are an overall comparison of reductive and generative keyword management methods in terms of retrieval effectiveness and other broader criteria. Originality/value - The paper is of value to anyone who wants to get an overall picture of keyword management techniques used in IR.
  2. Kettunen, K.; Kunttu, T.; Järvelin, K.: To stem or lemmatize a highly inflectional language in a probabilistic IR environment? (2005) 0.01
    0.012995535 = product of:
      0.02599107 = sum of:
        0.02599107 = product of:
          0.05198214 = sum of:
            0.05198214 = weight(_text_:k in 4395) [ClassicSimilarity], result of:
              0.05198214 = score(doc=4395,freq=4.0), product of:
                0.18639012 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.052213363 = queryNorm
                0.2788889 = fieldWeight in 4395, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4395)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  3. Airio, E.; Kettunen, K.: Does dictionary based bilingual retrieval work in a non-normalized index? (2009) 0.01
    0.011027076 = product of:
      0.022054153 = sum of:
        0.022054153 = product of:
          0.044108305 = sum of:
            0.044108305 = weight(_text_:k in 4224) [ClassicSimilarity], result of:
              0.044108305 = score(doc=4224,freq=2.0), product of:
                0.18639012 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.052213363 = queryNorm
                0.23664509 = fieldWeight in 4224, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4224)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  4. Järvelin, A.; Keskustalo, H.; Sormunen, E.; Saastamoinen, M.; Kettunen, K.: Information retrieval from historical newspaper collections in highly inflectional languages : a query expansion approach (2016) 0.01
    0.00918923 = product of:
      0.01837846 = sum of:
        0.01837846 = product of:
          0.03675692 = sum of:
            0.03675692 = weight(_text_:k in 3223) [ClassicSimilarity], result of:
              0.03675692 = score(doc=3223,freq=2.0), product of:
                0.18639012 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.052213363 = queryNorm
                0.19720423 = fieldWeight in 3223, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3223)
          0.5 = coord(1/2)
      0.5 = coord(1/2)