Document (#24502)

Author
Figuerola, C.G.
Gomez, R.
Lopez de San Roman, E.
Title
Stemming and n-grams in Spanish : an evaluation of their impact in information retrieval
Source
Journal of information science. 26(2000) no.6, S.461-467
Year
2000
Theme
Computerlinguistik

Similar documents (author)

  1. Gomez, J.: ¬A cataloger's workstation : using a NeXT computer and Digital Librarian software to access the Anglo-American Cataloguing Rules (1993) 1.98
    1.9806229 = sum of:
      1.9806229 = product of:
        3.9612458 = sum of:
          3.9612458 = weight(author_txt:gomez in 3708) [ClassicSimilarity], result of:
            3.9612458 = score(doc=3708,freq=1.0), product of:
              0.7284083 = queryWeight, product of:
                1.0310903 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.08118977 = queryNorm
              5.438222 = fieldWeight in 3708, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.625 = fieldNorm(doc=3708)
        0.5 = coord(1/2)
    
  2. Gomez, F.: Combining factual and heuristic knowledge in knowledge acquisition (1992) 1.98
    1.9806229 = sum of:
      1.9806229 = product of:
        3.9612458 = sum of:
          3.9612458 = weight(author_txt:gomez in 3758) [ClassicSimilarity], result of:
            3.9612458 = score(doc=3758,freq=1.0), product of:
              0.7284083 = queryWeight, product of:
                1.0310903 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.08118977 = queryNorm
              5.438222 = fieldWeight in 3758, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.625 = fieldNorm(doc=3758)
        0.5 = coord(1/2)
    
  3. Gomez, F.: Learning word syntactic subcategorizations interactively (1995) 1.98
    1.9806229 = sum of:
      1.9806229 = product of:
        3.9612458 = sum of:
          3.9612458 = weight(author_txt:gomez in 3130) [ClassicSimilarity], result of:
            3.9612458 = score(doc=3130,freq=1.0), product of:
              0.7284083 = queryWeight, product of:
                1.0310903 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.08118977 = queryNorm
              5.438222 = fieldWeight in 3130, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.625 = fieldNorm(doc=3130)
        0.5 = coord(1/2)
    
  4. Gomez, I.: Coping with the problem of subject classification diversity (1996) 1.98
    1.9806229 = sum of:
      1.9806229 = product of:
        3.9612458 = sum of:
          3.9612458 = weight(author_txt:gomez in 5074) [ClassicSimilarity], result of:
            3.9612458 = score(doc=5074,freq=1.0), product of:
              0.7284083 = queryWeight, product of:
                1.0310903 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.08118977 = queryNorm
              5.438222 = fieldWeight in 5074, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.625 = fieldNorm(doc=5074)
        0.5 = coord(1/2)
    
  5. Gomez, F.: ¬A representation of complex events and processes for the acquisition of knowledge from texts (1998) 1.98
    1.9806229 = sum of:
      1.9806229 = product of:
        3.9612458 = sum of:
          3.9612458 = weight(author_txt:gomez in 3245) [ClassicSimilarity], result of:
            3.9612458 = score(doc=3245,freq=1.0), product of:
              0.7284083 = queryWeight, product of:
                1.0310903 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.08118977 = queryNorm
              5.438222 = fieldWeight in 3245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.625 = fieldNorm(doc=3245)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Flores, F.N.; Moreira, V.P.: Assessing the impact of stemming accuracy on information retrieval : a multilingual perspective (2016) 0.50
    0.49895757 = sum of:
      0.49895757 = product of:
        0.7983321 = sum of:
          0.021066746 = weight(abstract_txt:information in 3187) [ClassicSimilarity], result of:
            0.021066746 = score(doc=3187,freq=4.0), product of:
              0.055691928 = queryWeight, product of:
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.023004197 = queryNorm
              0.37827286 = fieldWeight in 3187, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=3187)
          0.023413522 = weight(abstract_txt:their in 3187) [ClassicSimilarity], result of:
            0.023413522 = score(doc=3187,freq=1.0), product of:
              0.09485461 = queryWeight, product of:
                1.3050679 = boost
                3.1594994 = idf(docFreq=5101, maxDocs=44218)
                0.023004197 = queryNorm
              0.24683589 = fieldWeight in 3187, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1594994 = idf(docFreq=5101, maxDocs=44218)
                0.078125 = fieldNorm(doc=3187)
          0.069664836 = weight(abstract_txt:retrieval in 3187) [ClassicSimilarity], result of:
            0.069664836 = score(doc=3187,freq=5.0), product of:
              0.114753604 = queryWeight, product of:
                1.4354466 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.023004197 = queryNorm
              0.6070819 = fieldWeight in 3187, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=3187)
          0.25036207 = weight(abstract_txt:spanish in 3187) [ClassicSimilarity], result of:
            0.25036207 = score(doc=3187,freq=1.0), product of:
              0.46039045 = queryWeight, product of:
                2.875193 = boost
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.023004197 = queryNorm
              0.5438038 = fieldWeight in 3187, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.078125 = fieldNorm(doc=3187)
          0.43382493 = weight(abstract_txt:stemming in 3187) [ClassicSimilarity], result of:
            0.43382493 = score(doc=3187,freq=2.0), product of:
              0.5271655 = queryWeight, product of:
                3.0766447 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.023004197 = queryNorm
              0.8229388 = fieldWeight in 3187, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.078125 = fieldNorm(doc=3187)
        0.625 = coord(5/8)
    
  2. Brychcín, T.; Konopík, M.: HPS: High precision stemmer (2015) 0.44
    0.4432103 = sum of:
      0.4432103 = product of:
        0.8864206 = sum of:
          0.011917151 = weight(abstract_txt:information in 2686) [ClassicSimilarity], result of:
            0.011917151 = score(doc=2686,freq=2.0), product of:
              0.055691928 = queryWeight, product of:
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.023004197 = queryNorm
              0.21398345 = fieldWeight in 2686, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
          0.024924047 = weight(abstract_txt:retrieval in 2686) [ClassicSimilarity], result of:
            0.024924047 = score(doc=2686,freq=1.0), product of:
              0.114753604 = queryWeight, product of:
                1.4354466 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.023004197 = queryNorm
              0.21719621 = fieldWeight in 2686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
          0.20028965 = weight(abstract_txt:spanish in 2686) [ClassicSimilarity], result of:
            0.20028965 = score(doc=2686,freq=1.0), product of:
              0.46039045 = queryWeight, product of:
                2.875193 = boost
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.023004197 = queryNorm
              0.43504304 = fieldWeight in 2686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
          0.6492897 = weight(abstract_txt:stemming in 2686) [ClassicSimilarity], result of:
            0.6492897 = score(doc=2686,freq=7.0), product of:
              0.5271655 = queryWeight, product of:
                3.0766447 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.023004197 = queryNorm
              1.231662 = fieldWeight in 2686, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
        0.5 = coord(4/8)
    
  3. Xu, J.; Weischedel, R.: Empirical studies on the impact of lexical resources on CLIR performance (2005) 0.36
    0.36293772 = sum of:
      0.36293772 = product of:
        0.72587544 = sum of:
          0.010533373 = weight(abstract_txt:information in 1020) [ClassicSimilarity], result of:
            0.010533373 = score(doc=1020,freq=1.0), product of:
              0.055691928 = queryWeight, product of:
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.023004197 = queryNorm
              0.18913643 = fieldWeight in 1020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.03115506 = weight(abstract_txt:retrieval in 1020) [ClassicSimilarity], result of:
            0.03115506 = score(doc=1020,freq=1.0), product of:
              0.114753604 = queryWeight, product of:
                1.4354466 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.023004197 = queryNorm
              0.27149525 = fieldWeight in 1020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.25036207 = weight(abstract_txt:spanish in 1020) [ClassicSimilarity], result of:
            0.25036207 = score(doc=1020,freq=1.0), product of:
              0.46039045 = queryWeight, product of:
                2.875193 = boost
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.023004197 = queryNorm
              0.5438038 = fieldWeight in 1020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.43382493 = weight(abstract_txt:stemming in 1020) [ClassicSimilarity], result of:
            0.43382493 = score(doc=1020,freq=2.0), product of:
              0.5271655 = queryWeight, product of:
                3.0766447 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.023004197 = queryNorm
              0.8229388 = fieldWeight in 1020, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
        0.5 = coord(4/8)
    
  4. Hull, D.A.: Stemming algorithms : a case study for detailed evaluation (1996) 0.34
    0.34211344 = sum of:
      0.34211344 = product of:
        0.5473815 = sum of:
          0.017875727 = weight(abstract_txt:information in 2999) [ClassicSimilarity], result of:
            0.017875727 = score(doc=2999,freq=2.0), product of:
              0.055691928 = queryWeight, product of:
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.023004197 = queryNorm
              0.32097518 = fieldWeight in 2999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.09375 = fieldNorm(doc=2999)
          0.028096227 = weight(abstract_txt:their in 2999) [ClassicSimilarity], result of:
            0.028096227 = score(doc=2999,freq=1.0), product of:
              0.09485461 = queryWeight, product of:
                1.3050679 = boost
                3.1594994 = idf(docFreq=5101, maxDocs=44218)
                0.023004197 = queryNorm
              0.29620308 = fieldWeight in 2999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1594994 = idf(docFreq=5101, maxDocs=44218)
                0.09375 = fieldNorm(doc=2999)
          0.052871887 = weight(abstract_txt:retrieval in 2999) [ClassicSimilarity], result of:
            0.052871887 = score(doc=2999,freq=2.0), product of:
              0.114753604 = queryWeight, product of:
                1.4354466 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.023004197 = queryNorm
              0.4607427 = fieldWeight in 2999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=2999)
          0.08042499 = weight(abstract_txt:evaluation in 2999) [ClassicSimilarity], result of:
            0.08042499 = score(doc=2999,freq=1.0), product of:
              0.19122902 = queryWeight, product of:
                1.8530228 = boost
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.023004197 = queryNorm
              0.42056894 = fieldWeight in 2999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.09375 = fieldNorm(doc=2999)
          0.36811268 = weight(abstract_txt:stemming in 2999) [ClassicSimilarity], result of:
            0.36811268 = score(doc=2999,freq=1.0), product of:
              0.5271655 = queryWeight, product of:
                3.0766447 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.023004197 = queryNorm
              0.6982868 = fieldWeight in 2999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.09375 = fieldNorm(doc=2999)
        0.625 = coord(5/8)
    
  5. Robertson, A.M.; Willett, P.: Applications of n-grams in textual information systems (1998) 0.33
    0.3281172 = sum of:
      0.3281172 = product of:
        0.87497914 = sum of:
          0.020855013 = weight(abstract_txt:information in 4715) [ClassicSimilarity], result of:
            0.020855013 = score(doc=4715,freq=2.0), product of:
              0.055691928 = queryWeight, product of:
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.023004197 = queryNorm
              0.37447104 = fieldWeight in 4715, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.109375 = fieldNorm(doc=4715)
          0.043617085 = weight(abstract_txt:retrieval in 4715) [ClassicSimilarity], result of:
            0.043617085 = score(doc=4715,freq=1.0), product of:
              0.114753604 = queryWeight, product of:
                1.4354466 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.023004197 = queryNorm
              0.38009337 = fieldWeight in 4715, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.109375 = fieldNorm(doc=4715)
          0.81050706 = weight(abstract_txt:grams in 4715) [ClassicSimilarity], result of:
            0.81050706 = score(doc=4715,freq=2.0), product of:
              0.6389837 = queryWeight, product of:
                3.3872619 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.023004197 = queryNorm
              1.2684314 = fieldWeight in 4715, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.109375 = fieldNorm(doc=4715)
        0.375 = coord(3/8)