Document (#29476)

Author
Figuerola, C.G.
Rodríguez, A.F.Z.
Berrocal, J.L.A.
Title
Automatic vs manual categorisation of documents in Spanish
Source
Journal of documentation. 57(2001) no.6, S.763-773
Year
2001
Abstract
Automatic categorisation can be understood as a learning process during which a program recognises the characteristics that distinguish each category or class from others, i.e. those characteristics which the documents should have in order to belong to that category. As yet few experiments have been carried out with documents in Spanish. Here we show the possibilities of elaborating pattern vectors that include the characteristics of different classes or categories of documents, using techniques based on those applied to the expansion of queries by relevance; likewise, the results of applying these techniques to a collection of documents in Spanish are given. The same collection of documents was categorised manually and the results of both procedures were compared.
Footnote
Vgl. auch unter: http://www.emeraldinsight.com/10.1108/EUM0000000007099
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Rodríguez, E.E.: Consolidated edition of ISBD, International Standard Bibliographic Description : a standard to trust, a quality brand (2014) 4.84
    4.841027 = sum of:
      4.841027 = weight(author_txt:rodríguez in 1996) [ClassicSimilarity], result of:
        4.841027 = score(doc=1996,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.7456436 = idf(docFreq=51, maxDocs=44218)
            0.12910482 = queryNorm
          4.8410273 = fieldWeight in 1996, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.7456436 = idf(docFreq=51, maxDocs=44218)
            0.625 = fieldNorm(doc=1996)
    
  2. Zazo Rodríguez, A.F. -> Rodríguez, A.F.Z.: 4.79
    4.7923717 = sum of:
      4.7923717 = weight(author_txt:rodríguez in 1935) [ClassicSimilarity], result of:
        4.7923717 = score(doc=1935,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            7.7456436 = idf(docFreq=51, maxDocs=44218)
            0.12910482 = queryNorm
          4.792372 = fieldWeight in 1935, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            7.7456436 = idf(docFreq=51, maxDocs=44218)
            0.4375 = fieldNorm(doc=1935)
    
  3. Rodríguez, E.M.M. -> Méndez Rodríguez, E.M.: 4.79
    4.7923717 = sum of:
      4.7923717 = weight(author_txt:rodríguez in 2856) [ClassicSimilarity], result of:
        4.7923717 = score(doc=2856,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            7.7456436 = idf(docFreq=51, maxDocs=44218)
            0.12910482 = queryNorm
          4.792372 = fieldWeight in 2856, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            7.7456436 = idf(docFreq=51, maxDocs=44218)
            0.4375 = fieldNorm(doc=2856)
    
  4. Rodríguez, Z.Chinchilla -> Chinchilla Rodríguez, Z.: 4.79
    4.7923717 = sum of:
      4.7923717 = weight(author_txt:rodríguez in 67) [ClassicSimilarity], result of:
        4.7923717 = score(doc=67,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            7.7456436 = idf(docFreq=51, maxDocs=44218)
            0.12910482 = queryNorm
          4.792372 = fieldWeight in 67, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            7.7456436 = idf(docFreq=51, maxDocs=44218)
            0.4375 = fieldNorm(doc=67)
    
  5. Rodríguez Z. Chinchilla- -> Chinchilla-Rodríguez, Z.: 4.11
    4.107747 = sum of:
      4.107747 = weight(author_txt:rodríguez in 795) [ClassicSimilarity], result of:
        4.107747 = score(doc=795,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            7.7456436 = idf(docFreq=51, maxDocs=44218)
            0.12910482 = queryNorm
          4.1077476 = fieldWeight in 795, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            7.7456436 = idf(docFreq=51, maxDocs=44218)
            0.375 = fieldNorm(doc=795)
    

Similar documents (content)

  1. Sebastiani, F.: ¬A tutorial an automated text categorisation (1999) 0.20
    0.19928597 = sum of:
      0.19928597 = product of:
        0.7117356 = sum of:
          0.011931569 = weight(abstract_txt:have in 3390) [ClassicSimilarity], result of:
            0.011931569 = score(doc=3390,freq=1.0), product of:
              0.05957218 = queryWeight, product of:
                1.0770532 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.017259661 = queryNorm
              0.20028761 = fieldWeight in 3390, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.0625 = fieldNorm(doc=3390)
          0.05323884 = weight(abstract_txt:manually in 3390) [ClassicSimilarity], result of:
            0.05323884 = score(doc=3390,freq=1.0), product of:
              0.12815066 = queryWeight, product of:
                1.1170197 = boost
                6.6470313 = idf(docFreq=155, maxDocs=44218)
                0.017259661 = queryNorm
              0.41543946 = fieldWeight in 3390, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6470313 = idf(docFreq=155, maxDocs=44218)
                0.0625 = fieldNorm(doc=3390)
          0.007234743 = weight(abstract_txt:that in 3390) [ClassicSimilarity], result of:
            0.007234743 = score(doc=3390,freq=1.0), product of:
              0.048852965 = queryWeight, product of:
                1.194556 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017259661 = queryNorm
              0.1480922 = fieldWeight in 3390, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=3390)
          0.07191126 = weight(abstract_txt:automatic in 3390) [ClassicSimilarity], result of:
            0.07191126 = score(doc=3390,freq=2.0), product of:
              0.15659085 = queryWeight, product of:
                1.7462186 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.017259661 = queryNorm
              0.45923027 = fieldWeight in 3390, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=3390)
          0.063207194 = weight(abstract_txt:characteristics in 3390) [ClassicSimilarity], result of:
            0.063207194 = score(doc=3390,freq=1.0), product of:
              0.20723027 = queryWeight, product of:
                2.4602976 = boost
                4.8801513 = idf(docFreq=912, maxDocs=44218)
                0.017259661 = queryNorm
              0.30500945 = fieldWeight in 3390, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8801513 = idf(docFreq=912, maxDocs=44218)
                0.0625 = fieldNorm(doc=3390)
          0.37233752 = weight(abstract_txt:categorisation in 3390) [ClassicSimilarity], result of:
            0.37233752 = score(doc=3390,freq=3.0), product of:
              0.40941387 = queryWeight, product of:
                2.8235579 = boost
                8.401051 = idf(docFreq=26, maxDocs=44218)
                0.017259661 = queryNorm
              0.9094404 = fieldWeight in 3390, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.401051 = idf(docFreq=26, maxDocs=44218)
                0.0625 = fieldNorm(doc=3390)
          0.13187449 = weight(abstract_txt:documents in 3390) [ClassicSimilarity], result of:
            0.13187449 = score(doc=3390,freq=3.0), product of:
              0.2955872 = queryWeight, product of:
                4.1554575 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.017259661 = queryNorm
              0.44614407 = fieldWeight in 3390, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=3390)
        0.28 = coord(7/25)
    
  2. Medelyan, O.; Witten, I.H.: Domain-independent automatic keyphrase indexing with small training sets (2008) 0.18
    0.17623024 = sum of:
      0.17623024 = product of:
        0.6293937 = sum of:
          0.047748096 = weight(abstract_txt:manual in 1871) [ClassicSimilarity], result of:
            0.047748096 = score(doc=1871,freq=1.0), product of:
              0.10270679 = queryWeight, product of:
                5.950684 = idf(docFreq=312, maxDocs=44218)
                0.017259661 = queryNorm
              0.4648972 = fieldWeight in 1871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.950684 = idf(docFreq=312, maxDocs=44218)
                0.078125 = fieldNorm(doc=1871)
          0.06654855 = weight(abstract_txt:manually in 1871) [ClassicSimilarity], result of:
            0.06654855 = score(doc=1871,freq=1.0), product of:
              0.12815066 = queryWeight, product of:
                1.1170197 = boost
                6.6470313 = idf(docFreq=155, maxDocs=44218)
                0.017259661 = queryNorm
              0.5192993 = fieldWeight in 1871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6470313 = idf(docFreq=155, maxDocs=44218)
                0.078125 = fieldNorm(doc=1871)
          0.019139457 = weight(abstract_txt:results in 1871) [ClassicSimilarity], result of:
            0.019139457 = score(doc=1871,freq=1.0), product of:
              0.07034904 = queryWeight, product of:
                1.1704274 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.017259661 = queryNorm
              0.27206424 = fieldWeight in 1871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.078125 = fieldNorm(doc=1871)
          0.012789339 = weight(abstract_txt:that in 1871) [ClassicSimilarity], result of:
            0.012789339 = score(doc=1871,freq=2.0), product of:
              0.048852965 = queryWeight, product of:
                1.194556 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017259661 = queryNorm
              0.26179248 = fieldWeight in 1871, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=1871)
          0.06356118 = weight(abstract_txt:automatic in 1871) [ClassicSimilarity], result of:
            0.06356118 = score(doc=1871,freq=1.0), product of:
              0.15659085 = queryWeight, product of:
                1.7462186 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.017259661 = queryNorm
              0.40590608 = fieldWeight in 1871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.078125 = fieldNorm(doc=1871)
          0.22926262 = weight(abstract_txt:spanish in 1871) [ClassicSimilarity], result of:
            0.22926262 = score(doc=1871,freq=1.0), product of:
              0.4215907 = queryWeight, product of:
                3.5091872 = boost
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.017259661 = queryNorm
              0.5438038 = fieldWeight in 1871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.078125 = fieldNorm(doc=1871)
          0.19034444 = weight(abstract_txt:documents in 1871) [ClassicSimilarity], result of:
            0.19034444 = score(doc=1871,freq=4.0), product of:
              0.2955872 = queryWeight, product of:
                4.1554575 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.017259661 = queryNorm
              0.64395356 = fieldWeight in 1871, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=1871)
        0.28 = coord(7/25)
    
  3. Lee, Y.-H.; Wei, C.-P.; Hu, P.J.-H.: ¬An ontology-based technique for preserving user preferences in document-category evolutions (2011) 0.17
    0.16916978 = sum of:
      0.16916978 = product of:
        0.5286556 = sum of:
          0.03342367 = weight(abstract_txt:manual in 4353) [ClassicSimilarity], result of:
            0.03342367 = score(doc=4353,freq=1.0), product of:
              0.10270679 = queryWeight, product of:
                5.950684 = idf(docFreq=312, maxDocs=44218)
                0.017259661 = queryNorm
              0.32542804 = fieldWeight in 4353, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.950684 = idf(docFreq=312, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4353)
          0.018082824 = weight(abstract_txt:have in 4353) [ClassicSimilarity], result of:
            0.018082824 = score(doc=4353,freq=3.0), product of:
              0.05957218 = queryWeight, product of:
                1.0770532 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.017259661 = queryNorm
              0.30354476 = fieldWeight in 4353, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4353)
          0.018947095 = weight(abstract_txt:results in 4353) [ClassicSimilarity], result of:
            0.018947095 = score(doc=4353,freq=2.0), product of:
              0.07034904 = queryWeight, product of:
                1.1704274 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.017259661 = queryNorm
              0.26932985 = fieldWeight in 4353, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4353)
          0.010964573 = weight(abstract_txt:that in 4353) [ClassicSimilarity], result of:
            0.010964573 = score(doc=4353,freq=3.0), product of:
              0.048852965 = queryWeight, product of:
                1.194556 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017259661 = queryNorm
              0.22444029 = fieldWeight in 4353, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4353)
          0.07541944 = weight(abstract_txt:vectors in 4353) [ClassicSimilarity], result of:
            0.07541944 = score(doc=4353,freq=1.0), product of:
              0.17669271 = queryWeight, product of:
                1.3116251 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.017259661 = queryNorm
              0.4268396 = fieldWeight in 4353, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4353)
          0.051073473 = weight(abstract_txt:techniques in 4353) [ClassicSimilarity], result of:
            0.051073473 = score(doc=4353,freq=3.0), product of:
              0.11903177 = queryWeight, product of:
                1.5224634 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.017259661 = queryNorm
              0.4290743 = fieldWeight in 4353, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4353)
          0.20535433 = weight(abstract_txt:category in 4353) [ClassicSimilarity], result of:
            0.20535433 = score(doc=4353,freq=7.0), product of:
              0.22692186 = queryWeight, product of:
                2.102101 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.017259661 = queryNorm
              0.90495616 = fieldWeight in 4353, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4353)
          0.115390174 = weight(abstract_txt:documents in 4353) [ClassicSimilarity], result of:
            0.115390174 = score(doc=4353,freq=3.0), product of:
              0.2955872 = queryWeight, product of:
                4.1554575 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.017259661 = queryNorm
              0.39037606 = fieldWeight in 4353, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4353)
        0.32 = coord(8/25)
    
  4. Dumais, S.T.: Latent semantic analysis (2003) 0.16
    0.15838687 = sum of:
      0.15838687 = product of:
        0.35997015 = sum of:
          0.01909924 = weight(abstract_txt:manual in 2462) [ClassicSimilarity], result of:
            0.01909924 = score(doc=2462,freq=1.0), product of:
              0.10270679 = queryWeight, product of:
                5.950684 = idf(docFreq=312, maxDocs=44218)
                0.017259661 = queryNorm
              0.18595888 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.950684 = idf(docFreq=312, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.015783982 = weight(abstract_txt:have in 2462) [ClassicSimilarity], result of:
            0.015783982 = score(doc=2462,freq=7.0), product of:
              0.05957218 = queryWeight, product of:
                1.0770532 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.017259661 = queryNorm
              0.2649556 = fieldWeight in 2462, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.02661942 = weight(abstract_txt:manually in 2462) [ClassicSimilarity], result of:
            0.02661942 = score(doc=2462,freq=1.0), product of:
              0.12815066 = queryWeight, product of:
                1.1170197 = boost
                6.6470313 = idf(docFreq=155, maxDocs=44218)
                0.017259661 = queryNorm
              0.20771973 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6470313 = idf(docFreq=155, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.010852114 = weight(abstract_txt:that in 2462) [ClassicSimilarity], result of:
            0.010852114 = score(doc=2462,freq=9.0), product of:
              0.048852965 = queryWeight, product of:
                1.194556 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017259661 = queryNorm
              0.22213829 = fieldWeight in 2462, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.043096825 = weight(abstract_txt:belong in 2462) [ClassicSimilarity], result of:
            0.043096825 = score(doc=2462,freq=1.0), product of:
              0.17669271 = queryWeight, product of:
                1.3116251 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.017259661 = queryNorm
              0.24390835 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.0145366825 = weight(abstract_txt:those in 2462) [ClassicSimilarity], result of:
            0.0145366825 = score(doc=2462,freq=1.0), product of:
              0.10787198 = queryWeight, product of:
                1.4493382 = boost
                4.312277 = idf(docFreq=1610, maxDocs=44218)
                0.017259661 = queryNorm
              0.13475865 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.312277 = idf(docFreq=1610, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.023829322 = weight(abstract_txt:techniques in 2462) [ClassicSimilarity], result of:
            0.023829322 = score(doc=2462,freq=2.0), product of:
              0.11903177 = queryWeight, product of:
                1.5224634 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.017259661 = queryNorm
              0.20019296 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.01820892 = weight(abstract_txt:collection in 2462) [ClassicSimilarity], result of:
            0.01820892 = score(doc=2462,freq=1.0), product of:
              0.1253491 = queryWeight, product of:
                1.5623417 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.017259661 = queryNorm
              0.14526565 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.03595563 = weight(abstract_txt:automatic in 2462) [ClassicSimilarity], result of:
            0.03595563 = score(doc=2462,freq=2.0), product of:
              0.15659085 = queryWeight, product of:
                1.7462186 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.017259661 = queryNorm
              0.22961514 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.031603597 = weight(abstract_txt:characteristics in 2462) [ClassicSimilarity], result of:
            0.031603597 = score(doc=2462,freq=1.0), product of:
              0.20723027 = queryWeight, product of:
                2.4602976 = boost
                4.8801513 = idf(docFreq=912, maxDocs=44218)
                0.017259661 = queryNorm
              0.15250473 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8801513 = idf(docFreq=912, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.12038439 = weight(abstract_txt:documents in 2462) [ClassicSimilarity], result of:
            0.12038439 = score(doc=2462,freq=10.0), product of:
              0.2955872 = queryWeight, product of:
                4.1554575 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.017259661 = queryNorm
              0.40727198 = fieldWeight in 2462, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
        0.44 = coord(11/25)
    
  5. Olvera-Lobo, M.-D.; García-Santiago, L.: Analysis of errors in the automatic translation of questions for translingual QA systems (2010) 0.15
    0.15214211 = sum of:
      0.15214211 = product of:
        0.54336464 = sum of:
          0.01339762 = weight(abstract_txt:results in 3956) [ClassicSimilarity], result of:
            0.01339762 = score(doc=3956,freq=1.0), product of:
              0.07034904 = queryWeight, product of:
                1.1704274 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.017259661 = queryNorm
              0.19044496 = fieldWeight in 3956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3956)
          0.008952538 = weight(abstract_txt:that in 3956) [ClassicSimilarity], result of:
            0.008952538 = score(doc=3956,freq=2.0), product of:
              0.048852965 = queryWeight, product of:
                1.194556 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017259661 = queryNorm
              0.18325473 = fieldWeight in 3956, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3956)
          0.025439193 = weight(abstract_txt:those in 3956) [ClassicSimilarity], result of:
            0.025439193 = score(doc=3956,freq=1.0), product of:
              0.10787198 = queryWeight, product of:
                1.4493382 = boost
                4.312277 = idf(docFreq=1610, maxDocs=44218)
                0.017259661 = queryNorm
              0.23582764 = fieldWeight in 3956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.312277 = idf(docFreq=1610, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3956)
          0.045064773 = weight(abstract_txt:collection in 3956) [ClassicSimilarity], result of:
            0.045064773 = score(doc=3956,freq=2.0), product of:
              0.1253491 = queryWeight, product of:
                1.5623417 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.017259661 = queryNorm
              0.35951412 = fieldWeight in 3956, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3956)
          0.06292235 = weight(abstract_txt:automatic in 3956) [ClassicSimilarity], result of:
            0.06292235 = score(doc=3956,freq=2.0), product of:
              0.15659085 = queryWeight, product of:
                1.7462186 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.017259661 = queryNorm
              0.4018265 = fieldWeight in 3956, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3956)
          0.32096764 = weight(abstract_txt:spanish in 3956) [ClassicSimilarity], result of:
            0.32096764 = score(doc=3956,freq=4.0), product of:
              0.4215907 = queryWeight, product of:
                3.5091872 = boost
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.017259661 = queryNorm
              0.7613253 = fieldWeight in 3956, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3956)
          0.06662055 = weight(abstract_txt:documents in 3956) [ClassicSimilarity], result of:
            0.06662055 = score(doc=3956,freq=1.0), product of:
              0.2955872 = queryWeight, product of:
                4.1554575 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.017259661 = queryNorm
              0.22538373 = fieldWeight in 3956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3956)
        0.28 = coord(7/25)