Document (#30053)

Author
Li, K.W.
Yang, C.C.
Title
Conceptual analysis of parallel corpus collected from the Web
Source
Journal of the American Society for Information Science and Technology. 57(2006) no.5, S.632-644
Year
2006
Abstract
As illustrated by the World Wide Web, the volume of information in languages other than English has grown significantly in recent years. This highlights the importance of multilingual corpora. Much effort has been devoted to the compilation of multilingual corpora for the purpose of cross-lingual information retrieval and machine translation. Existing parallel corpora mostly involve European languages, such as English-French and English-Spanish. There is still a lack of parallel corpora between European languages and Asian. languages. In the authors' previous work, an alignment method to identify one-to-one Chinese and English title pairs was developed to construct an English-Chinese parallel corpus that works automatically from the World Wide Web, and a 100% precision and 87% recall were obtained. Careful analysis of these results has helped the authors to understand how the alignment method can be improved. A conceptual analysis was conducted, which includes the analysis of conceptual equivalent and conceptual information alternation in the aligned and nonaligned English-Chinese title pairs that are obtained by the alignment method. The result of the analysis not only reflects the characteristics of parallel corpora, but also gives insight into the strengths and weaknesses of the alignment method. In particular, conceptual alternation, such as omission and addition, is found to have a significant impact on the performance of the alignment method.
Footnote
Beitrag einer special topic section on multilingual information systems
Theme
Multilinguale Probleme

Similar documents (author)

  1. Yang, S.C.: ¬An interpretive and situated approach to an evaluation of Perseus digital libraries (2001) 4.54
    4.535107 = sum of:
      4.535107 = weight(author_txt:yang in 934) [ClassicSimilarity], result of:
        4.535107 = fieldWeight in 934, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.256171 = idf(docFreq=81, maxDocs=42740)
          0.625 = fieldNorm(doc=934)
    
  2. Yang, K.: Information retrieval on the Web (2004) 4.54
    4.535107 = sum of:
      4.535107 = weight(author_txt:yang in 279) [ClassicSimilarity], result of:
        4.535107 = fieldWeight in 279, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.256171 = idf(docFreq=81, maxDocs=42740)
          0.625 = fieldNorm(doc=279)
    
  3. Yang, C.C.: Content-based image retrievaI : a comparison between query by example and image browsing map approaches (2005) 4.54
    4.535107 = sum of:
      4.535107 = weight(author_txt:yang in 650) [ClassicSimilarity], result of:
        4.535107 = fieldWeight in 650, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.256171 = idf(docFreq=81, maxDocs=42740)
          0.625 = fieldNorm(doc=650)
    
  4. Salton, G.; Yang, C.S.: On the specification of term values in automatic indexing (1973) 3.63
    3.6280856 = sum of:
      3.6280856 = weight(author_txt:yang in 5476) [ClassicSimilarity], result of:
        3.6280856 = fieldWeight in 5476, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.256171 = idf(docFreq=81, maxDocs=42740)
          0.5 = fieldNorm(doc=5476)
    
  5. Yang, Y.; Chute, C.G.A.: ¬A schematic analysis of the Unified Medical Language System (1992) 3.63
    3.6280856 = sum of:
      3.6280856 = weight(author_txt:yang in 6445) [ClassicSimilarity], result of:
        3.6280856 = fieldWeight in 6445, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.256171 = idf(docFreq=81, maxDocs=42740)
          0.5 = fieldNorm(doc=6445)
    

Similar documents (content)

  1. Yang, C.C.; Li, K.W.: Automatic construction of English/Chinese parallel corpora (2003) 1.13
    1.1309756 = sum of:
      1.1309756 = product of:
        1.7671494 = sum of:
          0.04397218 = weight(abstract_txt:asian in 2684) [ClassicSimilarity], result of:
            0.04397218 = score(doc=2684,freq=1.0), product of:
              0.103194915 = queryWeight, product of:
                1.0248922 = boost
                7.7916894 = idf(docFreq=47, maxDocs=42740)
                0.012922557 = queryNorm
              0.426108 = fieldWeight in 2684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7916894 = idf(docFreq=47, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2684)
          0.070059285 = weight(abstract_txt:lingual in 2684) [ClassicSimilarity], result of:
            0.070059285 = score(doc=2684,freq=2.0), product of:
              0.11173094 = queryWeight, product of:
                1.0664384 = boost
                8.107542 = idf(docFreq=34, maxDocs=42740)
                0.012922557 = queryNorm
              0.6270357 = fieldWeight in 2684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.107542 = idf(docFreq=34, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2684)
          0.016276354 = weight(abstract_txt:world in 2684) [ClassicSimilarity], result of:
            0.016276354 = score(doc=2684,freq=1.0), product of:
              0.06702771 = queryWeight, product of:
                1.1681302 = boost
                4.4403243 = idf(docFreq=1369, maxDocs=42740)
                0.012922557 = queryNorm
              0.24283023 = fieldWeight in 2684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4403243 = idf(docFreq=1369, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2684)
          0.021510325 = weight(abstract_txt:wide in 2684) [ClassicSimilarity], result of:
            0.021510325 = score(doc=2684,freq=1.0), product of:
              0.08071996 = queryWeight, product of:
                1.2819011 = boost
                4.872793 = idf(docFreq=888, maxDocs=42740)
                0.012922557 = queryNorm
              0.26648086 = fieldWeight in 2684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.872793 = idf(docFreq=888, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2684)
          0.048347313 = weight(abstract_txt:european in 2684) [ClassicSimilarity], result of:
            0.048347313 = score(doc=2684,freq=2.0), product of:
              0.10993125 = queryWeight, product of:
                1.4959761 = boost
                5.6865396 = idf(docFreq=393, maxDocs=42740)
                0.012922557 = queryNorm
              0.43979588 = fieldWeight in 2684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6865396 = idf(docFreq=393, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2684)
          0.059612982 = weight(abstract_txt:title in 2684) [ClassicSimilarity], result of:
            0.059612982 = score(doc=2684,freq=3.0), product of:
              0.11042561 = queryWeight, product of:
                1.4993359 = boost
                5.6993113 = idf(docFreq=388, maxDocs=42740)
                0.012922557 = queryNorm
              0.53984743 = fieldWeight in 2684, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6993113 = idf(docFreq=388, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2684)
          0.07430031 = weight(abstract_txt:corpus in 2684) [ClassicSimilarity], result of:
            0.07430031 = score(doc=2684,freq=3.0), product of:
              0.12788992 = queryWeight, product of:
                1.6135492 = boost
                6.1334615 = idf(docFreq=251, maxDocs=42740)
                0.012922557 = queryNorm
              0.5809708 = fieldWeight in 2684, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1334615 = idf(docFreq=251, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2684)
          0.06535598 = weight(abstract_txt:multilingual in 2684) [ClassicSimilarity], result of:
            0.06535598 = score(doc=2684,freq=2.0), product of:
              0.13439915 = queryWeight, product of:
                1.6541021 = boost
                6.287612 = idf(docFreq=215, maxDocs=42740)
                0.012922557 = queryNorm
              0.48628268 = fieldWeight in 2684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.287612 = idf(docFreq=215, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2684)
          0.08423513 = weight(abstract_txt:pairs in 2684) [ClassicSimilarity], result of:
            0.08423513 = score(doc=2684,freq=2.0), product of:
              0.15917265 = queryWeight, product of:
                1.8001069 = boost
                6.842609 = idf(docFreq=123, maxDocs=42740)
                0.012922557 = queryNorm
              0.52920604 = fieldWeight in 2684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.842609 = idf(docFreq=123, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2684)
          0.14212732 = weight(abstract_txt:chinese in 2684) [ClassicSimilarity], result of:
            0.14212732 = score(doc=2684,freq=4.0), product of:
              0.20496441 = queryWeight, product of:
                2.5017788 = boost
                6.3398805 = idf(docFreq=204, maxDocs=42740)
                0.012922557 = queryNorm
              0.6934244 = fieldWeight in 2684, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.3398805 = idf(docFreq=204, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2684)
          0.07360121 = weight(abstract_txt:languages in 2684) [ClassicSimilarity], result of:
            0.07360121 = score(doc=2684,freq=2.0), product of:
              0.18329035 = queryWeight, product of:
                2.7317996 = boost
                5.192091 = idf(docFreq=645, maxDocs=42740)
                0.012922557 = queryNorm
              0.4015553 = fieldWeight in 2684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.192091 = idf(docFreq=645, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2684)
          0.0859355 = weight(abstract_txt:method in 2684) [ClassicSimilarity], result of:
            0.0859355 = score(doc=2684,freq=4.0), product of:
              0.17376329 = queryWeight, product of:
                2.973809 = boost
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.012922557 = queryNorm
              0.494555 = fieldWeight in 2684, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2684)
          0.17274514 = weight(abstract_txt:parallel in 2684) [ClassicSimilarity], result of:
            0.17274514 = score(doc=2684,freq=2.0), product of:
              0.34870395 = queryWeight, product of:
                4.212719 = boost
                6.405395 = idf(docFreq=191, maxDocs=42740)
                0.012922557 = queryNorm
              0.495392 = fieldWeight in 2684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.405395 = idf(docFreq=191, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2684)
          0.2416193 = weight(abstract_txt:english in 2684) [ClassicSimilarity], result of:
            0.2416193 = score(doc=2684,freq=6.0), product of:
              0.32133698 = queryWeight, product of:
                4.430013 = boost
                5.6131573 = idf(docFreq=423, maxDocs=42740)
                0.012922557 = queryNorm
              0.75191873 = fieldWeight in 2684, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.6131573 = idf(docFreq=423, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2684)
          0.28795272 = weight(abstract_txt:corpora in 2684) [ClassicSimilarity], result of:
            0.28795272 = score(doc=2684,freq=3.0), product of:
              0.4282559 = queryWeight, product of:
                4.66859 = boost
                7.098542 = idf(docFreq=95, maxDocs=42740)
                0.012922557 = queryNorm
              0.67238474 = fieldWeight in 2684, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.098542 = idf(docFreq=95, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2684)
          0.2794984 = weight(abstract_txt:alignment in 2684) [ClassicSimilarity], result of:
            0.2794984 = score(doc=2684,freq=2.0), product of:
              0.4805875 = queryWeight, product of:
                4.945615 = boost
                7.519756 = idf(docFreq=62, maxDocs=42740)
                0.012922557 = queryNorm
              0.58157647 = fieldWeight in 2684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.519756 = idf(docFreq=62, maxDocs=42740)
                0.0546875 = fieldNorm(doc=2684)
        0.64 = coord(16/25)
    
  2. Talvensaari, T.; Laurikkala, J.; Järvelin, K.; Juhola, M.: ¬A study on automatic creation of a comparable document collection in cross-language information retrieval (2006) 0.42
    0.42385483 = sum of:
      0.42385483 = product of:
        1.3245463 = sum of:
          0.09316212 = weight(abstract_txt:aligned in 602) [ClassicSimilarity], result of:
            0.09316212 = score(doc=602,freq=2.0), product of:
              0.12360269 = queryWeight, product of:
                1.1216646 = boost
                8.527396 = idf(docFreq=22, maxDocs=42740)
                0.012922557 = queryNorm
              0.7537224 = fieldWeight in 602, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.527396 = idf(docFreq=22, maxDocs=42740)
                0.0625 = fieldNorm(doc=602)
          0.117904626 = weight(abstract_txt:pairs in 602) [ClassicSimilarity], result of:
            0.117904626 = score(doc=602,freq=3.0), product of:
              0.15917265 = queryWeight, product of:
                1.8001069 = boost
                6.842609 = idf(docFreq=123, maxDocs=42740)
                0.012922557 = queryNorm
              0.74073416 = fieldWeight in 602, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.842609 = idf(docFreq=123, maxDocs=42740)
                0.0625 = fieldNorm(doc=602)
          0.103020236 = weight(abstract_txt:languages in 602) [ClassicSimilarity], result of:
            0.103020236 = score(doc=602,freq=3.0), product of:
              0.18329035 = queryWeight, product of:
                2.7317996 = boost
                5.192091 = idf(docFreq=645, maxDocs=42740)
                0.012922557 = queryNorm
              0.56206036 = fieldWeight in 602, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.192091 = idf(docFreq=645, maxDocs=42740)
                0.0625 = fieldNorm(doc=602)
          0.098212 = weight(abstract_txt:method in 602) [ClassicSimilarity], result of:
            0.098212 = score(doc=602,freq=4.0), product of:
              0.17376329 = queryWeight, product of:
                2.973809 = boost
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.012922557 = queryNorm
              0.5652057 = fieldWeight in 602, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.0625 = fieldNorm(doc=602)
          0.13959916 = weight(abstract_txt:parallel in 602) [ClassicSimilarity], result of:
            0.13959916 = score(doc=602,freq=1.0), product of:
              0.34870395 = queryWeight, product of:
                4.212719 = boost
                6.405395 = idf(docFreq=191, maxDocs=42740)
                0.012922557 = queryNorm
              0.4003372 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.405395 = idf(docFreq=191, maxDocs=42740)
                0.0625 = fieldNorm(doc=602)
          0.11273219 = weight(abstract_txt:english in 602) [ClassicSimilarity], result of:
            0.11273219 = score(doc=602,freq=1.0), product of:
              0.32133698 = queryWeight, product of:
                4.430013 = boost
                5.6131573 = idf(docFreq=423, maxDocs=42740)
                0.012922557 = queryNorm
              0.35082233 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6131573 = idf(docFreq=423, maxDocs=42740)
                0.0625 = fieldNorm(doc=602)
          0.2686999 = weight(abstract_txt:corpora in 602) [ClassicSimilarity], result of:
            0.2686999 = score(doc=602,freq=2.0), product of:
              0.4282559 = queryWeight, product of:
                4.66859 = boost
                7.098542 = idf(docFreq=95, maxDocs=42740)
                0.012922557 = queryNorm
              0.6274284 = fieldWeight in 602, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.098542 = idf(docFreq=95, maxDocs=42740)
                0.0625 = fieldNorm(doc=602)
          0.39121622 = weight(abstract_txt:alignment in 602) [ClassicSimilarity], result of:
            0.39121622 = score(doc=602,freq=3.0), product of:
              0.4805875 = queryWeight, product of:
                4.945615 = boost
                7.519756 = idf(docFreq=62, maxDocs=42740)
                0.012922557 = queryNorm
              0.81403744 = fieldWeight in 602, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.519756 = idf(docFreq=62, maxDocs=42740)
                0.0625 = fieldNorm(doc=602)
        0.32 = coord(8/25)
    
  3. Yang, C.C.; Luk, J.: Automatic generation of English/Chinese thesaurus based on a parallel corpus in laws (2003) 0.36
    0.36393625 = sum of:
      0.36393625 = product of:
        0.7582005 = sum of:
          0.0314087 = weight(abstract_txt:asian in 2617) [ClassicSimilarity], result of:
            0.0314087 = score(doc=2617,freq=1.0), product of:
              0.103194915 = queryWeight, product of:
                1.0248922 = boost
                7.7916894 = idf(docFreq=47, maxDocs=42740)
                0.012922557 = queryNorm
              0.30436286 = fieldWeight in 2617, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7916894 = idf(docFreq=47, maxDocs=42740)
                0.0390625 = fieldNorm(doc=2617)
          0.0791239 = weight(abstract_txt:lingual in 2617) [ClassicSimilarity], result of:
            0.0791239 = score(doc=2617,freq=5.0), product of:
              0.11173094 = queryWeight, product of:
                1.0664384 = boost
                8.107542 = idf(docFreq=34, maxDocs=42740)
                0.012922557 = queryNorm
              0.70816463 = fieldWeight in 2617, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.107542 = idf(docFreq=34, maxDocs=42740)
                0.0390625 = fieldNorm(doc=2617)
          0.011625968 = weight(abstract_txt:world in 2617) [ClassicSimilarity], result of:
            0.011625968 = score(doc=2617,freq=1.0), product of:
              0.06702771 = queryWeight, product of:
                1.1681302 = boost
                4.4403243 = idf(docFreq=1369, maxDocs=42740)
                0.012922557 = queryNorm
              0.17345017 = fieldWeight in 2617, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4403243 = idf(docFreq=1369, maxDocs=42740)
                0.0390625 = fieldNorm(doc=2617)
          0.015364519 = weight(abstract_txt:wide in 2617) [ClassicSimilarity], result of:
            0.015364519 = score(doc=2617,freq=1.0), product of:
              0.08071996 = queryWeight, product of:
                1.2819011 = boost
                4.872793 = idf(docFreq=888, maxDocs=42740)
                0.012922557 = queryNorm
              0.19034348 = fieldWeight in 2617, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.872793 = idf(docFreq=888, maxDocs=42740)
                0.0390625 = fieldNorm(doc=2617)
          0.024419079 = weight(abstract_txt:european in 2617) [ClassicSimilarity], result of:
            0.024419079 = score(doc=2617,freq=1.0), product of:
              0.10993125 = queryWeight, product of:
                1.4959761 = boost
                5.6865396 = idf(docFreq=393, maxDocs=42740)
                0.012922557 = queryNorm
              0.22213045 = fieldWeight in 2617, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6865396 = idf(docFreq=393, maxDocs=42740)
                0.0390625 = fieldNorm(doc=2617)
          0.02563724 = weight(abstract_txt:obtained in 2617) [ClassicSimilarity], result of:
            0.02563724 = score(doc=2617,freq=1.0), product of:
              0.1135575 = queryWeight, product of:
                1.5204494 = boost
                5.779568 = idf(docFreq=358, maxDocs=42740)
                0.012922557 = queryNorm
              0.22576438 = fieldWeight in 2617, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.779568 = idf(docFreq=358, maxDocs=42740)
                0.0390625 = fieldNorm(doc=2617)
          0.043332826 = weight(abstract_txt:corpus in 2617) [ClassicSimilarity], result of:
            0.043332826 = score(doc=2617,freq=2.0), product of:
              0.12788992 = queryWeight, product of:
                1.6135492 = boost
                6.1334615 = idf(docFreq=251, maxDocs=42740)
                0.012922557 = queryNorm
              0.3388291 = fieldWeight in 2617, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1334615 = idf(docFreq=251, maxDocs=42740)
                0.0390625 = fieldNorm(doc=2617)
          0.01665805 = weight(abstract_txt:analysis in 2617) [ClassicSimilarity], result of:
            0.01665805 = score(doc=2617,freq=1.0), product of:
              0.115619496 = queryWeight, product of:
                2.4257698 = boost
                3.6883576 = idf(docFreq=2905, maxDocs=42740)
                0.012922557 = queryNorm
              0.14407647 = fieldWeight in 2617, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6883576 = idf(docFreq=2905, maxDocs=42740)
                0.0390625 = fieldNorm(doc=2617)
          0.10151952 = weight(abstract_txt:chinese in 2617) [ClassicSimilarity], result of:
            0.10151952 = score(doc=2617,freq=4.0), product of:
              0.20496441 = queryWeight, product of:
                2.5017788 = boost
                6.3398805 = idf(docFreq=204, maxDocs=42740)
                0.012922557 = queryNorm
              0.49530315 = fieldWeight in 2617, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.3398805 = idf(docFreq=204, maxDocs=42740)
                0.0390625 = fieldNorm(doc=2617)
          0.07434845 = weight(abstract_txt:languages in 2617) [ClassicSimilarity], result of:
            0.07434845 = score(doc=2617,freq=4.0), product of:
              0.18329035 = queryWeight, product of:
                2.7317996 = boost
                5.192091 = idf(docFreq=645, maxDocs=42740)
                0.012922557 = queryNorm
              0.4056321 = fieldWeight in 2617, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.192091 = idf(docFreq=645, maxDocs=42740)
                0.0390625 = fieldNorm(doc=2617)
          0.12338939 = weight(abstract_txt:parallel in 2617) [ClassicSimilarity], result of:
            0.12338939 = score(doc=2617,freq=2.0), product of:
              0.34870395 = queryWeight, product of:
                4.212719 = boost
                6.405395 = idf(docFreq=191, maxDocs=42740)
                0.012922557 = queryNorm
              0.35385144 = fieldWeight in 2617, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.405395 = idf(docFreq=191, maxDocs=42740)
                0.0390625 = fieldNorm(doc=2617)
          0.21137285 = weight(abstract_txt:english in 2617) [ClassicSimilarity], result of:
            0.21137285 = score(doc=2617,freq=9.0), product of:
              0.32133698 = queryWeight, product of:
                4.430013 = boost
                5.6131573 = idf(docFreq=423, maxDocs=42740)
                0.012922557 = queryNorm
              0.65779185 = fieldWeight in 2617, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                5.6131573 = idf(docFreq=423, maxDocs=42740)
                0.0390625 = fieldNorm(doc=2617)
        0.48 = coord(12/25)
    
  4. Xu, J.; Weischedel, R.: Empirical studies on the impact of lexical resources on CLIR performance (2005) 0.32
    0.32264492 = sum of:
      0.32264492 = product of:
        1.3443539 = sum of:
          0.07077058 = weight(abstract_txt:lingual in 3021) [ClassicSimilarity], result of:
            0.07077058 = score(doc=3021,freq=1.0), product of:
              0.11173094 = queryWeight, product of:
                1.0664384 = boost
                8.107542 = idf(docFreq=34, maxDocs=42740)
                0.012922557 = queryNorm
              0.63340175 = fieldWeight in 3021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.107542 = idf(docFreq=34, maxDocs=42740)
                0.078125 = fieldNorm(doc=3021)
          0.08666565 = weight(abstract_txt:corpus in 3021) [ClassicSimilarity], result of:
            0.08666565 = score(doc=3021,freq=2.0), product of:
              0.12788992 = queryWeight, product of:
                1.6135492 = boost
                6.1334615 = idf(docFreq=251, maxDocs=42740)
                0.012922557 = queryNorm
              0.6776582 = fieldWeight in 3021, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1334615 = idf(docFreq=251, maxDocs=42740)
                0.078125 = fieldNorm(doc=3021)
          0.14357029 = weight(abstract_txt:chinese in 3021) [ClassicSimilarity], result of:
            0.14357029 = score(doc=3021,freq=2.0), product of:
              0.20496441 = queryWeight, product of:
                2.5017788 = boost
                6.3398805 = idf(docFreq=204, maxDocs=42740)
                0.012922557 = queryNorm
              0.7004645 = fieldWeight in 3021, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.3398805 = idf(docFreq=204, maxDocs=42740)
                0.078125 = fieldNorm(doc=3021)
          0.42743343 = weight(abstract_txt:parallel in 3021) [ClassicSimilarity], result of:
            0.42743343 = score(doc=3021,freq=6.0), product of:
              0.34870395 = queryWeight, product of:
                4.212719 = boost
                6.405395 = idf(docFreq=191, maxDocs=42740)
                0.012922557 = queryNorm
              1.2257774 = fieldWeight in 3021, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.405395 = idf(docFreq=191, maxDocs=42740)
                0.078125 = fieldNorm(doc=3021)
          0.14091523 = weight(abstract_txt:english in 3021) [ClassicSimilarity], result of:
            0.14091523 = score(doc=3021,freq=1.0), product of:
              0.32133698 = queryWeight, product of:
                4.430013 = boost
                5.6131573 = idf(docFreq=423, maxDocs=42740)
                0.012922557 = queryNorm
              0.4385279 = fieldWeight in 3021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6131573 = idf(docFreq=423, maxDocs=42740)
                0.078125 = fieldNorm(doc=3021)
          0.4749988 = weight(abstract_txt:corpora in 3021) [ClassicSimilarity], result of:
            0.4749988 = score(doc=3021,freq=4.0), product of:
              0.4282559 = queryWeight, product of:
                4.66859 = boost
                7.098542 = idf(docFreq=95, maxDocs=42740)
                0.012922557 = queryNorm
              1.1091472 = fieldWeight in 3021, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.098542 = idf(docFreq=95, maxDocs=42740)
                0.078125 = fieldNorm(doc=3021)
        0.24 = coord(6/25)
    
  5. Li, K.W.; Yang, C.C.: Automatic crosslingual thesaurus generated from the Hong Kong SAR Police Department Web Corpus for Crime Analysis (2005) 0.27
    0.27207965 = sum of:
      0.27207965 = product of:
        0.85024893 = sum of:
          0.053302333 = weight(abstract_txt:asian in 4392) [ClassicSimilarity], result of:
            0.053302333 = score(doc=4392,freq=2.0), product of:
              0.103194915 = queryWeight, product of:
                1.0248922 = boost
                7.7916894 = idf(docFreq=47, maxDocs=42740)
                0.012922557 = queryNorm
              0.5165209 = fieldWeight in 4392, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7916894 = idf(docFreq=47, maxDocs=42740)
                0.046875 = fieldNorm(doc=4392)
          0.051999386 = weight(abstract_txt:corpus in 4392) [ClassicSimilarity], result of:
            0.051999386 = score(doc=4392,freq=2.0), product of:
              0.12788992 = queryWeight, product of:
                1.6135492 = boost
                6.1334615 = idf(docFreq=251, maxDocs=42740)
                0.012922557 = queryNorm
              0.4065949 = fieldWeight in 4392, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1334615 = idf(docFreq=251, maxDocs=42740)
                0.046875 = fieldNorm(doc=4392)
          0.028269643 = weight(abstract_txt:analysis in 4392) [ClassicSimilarity], result of:
            0.028269643 = score(doc=4392,freq=2.0), product of:
              0.115619496 = queryWeight, product of:
                2.4257698 = boost
                3.6883576 = idf(docFreq=2905, maxDocs=42740)
                0.012922557 = queryNorm
              0.24450585 = fieldWeight in 4392, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6883576 = idf(docFreq=2905, maxDocs=42740)
                0.046875 = fieldNorm(doc=4392)
          0.13620274 = weight(abstract_txt:chinese in 4392) [ClassicSimilarity], result of:
            0.13620274 = score(doc=4392,freq=5.0), product of:
              0.20496441 = queryWeight, product of:
                2.5017788 = boost
                6.3398805 = idf(docFreq=204, maxDocs=42740)
                0.012922557 = queryNorm
              0.66451895 = fieldWeight in 4392, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.3398805 = idf(docFreq=204, maxDocs=42740)
                0.046875 = fieldNorm(doc=4392)
          0.1261735 = weight(abstract_txt:languages in 4392) [ClassicSimilarity], result of:
            0.1261735 = score(doc=4392,freq=8.0), product of:
              0.18329035 = queryWeight, product of:
                2.7317996 = boost
                5.192091 = idf(docFreq=645, maxDocs=42740)
                0.012922557 = queryNorm
              0.6883805 = fieldWeight in 4392, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.192091 = idf(docFreq=645, maxDocs=42740)
                0.046875 = fieldNorm(doc=4392)
          0.104699366 = weight(abstract_txt:parallel in 4392) [ClassicSimilarity], result of:
            0.104699366 = score(doc=4392,freq=1.0), product of:
              0.34870395 = queryWeight, product of:
                4.212719 = boost
                6.405395 = idf(docFreq=191, maxDocs=42740)
                0.012922557 = queryNorm
              0.30025288 = fieldWeight in 4392, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.405395 = idf(docFreq=191, maxDocs=42740)
                0.046875 = fieldNorm(doc=4392)
          0.20710227 = weight(abstract_txt:english in 4392) [ClassicSimilarity], result of:
            0.20710227 = score(doc=4392,freq=6.0), product of:
              0.32133698 = queryWeight, product of:
                4.430013 = boost
                5.6131573 = idf(docFreq=423, maxDocs=42740)
                0.012922557 = queryNorm
              0.6445018 = fieldWeight in 4392, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.6131573 = idf(docFreq=423, maxDocs=42740)
                0.046875 = fieldNorm(doc=4392)
          0.14249966 = weight(abstract_txt:corpora in 4392) [ClassicSimilarity], result of:
            0.14249966 = score(doc=4392,freq=1.0), product of:
              0.4282559 = queryWeight, product of:
                4.66859 = boost
                7.098542 = idf(docFreq=95, maxDocs=42740)
                0.012922557 = queryNorm
              0.33274418 = fieldWeight in 4392, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.098542 = idf(docFreq=95, maxDocs=42740)
                0.046875 = fieldNorm(doc=4392)
        0.32 = coord(8/25)