Document (#30052)

Author
Li, K.W.
Yang, C.C.
Title
Conceptual analysis of parallel corpus collected from the Web
Source
Journal of the American Society for Information Science and Technology. 57(2006) no.5, S.632-644
Year
2006
Abstract
As illustrated by the World Wide Web, the volume of information in languages other than English has grown significantly in recent years. This highlights the importance of multilingual corpora. Much effort has been devoted to the compilation of multilingual corpora for the purpose of cross-lingual information retrieval and machine translation. Existing parallel corpora mostly involve European languages, such as English-French and English-Spanish. There is still a lack of parallel corpora between European languages and Asian. languages. In the authors' previous work, an alignment method to identify one-to-one Chinese and English title pairs was developed to construct an English-Chinese parallel corpus that works automatically from the World Wide Web, and a 100% precision and 87% recall were obtained. Careful analysis of these results has helped the authors to understand how the alignment method can be improved. A conceptual analysis was conducted, which includes the analysis of conceptual equivalent and conceptual information alternation in the aligned and nonaligned English-Chinese title pairs that are obtained by the alignment method. The result of the analysis not only reflects the characteristics of parallel corpora, but also gives insight into the strengths and weaknesses of the alignment method. In particular, conceptual alternation, such as omission and addition, is found to have a significant impact on the performance of the alignment method.
Footnote
Beitrag einer special topic section on multilingual information systems
Theme
Multilinguale Probleme

Similar documents (author)

  1. Yang, S.C.: ¬An interpretive and situated approach to an evaluation of Perseus digital libraries (2001) 4.50
    4.4981737 = sum of:
      4.4981737 = weight(author_txt:yang in 6933) [ClassicSimilarity], result of:
        4.4981737 = fieldWeight in 6933, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.1970778 = idf(docFreq=89, maxDocs=44218)
          0.625 = fieldNorm(doc=6933)
    
  2. Yang, K.: Information retrieval on the Web (2004) 4.50
    4.4981737 = sum of:
      4.4981737 = weight(author_txt:yang in 4278) [ClassicSimilarity], result of:
        4.4981737 = fieldWeight in 4278, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.1970778 = idf(docFreq=89, maxDocs=44218)
          0.625 = fieldNorm(doc=4278)
    
  3. Yang, C.C.: Content-based image retrievaI : a comparison between query by example and image browsing map approaches (2005) 4.50
    4.4981737 = sum of:
      4.4981737 = weight(author_txt:yang in 4649) [ClassicSimilarity], result of:
        4.4981737 = fieldWeight in 4649, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.1970778 = idf(docFreq=89, maxDocs=44218)
          0.625 = fieldNorm(doc=4649)
    
  4. Salton, G.; Yang, C.S.: On the specification of term values in automatic indexing (1973) 3.60
    3.5985389 = sum of:
      3.5985389 = weight(author_txt:yang in 5476) [ClassicSimilarity], result of:
        3.5985389 = fieldWeight in 5476, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.1970778 = idf(docFreq=89, maxDocs=44218)
          0.5 = fieldNorm(doc=5476)
    
  5. Yang, Y.; Chute, C.G.A.: ¬A schematic analysis of the Unified Medical Language System (1992) 3.60
    3.5985389 = sum of:
      3.5985389 = weight(author_txt:yang in 6445) [ClassicSimilarity], result of:
        3.5985389 = fieldWeight in 6445, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.1970778 = idf(docFreq=89, maxDocs=44218)
          0.5 = fieldNorm(doc=6445)
    

Similar documents (content)

  1. Yang, C.C.; Li, K.W.: Automatic construction of English/Chinese parallel corpora (2003) 1.13
    1.1273054 = sum of:
      1.1273054 = product of:
        1.7614148 = sum of:
          0.04457501 = weight(abstract_txt:asian in 1683) [ClassicSimilarity], result of:
            0.04457501 = score(doc=1683,freq=1.0), product of:
              0.10470136 = queryWeight, product of:
                1.0305957 = boost
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.013050074 = queryNorm
              0.42573476 = fieldWeight in 1683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.07136032 = weight(abstract_txt:lingual in 1683) [ClassicSimilarity], result of:
            0.07136032 = score(doc=1683,freq=2.0), product of:
              0.11372411 = queryWeight, product of:
                1.0740844 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.013050074 = queryNorm
              0.6274863 = fieldWeight in 1683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.016444024 = weight(abstract_txt:world in 1683) [ClassicSimilarity], result of:
            0.016444024 = score(doc=1683,freq=1.0), product of:
              0.06785368 = queryWeight, product of:
                1.1733129 = boost
                4.4314575 = idf(docFreq=1429, maxDocs=44218)
                0.013050074 = queryNorm
              0.24234533 = fieldWeight in 1683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4314575 = idf(docFreq=1429, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.021829642 = weight(abstract_txt:wide in 1683) [ClassicSimilarity], result of:
            0.021829642 = score(doc=1683,freq=1.0), product of:
              0.08195946 = queryWeight, product of:
                1.2895159 = boost
                4.870342 = idf(docFreq=921, maxDocs=44218)
                0.013050074 = queryNorm
              0.2663468 = fieldWeight in 1683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.870342 = idf(docFreq=921, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.049306806 = weight(abstract_txt:european in 1683) [ClassicSimilarity], result of:
            0.049306806 = score(doc=1683,freq=2.0), product of:
              0.11198571 = queryWeight, product of:
                1.5073304 = boost
                5.6930003 = idf(docFreq=404, maxDocs=44218)
                0.013050074 = queryNorm
              0.44029552 = fieldWeight in 1683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6930003 = idf(docFreq=404, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.061350465 = weight(abstract_txt:title in 1683) [ClassicSimilarity], result of:
            0.061350465 = score(doc=1683,freq=3.0), product of:
              0.11317213 = queryWeight, product of:
                1.515294 = boost
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.013050074 = queryNorm
              0.5420987 = fieldWeight in 1683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.07423191 = weight(abstract_txt:corpus in 1683) [ClassicSimilarity], result of:
            0.07423191 = score(doc=1683,freq=3.0), product of:
              0.1285054 = queryWeight, product of:
                1.614685 = boost
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.013050074 = queryNorm
              0.577656 = fieldWeight in 1683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.06677919 = weight(abstract_txt:multilingual in 1683) [ClassicSimilarity], result of:
            0.06677919 = score(doc=1683,freq=2.0), product of:
              0.13708359 = queryWeight, product of:
                1.6677074 = boost
                6.2987247 = idf(docFreq=220, maxDocs=44218)
                0.013050074 = queryNorm
              0.48714212 = fieldWeight in 1683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2987247 = idf(docFreq=220, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.083989985 = weight(abstract_txt:pairs in 1683) [ClassicSimilarity], result of:
            0.083989985 = score(doc=1683,freq=2.0), product of:
              0.15972628 = queryWeight, product of:
                1.8001775 = boost
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.013050074 = queryNorm
              0.525837 = fieldWeight in 1683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.14196627 = weight(abstract_txt:chinese in 1683) [ClassicSimilarity], result of:
            0.14196627 = score(doc=1683,freq=4.0), product of:
              0.20592159 = queryWeight, product of:
                2.5033622 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.013050074 = queryNorm
              0.68941903 = fieldWeight in 1683, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.07463505 = weight(abstract_txt:languages in 1683) [ClassicSimilarity], result of:
            0.07463505 = score(doc=1683,freq=2.0), product of:
              0.18600726 = queryWeight, product of:
                2.7473063 = boost
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.013050074 = queryNorm
              0.40124804 = fieldWeight in 1683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.086149104 = weight(abstract_txt:method in 1683) [ClassicSimilarity], result of:
            0.086149104 = score(doc=1683,freq=4.0), product of:
              0.17499617 = queryWeight, product of:
                2.9792807 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.013050074 = queryNorm
              0.4922914 = fieldWeight in 1683, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.17625676 = weight(abstract_txt:parallel in 1683) [ClassicSimilarity], result of:
            0.17625676 = score(doc=1683,freq=2.0), product of:
              0.35533273 = queryWeight, product of:
                4.2453623 = boost
                6.4136834 = idf(docFreq=196, maxDocs=44218)
                0.013050074 = queryNorm
              0.496033 = fieldWeight in 1683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4136834 = idf(docFreq=196, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.24052389 = weight(abstract_txt:english in 1683) [ClassicSimilarity], result of:
            0.24052389 = score(doc=1683,freq=6.0), product of:
              0.3221045 = queryWeight, product of:
                4.4277816 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.013050074 = queryNorm
              0.7467262 = fieldWeight in 1683, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.28355235 = weight(abstract_txt:corpora in 1683) [ClassicSimilarity], result of:
            0.28355235 = score(doc=1683,freq=3.0), product of:
              0.42618328 = queryWeight, product of:
                4.6493835 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.013050074 = queryNorm
              0.6653296 = fieldWeight in 1683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.26846415 = weight(abstract_txt:alignment in 1683) [ClassicSimilarity], result of:
            0.26846415 = score(doc=1683,freq=2.0), product of:
              0.47039452 = queryWeight, product of:
                4.8845916 = boost
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.013050074 = queryNorm
              0.57072127 = fieldWeight in 1683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
        0.64 = coord(16/25)
    
  2. Talvensaari, T.; Laurikkala, J.; Järvelin, K.; Juhola, M.: ¬A study on automatic creation of a comparable document collection in cross-language information retrieval (2006) 0.42
    0.41832802 = sum of:
      0.41832802 = product of:
        1.307275 = sum of:
          0.09176685 = weight(abstract_txt:aligned in 5601) [ClassicSimilarity], result of:
            0.09176685 = score(doc=5601,freq=2.0), product of:
              0.12302988 = queryWeight, product of:
                1.1171653 = boost
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.013050074 = queryNorm
              0.74589074 = fieldWeight in 5601, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.11756149 = weight(abstract_txt:pairs in 5601) [ClassicSimilarity], result of:
            0.11756149 = score(doc=5601,freq=3.0), product of:
              0.15972628 = queryWeight, product of:
                1.8001775 = boost
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.013050074 = queryNorm
              0.7360185 = fieldWeight in 5601, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.1044673 = weight(abstract_txt:languages in 5601) [ClassicSimilarity], result of:
            0.1044673 = score(doc=5601,freq=3.0), product of:
              0.18600726 = queryWeight, product of:
                2.7473063 = boost
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.013050074 = queryNorm
              0.56163025 = fieldWeight in 5601, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.09845612 = weight(abstract_txt:method in 5601) [ClassicSimilarity], result of:
            0.09845612 = score(doc=5601,freq=4.0), product of:
              0.17499617 = queryWeight, product of:
                2.9792807 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.013050074 = queryNorm
              0.56261873 = fieldWeight in 5601, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.14243698 = weight(abstract_txt:parallel in 5601) [ClassicSimilarity], result of:
            0.14243698 = score(doc=5601,freq=1.0), product of:
              0.35533273 = queryWeight, product of:
                4.2453623 = boost
                6.4136834 = idf(docFreq=196, maxDocs=44218)
                0.013050074 = queryNorm
              0.4008552 = fieldWeight in 5601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4136834 = idf(docFreq=196, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.1122211 = weight(abstract_txt:english in 5601) [ClassicSimilarity], result of:
            0.1122211 = score(doc=5601,freq=1.0), product of:
              0.3221045 = queryWeight, product of:
                4.4277816 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.013050074 = queryNorm
              0.34839964 = fieldWeight in 5601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.26459372 = weight(abstract_txt:corpora in 5601) [ClassicSimilarity], result of:
            0.26459372 = score(doc=5601,freq=2.0), product of:
              0.42618328 = queryWeight, product of:
                4.6493835 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.013050074 = queryNorm
              0.6208449 = fieldWeight in 5601, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.37577155 = weight(abstract_txt:alignment in 5601) [ClassicSimilarity], result of:
            0.37577155 = score(doc=5601,freq=3.0), product of:
              0.47039452 = queryWeight, product of:
                4.8845916 = boost
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.013050074 = queryNorm
              0.7988434 = fieldWeight in 5601, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
        0.32 = coord(8/25)
    
  3. Yang, C.C.; Luk, J.: Automatic generation of English/Chinese thesaurus based on a parallel corpus in laws (2003) 0.37
    0.3663755 = sum of:
      0.3663755 = product of:
        0.7632823 = sum of:
          0.031839296 = weight(abstract_txt:asian in 1616) [ClassicSimilarity], result of:
            0.031839296 = score(doc=1616,freq=1.0), product of:
              0.10470136 = queryWeight, product of:
                1.0305957 = boost
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.013050074 = queryNorm
              0.30409628 = fieldWeight in 1616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1616)
          0.080593266 = weight(abstract_txt:lingual in 1616) [ClassicSimilarity], result of:
            0.080593266 = score(doc=1616,freq=5.0), product of:
              0.11372411 = queryWeight, product of:
                1.0740844 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.013050074 = queryNorm
              0.70867354 = fieldWeight in 1616, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1616)
          0.01174573 = weight(abstract_txt:world in 1616) [ClassicSimilarity], result of:
            0.01174573 = score(doc=1616,freq=1.0), product of:
              0.06785368 = queryWeight, product of:
                1.1733129 = boost
                4.4314575 = idf(docFreq=1429, maxDocs=44218)
                0.013050074 = queryNorm
              0.17310381 = fieldWeight in 1616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4314575 = idf(docFreq=1429, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1616)
          0.015592602 = weight(abstract_txt:wide in 1616) [ClassicSimilarity], result of:
            0.015592602 = score(doc=1616,freq=1.0), product of:
              0.08195946 = queryWeight, product of:
                1.2895159 = boost
                4.870342 = idf(docFreq=921, maxDocs=44218)
                0.013050074 = queryNorm
              0.19024773 = fieldWeight in 1616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.870342 = idf(docFreq=921, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1616)
          0.0249037 = weight(abstract_txt:european in 1616) [ClassicSimilarity], result of:
            0.0249037 = score(doc=1616,freq=1.0), product of:
              0.11198571 = queryWeight, product of:
                1.5073304 = boost
                5.6930003 = idf(docFreq=404, maxDocs=44218)
                0.013050074 = queryNorm
              0.22238283 = fieldWeight in 1616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6930003 = idf(docFreq=404, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1616)
          0.025749253 = weight(abstract_txt:obtained in 1616) [ClassicSimilarity], result of:
            0.025749253 = score(doc=1616,freq=1.0), product of:
              0.114506416 = queryWeight, product of:
                1.5242003 = boost
                5.756716 = idf(docFreq=379, maxDocs=44218)
                0.013050074 = queryNorm
              0.22487171 = fieldWeight in 1616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.756716 = idf(docFreq=379, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1616)
          0.043292932 = weight(abstract_txt:corpus in 1616) [ClassicSimilarity], result of:
            0.043292932 = score(doc=1616,freq=2.0), product of:
              0.1285054 = queryWeight, product of:
                1.614685 = boost
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.013050074 = queryNorm
              0.33689582 = fieldWeight in 1616, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1616)
          0.016455976 = weight(abstract_txt:analysis in 1616) [ClassicSimilarity], result of:
            0.016455976 = score(doc=1616,freq=1.0), product of:
              0.11530527 = queryWeight, product of:
                2.4183643 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.013050074 = queryNorm
              0.1427166 = fieldWeight in 1616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1616)
          0.10140448 = weight(abstract_txt:chinese in 1616) [ClassicSimilarity], result of:
            0.10140448 = score(doc=1616,freq=4.0), product of:
              0.20592159 = queryWeight, product of:
                2.5033622 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.013050074 = queryNorm
              0.4924422 = fieldWeight in 1616, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1616)
          0.07539278 = weight(abstract_txt:languages in 1616) [ClassicSimilarity], result of:
            0.07539278 = score(doc=1616,freq=4.0), product of:
              0.18600726 = queryWeight, product of:
                2.7473063 = boost
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.013050074 = queryNorm
              0.40532172 = fieldWeight in 1616, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1616)
          0.1258977 = weight(abstract_txt:parallel in 1616) [ClassicSimilarity], result of:
            0.1258977 = score(doc=1616,freq=2.0), product of:
              0.35533273 = queryWeight, product of:
                4.2453623 = boost
                6.4136834 = idf(docFreq=196, maxDocs=44218)
                0.013050074 = queryNorm
              0.35430932 = fieldWeight in 1616, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4136834 = idf(docFreq=196, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1616)
          0.21041456 = weight(abstract_txt:english in 1616) [ClassicSimilarity], result of:
            0.21041456 = score(doc=1616,freq=9.0), product of:
              0.3221045 = queryWeight, product of:
                4.4277816 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.013050074 = queryNorm
              0.6532493 = fieldWeight in 1616, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1616)
        0.48 = coord(12/25)
    
  4. Xu, J.; Weischedel, R.: Empirical studies on the impact of lexical resources on CLIR performance (2005) 0.32
    0.3230921 = sum of:
      0.3230921 = product of:
        1.3462172 = sum of:
          0.072084814 = weight(abstract_txt:lingual in 1020) [ClassicSimilarity], result of:
            0.072084814 = score(doc=1020,freq=1.0), product of:
              0.11372411 = queryWeight, product of:
                1.0740844 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.013050074 = queryNorm
              0.6338569 = fieldWeight in 1020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.086585864 = weight(abstract_txt:corpus in 1020) [ClassicSimilarity], result of:
            0.086585864 = score(doc=1020,freq=2.0), product of:
              0.1285054 = queryWeight, product of:
                1.614685 = boost
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.013050074 = queryNorm
              0.67379165 = fieldWeight in 1020, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.14340757 = weight(abstract_txt:chinese in 1020) [ClassicSimilarity], result of:
            0.14340757 = score(doc=1020,freq=2.0), product of:
              0.20592159 = queryWeight, product of:
                2.5033622 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.013050074 = queryNorm
              0.69641834 = fieldWeight in 1020, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.43612242 = weight(abstract_txt:parallel in 1020) [ClassicSimilarity], result of:
            0.43612242 = score(doc=1020,freq=6.0), product of:
              0.35533273 = queryWeight, product of:
                4.2453623 = boost
                6.4136834 = idf(docFreq=196, maxDocs=44218)
                0.013050074 = queryNorm
              1.2273635 = fieldWeight in 1020, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.4136834 = idf(docFreq=196, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.14027637 = weight(abstract_txt:english in 1020) [ClassicSimilarity], result of:
            0.14027637 = score(doc=1020,freq=1.0), product of:
              0.3221045 = queryWeight, product of:
                4.4277816 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.013050074 = queryNorm
              0.43549955 = fieldWeight in 1020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.46774006 = weight(abstract_txt:corpora in 1020) [ClassicSimilarity], result of:
            0.46774006 = score(doc=1020,freq=4.0), product of:
              0.42618328 = queryWeight, product of:
                4.6493835 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.013050074 = queryNorm
              1.0975091 = fieldWeight in 1020, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
        0.24 = coord(6/25)
    
  5. Li, K.W.; Yang, C.C.: Automatic crosslingual thesaurus generated from the Hong Kong SAR Police Department Web Corpus for Crime Analysis (2005) 0.27
    0.2723899 = sum of:
      0.2723899 = product of:
        0.85121846 = sum of:
          0.05403307 = weight(abstract_txt:asian in 3391) [ClassicSimilarity], result of:
            0.05403307 = score(doc=3391,freq=2.0), product of:
              0.10470136 = queryWeight, product of:
                1.0305957 = boost
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.013050074 = queryNorm
              0.51606846 = fieldWeight in 3391, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.046875 = fieldNorm(doc=3391)
          0.051951513 = weight(abstract_txt:corpus in 3391) [ClassicSimilarity], result of:
            0.051951513 = score(doc=3391,freq=2.0), product of:
              0.1285054 = queryWeight, product of:
                1.614685 = boost
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.013050074 = queryNorm
              0.40427497 = fieldWeight in 3391, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.046875 = fieldNorm(doc=3391)
          0.027926717 = weight(abstract_txt:analysis in 3391) [ClassicSimilarity], result of:
            0.027926717 = score(doc=3391,freq=2.0), product of:
              0.11530527 = queryWeight, product of:
                2.4183643 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.013050074 = queryNorm
              0.24219811 = fieldWeight in 3391, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.046875 = fieldNorm(doc=3391)
          0.13604839 = weight(abstract_txt:chinese in 3391) [ClassicSimilarity], result of:
            0.13604839 = score(doc=3391,freq=5.0), product of:
              0.20592159 = queryWeight, product of:
                2.5033622 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.013050074 = queryNorm
              0.66068053 = fieldWeight in 3391, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.046875 = fieldNorm(doc=3391)
          0.1279458 = weight(abstract_txt:languages in 3391) [ClassicSimilarity], result of:
            0.1279458 = score(doc=3391,freq=8.0), product of:
              0.18600726 = queryWeight, product of:
                2.7473063 = boost
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.013050074 = queryNorm
              0.68785375 = fieldWeight in 3391, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.046875 = fieldNorm(doc=3391)
          0.106827736 = weight(abstract_txt:parallel in 3391) [ClassicSimilarity], result of:
            0.106827736 = score(doc=3391,freq=1.0), product of:
              0.35533273 = queryWeight, product of:
                4.2453623 = boost
                6.4136834 = idf(docFreq=196, maxDocs=44218)
                0.013050074 = queryNorm
              0.30064142 = fieldWeight in 3391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4136834 = idf(docFreq=196, maxDocs=44218)
                0.046875 = fieldNorm(doc=3391)
          0.20616332 = weight(abstract_txt:english in 3391) [ClassicSimilarity], result of:
            0.20616332 = score(doc=3391,freq=6.0), product of:
              0.3221045 = queryWeight, product of:
                4.4277816 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.013050074 = queryNorm
              0.640051 = fieldWeight in 3391, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.046875 = fieldNorm(doc=3391)
          0.140322 = weight(abstract_txt:corpora in 3391) [ClassicSimilarity], result of:
            0.140322 = score(doc=3391,freq=1.0), product of:
              0.42618328 = queryWeight, product of:
                4.6493835 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.013050074 = queryNorm
              0.32925272 = fieldWeight in 3391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.046875 = fieldNorm(doc=3391)
        0.32 = coord(8/25)