Document (#38166)

Author
Oard, D.W.
Title
Alternative approaches for cross-language text retrieval
Source
http://www.aaai.org/Papers/Symposia/Spring/1997/SS-97-05/SS97-05-021.pdf]
Year
1997
Series
AAAI Technical Report SS-97-05
Abstract
The explosive growth of the Internet and other sources of networked information have made automatic mediation of access to networked information sources an increasingly important problem. Much of this information is expressed as electronic text, and it is becoming practical to automatically convert some printed documents and recorded speech to electronic text as well. Thus, automated systems capable of detecting useful documents are finding widespread application. With even a small number of languages it can be inconvenient to issue the same query repeatedly in every language, so users who are able to read more than one language will likely prefer a multilingual text retrieval system over a collection of monolingual systems. And since reading ability in a language does not always imply fluent writing ability in that language, such users will likely find cross-language text retrieval particularly useful for languages in which they are less confident of their ability to express their information needs effectively. The use of such systems can be also be beneficial if the user is able to read only a single language. For example, when only a small portion of the document collection will ever be examined by the user, performing retrieval before translation can be significantly more economical than performing translation before retrieval. So when the application is sufficiently important to justify the time and effort required for translation, those costs can be minimized if an effective cross-language text retrieval system is available. Even when translation is not available, there are circumstances in which cross-language text retrieval could be useful to a monolingual user. For example, a researcher might find a paper published in an unfamiliar language useful if that paper contains references to works by the same author that are in the researcher's native language.
Multilingual text retrieval can be defined as selection of useful documents from collections that may contain several languages (English, French, Chinese, etc.). This formulation allows for the possibility that individual documents might contain more than one language, a common occurrence in some applications. Both cross-language and within-language retrieval are included in this formulation, but it is the cross-language aspect of the problem which distinguishes multilingual text retrieval from its well studied monolingual counterpart. At the SIGIR 96 workshop on "Cross-Linguistic Information Retrieval" the participants discussed the proliferation of terminology being used to describe the field and settled on "Cross-Language" as the best single description of the salient aspect of the problem. "Multilingual" was felt to be too broad, since that term has also been used to describe systems able to perform within-language retrieval in more than one language but that lack any cross-language capability. "Cross-lingual" and "cross-linguistic" were felt to be equally good descriptions of the field, but "crosslanguage" was selected as the preferred term in the interest of standardization. Unfortunately, at about the same time the U.S. Defense Advanced Research Projects Agency (DARPA) introduced "translingual" as their preferred term, so we are still some distance from reaching consensus on this matter.
I will not attempt to draw a sharp distinction between retrieval and filtering in this survey. Although my own work on adaptive cross-language text filtering has led me to make this distinction fairly carefully in other presentations (c.f., (Oard 1997b)), such an proach does little to help understand the fundamental techniques which have been applied or the results that have been obtained in this case. Since it is still common to view filtering (detection of useful documents in dynamic document streams) as a kind of retrieval, will simply adopt that perspective here.
Theme
Multilinguale Probleme
Semantisches Umfeld in Indexierung u. Retrieval

Similar documents (author)

  1. Oard, D.W.: Serving users in many languages : cross-language information retrieval for digital libraries (1997) 5.47
    5.4713416 = sum of:
      5.4713416 = weight(author_txt:oard in 3262) [ClassicSimilarity], result of:
        5.4713416 = score(doc=3262,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.754148 = idf(docFreq=17, maxDocs=41962)
            0.114231564 = queryNorm
          5.471342 = fieldWeight in 3262, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.754148 = idf(docFreq=17, maxDocs=41962)
            0.625 = fieldNorm(doc=3262)
    
  2. Oard, D.W.: Multilingual information access (2009) 5.47
    5.4713416 = sum of:
      5.4713416 = weight(author_txt:oard in 851) [ClassicSimilarity], result of:
        5.4713416 = score(doc=851,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.754148 = idf(docFreq=17, maxDocs=41962)
            0.114231564 = queryNorm
          5.471342 = fieldWeight in 851, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.754148 = idf(docFreq=17, maxDocs=41962)
            0.625 = fieldNorm(doc=851)
    
  3. Wang, J.; Oard, D.W.: Matching meaning for cross-language information retrieval (2012) 4.38
    4.3770733 = sum of:
      4.3770733 = weight(author_txt:oard in 7430) [ClassicSimilarity], result of:
        4.3770733 = score(doc=7430,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.754148 = idf(docFreq=17, maxDocs=41962)
            0.114231564 = queryNorm
          4.377074 = fieldWeight in 7430, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.754148 = idf(docFreq=17, maxDocs=41962)
            0.5 = fieldNorm(doc=7430)
    
  4. Oard, D.W.; Resnik, P.: Support for interactive document selection in cross-language information retrieval (1999) 4.38
    4.3770733 = sum of:
      4.3770733 = weight(author_txt:oard in 6007) [ClassicSimilarity], result of:
        4.3770733 = score(doc=6007,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.754148 = idf(docFreq=17, maxDocs=41962)
            0.114231564 = queryNorm
          4.377074 = fieldWeight in 6007, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.754148 = idf(docFreq=17, maxDocs=41962)
            0.5 = fieldNorm(doc=6007)
    
  5. Oard, D.W.; Diekema, A.R.: Cross-language information retrieval (1999) 4.38
    4.3770733 = sum of:
      4.3770733 = weight(author_txt:oard in 6104) [ClassicSimilarity], result of:
        4.3770733 = score(doc=6104,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.754148 = idf(docFreq=17, maxDocs=41962)
            0.114231564 = queryNorm
          4.377074 = fieldWeight in 6104, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.754148 = idf(docFreq=17, maxDocs=41962)
            0.5 = fieldNorm(doc=6104)
    

Similar documents (content)

  1. Ballesteros, L.A.: Cross-language retrieval via transitive relation (2000) 0.70
    0.6982683 = sum of:
      0.6982683 = product of:
        1.1637805 = sum of:
          0.052128643 = weight(abstract_txt:performing in 1031) [ClassicSimilarity], result of:
            0.052128643 = score(doc=1031,freq=1.0), product of:
              0.123624995 = queryWeight, product of:
                6.74668 = idf(docFreq=133, maxDocs=41962)
                0.018323828 = queryNorm
              0.4216675 = fieldWeight in 1031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.74668 = idf(docFreq=133, maxDocs=41962)
                0.0625 = fieldNorm(doc=1031)
          0.013584702 = weight(abstract_txt:systems in 1031) [ClassicSimilarity], result of:
            0.013584702 = score(doc=1031,freq=1.0), product of:
              0.063547544 = queryWeight, product of:
                1.0139376 = boost
                3.4203563 = idf(docFreq=3729, maxDocs=41962)
                0.018323828 = queryNorm
              0.21377227 = fieldWeight in 1031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4203563 = idf(docFreq=3729, maxDocs=41962)
                0.0625 = fieldNorm(doc=1031)
          0.013937808 = weight(abstract_txt:more in 1031) [ClassicSimilarity], result of:
            0.013937808 = score(doc=1031,freq=1.0), product of:
              0.06464402 = queryWeight, product of:
                1.0226476 = boost
                3.449738 = idf(docFreq=3621, maxDocs=41962)
                0.018323828 = queryNorm
              0.21560863 = fieldWeight in 1031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.449738 = idf(docFreq=3621, maxDocs=41962)
                0.0625 = fieldNorm(doc=1031)
          0.030270621 = weight(abstract_txt:since in 1031) [ClassicSimilarity], result of:
            0.030270621 = score(doc=1031,freq=1.0), product of:
              0.09849934 = queryWeight, product of:
                1.093224 = boost
                4.917088 = idf(docFreq=834, maxDocs=41962)
                0.018323828 = queryNorm
              0.307318 = fieldWeight in 1031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.917088 = idf(docFreq=834, maxDocs=41962)
                0.0625 = fieldNorm(doc=1031)
          0.07963592 = weight(abstract_txt:languages in 1031) [ClassicSimilarity], result of:
            0.07963592 = score(doc=1031,freq=5.0), product of:
              0.109774575 = queryWeight, product of:
                1.1540998 = boost
                5.1908946 = idf(docFreq=634, maxDocs=41962)
                0.018323828 = queryNorm
              0.72544956 = fieldWeight in 1031, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.1908946 = idf(docFreq=634, maxDocs=41962)
                0.0625 = fieldNorm(doc=1031)
          0.02056943 = weight(abstract_txt:than in 1031) [ClassicSimilarity], result of:
            0.02056943 = score(doc=1031,freq=1.0), product of:
              0.08379412 = queryWeight, product of:
                1.16431 = boost
                3.9276135 = idf(docFreq=2245, maxDocs=41962)
                0.018323828 = queryNorm
              0.24547584 = fieldWeight in 1031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9276135 = idf(docFreq=2245, maxDocs=41962)
                0.0625 = fieldNorm(doc=1031)
          0.012598041 = weight(abstract_txt:this in 1031) [ClassicSimilarity], result of:
            0.012598041 = score(doc=1031,freq=2.0), product of:
              0.057801183 = queryWeight, product of:
                1.2792318 = boost
                2.4658763 = idf(docFreq=9687, maxDocs=41962)
                0.018323828 = queryNorm
              0.21795473 = fieldWeight in 1031, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4658763 = idf(docFreq=9687, maxDocs=41962)
                0.0625 = fieldNorm(doc=1031)
          0.024614468 = weight(abstract_txt:will in 1031) [ClassicSimilarity], result of:
            0.024614468 = score(doc=1031,freq=1.0), product of:
              0.10174092 = queryWeight, product of:
                1.4343815 = boost
                3.8709252 = idf(docFreq=2376, maxDocs=41962)
                0.018323828 = queryNorm
              0.24193282 = fieldWeight in 1031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8709252 = idf(docFreq=2376, maxDocs=41962)
                0.0625 = fieldNorm(doc=1031)
          0.02626311 = weight(abstract_txt:that in 1031) [ClassicSimilarity], result of:
            0.02626311 = score(doc=1031,freq=6.0), product of:
              0.07111696 = queryWeight, product of:
                1.6089395 = boost
                2.4122221 = idf(docFreq=10221, maxDocs=41962)
                0.018323828 = queryNorm
              0.3692946 = fieldWeight in 1031, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.4122221 = idf(docFreq=10221, maxDocs=41962)
                0.0625 = fieldNorm(doc=1031)
          0.08048952 = weight(abstract_txt:translation in 1031) [ClassicSimilarity], result of:
            0.08048952 = score(doc=1031,freq=1.0), product of:
              0.2080774 = queryWeight, product of:
                1.8347391 = boost
                6.1891985 = idf(docFreq=233, maxDocs=41962)
                0.018323828 = queryNorm
              0.3868249 = fieldWeight in 1031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1891985 = idf(docFreq=233, maxDocs=41962)
                0.0625 = fieldNorm(doc=1031)
          0.084025845 = weight(abstract_txt:multilingual in 1031) [ClassicSimilarity], result of:
            0.084025845 = score(doc=1031,freq=1.0), product of:
              0.21412824 = queryWeight, product of:
                1.8612248 = boost
                6.278544 = idf(docFreq=213, maxDocs=41962)
                0.018323828 = queryNorm
              0.392409 = fieldWeight in 1031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.278544 = idf(docFreq=213, maxDocs=41962)
                0.0625 = fieldNorm(doc=1031)
          0.056620058 = weight(abstract_txt:text in 1031) [ClassicSimilarity], result of:
            0.056620058 = score(doc=1031,freq=1.0), product of:
              0.22337037 = queryWeight, product of:
                3.0056932 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.018323828 = queryNorm
              0.2534806 = fieldWeight in 1031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.0625 = fieldNorm(doc=1031)
          0.06969076 = weight(abstract_txt:retrieval in 1031) [ClassicSimilarity], result of:
            0.06969076 = score(doc=1031,freq=2.0), product of:
              0.22778659 = queryWeight, product of:
                3.5913684 = boost
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.018323828 = queryNorm
              0.30594757 = fieldWeight in 1031, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.0625 = fieldNorm(doc=1031)
          0.31784666 = weight(abstract_txt:cross in 1031) [ClassicSimilarity], result of:
            0.31784666 = score(doc=1031,freq=3.0), product of:
              0.5198489 = queryWeight, product of:
                5.022975 = boost
                5.6480675 = idf(docFreq=401, maxDocs=41962)
                0.018323828 = queryNorm
              0.6114212 = fieldWeight in 1031, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6480675 = idf(docFreq=401, maxDocs=41962)
                0.0625 = fieldNorm(doc=1031)
          0.28150493 = weight(abstract_txt:language in 1031) [ClassicSimilarity], result of:
            0.28150493 = score(doc=1031,freq=5.0), product of:
              0.4794272 = queryWeight, product of:
                6.2274203 = boost
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.018323828 = queryNorm
              0.5871693 = fieldWeight in 1031, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.0625 = fieldNorm(doc=1031)
        0.6 = coord(15/25)
    
  2. Oard, D.W.: Serving users in many languages : cross-language information retrieval for digital libraries (1997) 0.57
    0.5703311 = sum of:
      0.5703311 = product of:
        1.0184484 = sum of:
          0.013937808 = weight(abstract_txt:more in 3262) [ClassicSimilarity], result of:
            0.013937808 = score(doc=3262,freq=1.0), product of:
              0.06464402 = queryWeight, product of:
                1.0226476 = boost
                3.449738 = idf(docFreq=3621, maxDocs=41962)
                0.018323828 = queryNorm
              0.21560863 = fieldWeight in 3262, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.449738 = idf(docFreq=3621, maxDocs=41962)
                0.0625 = fieldNorm(doc=3262)
          0.06168572 = weight(abstract_txt:languages in 3262) [ClassicSimilarity], result of:
            0.06168572 = score(doc=3262,freq=3.0), product of:
              0.109774575 = queryWeight, product of:
                1.1540998 = boost
                5.1908946 = idf(docFreq=634, maxDocs=41962)
                0.018323828 = queryNorm
              0.56193084 = fieldWeight in 3262, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1908946 = idf(docFreq=634, maxDocs=41962)
                0.0625 = fieldNorm(doc=3262)
          0.02056943 = weight(abstract_txt:than in 3262) [ClassicSimilarity], result of:
            0.02056943 = score(doc=3262,freq=1.0), product of:
              0.08379412 = queryWeight, product of:
                1.16431 = boost
                3.9276135 = idf(docFreq=2245, maxDocs=41962)
                0.018323828 = queryNorm
              0.24547584 = fieldWeight in 3262, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9276135 = idf(docFreq=2245, maxDocs=41962)
                0.0625 = fieldNorm(doc=3262)
          0.044761058 = weight(abstract_txt:ability in 3262) [ClassicSimilarity], result of:
            0.044761058 = score(doc=3262,freq=1.0), product of:
              0.12784566 = queryWeight, product of:
                1.2454764 = boost
                5.6018867 = idf(docFreq=420, maxDocs=41962)
                0.018323828 = queryNorm
              0.35011792 = fieldWeight in 3262, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6018867 = idf(docFreq=420, maxDocs=41962)
                0.0625 = fieldNorm(doc=3262)
          0.008908161 = weight(abstract_txt:this in 3262) [ClassicSimilarity], result of:
            0.008908161 = score(doc=3262,freq=1.0), product of:
              0.057801183 = queryWeight, product of:
                1.2792318 = boost
                2.4658763 = idf(docFreq=9687, maxDocs=41962)
                0.018323828 = queryNorm
              0.15411727 = fieldWeight in 3262, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4658763 = idf(docFreq=9687, maxDocs=41962)
                0.0625 = fieldNorm(doc=3262)
          0.024614468 = weight(abstract_txt:will in 3262) [ClassicSimilarity], result of:
            0.024614468 = score(doc=3262,freq=1.0), product of:
              0.10174092 = queryWeight, product of:
                1.4343815 = boost
                3.8709252 = idf(docFreq=2376, maxDocs=41962)
                0.018323828 = queryNorm
              0.24193282 = fieldWeight in 3262, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8709252 = idf(docFreq=2376, maxDocs=41962)
                0.0625 = fieldNorm(doc=3262)
          0.029677318 = weight(abstract_txt:documents in 3262) [ClassicSimilarity], result of:
            0.029677318 = score(doc=3262,freq=1.0), product of:
              0.11525288 = queryWeight, product of:
                1.5266615 = boost
                4.1199584 = idf(docFreq=1852, maxDocs=41962)
                0.018323828 = queryNorm
              0.2574974 = fieldWeight in 3262, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1199584 = idf(docFreq=1852, maxDocs=41962)
                0.0625 = fieldNorm(doc=3262)
          0.01072187 = weight(abstract_txt:that in 3262) [ClassicSimilarity], result of:
            0.01072187 = score(doc=3262,freq=1.0), product of:
              0.07111696 = queryWeight, product of:
                1.6089395 = boost
                2.4122221 = idf(docFreq=10221, maxDocs=41962)
                0.018323828 = queryNorm
              0.15076388 = fieldWeight in 3262, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4122221 = idf(docFreq=10221, maxDocs=41962)
                0.0625 = fieldNorm(doc=3262)
          0.113829374 = weight(abstract_txt:translation in 3262) [ClassicSimilarity], result of:
            0.113829374 = score(doc=3262,freq=2.0), product of:
              0.2080774 = queryWeight, product of:
                1.8347391 = boost
                6.1891985 = idf(docFreq=233, maxDocs=41962)
                0.018323828 = queryNorm
              0.54705304 = fieldWeight in 3262, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1891985 = idf(docFreq=233, maxDocs=41962)
                0.0625 = fieldNorm(doc=3262)
          0.118830495 = weight(abstract_txt:multilingual in 3262) [ClassicSimilarity], result of:
            0.118830495 = score(doc=3262,freq=2.0), product of:
              0.21412824 = queryWeight, product of:
                1.8612248 = boost
                6.278544 = idf(docFreq=213, maxDocs=41962)
                0.018323828 = queryNorm
              0.5549501 = fieldWeight in 3262, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.278544 = idf(docFreq=213, maxDocs=41962)
                0.0625 = fieldNorm(doc=3262)
          0.056620058 = weight(abstract_txt:text in 3262) [ClassicSimilarity], result of:
            0.056620058 = score(doc=3262,freq=1.0), product of:
              0.22337037 = queryWeight, product of:
                3.0056932 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.018323828 = queryNorm
              0.2534806 = fieldWeight in 3262, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.0625 = fieldNorm(doc=3262)
          0.049278803 = weight(abstract_txt:retrieval in 3262) [ClassicSimilarity], result of:
            0.049278803 = score(doc=3262,freq=1.0), product of:
              0.22778659 = queryWeight, product of:
                3.5913684 = boost
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.018323828 = queryNorm
              0.2163376 = fieldWeight in 3262, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.0625 = fieldNorm(doc=3262)
          0.18350884 = weight(abstract_txt:cross in 3262) [ClassicSimilarity], result of:
            0.18350884 = score(doc=3262,freq=1.0), product of:
              0.5198489 = queryWeight, product of:
                5.022975 = boost
                5.6480675 = idf(docFreq=401, maxDocs=41962)
                0.018323828 = queryNorm
              0.35300422 = fieldWeight in 3262, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6480675 = idf(docFreq=401, maxDocs=41962)
                0.0625 = fieldNorm(doc=3262)
          0.28150493 = weight(abstract_txt:language in 3262) [ClassicSimilarity], result of:
            0.28150493 = score(doc=3262,freq=5.0), product of:
              0.4794272 = queryWeight, product of:
                6.2274203 = boost
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.018323828 = queryNorm
              0.5871693 = fieldWeight in 3262, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.0625 = fieldNorm(doc=3262)
        0.56 = coord(14/25)
    
  3. López-Ostenero, F.; Peinado, V.; Gonzalo, J.; Verdejo, F.: Interactive question answering : Is Cross-Language harder than monolingual searching? (2008) 0.55
    0.55061674 = sum of:
      0.55061674 = product of:
        1.2514017 = sum of:
          0.024014588 = weight(abstract_txt:systems in 4024) [ClassicSimilarity], result of:
            0.024014588 = score(doc=4024,freq=2.0), product of:
              0.063547544 = queryWeight, product of:
                1.0139376 = boost
                3.4203563 = idf(docFreq=3729, maxDocs=41962)
                0.018323828 = queryNorm
              0.37789956 = fieldWeight in 4024, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4203563 = idf(docFreq=3729, maxDocs=41962)
                0.078125 = fieldNorm(doc=4024)
          0.024638796 = weight(abstract_txt:more in 4024) [ClassicSimilarity], result of:
            0.024638796 = score(doc=4024,freq=2.0), product of:
              0.06464402 = queryWeight, product of:
                1.0226476 = boost
                3.449738 = idf(docFreq=3621, maxDocs=41962)
                0.018323828 = queryNorm
              0.3811458 = fieldWeight in 4024, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.449738 = idf(docFreq=3621, maxDocs=41962)
                0.078125 = fieldNorm(doc=4024)
          0.044534124 = weight(abstract_txt:than in 4024) [ClassicSimilarity], result of:
            0.044534124 = score(doc=4024,freq=3.0), product of:
              0.08379412 = queryWeight, product of:
                1.16431 = boost
                3.9276135 = idf(docFreq=2245, maxDocs=41962)
                0.018323828 = queryNorm
              0.5314708 = fieldWeight in 4024, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9276135 = idf(docFreq=2245, maxDocs=41962)
                0.078125 = fieldNorm(doc=4024)
          0.01574755 = weight(abstract_txt:this in 4024) [ClassicSimilarity], result of:
            0.01574755 = score(doc=4024,freq=2.0), product of:
              0.057801183 = queryWeight, product of:
                1.2792318 = boost
                2.4658763 = idf(docFreq=9687, maxDocs=41962)
                0.018323828 = queryNorm
              0.2724434 = fieldWeight in 4024, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4658763 = idf(docFreq=9687, maxDocs=41962)
                0.078125 = fieldNorm(doc=4024)
          0.03709665 = weight(abstract_txt:documents in 4024) [ClassicSimilarity], result of:
            0.03709665 = score(doc=4024,freq=1.0), product of:
              0.11525288 = queryWeight, product of:
                1.5266615 = boost
                4.1199584 = idf(docFreq=1852, maxDocs=41962)
                0.018323828 = queryNorm
              0.32187176 = fieldWeight in 4024, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1199584 = idf(docFreq=1852, maxDocs=41962)
                0.078125 = fieldNorm(doc=4024)
          0.018953765 = weight(abstract_txt:that in 4024) [ClassicSimilarity], result of:
            0.018953765 = score(doc=4024,freq=2.0), product of:
              0.07111696 = queryWeight, product of:
                1.6089395 = boost
                2.4122221 = idf(docFreq=10221, maxDocs=41962)
                0.018323828 = queryNorm
              0.2665154 = fieldWeight in 4024, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4122221 = idf(docFreq=10221, maxDocs=41962)
                0.078125 = fieldNorm(doc=4024)
          0.22881655 = weight(abstract_txt:monolingual in 4024) [ClassicSimilarity], result of:
            0.22881655 = score(doc=4024,freq=2.0), product of:
              0.25949407 = queryWeight, product of:
                1.7744191 = boost
                7.980958 = idf(docFreq=38, maxDocs=41962)
                0.018323828 = queryNorm
              0.88177955 = fieldWeight in 4024, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.980958 = idf(docFreq=38, maxDocs=41962)
                0.078125 = fieldNorm(doc=4024)
          0.100611895 = weight(abstract_txt:translation in 4024) [ClassicSimilarity], result of:
            0.100611895 = score(doc=4024,freq=1.0), product of:
              0.2080774 = queryWeight, product of:
                1.8347391 = boost
                6.1891985 = idf(docFreq=233, maxDocs=41962)
                0.018323828 = queryNorm
              0.48353112 = fieldWeight in 4024, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1891985 = idf(docFreq=233, maxDocs=41962)
                0.078125 = fieldNorm(doc=4024)
          0.08711344 = weight(abstract_txt:retrieval in 4024) [ClassicSimilarity], result of:
            0.08711344 = score(doc=4024,freq=2.0), product of:
              0.22778659 = queryWeight, product of:
                3.5913684 = boost
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.018323828 = queryNorm
              0.38243446 = fieldWeight in 4024, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.078125 = fieldNorm(doc=4024)
          0.3973083 = weight(abstract_txt:cross in 4024) [ClassicSimilarity], result of:
            0.3973083 = score(doc=4024,freq=3.0), product of:
              0.5198489 = queryWeight, product of:
                5.022975 = boost
                5.6480675 = idf(docFreq=401, maxDocs=41962)
                0.018323828 = queryNorm
              0.7642765 = fieldWeight in 4024, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6480675 = idf(docFreq=401, maxDocs=41962)
                0.078125 = fieldNorm(doc=4024)
          0.27256596 = weight(abstract_txt:language in 4024) [ClassicSimilarity], result of:
            0.27256596 = score(doc=4024,freq=3.0), product of:
              0.4794272 = queryWeight, product of:
                6.2274203 = boost
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.018323828 = queryNorm
              0.5685242 = fieldWeight in 4024, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.078125 = fieldNorm(doc=4024)
        0.44 = coord(11/25)
    
  4. Francu, V.: Language-independent structures and multilingual information access (2003) 0.53
    0.5326227 = sum of:
      0.5326227 = product of:
        1.0242745 = sum of:
          0.02058822 = weight(abstract_txt:systems in 3754) [ClassicSimilarity], result of:
            0.02058822 = score(doc=3754,freq=3.0), product of:
              0.063547544 = queryWeight, product of:
                1.0139376 = boost
                3.4203563 = idf(docFreq=3729, maxDocs=41962)
                0.018323828 = queryNorm
              0.32398134 = fieldWeight in 3754, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4203563 = idf(docFreq=3729, maxDocs=41962)
                0.0546875 = fieldNorm(doc=3754)
          0.012195582 = weight(abstract_txt:more in 3754) [ClassicSimilarity], result of:
            0.012195582 = score(doc=3754,freq=1.0), product of:
              0.06464402 = queryWeight, product of:
                1.0226476 = boost
                3.449738 = idf(docFreq=3621, maxDocs=41962)
                0.018323828 = queryNorm
              0.18865755 = fieldWeight in 3754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.449738 = idf(docFreq=3621, maxDocs=41962)
                0.0546875 = fieldNorm(doc=3754)
          0.031162482 = weight(abstract_txt:languages in 3754) [ClassicSimilarity], result of:
            0.031162482 = score(doc=3754,freq=1.0), product of:
              0.109774575 = queryWeight, product of:
                1.1540998 = boost
                5.1908946 = idf(docFreq=634, maxDocs=41962)
                0.018323828 = queryNorm
              0.28387704 = fieldWeight in 3754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1908946 = idf(docFreq=634, maxDocs=41962)
                0.0546875 = fieldNorm(doc=3754)
          0.025453372 = weight(abstract_txt:than in 3754) [ClassicSimilarity], result of:
            0.025453372 = score(doc=3754,freq=2.0), product of:
              0.08379412 = queryWeight, product of:
                1.16431 = boost
                3.9276135 = idf(docFreq=2245, maxDocs=41962)
                0.018323828 = queryNorm
              0.30376086 = fieldWeight in 3754, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9276135 = idf(docFreq=2245, maxDocs=41962)
                0.0546875 = fieldNorm(doc=3754)
          0.054156378 = weight(abstract_txt:able in 3754) [ClassicSimilarity], result of:
            0.054156378 = score(doc=3754,freq=2.0), product of:
              0.12594187 = queryWeight, product of:
                1.2361681 = boost
                5.5600204 = idf(docFreq=438, maxDocs=41962)
                0.018323828 = queryNorm
              0.43001089 = fieldWeight in 3754, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5600204 = idf(docFreq=438, maxDocs=41962)
                0.0546875 = fieldNorm(doc=3754)
          0.011023286 = weight(abstract_txt:this in 3754) [ClassicSimilarity], result of:
            0.011023286 = score(doc=3754,freq=2.0), product of:
              0.057801183 = queryWeight, product of:
                1.2792318 = boost
                2.4658763 = idf(docFreq=9687, maxDocs=41962)
                0.018323828 = queryNorm
              0.19071038 = fieldWeight in 3754, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4658763 = idf(docFreq=9687, maxDocs=41962)
                0.0546875 = fieldNorm(doc=3754)
          0.009381636 = weight(abstract_txt:that in 3754) [ClassicSimilarity], result of:
            0.009381636 = score(doc=3754,freq=1.0), product of:
              0.07111696 = queryWeight, product of:
                1.6089395 = boost
                2.4122221 = idf(docFreq=10221, maxDocs=41962)
                0.018323828 = queryNorm
              0.1319184 = fieldWeight in 3754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4122221 = idf(docFreq=10221, maxDocs=41962)
                0.0546875 = fieldNorm(doc=3754)
          0.070428334 = weight(abstract_txt:translation in 3754) [ClassicSimilarity], result of:
            0.070428334 = score(doc=3754,freq=1.0), product of:
              0.2080774 = queryWeight, product of:
                1.8347391 = boost
                6.1891985 = idf(docFreq=233, maxDocs=41962)
                0.018323828 = queryNorm
              0.3384718 = fieldWeight in 3754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1891985 = idf(docFreq=233, maxDocs=41962)
                0.0546875 = fieldNorm(doc=3754)
          0.1273449 = weight(abstract_txt:multilingual in 3754) [ClassicSimilarity], result of:
            0.1273449 = score(doc=3754,freq=3.0), product of:
              0.21412824 = queryWeight, product of:
                1.8612248 = boost
                6.278544 = idf(docFreq=213, maxDocs=41962)
                0.018323828 = queryNorm
              0.5947133 = fieldWeight in 3754, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.278544 = idf(docFreq=213, maxDocs=41962)
                0.0546875 = fieldNorm(doc=3754)
          0.04954255 = weight(abstract_txt:text in 3754) [ClassicSimilarity], result of:
            0.04954255 = score(doc=3754,freq=1.0), product of:
              0.22337037 = queryWeight, product of:
                3.0056932 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.018323828 = queryNorm
              0.22179553 = fieldWeight in 3754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.0546875 = fieldNorm(doc=3754)
          0.12195882 = weight(abstract_txt:retrieval in 3754) [ClassicSimilarity], result of:
            0.12195882 = score(doc=3754,freq=8.0), product of:
              0.22778659 = queryWeight, product of:
                3.5913684 = boost
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.018323828 = queryNorm
              0.53540826 = fieldWeight in 3754, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.0546875 = fieldNorm(doc=3754)
          0.16057025 = weight(abstract_txt:cross in 3754) [ClassicSimilarity], result of:
            0.16057025 = score(doc=3754,freq=1.0), product of:
              0.5198489 = queryWeight, product of:
                5.022975 = boost
                5.6480675 = idf(docFreq=401, maxDocs=41962)
                0.018323828 = queryNorm
              0.3088787 = fieldWeight in 3754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6480675 = idf(docFreq=401, maxDocs=41962)
                0.0546875 = fieldNorm(doc=3754)
          0.33046868 = weight(abstract_txt:language in 3754) [ClassicSimilarity], result of:
            0.33046868 = score(doc=3754,freq=9.0), product of:
              0.4794272 = queryWeight, product of:
                6.2274203 = boost
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.018323828 = queryNorm
              0.689299 = fieldWeight in 3754, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.0546875 = fieldNorm(doc=3754)
        0.52 = coord(13/25)
    
  5. Chowdhury, G.G.: Natural language processing (2002) 0.45
    0.445186 = sum of:
      0.445186 = product of:
        1.112965 = sum of:
          0.024014588 = weight(abstract_txt:systems in 285) [ClassicSimilarity], result of:
            0.024014588 = score(doc=285,freq=2.0), product of:
              0.063547544 = queryWeight, product of:
                1.0139376 = boost
                3.4203563 = idf(docFreq=3729, maxDocs=41962)
                0.018323828 = queryNorm
              0.37789956 = fieldWeight in 285, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4203563 = idf(docFreq=3729, maxDocs=41962)
                0.078125 = fieldNorm(doc=285)
          0.044517834 = weight(abstract_txt:languages in 285) [ClassicSimilarity], result of:
            0.044517834 = score(doc=285,freq=1.0), product of:
              0.109774575 = queryWeight, product of:
                1.1540998 = boost
                5.1908946 = idf(docFreq=634, maxDocs=41962)
                0.018323828 = queryNorm
              0.40553865 = fieldWeight in 285, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1908946 = idf(docFreq=634, maxDocs=41962)
                0.078125 = fieldNorm(doc=285)
          0.023213526 = weight(abstract_txt:that in 285) [ClassicSimilarity], result of:
            0.023213526 = score(doc=285,freq=3.0), product of:
              0.07111696 = queryWeight, product of:
                1.6089395 = boost
                2.4122221 = idf(docFreq=10221, maxDocs=41962)
                0.018323828 = queryNorm
              0.32641336 = fieldWeight in 285, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4122221 = idf(docFreq=10221, maxDocs=41962)
                0.078125 = fieldNorm(doc=285)
          0.100611895 = weight(abstract_txt:translation in 285) [ClassicSimilarity], result of:
            0.100611895 = score(doc=285,freq=1.0), product of:
              0.2080774 = queryWeight, product of:
                1.8347391 = boost
                6.1891985 = idf(docFreq=233, maxDocs=41962)
                0.018323828 = queryNorm
              0.48353112 = fieldWeight in 285, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1891985 = idf(docFreq=233, maxDocs=41962)
                0.078125 = fieldNorm(doc=285)
          0.10503231 = weight(abstract_txt:multilingual in 285) [ClassicSimilarity], result of:
            0.10503231 = score(doc=285,freq=1.0), product of:
              0.21412824 = queryWeight, product of:
                1.8612248 = boost
                6.278544 = idf(docFreq=213, maxDocs=41962)
                0.018323828 = queryNorm
              0.49051124 = fieldWeight in 285, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.278544 = idf(docFreq=213, maxDocs=41962)
                0.078125 = fieldNorm(doc=285)
          0.07261806 = weight(abstract_txt:useful in 285) [ClassicSimilarity], result of:
            0.07261806 = score(doc=285,freq=1.0), product of:
              0.19165443 = queryWeight, product of:
                2.1565866 = boost
                4.849933 = idf(docFreq=892, maxDocs=41962)
                0.018323828 = queryNorm
              0.37890103 = fieldWeight in 285, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.849933 = idf(docFreq=892, maxDocs=41962)
                0.078125 = fieldNorm(doc=285)
          0.10009106 = weight(abstract_txt:text in 285) [ClassicSimilarity], result of:
            0.10009106 = score(doc=285,freq=2.0), product of:
              0.22337037 = queryWeight, product of:
                3.0056932 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.018323828 = queryNorm
              0.44809464 = fieldWeight in 285, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.078125 = fieldNorm(doc=285)
          0.061598506 = weight(abstract_txt:retrieval in 285) [ClassicSimilarity], result of:
            0.061598506 = score(doc=285,freq=1.0), product of:
              0.22778659 = queryWeight, product of:
                3.5913684 = boost
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.018323828 = queryNorm
              0.270422 = fieldWeight in 285, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.078125 = fieldNorm(doc=285)
          0.22938606 = weight(abstract_txt:cross in 285) [ClassicSimilarity], result of:
            0.22938606 = score(doc=285,freq=1.0), product of:
              0.5198489 = queryWeight, product of:
                5.022975 = boost
                5.6480675 = idf(docFreq=401, maxDocs=41962)
                0.018323828 = queryNorm
              0.44125527 = fieldWeight in 285, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6480675 = idf(docFreq=401, maxDocs=41962)
                0.078125 = fieldNorm(doc=285)
          0.35188115 = weight(abstract_txt:language in 285) [ClassicSimilarity], result of:
            0.35188115 = score(doc=285,freq=5.0), product of:
              0.4794272 = queryWeight, product of:
                6.2274203 = boost
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.018323828 = queryNorm
              0.7339616 = fieldWeight in 285, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.078125 = fieldNorm(doc=285)
        0.4 = coord(10/25)