Document (#24299)

Author
Sheridan, P.
Ballerini, J.P.
Schäuble, P.
Title
Building a large multilingual test collection from comparable news documents
Source
Cross-language information retrieval. Ed.: G. Grefenstette
Imprint
Boston, MA : Kluwer Academic Publ.
Year
1998
Pages
S.137-150
Series
The Kluwer International series on information retrieval
Theme
Multilinguale Probleme
Retrievalstudien

Similar documents (author)

  1. Knaus, D.; Mittendorf, E.; Schäuble, P.; Sheridan, P.: Is recall relevant? : An analysis how user interface conditions affect strategies and performance in large scale text retrieval (1996) 4.13
    4.1349206 = sum of:
      4.1349206 = sum of:
        1.7066147 = weight(author_txt:schäuble in 7570) [ClassicSimilarity], result of:
          1.7066147 = score(doc=7570,freq=1.0), product of:
            0.62012804 = queryWeight, product of:
              8.806516 = idf(docFreq=17, maxDocs=44218)
              0.07041696 = queryNorm
            2.752036 = fieldWeight in 7570, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.806516 = idf(docFreq=17, maxDocs=44218)
              0.3125 = fieldNorm(doc=7570)
        2.4283059 = weight(author_txt:sheridan in 7570) [ClassicSimilarity], result of:
          2.4283059 = score(doc=7570,freq=1.0), product of:
            0.7845006 = queryWeight, product of:
              1.1247499 = boost
              9.905128 = idf(docFreq=5, maxDocs=44218)
              0.07041696 = queryNorm
            3.0953524 = fieldWeight in 7570, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.905128 = idf(docFreq=5, maxDocs=44218)
              0.3125 = fieldNorm(doc=7570)
    
  2. Ballerini, J.-P.; Büchel, M.; Domenig, R.; Knaus, D.; Mateev, B.; Mittendorf, E.; Schäuble, P.; Wechsler, M.; Sheridan, P.: SPIDER retrieval system at TREC-5 (1997) 2.89
    2.8944445 = sum of:
      2.8944445 = sum of:
        1.1946304 = weight(author_txt:schäuble in 3104) [ClassicSimilarity], result of:
          1.1946304 = score(doc=3104,freq=1.0), product of:
            0.62012804 = queryWeight, product of:
              8.806516 = idf(docFreq=17, maxDocs=44218)
              0.07041696 = queryNorm
            1.9264253 = fieldWeight in 3104, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.806516 = idf(docFreq=17, maxDocs=44218)
              0.21875 = fieldNorm(doc=3104)
        1.699814 = weight(author_txt:sheridan in 3104) [ClassicSimilarity], result of:
          1.699814 = score(doc=3104,freq=1.0), product of:
            0.7845006 = queryWeight, product of:
              1.1247499 = boost
              9.905128 = idf(docFreq=5, maxDocs=44218)
              0.07041696 = queryNorm
            2.1667466 = fieldWeight in 3104, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.905128 = idf(docFreq=5, maxDocs=44218)
              0.21875 = fieldNorm(doc=3104)
    
  3. Sheridan, P.; Smeaton, A.F.: ¬The application of morpho-syntactic language processing to effective phrase matching (1992) 1.94
    1.9426446 = sum of:
      1.9426446 = product of:
        3.8852892 = sum of:
          3.8852892 = weight(author_txt:sheridan in 6575) [ClassicSimilarity], result of:
            3.8852892 = score(doc=6575,freq=1.0), product of:
              0.7845006 = queryWeight, product of:
                1.1247499 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.07041696 = queryNorm
              4.952564 = fieldWeight in 6575, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.5 = fieldNorm(doc=6575)
        0.5 = coord(1/2)
    
  4. Stein, M.J.; Sheridan, C.R.: Hypertext and the identity link (1990) 1.94
    1.9426446 = sum of:
      1.9426446 = product of:
        3.8852892 = sum of:
          3.8852892 = weight(author_txt:sheridan in 5707) [ClassicSimilarity], result of:
            3.8852892 = score(doc=5707,freq=1.0), product of:
              0.7845006 = queryWeight, product of:
                1.1247499 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.07041696 = queryNorm
              4.952564 = fieldWeight in 5707, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.5 = fieldNorm(doc=5707)
        0.5 = coord(1/2)
    
  5. Schäuble, P.: Information retrieval based on information structures (1989) 1.71
    1.7066147 = sum of:
      1.7066147 = product of:
        3.4132295 = sum of:
          3.4132295 = weight(author_txt:schäuble in 8814) [ClassicSimilarity], result of:
            3.4132295 = score(doc=8814,freq=1.0), product of:
              0.62012804 = queryWeight, product of:
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.07041696 = queryNorm
              5.504072 = fieldWeight in 8814, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.625 = fieldNorm(doc=8814)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Talvensaari, T.; Juhola, M.; Laurikkala, J.; Järvelin, K.: Corpus-based cross-language information retrieval in retrieval of highly relevant documents (2007) 0.48
    0.4776199 = sum of:
      0.4776199 = product of:
        0.7164298 = sum of:
          0.014931852 = weight(abstract_txt:from in 139) [ClassicSimilarity], result of:
            0.014931852 = score(doc=139,freq=1.0), product of:
              0.08643986 = queryWeight, product of:
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.031274796 = queryNorm
              0.17274266 = fieldWeight in 139, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=139)
          0.11069894 = weight(abstract_txt:documents in 139) [ClassicSimilarity], result of:
            0.11069894 = score(doc=139,freq=5.0), product of:
              0.1921958 = queryWeight, product of:
                1.491128 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.031274796 = queryNorm
              0.5759696 = fieldWeight in 139, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=139)
          0.062493037 = weight(abstract_txt:large in 139) [ClassicSimilarity], result of:
            0.062493037 = score(doc=139,freq=1.0), product of:
              0.22448778 = queryWeight, product of:
                1.6115334 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.031274796 = queryNorm
              0.27838057 = fieldWeight in 139, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0625 = fieldNorm(doc=139)
          0.07103848 = weight(abstract_txt:collection in 139) [ClassicSimilarity], result of:
            0.07103848 = score(doc=139,freq=1.0), product of:
              0.24451229 = queryWeight, product of:
                1.6818734 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.031274796 = queryNorm
              0.2905313 = fieldWeight in 139, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.0625 = fieldNorm(doc=139)
          0.12854898 = weight(abstract_txt:test in 139) [ClassicSimilarity], result of:
            0.12854898 = score(doc=139,freq=2.0), product of:
              0.2881868 = queryWeight, product of:
                1.8259126 = boost
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.031274796 = queryNorm
              0.44606134 = fieldWeight in 139, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.0625 = fieldNorm(doc=139)
          0.32871854 = weight(abstract_txt:comparable in 139) [ClassicSimilarity], result of:
            0.32871854 = score(doc=139,freq=2.0), product of:
              0.5389036 = queryWeight, product of:
                2.4968848 = boost
                6.901097 = idf(docFreq=120, maxDocs=44218)
                0.031274796 = queryNorm
              0.60997653 = fieldWeight in 139, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.901097 = idf(docFreq=120, maxDocs=44218)
                0.0625 = fieldNorm(doc=139)
        0.6666667 = coord(6/9)
    
  2. Chen, H.-H.; Kuo, J.-J.; Huang, S.-J.; Lin, C.-J.; Wung, H.-C.: ¬A summarization system for Chinese news from multiple sources (2003) 0.41
    0.40923944 = sum of:
      0.40923944 = product of:
        0.736631 = sum of:
          0.026396036 = weight(abstract_txt:from in 2115) [ClassicSimilarity], result of:
            0.026396036 = score(doc=2115,freq=2.0), product of:
              0.08643986 = queryWeight, product of:
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.031274796 = queryNorm
              0.30536878 = fieldWeight in 2115, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.078125 = fieldNorm(doc=2115)
          0.08751519 = weight(abstract_txt:documents in 2115) [ClassicSimilarity], result of:
            0.08751519 = score(doc=2115,freq=2.0), product of:
              0.1921958 = queryWeight, product of:
                1.491128 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.031274796 = queryNorm
              0.4553439 = fieldWeight in 2115, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=2115)
          0.0781163 = weight(abstract_txt:large in 2115) [ClassicSimilarity], result of:
            0.0781163 = score(doc=2115,freq=1.0), product of:
              0.22448778 = queryWeight, product of:
                1.6115334 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.031274796 = queryNorm
              0.34797573 = fieldWeight in 2115, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.078125 = fieldNorm(doc=2115)
          0.3236897 = weight(abstract_txt:news in 2115) [ClassicSimilarity], result of:
            0.3236897 = score(doc=2115,freq=3.0), product of:
              0.40155384 = queryWeight, product of:
                2.1553354 = boost
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.031274796 = queryNorm
              0.8060929 = fieldWeight in 2115, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.078125 = fieldNorm(doc=2115)
          0.22091377 = weight(abstract_txt:multilingual in 2115) [ClassicSimilarity], result of:
            0.22091377 = score(doc=2115,freq=1.0), product of:
              0.44893155 = queryWeight, product of:
                2.2789407 = boost
                6.2987247 = idf(docFreq=220, maxDocs=44218)
                0.031274796 = queryNorm
              0.49208787 = fieldWeight in 2115, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2987247 = idf(docFreq=220, maxDocs=44218)
                0.078125 = fieldNorm(doc=2115)
        0.5555556 = coord(5/9)
    
  3. Chen, H.; Lally, A.M.; Zhu, B.; Chau, M.: HelpfulMed : Intelligent searching for medical information over the Internet (2003) 0.36
    0.36120832 = sum of:
      0.36120832 = product of:
        0.5418125 = sum of:
          0.014931852 = weight(abstract_txt:from in 1615) [ClassicSimilarity], result of:
            0.014931852 = score(doc=1615,freq=1.0), product of:
              0.08643986 = queryWeight, product of:
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.031274796 = queryNorm
              0.17274266 = fieldWeight in 1615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=1615)
          0.07001215 = weight(abstract_txt:documents in 1615) [ClassicSimilarity], result of:
            0.07001215 = score(doc=1615,freq=2.0), product of:
              0.1921958 = queryWeight, product of:
                1.491128 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.031274796 = queryNorm
              0.36427513 = fieldWeight in 1615, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=1615)
          0.062493037 = weight(abstract_txt:large in 1615) [ClassicSimilarity], result of:
            0.062493037 = score(doc=1615,freq=1.0), product of:
              0.22448778 = queryWeight, product of:
                1.6115334 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.031274796 = queryNorm
              0.27838057 = fieldWeight in 1615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0625 = fieldNorm(doc=1615)
          0.07103848 = weight(abstract_txt:collection in 1615) [ClassicSimilarity], result of:
            0.07103848 = score(doc=1615,freq=1.0), product of:
              0.24451229 = queryWeight, product of:
                1.6818734 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.031274796 = queryNorm
              0.2905313 = fieldWeight in 1615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.0625 = fieldNorm(doc=1615)
          0.09089786 = weight(abstract_txt:test in 1615) [ClassicSimilarity], result of:
            0.09089786 = score(doc=1615,freq=1.0), product of:
              0.2881868 = queryWeight, product of:
                1.8259126 = boost
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.031274796 = queryNorm
              0.315413 = fieldWeight in 1615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.0625 = fieldNorm(doc=1615)
          0.23243912 = weight(abstract_txt:comparable in 1615) [ClassicSimilarity], result of:
            0.23243912 = score(doc=1615,freq=1.0), product of:
              0.5389036 = queryWeight, product of:
                2.4968848 = boost
                6.901097 = idf(docFreq=120, maxDocs=44218)
                0.031274796 = queryNorm
              0.43131855 = fieldWeight in 1615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.901097 = idf(docFreq=120, maxDocs=44218)
                0.0625 = fieldNorm(doc=1615)
        0.6666667 = coord(6/9)
    
  4. Spina, D.; Trippas, J.R.; Cavedon, L.; Sanderson, M.: Extracting audio summaries to support effective spoken document search (2017) 0.35
    0.34942386 = sum of:
      0.34942386 = product of:
        0.62896293 = sum of:
          0.03732963 = weight(abstract_txt:from in 3788) [ClassicSimilarity], result of:
            0.03732963 = score(doc=3788,freq=4.0), product of:
              0.08643986 = queryWeight, product of:
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.031274796 = queryNorm
              0.43185666 = fieldWeight in 3788, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.078125 = fieldNorm(doc=3788)
          0.061882585 = weight(abstract_txt:documents in 3788) [ClassicSimilarity], result of:
            0.061882585 = score(doc=3788,freq=1.0), product of:
              0.1921958 = queryWeight, product of:
                1.491128 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.031274796 = queryNorm
              0.32197678 = fieldWeight in 3788, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=3788)
          0.12557948 = weight(abstract_txt:collection in 3788) [ClassicSimilarity], result of:
            0.12557948 = score(doc=3788,freq=2.0), product of:
              0.24451229 = queryWeight, product of:
                1.6818734 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.031274796 = queryNorm
              0.51359165 = fieldWeight in 3788, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.078125 = fieldNorm(doc=3788)
          0.11362232 = weight(abstract_txt:test in 3788) [ClassicSimilarity], result of:
            0.11362232 = score(doc=3788,freq=1.0), product of:
              0.2881868 = queryWeight, product of:
                1.8259126 = boost
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.031274796 = queryNorm
              0.39426625 = fieldWeight in 3788, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.078125 = fieldNorm(doc=3788)
          0.29054892 = weight(abstract_txt:comparable in 3788) [ClassicSimilarity], result of:
            0.29054892 = score(doc=3788,freq=1.0), product of:
              0.5389036 = queryWeight, product of:
                2.4968848 = boost
                6.901097 = idf(docFreq=120, maxDocs=44218)
                0.031274796 = queryNorm
              0.5391482 = fieldWeight in 3788, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.901097 = idf(docFreq=120, maxDocs=44218)
                0.078125 = fieldNorm(doc=3788)
        0.5555556 = coord(5/9)
    
  5. Yang, C.C.; Li, K.W.: Automatic construction of English/Chinese parallel corpora (2003) 0.33
    0.3346138 = sum of:
      0.3346138 = product of:
        0.6023048 = sum of:
          0.013065372 = weight(abstract_txt:from in 1683) [ClassicSimilarity], result of:
            0.013065372 = score(doc=1683,freq=1.0), product of:
              0.08643986 = queryWeight, product of:
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.031274796 = queryNorm
              0.15114984 = fieldWeight in 1683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.054681405 = weight(abstract_txt:large in 1683) [ClassicSimilarity], result of:
            0.054681405 = score(doc=1683,freq=1.0), product of:
              0.22448778 = queryWeight, product of:
                1.6115334 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.031274796 = queryNorm
              0.243583 = fieldWeight in 1683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.112480365 = weight(abstract_txt:test in 1683) [ClassicSimilarity], result of:
            0.112480365 = score(doc=1683,freq=2.0), product of:
              0.2881868 = queryWeight, product of:
                1.8259126 = boost
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.031274796 = queryNorm
              0.39030367 = fieldWeight in 1683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.21869346 = weight(abstract_txt:multilingual in 1683) [ClassicSimilarity], result of:
            0.21869346 = score(doc=1683,freq=2.0), product of:
              0.44893155 = queryWeight, product of:
                2.2789407 = boost
                6.2987247 = idf(docFreq=220, maxDocs=44218)
                0.031274796 = queryNorm
              0.48714212 = fieldWeight in 1683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2987247 = idf(docFreq=220, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.20338424 = weight(abstract_txt:comparable in 1683) [ClassicSimilarity], result of:
            0.20338424 = score(doc=1683,freq=1.0), product of:
              0.5389036 = queryWeight, product of:
                2.4968848 = boost
                6.901097 = idf(docFreq=120, maxDocs=44218)
                0.031274796 = queryNorm
              0.37740374 = fieldWeight in 1683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.901097 = idf(docFreq=120, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
        0.5555556 = coord(5/9)