Document (#24300)

Author
Sheridan, P.
Ballerini, J.P.
Schäuble, P.
Title
Building a large multilingual test collection from comparable news documents
Source
Cross-language information retrieval. Ed.: G. Grefenstette
Imprint
Boston, MA : Kluwer Academic Publ.
Year
1998
Pages
S.137-150
Series
The Kluwer International series on information retrieval
Theme
Multilinguale Probleme
Retrievalstudien

Similar documents (author)

  1. Knaus, D.; Mittendorf, E.; Schäuble, P.; Sheridan, P.: Is recall relevant? : An analysis how user interface conditions affect strategies and performance in large scale text retrieval (1996) 4.13
    4.1251807 = sum of:
      4.1251807 = sum of:
        1.7017602 = weight(author_txt:schäuble in 640) [ClassicSimilarity], result of:
          1.7017602 = score(doc=640,freq=1.0), product of:
            0.61991566 = queryWeight, product of:
              8.784473 = idf(docFreq=17, maxDocs=43254)
              0.07056947 = queryNorm
            2.745148 = fieldWeight in 640, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.784473 = idf(docFreq=17, maxDocs=43254)
              0.3125 = fieldNorm(doc=640)
        2.4234207 = weight(author_txt:sheridan in 640) [ClassicSimilarity], result of:
          2.4234207 = score(doc=640,freq=1.0), product of:
            0.7846685 = queryWeight, product of:
              1.125063 = boost
              9.883085 = idf(docFreq=5, maxDocs=43254)
              0.07056947 = queryNorm
            3.0884643 = fieldWeight in 640, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.883085 = idf(docFreq=5, maxDocs=43254)
              0.3125 = fieldNorm(doc=640)
    
  2. Ballerini, J.-P.; Büchel, M.; Domenig, R.; Knaus, D.; Mateev, B.; Mittendorf, E.; Schäuble, P.; Wechsler, M.; Sheridan, P.: SPIDER retrieval system at TREC-5 (1997) 2.89
    2.8876266 = sum of:
      2.8876266 = sum of:
        1.1912322 = weight(author_txt:schäuble in 5105) [ClassicSimilarity], result of:
          1.1912322 = score(doc=5105,freq=1.0), product of:
            0.61991566 = queryWeight, product of:
              8.784473 = idf(docFreq=17, maxDocs=43254)
              0.07056947 = queryNorm
            1.9216036 = fieldWeight in 5105, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.784473 = idf(docFreq=17, maxDocs=43254)
              0.21875 = fieldNorm(doc=5105)
        1.6963943 = weight(author_txt:sheridan in 5105) [ClassicSimilarity], result of:
          1.6963943 = score(doc=5105,freq=1.0), product of:
            0.7846685 = queryWeight, product of:
              1.125063 = boost
              9.883085 = idf(docFreq=5, maxDocs=43254)
              0.07056947 = queryNorm
            2.1619248 = fieldWeight in 5105, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.883085 = idf(docFreq=5, maxDocs=43254)
              0.21875 = fieldNorm(doc=5105)
    
  3. Sheridan, P.; Smeaton, A.F.: ¬The application of morpho-syntactic language processing to effective phrase matching (1992) 1.94
    1.9387364 = sum of:
      1.9387364 = product of:
        3.8774729 = sum of:
          3.8774729 = weight(author_txt:sheridan in 6575) [ClassicSimilarity], result of:
            3.8774729 = score(doc=6575,freq=1.0), product of:
              0.7846685 = queryWeight, product of:
                1.125063 = boost
                9.883085 = idf(docFreq=5, maxDocs=43254)
                0.07056947 = queryNorm
              4.9415426 = fieldWeight in 6575, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.883085 = idf(docFreq=5, maxDocs=43254)
                0.5 = fieldNorm(doc=6575)
        0.5 = coord(1/2)
    
  4. Stein, M.J.; Sheridan, C.R.: Hypertext and the identity link (1990) 1.94
    1.9387364 = sum of:
      1.9387364 = product of:
        3.8774729 = sum of:
          3.8774729 = weight(author_txt:sheridan in 6776) [ClassicSimilarity], result of:
            3.8774729 = score(doc=6776,freq=1.0), product of:
              0.7846685 = queryWeight, product of:
                1.125063 = boost
                9.883085 = idf(docFreq=5, maxDocs=43254)
                0.07056947 = queryNorm
              4.9415426 = fieldWeight in 6776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.883085 = idf(docFreq=5, maxDocs=43254)
                0.5 = fieldNorm(doc=6776)
        0.5 = coord(1/2)
    
  5. Schäuble, P.: Information retrieval based on information structures (1989) 1.70
    1.7017602 = sum of:
      1.7017602 = product of:
        3.4035203 = sum of:
          3.4035203 = weight(author_txt:schäuble in 814) [ClassicSimilarity], result of:
            3.4035203 = score(doc=814,freq=1.0), product of:
              0.61991566 = queryWeight, product of:
                8.784473 = idf(docFreq=17, maxDocs=43254)
                0.07056947 = queryNorm
              5.490296 = fieldWeight in 814, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.784473 = idf(docFreq=17, maxDocs=43254)
                0.625 = fieldNorm(doc=814)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Talvensaari, T.; Juhola, M.; Laurikkala, J.; Järvelin, K.: Corpus-based cross-language information retrieval in retrieval of highly relevant documents (2007) 0.48
    0.4756341 = sum of:
      0.4756341 = product of:
        0.71345115 = sum of:
          0.015150993 = weight(abstract_txt:from in 2140) [ClassicSimilarity], result of:
            0.015150993 = score(doc=2140,freq=1.0), product of:
              0.087181576 = queryWeight, product of:
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.031353667 = queryNorm
              0.17378664 = fieldWeight in 2140, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.0625 = fieldNorm(doc=2140)
          0.10991248 = weight(abstract_txt:documents in 2140) [ClassicSimilarity], result of:
            0.10991248 = score(doc=2140,freq=5.0), product of:
              0.19106098 = queryWeight, product of:
                1.4803815 = boost
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.031353667 = queryNorm
              0.57527435 = fieldWeight in 2140, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.0625 = fieldNorm(doc=2140)
          0.06281677 = weight(abstract_txt:large in 2140) [ClassicSimilarity], result of:
            0.06281677 = score(doc=2140,freq=1.0), product of:
              0.22499923 = queryWeight, product of:
                1.6064905 = boost
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.031353667 = queryNorm
              0.27918658 = fieldWeight in 2140, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.0625 = fieldNorm(doc=2140)
          0.07103039 = weight(abstract_txt:collection in 2140) [ClassicSimilarity], result of:
            0.07103039 = score(doc=2140,freq=1.0), product of:
              0.24420814 = queryWeight, product of:
                1.6736618 = boost
                4.653761 = idf(docFreq=1119, maxDocs=43254)
                0.031353667 = queryNorm
              0.29086006 = fieldWeight in 2140, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.653761 = idf(docFreq=1119, maxDocs=43254)
                0.0625 = fieldNorm(doc=2140)
          0.12892579 = weight(abstract_txt:test in 2140) [ClassicSimilarity], result of:
            0.12892579 = score(doc=2140,freq=2.0), product of:
              0.28841236 = queryWeight, product of:
                1.8188403 = boost
                5.057442 = idf(docFreq=747, maxDocs=43254)
                0.031353667 = queryNorm
              0.44701895 = fieldWeight in 2140, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.057442 = idf(docFreq=747, maxDocs=43254)
                0.0625 = fieldNorm(doc=2140)
          0.32561478 = weight(abstract_txt:comparable in 2140) [ClassicSimilarity], result of:
            0.32561478 = score(doc=2140,freq=2.0), product of:
              0.53488046 = queryWeight, product of:
                2.476943 = boost
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.031353667 = queryNorm
              0.6087618 = fieldWeight in 2140, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.0625 = fieldNorm(doc=2140)
        0.6666667 = coord(6/9)
    
  2. Chen, H.-H.; Kuo, J.-J.; Huang, S.-J.; Lin, C.-J.; Wung, H.-C.: ¬A summarization system for Chinese news from multiple sources (2003) 0.41
    0.41283214 = sum of:
      0.41283214 = product of:
        0.74309784 = sum of:
          0.026783425 = weight(abstract_txt:from in 4116) [ClassicSimilarity], result of:
            0.026783425 = score(doc=4116,freq=2.0), product of:
              0.087181576 = queryWeight, product of:
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.031353667 = queryNorm
              0.3072143 = fieldWeight in 4116, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.078125 = fieldNorm(doc=4116)
          0.08689345 = weight(abstract_txt:documents in 4116) [ClassicSimilarity], result of:
            0.08689345 = score(doc=4116,freq=2.0), product of:
              0.19106098 = queryWeight, product of:
                1.4803815 = boost
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.031353667 = queryNorm
              0.45479432 = fieldWeight in 4116, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.078125 = fieldNorm(doc=4116)
          0.07852096 = weight(abstract_txt:large in 4116) [ClassicSimilarity], result of:
            0.07852096 = score(doc=4116,freq=1.0), product of:
              0.22499923 = queryWeight, product of:
                1.6064905 = boost
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.031353667 = queryNorm
              0.34898323 = fieldWeight in 4116, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.078125 = fieldNorm(doc=4116)
          0.33211473 = weight(abstract_txt:news in 4116) [ClassicSimilarity], result of:
            0.33211473 = score(doc=4116,freq=3.0), product of:
              0.40801454 = queryWeight, product of:
                2.1633434 = boost
                6.0153627 = idf(docFreq=286, maxDocs=43254)
                0.031353667 = queryNorm
              0.81397766 = fieldWeight in 4116, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.0153627 = idf(docFreq=286, maxDocs=43254)
                0.078125 = fieldNorm(doc=4116)
          0.21878529 = weight(abstract_txt:multilingual in 4116) [ClassicSimilarity], result of:
            0.21878529 = score(doc=4116,freq=1.0), product of:
              0.44552222 = queryWeight, product of:
                2.2605927 = boost
                6.2857733 = idf(docFreq=218, maxDocs=43254)
                0.031353667 = queryNorm
              0.49107605 = fieldWeight in 4116, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2857733 = idf(docFreq=218, maxDocs=43254)
                0.078125 = fieldNorm(doc=4116)
        0.5555556 = coord(5/9)
    
  3. Chen, H.; Lally, A.M.; Zhu, B.; Chau, M.: HelpfulMed : Intelligent searching for medical information over the Internet (2003) 0.36
    0.35994777 = sum of:
      0.35994777 = product of:
        0.53992164 = sum of:
          0.015150993 = weight(abstract_txt:from in 3616) [ClassicSimilarity], result of:
            0.015150993 = score(doc=3616,freq=1.0), product of:
              0.087181576 = queryWeight, product of:
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.031353667 = queryNorm
              0.17378664 = fieldWeight in 3616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.0625 = fieldNorm(doc=3616)
          0.06951476 = weight(abstract_txt:documents in 3616) [ClassicSimilarity], result of:
            0.06951476 = score(doc=3616,freq=2.0), product of:
              0.19106098 = queryWeight, product of:
                1.4803815 = boost
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.031353667 = queryNorm
              0.36383545 = fieldWeight in 3616, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.0625 = fieldNorm(doc=3616)
          0.06281677 = weight(abstract_txt:large in 3616) [ClassicSimilarity], result of:
            0.06281677 = score(doc=3616,freq=1.0), product of:
              0.22499923 = queryWeight, product of:
                1.6064905 = boost
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.031353667 = queryNorm
              0.27918658 = fieldWeight in 3616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.0625 = fieldNorm(doc=3616)
          0.07103039 = weight(abstract_txt:collection in 3616) [ClassicSimilarity], result of:
            0.07103039 = score(doc=3616,freq=1.0), product of:
              0.24420814 = queryWeight, product of:
                1.6736618 = boost
                4.653761 = idf(docFreq=1119, maxDocs=43254)
                0.031353667 = queryNorm
              0.29086006 = fieldWeight in 3616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.653761 = idf(docFreq=1119, maxDocs=43254)
                0.0625 = fieldNorm(doc=3616)
          0.091164306 = weight(abstract_txt:test in 3616) [ClassicSimilarity], result of:
            0.091164306 = score(doc=3616,freq=1.0), product of:
              0.28841236 = queryWeight, product of:
                1.8188403 = boost
                5.057442 = idf(docFreq=747, maxDocs=43254)
                0.031353667 = queryNorm
              0.31609014 = fieldWeight in 3616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.057442 = idf(docFreq=747, maxDocs=43254)
                0.0625 = fieldNorm(doc=3616)
          0.23024443 = weight(abstract_txt:comparable in 3616) [ClassicSimilarity], result of:
            0.23024443 = score(doc=3616,freq=1.0), product of:
              0.53488046 = queryWeight, product of:
                2.476943 = boost
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.031353667 = queryNorm
              0.4304596 = fieldWeight in 3616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.0625 = fieldNorm(doc=3616)
        0.6666667 = coord(6/9)
    
  4. Spina, D.; Trippas, J.R.; Cavedon, L.; Sanderson, M.: Extracting audio summaries to support effective spoken document search (2017) 0.35
    0.34813696 = sum of:
      0.34813696 = product of:
        0.6266465 = sum of:
          0.03787748 = weight(abstract_txt:from in 5253) [ClassicSimilarity], result of:
            0.03787748 = score(doc=5253,freq=4.0), product of:
              0.087181576 = queryWeight, product of:
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.031353667 = queryNorm
              0.4344666 = fieldWeight in 5253, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.078125 = fieldNorm(doc=5253)
          0.06144295 = weight(abstract_txt:documents in 5253) [ClassicSimilarity], result of:
            0.06144295 = score(doc=5253,freq=1.0), product of:
              0.19106098 = queryWeight, product of:
                1.4803815 = boost
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.031353667 = queryNorm
              0.32158816 = fieldWeight in 5253, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.078125 = fieldNorm(doc=5253)
          0.12556519 = weight(abstract_txt:collection in 5253) [ClassicSimilarity], result of:
            0.12556519 = score(doc=5253,freq=2.0), product of:
              0.24420814 = queryWeight, product of:
                1.6736618 = boost
                4.653761 = idf(docFreq=1119, maxDocs=43254)
                0.031353667 = queryNorm
              0.5141728 = fieldWeight in 5253, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.653761 = idf(docFreq=1119, maxDocs=43254)
                0.078125 = fieldNorm(doc=5253)
          0.11395538 = weight(abstract_txt:test in 5253) [ClassicSimilarity], result of:
            0.11395538 = score(doc=5253,freq=1.0), product of:
              0.28841236 = queryWeight, product of:
                1.8188403 = boost
                5.057442 = idf(docFreq=747, maxDocs=43254)
                0.031353667 = queryNorm
              0.39511266 = fieldWeight in 5253, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.057442 = idf(docFreq=747, maxDocs=43254)
                0.078125 = fieldNorm(doc=5253)
          0.28780553 = weight(abstract_txt:comparable in 5253) [ClassicSimilarity], result of:
            0.28780553 = score(doc=5253,freq=1.0), product of:
              0.53488046 = queryWeight, product of:
                2.476943 = boost
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.031353667 = queryNorm
              0.5380745 = fieldWeight in 5253, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.078125 = fieldNorm(doc=5253)
        0.5555556 = coord(5/9)
    
  5. Yang, C.C.; Li, K.W.: Automatic construction of English/Chinese parallel corpora (2003) 0.33
    0.33282343 = sum of:
      0.33282343 = product of:
        0.5990821 = sum of:
          0.013257119 = weight(abstract_txt:from in 3684) [ClassicSimilarity], result of:
            0.013257119 = score(doc=3684,freq=1.0), product of:
              0.087181576 = queryWeight, product of:
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.031353667 = queryNorm
              0.15206331 = fieldWeight in 3684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.0546875 = fieldNorm(doc=3684)
          0.05496467 = weight(abstract_txt:large in 3684) [ClassicSimilarity], result of:
            0.05496467 = score(doc=3684,freq=1.0), product of:
              0.22499923 = queryWeight, product of:
                1.6064905 = boost
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.031353667 = queryNorm
              0.24428825 = fieldWeight in 3684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.466985 = idf(docFreq=1349, maxDocs=43254)
                0.0546875 = fieldNorm(doc=3684)
          0.11281007 = weight(abstract_txt:test in 3684) [ClassicSimilarity], result of:
            0.11281007 = score(doc=3684,freq=2.0), product of:
              0.28841236 = queryWeight, product of:
                1.8188403 = boost
                5.057442 = idf(docFreq=747, maxDocs=43254)
                0.031353667 = queryNorm
              0.3911416 = fieldWeight in 3684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.057442 = idf(docFreq=747, maxDocs=43254)
                0.0546875 = fieldNorm(doc=3684)
          0.21658637 = weight(abstract_txt:multilingual in 3684) [ClassicSimilarity], result of:
            0.21658637 = score(doc=3684,freq=2.0), product of:
              0.44552222 = queryWeight, product of:
                2.2605927 = boost
                6.2857733 = idf(docFreq=218, maxDocs=43254)
                0.031353667 = queryNorm
              0.48614043 = fieldWeight in 3684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2857733 = idf(docFreq=218, maxDocs=43254)
                0.0546875 = fieldNorm(doc=3684)
          0.20146388 = weight(abstract_txt:comparable in 3684) [ClassicSimilarity], result of:
            0.20146388 = score(doc=3684,freq=1.0), product of:
              0.53488046 = queryWeight, product of:
                2.476943 = boost
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.031353667 = queryNorm
              0.37665215 = fieldWeight in 3684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.0546875 = fieldNorm(doc=3684)
        0.5555556 = coord(5/9)