Search (81 results, page 1 of 5)

  • × theme_ss:"Retrievalstudien"
  1. Talvensaari, T.; Laurikkala, J.; Järvelin, K.; Juhola, M.: ¬A study on automatic creation of a comparable document collection in cross-language information retrieval (2006) 0.03
    0.033742145 = product of:
      0.10122643 = sum of:
        0.10122643 = weight(_text_:400 in 5601) [ClassicSimilarity], result of:
          0.10122643 = score(doc=5601,freq=2.0), product of:
            0.2795319 = queryWeight, product of:
              6.5552235 = idf(docFreq=170, maxDocs=44218)
              0.04264262 = queryNorm
            0.36212835 = fieldWeight in 5601, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              6.5552235 = idf(docFreq=170, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5601)
      0.33333334 = coord(1/3)
    
    Abstract
    Purpose - To present a method for creating a comparable document collection from two document collections in different languages. Design/methodology/approach - The best query keys were extracted from a Finnish source collection (articles of the newspaper Aamulehti) with the relative average term frequency formula. The keys were translated into English with a dictionary-based query translation program. The resulting lists of words were used as queries that were run against the target collection (Los Angeles Times articles) with the nearest neighbor method. The documents were aligned with unrestricted and date-restricted alignment schemes, which were also combined. Findings - The combined alignment scheme was found the best, when the relatedness of the document pairs was assessed with a five-degree relevance scale. Of the 400 document pairs, roughly 40 percent were highly or fairly related and 75 percent included at least lexical similarity. Research limitations/implications - The number of alignment pairs was small due to the short common time period of the two collections, and their geographical (and thus, topical) remoteness. In future, our aim is to build larger comparable corpora in various languages and use them as source of translation knowledge for the purposes of cross-language information retrieval (CLIR). Practical implications - Readily available parallel corpora are scarce. With this method, two unrelated document collections can relatively easily be aligned to create a CLIR resource. Originality/value - The method can be applied to weakly linked collections and morphologically complex languages, such as Finnish.
  2. Rijsbergen, C.J. van: ¬A test for the separation of relevant and non-relevant documents in experimental retrieval collections (1973) 0.02
    0.020635407 = product of:
      0.06190622 = sum of:
        0.06190622 = product of:
          0.09285933 = sum of:
            0.046639442 = weight(_text_:29 in 5002) [ClassicSimilarity], result of:
              0.046639442 = score(doc=5002,freq=2.0), product of:
                0.1500034 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.04264262 = queryNorm
                0.31092256 = fieldWeight in 5002, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0625 = fieldNorm(doc=5002)
            0.04621989 = weight(_text_:22 in 5002) [ClassicSimilarity], result of:
              0.04621989 = score(doc=5002,freq=2.0), product of:
                0.14932719 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04264262 = queryNorm
                0.30952093 = fieldWeight in 5002, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=5002)
          0.6666667 = coord(2/3)
      0.33333334 = coord(1/3)
    
    Date
    19. 3.1996 11:22:12
    Source
    Journal of documentation. 29(1973) no.3, S.251-257
  3. Chu, H.: Factors affecting relevance judgment : a report from TREC Legal track (2011) 0.01
    0.012897132 = product of:
      0.038691394 = sum of:
        0.038691394 = product of:
          0.058037087 = sum of:
            0.029149653 = weight(_text_:29 in 4540) [ClassicSimilarity], result of:
              0.029149653 = score(doc=4540,freq=2.0), product of:
                0.1500034 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.04264262 = queryNorm
                0.19432661 = fieldWeight in 4540, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4540)
            0.028887432 = weight(_text_:22 in 4540) [ClassicSimilarity], result of:
              0.028887432 = score(doc=4540,freq=2.0), product of:
                0.14932719 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04264262 = queryNorm
                0.19345059 = fieldWeight in 4540, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4540)
          0.6666667 = coord(2/3)
      0.33333334 = coord(1/3)
    
    Date
    12. 7.2011 18:29:22
  4. Hofstede, M.: Literatuur over onderwerpen zoeken in de OPC (1994) 0.01
    0.010364321 = product of:
      0.031092962 = sum of:
        0.031092962 = product of:
          0.093278885 = sum of:
            0.093278885 = weight(_text_:29 in 5400) [ClassicSimilarity], result of:
              0.093278885 = score(doc=5400,freq=2.0), product of:
                0.1500034 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.04264262 = queryNorm
                0.6218451 = fieldWeight in 5400, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.125 = fieldNorm(doc=5400)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Source
    CRI bulletin. 29(1994), Sept., S.14-15
  5. Hancock-Beaulieu, M.; McKenzie, L.; Irving, A.: Evaluative protocols for searching behaviour in online library catalogues (1991) 0.01
    0.0090687815 = product of:
      0.027206343 = sum of:
        0.027206343 = product of:
          0.081619024 = sum of:
            0.081619024 = weight(_text_:29 in 347) [ClassicSimilarity], result of:
              0.081619024 = score(doc=347,freq=2.0), product of:
                0.1500034 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.04264262 = queryNorm
                0.5441145 = fieldWeight in 347, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.109375 = fieldNorm(doc=347)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    23. 1.1999 19:52:29
  6. Harman, D.K.: ¬The TREC test collections (2005) 0.01
    0.0090687815 = product of:
      0.027206343 = sum of:
        0.027206343 = product of:
          0.081619024 = sum of:
            0.081619024 = weight(_text_:29 in 4637) [ClassicSimilarity], result of:
              0.081619024 = score(doc=4637,freq=2.0), product of:
                0.1500034 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.04264262 = queryNorm
                0.5441145 = fieldWeight in 4637, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4637)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    29. 3.1996 18:16:49
  7. Buckley, C.; Voorhees, E.M.: Retrieval system evaluation (2005) 0.01
    0.0090687815 = product of:
      0.027206343 = sum of:
        0.027206343 = product of:
          0.081619024 = sum of:
            0.081619024 = weight(_text_:29 in 648) [ClassicSimilarity], result of:
              0.081619024 = score(doc=648,freq=2.0), product of:
                0.1500034 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.04264262 = queryNorm
                0.5441145 = fieldWeight in 648, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.109375 = fieldNorm(doc=648)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    29. 3.1996 18:16:49
  8. Voiskunskii, V.G.: Evaluation of search results (2000) 0.01
    0.0090687815 = product of:
      0.027206343 = sum of:
        0.027206343 = product of:
          0.081619024 = sum of:
            0.081619024 = weight(_text_:29 in 4670) [ClassicSimilarity], result of:
              0.081619024 = score(doc=4670,freq=2.0), product of:
                0.1500034 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.04264262 = queryNorm
                0.5441145 = fieldWeight in 4670, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4670)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Source
    Encyclopedia of library and information science. Vol.66, [=Suppl.29]
  9. Harman, D.K.: ¬The TREC ad hoc experiments (2005) 0.01
    0.0090687815 = product of:
      0.027206343 = sum of:
        0.027206343 = product of:
          0.081619024 = sum of:
            0.081619024 = weight(_text_:29 in 5711) [ClassicSimilarity], result of:
              0.081619024 = score(doc=5711,freq=2.0), product of:
                0.1500034 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.04264262 = queryNorm
                0.5441145 = fieldWeight in 5711, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.109375 = fieldNorm(doc=5711)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    29. 3.1996 18:16:49
  10. Robertson, S.; Callan, J.: Routing and filtering (2005) 0.01
    0.0090687815 = product of:
      0.027206343 = sum of:
        0.027206343 = product of:
          0.081619024 = sum of:
            0.081619024 = weight(_text_:29 in 4688) [ClassicSimilarity], result of:
              0.081619024 = score(doc=4688,freq=2.0), product of:
                0.1500034 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.04264262 = queryNorm
                0.5441145 = fieldWeight in 4688, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4688)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    29. 3.1996 18:16:49
  11. Beaulieu, M.: Approaches to user-based studies in information seeking and retrieval : a Sheffield perspective (2003) 0.01
    0.0090687815 = product of:
      0.027206343 = sum of:
        0.027206343 = product of:
          0.081619024 = sum of:
            0.081619024 = weight(_text_:29 in 4692) [ClassicSimilarity], result of:
              0.081619024 = score(doc=4692,freq=2.0), product of:
                0.1500034 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.04264262 = queryNorm
                0.5441145 = fieldWeight in 4692, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4692)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Source
    Journal of information science. 29(2003) no.4, S.239-248
  12. Harman, D.K.: Beyond English (2005) 0.01
    0.0090687815 = product of:
      0.027206343 = sum of:
        0.027206343 = product of:
          0.081619024 = sum of:
            0.081619024 = weight(_text_:29 in 4850) [ClassicSimilarity], result of:
              0.081619024 = score(doc=4850,freq=2.0), product of:
                0.1500034 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.04264262 = queryNorm
                0.5441145 = fieldWeight in 4850, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4850)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    29. 3.1996 18:16:49
  13. Voorhees, E.M.; Garofolo, J.S.: Retrieving noisy text (2005) 0.01
    0.0090687815 = product of:
      0.027206343 = sum of:
        0.027206343 = product of:
          0.081619024 = sum of:
            0.081619024 = weight(_text_:29 in 5084) [ClassicSimilarity], result of:
              0.081619024 = score(doc=5084,freq=2.0), product of:
                0.1500034 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.04264262 = queryNorm
                0.5441145 = fieldWeight in 5084, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.109375 = fieldNorm(doc=5084)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    29. 3.1996 18:16:49
  14. Fuhr, N.; Niewelt, B.: ¬Ein Retrievaltest mit automatisch indexierten Dokumenten (1984) 0.01
    0.008987201 = product of:
      0.026961602 = sum of:
        0.026961602 = product of:
          0.08088481 = sum of:
            0.08088481 = weight(_text_:22 in 262) [ClassicSimilarity], result of:
              0.08088481 = score(doc=262,freq=2.0), product of:
                0.14932719 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04264262 = queryNorm
                0.5416616 = fieldWeight in 262, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=262)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    20.10.2000 12:22:23
  15. Tomaiuolo, N.G.; Parker, J.: Maximizing relevant retrieval : keyword and natural language searching (1998) 0.01
    0.008987201 = product of:
      0.026961602 = sum of:
        0.026961602 = product of:
          0.08088481 = sum of:
            0.08088481 = weight(_text_:22 in 6418) [ClassicSimilarity], result of:
              0.08088481 = score(doc=6418,freq=2.0), product of:
                0.14932719 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04264262 = queryNorm
                0.5416616 = fieldWeight in 6418, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=6418)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Source
    Online. 22(1998) no.6, S.57-58
  16. Voorhees, E.M.; Harman, D.: Overview of the Sixth Text REtrieval Conference (TREC-6) (2000) 0.01
    0.008987201 = product of:
      0.026961602 = sum of:
        0.026961602 = product of:
          0.08088481 = sum of:
            0.08088481 = weight(_text_:22 in 6438) [ClassicSimilarity], result of:
              0.08088481 = score(doc=6438,freq=2.0), product of:
                0.14932719 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04264262 = queryNorm
                0.5416616 = fieldWeight in 6438, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=6438)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    11. 8.2001 16:22:19
  17. Dalrymple, P.W.: Retrieval by reformulation in two library catalogs : toward a cognitive model of searching behavior (1990) 0.01
    0.008987201 = product of:
      0.026961602 = sum of:
        0.026961602 = product of:
          0.08088481 = sum of:
            0.08088481 = weight(_text_:22 in 5089) [ClassicSimilarity], result of:
              0.08088481 = score(doc=5089,freq=2.0), product of:
                0.14932719 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04264262 = queryNorm
                0.5416616 = fieldWeight in 5089, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=5089)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    22. 7.2006 18:43:54
  18. Voorhees, E.M.: Question answering in TREC (2005) 0.01
    0.0077732406 = product of:
      0.023319721 = sum of:
        0.023319721 = product of:
          0.06995916 = sum of:
            0.06995916 = weight(_text_:29 in 6487) [ClassicSimilarity], result of:
              0.06995916 = score(doc=6487,freq=2.0), product of:
                0.1500034 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.04264262 = queryNorm
                0.46638384 = fieldWeight in 6487, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.09375 = fieldNorm(doc=6487)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    29. 3.1996 18:16:49
  19. TREC-1: The first text retrieval conference : Rockville, MD, USA, 4-6 Nov. 1993 (1993) 0.01
    0.0077732406 = product of:
      0.023319721 = sum of:
        0.023319721 = product of:
          0.06995916 = sum of:
            0.06995916 = weight(_text_:29 in 1315) [ClassicSimilarity], result of:
              0.06995916 = score(doc=1315,freq=2.0), product of:
                0.1500034 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.04264262 = queryNorm
                0.46638384 = fieldWeight in 1315, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.09375 = fieldNorm(doc=1315)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Source
    Information processing and management. 29(1993) no.4, S.411-521
  20. Cormack, G.V.; Clarke, C.L.A.; Palmer, C.R.; Lynam, T.R.: MultiText experiments for TREC (2005) 0.01
    0.0077732406 = product of:
      0.023319721 = sum of:
        0.023319721 = product of:
          0.06995916 = sum of:
            0.06995916 = weight(_text_:29 in 4298) [ClassicSimilarity], result of:
              0.06995916 = score(doc=4298,freq=2.0), product of:
                0.1500034 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.04264262 = queryNorm
                0.46638384 = fieldWeight in 4298, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.09375 = fieldNorm(doc=4298)
          0.33333334 = coord(1/3)
      0.33333334 = coord(1/3)
    
    Date
    29. 3.1996 18:16:49

Languages

  • e 72
  • d 5
  • f 1
  • fi 1
  • nl 1
  • More… Less…

Types

  • a 74
  • s 5
  • m 3
  • el 1
  • r 1
  • More… Less…