Search (466 results, page 1 of 24)

Salton, G.; Lesk, M.E.: Computer evaluation of indexing and text processing (1968) 0.08

0.084351756 = product of:
  0.14058626 = sum of:
    0.07581701 = weight(_text_:g in 77) [ClassicSimilarity], result of:
      0.07581701 = score(doc=77,freq=2.0), product of:
        0.15225126 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.040536046 = queryNorm
        0.49797297 = fieldWeight in 77, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.09375 = fieldNorm(doc=77)
    0.05762393 = weight(_text_:u in 77) [ClassicSimilarity], result of:
      0.05762393 = score(doc=77,freq=2.0), product of:
        0.13273303 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.040536046 = queryNorm
        0.43413407 = fieldWeight in 77, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.09375 = fieldNorm(doc=77)
    0.0071453196 = weight(_text_:a in 77) [ClassicSimilarity], result of:
      0.0071453196 = score(doc=77,freq=2.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.15287387 = fieldWeight in 77, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.09375 = fieldNorm(doc=77)
  0.6 = coord(3/5)

Footnote: Wiederabgedruckt in: Readings in information retrieval. Ed.: K. Sparck Jones u. P. Willett. San Francisco: Morgan Kaufmann 1997. S.60-84.
Type: a

¬The Second Text Retrieval Conference : TREC-2 (1995) 0.08

0.07535346 = product of:
  0.12558909 = sum of:
    0.037908506 = weight(_text_:g in 1320) [ClassicSimilarity], result of:
      0.037908506 = score(doc=1320,freq=2.0), product of:
        0.15225126 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.040536046 = queryNorm
        0.24898648 = fieldWeight in 1320, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.046875 = fieldNorm(doc=1320)
    0.08149254 = weight(_text_:u in 1320) [ClassicSimilarity], result of:
      0.08149254 = score(doc=1320,freq=16.0), product of:
        0.13273303 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.040536046 = queryNorm
        0.6139583 = fieldWeight in 1320, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.046875 = fieldNorm(doc=1320)
    0.0061880285 = weight(_text_:a in 1320) [ClassicSimilarity], result of:
      0.0061880285 = score(doc=1320,freq=6.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.13239266 = fieldWeight in 1320, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.046875 = fieldNorm(doc=1320)
  0.6 = coord(3/5)

Abstract: A special issue devoted to papers from the 2nd Text Retrieval Conference (TREC-2) held in Aug 93
Content: Enthält die Beiträge: HARMAN, D.: Overview of the Second Text Retrieval Conference (TREC-2); SPRACK JONES, K.: Reflections on TREC; BUCKLEY, C., J. ALLAN u. G. SALTON: Automatic routing and retrieval using SMART: TREC-2; CALLAN, J.P., W.B. CROFT u. J. BROGLIO: TREC and TIPSTER experiments with INQUERY; ROBERTSON, S.R., S. WALKER u. M.M. HANCOCK-BEAULIEU: Large test collection experiments on an operational, interactive system: OKAPI at TREC; ZOBEL, J., A. MOFFAT, R. WILKINSON u. R. SACKS-DAVIS: Efficient retrieval of partial documents; METTLER, M. u. F. NORDBY: TREC routing experiments with the TRW/Paracel Fast Data Finder; EVANS, D.A. u. R.G. LEFFERTS: CLARIT-TREC experiments; STRZALKOWSKI, T.: Natural language information retrieval; CAID, W.R., S.T. DUMAIS u. S.I. GALLANT: Learned vector-space models for document retrieval; BELKIN, N.J. P. KANTOR, E.A. FOX u. J.A. SHAW: Combining the evidence of multiple query representations for information retrieval
Type: a

Hull, D.; Grefenstette, G.; Schulze, B.M.; Gaussier, E.; Schütze, H.; Pedersen, J.: Xerox TREC-5 site reports : routing, filtering, NLP, and Spanisch tracks (1997) 0.07

0.070293136 = product of:
  0.117155224 = sum of:
    0.06318085 = weight(_text_:g in 3096) [ClassicSimilarity], result of:
      0.06318085 = score(doc=3096,freq=2.0), product of:
        0.15225126 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.040536046 = queryNorm
        0.4149775 = fieldWeight in 3096, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.078125 = fieldNorm(doc=3096)
    0.048019946 = weight(_text_:u in 3096) [ClassicSimilarity], result of:
      0.048019946 = score(doc=3096,freq=2.0), product of:
        0.13273303 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.040536046 = queryNorm
        0.3617784 = fieldWeight in 3096, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.078125 = fieldNorm(doc=3096)
    0.0059544328 = weight(_text_:a in 3096) [ClassicSimilarity], result of:
      0.0059544328 = score(doc=3096,freq=2.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.12739488 = fieldWeight in 3096, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.078125 = fieldNorm(doc=3096)
  0.6 = coord(3/5)

Source: The Fifth Text Retrieval Conference (TREC-5). Ed.: E.M. Voorhees u. D.K. Harman
Type: a

Allan, J.; Callan, J.P.; Croft, W.B.; Ballesteros, L.; Broglio, J.; Xu, J.; Shu, H.: INQUERY at TREC-5 (1997) 0.05

0.048860855 = product of:
  0.08143476 = sum of:
    0.048019946 = weight(_text_:u in 3103) [ClassicSimilarity], result of:
      0.048019946 = score(doc=3103,freq=2.0), product of:
        0.13273303 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.040536046 = queryNorm
        0.3617784 = fieldWeight in 3103, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.078125 = fieldNorm(doc=3103)
    0.0059544328 = weight(_text_:a in 3103) [ClassicSimilarity], result of:
      0.0059544328 = score(doc=3103,freq=2.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.12739488 = fieldWeight in 3103, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.078125 = fieldNorm(doc=3103)
    0.027460374 = product of:
      0.054920748 = sum of:
        0.054920748 = weight(_text_:22 in 3103) [ClassicSimilarity], result of:
          0.054920748 = score(doc=3103,freq=2.0), product of:
            0.14195032 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.040536046 = queryNorm
            0.38690117 = fieldWeight in 3103, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=3103)
      0.5 = coord(1/2)
  0.6 = coord(3/5)

Date: 27. 2.1999 20:55:22
Source: The Fifth Text Retrieval Conference (TREC-5). Ed.: E.M. Voorhees u. D.K. Harman
Type: a

Ng, K.B.; Loewenstern, D.; Basu, C.; Hirsh, H.; Kantor, P.B.: Data fusion of machine-learning methods for the TREC5 routing tak (and other work) (1997) 0.05

0.048860855 = product of:
  0.08143476 = sum of:
    0.048019946 = weight(_text_:u in 3107) [ClassicSimilarity], result of:
      0.048019946 = score(doc=3107,freq=2.0), product of:
        0.13273303 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.040536046 = queryNorm
        0.3617784 = fieldWeight in 3107, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.078125 = fieldNorm(doc=3107)
    0.0059544328 = weight(_text_:a in 3107) [ClassicSimilarity], result of:
      0.0059544328 = score(doc=3107,freq=2.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.12739488 = fieldWeight in 3107, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.078125 = fieldNorm(doc=3107)
    0.027460374 = product of:
      0.054920748 = sum of:
        0.054920748 = weight(_text_:22 in 3107) [ClassicSimilarity], result of:
          0.054920748 = score(doc=3107,freq=2.0), product of:
            0.14195032 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.040536046 = queryNorm
            0.38690117 = fieldWeight in 3107, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=3107)
      0.5 = coord(1/2)
  0.6 = coord(3/5)

Date: 27. 2.1999 20:59:22
Source: The Fifth Text Retrieval Conference (TREC-5). Ed.: E.M. Voorhees u. D.K. Harman
Type: a

Buckley, C.; Allan, J.; Salton, G.: Automatic routing and retrieval using Smart : TREC-2 (1995) 0.05

0.04570371 = product of:
  0.07617284 = sum of:
    0.037908506 = weight(_text_:g in 5699) [ClassicSimilarity], result of:
      0.037908506 = score(doc=5699,freq=2.0), product of:
        0.15225126 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.040536046 = queryNorm
        0.24898648 = fieldWeight in 5699, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.046875 = fieldNorm(doc=5699)
    0.028811965 = weight(_text_:u in 5699) [ClassicSimilarity], result of:
      0.028811965 = score(doc=5699,freq=2.0), product of:
        0.13273303 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.040536046 = queryNorm
        0.21706703 = fieldWeight in 5699, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.046875 = fieldNorm(doc=5699)
    0.00945237 = weight(_text_:a in 5699) [ClassicSimilarity], result of:
      0.00945237 = score(doc=5699,freq=14.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.20223314 = fieldWeight in 5699, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.046875 = fieldNorm(doc=5699)
  0.6 = coord(3/5)

Abstract: The Smart information retrieval project emphazises completely automatic approaches to the understanding and retrieval of large quantities of text. The work in the TREC-2 environment continues, performing both routing and ad hoc experiments. The ad hoc work extends investigations into combining global similarities, giving an overall indication of how a document matches a query, with local similarities identifying a smaller part of the document that matches the query. The performance of ad hoc runs is good, but it is clear that full advantage of the available local information is not been taken advantage of. The routing experiments use conventional relevance feedback approaches to routing, but with a much greater degree of query expansion than was previously done. The length of a query vector is increased by a factor of 5 to 10 by adding terms found in previously seen relevant documents. This approach improves effectiveness by 30-40% over the original query
Theme: Semantisches Umfeld in Indexierung u. Retrieval
Type: a

Salton, G.: ¬The Smart environment for retrieval systeme valuation : advantages and problem areas (1981) 0.04

0.03871576 = product of:
  0.0967894 = sum of:
    0.08845319 = weight(_text_:g in 3159) [ClassicSimilarity], result of:
      0.08845319 = score(doc=3159,freq=2.0), product of:
        0.15225126 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.040536046 = queryNorm
        0.5809685 = fieldWeight in 3159, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.109375 = fieldNorm(doc=3159)
    0.008336206 = weight(_text_:a in 3159) [ClassicSimilarity], result of:
      0.008336206 = score(doc=3159,freq=2.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.17835285 = fieldWeight in 3159, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.109375 = fieldNorm(doc=3159)
  0.4 = coord(2/5)

Type: a

Wood, F.; Ford, N.; Miller, D.; Sobczyk, G.; Duffin, R.: Information skills, searching behaviour and cognitive styles for student-centred learning : a computer-assisted learning approach (1996) 0.04

0.036343656 = product of:
  0.06057276 = sum of:
    0.037908506 = weight(_text_:g in 4341) [ClassicSimilarity], result of:
      0.037908506 = score(doc=4341,freq=2.0), product of:
        0.15225126 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.040536046 = queryNorm
        0.24898648 = fieldWeight in 4341, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.046875 = fieldNorm(doc=4341)
    0.0061880285 = weight(_text_:a in 4341) [ClassicSimilarity], result of:
      0.0061880285 = score(doc=4341,freq=6.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.13239266 = fieldWeight in 4341, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.046875 = fieldNorm(doc=4341)
    0.016476223 = product of:
      0.032952446 = sum of:
        0.032952446 = weight(_text_:22 in 4341) [ClassicSimilarity], result of:
          0.032952446 = score(doc=4341,freq=2.0), product of:
            0.14195032 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.040536046 = queryNorm
            0.23214069 = fieldWeight in 4341, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=4341)
      0.5 = coord(1/2)
  0.6 = coord(3/5)

Abstract: Undergraduates were tested to establish how they searched databases, the effectiveness of their searches and their satisfaction with them. The students' cognitive and learning styles were determined by the Lancaster Approaches to Studying Inventory and Riding's Cognitive Styles Analysis tests. There were significant differences in the searching behaviour and the effectiveness of the searches carried out by students with different learning and cognitive styles. Computer-assisted learning (CAL) packages were developed for three departments. The effectiveness of the packages were evaluated. Significant differences were found in the ways students with different learning styles used the packages. Based on the experience gained, guidelines for the teaching of information skills and the production and use of packages were prepared. About 2/3 of the searches had serious weaknesses, indicating a need for effective training. It appears that choice of searching strategies, search effectiveness and use of CAL packages are all affected by the cognitive and learning styles of the searcher. Therefore, students should be made aware of their own styles and, if appropriate, how to adopt more effective strategies
Source: Journal of information science. 22(1996) no.2, S.79-92
Type: a

Gauch, S.; Wang, J.: Corpus analysis for TREC 5 query expansion (1997) 0.04

0.03545515 = product of:
  0.088637866 = sum of:
    0.08149254 = weight(_text_:u in 5800) [ClassicSimilarity], result of:
      0.08149254 = score(doc=5800,freq=4.0), product of:
        0.13273303 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.040536046 = queryNorm
        0.6139583 = fieldWeight in 5800, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.09375 = fieldNorm(doc=5800)
    0.0071453196 = weight(_text_:a in 5800) [ClassicSimilarity], result of:
      0.0071453196 = score(doc=5800,freq=2.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.15287387 = fieldWeight in 5800, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.09375 = fieldNorm(doc=5800)
  0.4 = coord(2/5)

Source: The Fifth Text Retrieval Conference (TREC-5). Ed.: E.M. Voorhees u. D.K. Harman
Theme: Semantisches Umfeld in Indexierung u. Retrieval
Type: a

Buckley, C.; Singhal, A.; Mitra, M.; Salton, G.: New retrieval approaches using SMART : TREC 4 (1996) 0.03

0.03436881 = product of:
  0.08592202 = sum of:
    0.07581701 = weight(_text_:g in 7528) [ClassicSimilarity], result of:
      0.07581701 = score(doc=7528,freq=2.0), product of:
        0.15225126 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.040536046 = queryNorm
        0.49797297 = fieldWeight in 7528, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.09375 = fieldNorm(doc=7528)
    0.010105007 = weight(_text_:a in 7528) [ClassicSimilarity], result of:
      0.010105007 = score(doc=7528,freq=4.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.2161963 = fieldWeight in 7528, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.09375 = fieldNorm(doc=7528)
  0.4 = coord(2/5)

Type: a

Sheridan, P.; Ballerini, J.P.; Schäuble, P.: Building a large multilingual test collection from comparable news documents (1998) 0.03

0.03436881 = product of:
  0.08592202 = sum of:
    0.07581701 = weight(_text_:g in 6298) [ClassicSimilarity], result of:
      0.07581701 = score(doc=6298,freq=2.0), product of:
        0.15225126 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.040536046 = queryNorm
        0.49797297 = fieldWeight in 6298, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.09375 = fieldNorm(doc=6298)
    0.010105007 = weight(_text_:a in 6298) [ClassicSimilarity], result of:
      0.010105007 = score(doc=6298,freq=4.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.2161963 = fieldWeight in 6298, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.09375 = fieldNorm(doc=6298)
  0.4 = coord(2/5)

Source: Cross-language information retrieval. Ed.: G. Grefenstette
Type: a

Davis, M.W.: On the effective use of large parallel corpora in cross-language text retrieval (1998) 0.03

0.033184934 = product of:
  0.082962334 = sum of:
    0.07581701 = weight(_text_:g in 6302) [ClassicSimilarity], result of:
      0.07581701 = score(doc=6302,freq=2.0), product of:
        0.15225126 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.040536046 = queryNorm
        0.49797297 = fieldWeight in 6302, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.09375 = fieldNorm(doc=6302)
    0.0071453196 = weight(_text_:a in 6302) [ClassicSimilarity], result of:
      0.0071453196 = score(doc=6302,freq=2.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.15287387 = fieldWeight in 6302, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.09375 = fieldNorm(doc=6302)
  0.4 = coord(2/5)

Source: Cross-language information retrieval. Ed.: G. Grefenstette
Type: a

Rapke, K.: Automatische Indexierung von Volltexten für die Gruner+Jahr Pressedatenbank (2001) 0.03
```
0.03175587 = product of:
  0.07938967 = sum of:
    0.07581701 = weight(_text_:g in 6386) [ClassicSimilarity], result of:
      0.07581701 = score(doc=6386,freq=8.0), product of:
        0.15225126 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.040536046 = queryNorm
        0.49797297 = fieldWeight in 6386, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.046875 = fieldNorm(doc=6386)
    0.0035726598 = weight(_text_:a in 6386) [ClassicSimilarity], result of:
      0.0035726598 = score(doc=6386,freq=2.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.07643694 = fieldWeight in 6386, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.046875 = fieldNorm(doc=6386)
  0.4 = coord(2/5)
```
Abstract

Retrieval Tests sind die anerkannteste Methode, um neue Verfahren der Inhaltserschließung gegenüber traditionellen Verfahren zu rechtfertigen. Im Rahmen einer Diplomarbeit wurden zwei grundsätzlich unterschiedliche Systeme der automatischen inhaltlichen Erschließung anhand der Pressedatenbank des Verlagshauses Gruner + Jahr (G+J) getestet und evaluiert. Untersucht wurde dabei natürlichsprachliches Retrieval im Vergleich zu Booleschem Retrieval. Bei den beiden Systemen handelt es sich zum einen um Autonomy von Autonomy Inc. und DocCat, das von IBM an die Datenbankstruktur der G+J Pressedatenbank angepasst wurde. Ersteres ist ein auf natürlichsprachlichem Retrieval basierendes, probabilistisches System. DocCat demgegenüber basiert auf Booleschem Retrieval und ist ein lernendes System, das auf Grund einer intellektuell erstellten Trainingsvorlage indexiert. Methodisch geht die Evaluation vom realen Anwendungskontext der Textdokumentation von G+J aus. Die Tests werden sowohl unter statistischen wie auch qualitativen Gesichtspunkten bewertet. Ein Ergebnis der Tests ist, dass DocCat einige Mängel gegenüber der intellektuellen Inhaltserschließung aufweist, die noch behoben werden müssen, während das natürlichsprachliche Retrieval von Autonomy in diesem Rahmen und für die speziellen Anforderungen der G+J Textdokumentation so nicht einsetzbar ist

Type

a
Cross-language information retrieval (1998) 0.03
```
0.03131154 = product of:
  0.0521859 = sum of:
    0.022337802 = weight(_text_:g in 6299) [ClassicSimilarity], result of:
      0.022337802 = score(doc=6299,freq=4.0), product of:
        0.15225126 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.040536046 = queryNorm
        0.1467167 = fieldWeight in 6299, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.01953125 = fieldNorm(doc=6299)
    0.020793248 = weight(_text_:u in 6299) [ClassicSimilarity], result of:
      0.020793248 = score(doc=6299,freq=6.0), product of:
        0.13273303 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.040536046 = queryNorm
        0.15665466 = fieldWeight in 6299, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.01953125 = fieldNorm(doc=6299)
    0.009054851 = weight(_text_:a in 6299) [ClassicSimilarity], result of:
      0.009054851 = score(doc=6299,freq=74.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.19372822 = fieldWeight in 6299, product of:
          8.602325 = tf(freq=74.0), with freq of:
            74.0 = termFreq=74.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.01953125 = fieldNorm(doc=6299)
  0.6 = coord(3/5)
```
Content

Enthält die Beiträge: GREFENSTETTE, G.: The Problem of Cross-Language Information Retrieval; DAVIS, M.W.: On the Effective Use of Large Parallel Corpora in Cross-Language Text Retrieval; BALLESTEROS, L. u. W.B. CROFT: Statistical Methods for Cross-Language Information Retrieval; Distributed Cross-Lingual Information Retrieval; Automatic Cross-Language Information Retrieval Using Latent Semantic Indexing; EVANS, D.A. u.a.: Mapping Vocabularies Using Latent Semantics; PICCHI, E. u. C. PETERS: Cross-Language Information Retrieval: A System for Comparable Corpus Querying; YAMABANA, K. u.a.: A Language Conversion Front-End for Cross-Language Information Retrieval; GACHOT, D.A. u.a.: The Systran NLP Browser: An Application of Machine Translation Technology in Cross-Language Information Retrieval; HULL, D.: A Weighted Boolean Model for Cross-Language Text Retrieval; SHERIDAN, P. u.a. Building a Large Multilingual Test Collection from Comparable News Documents; OARD; D.W. u. B.J. DORR: Evaluating Cross-Language Text Filtering Effectiveness

Editor

Grefenstette, G.

Footnote

Rez. in: Machine translation review: 1999, no.10, S.26-27 (D. Lewis): "Cross Language Information Retrieval (CLIR) addresses the growing need to access large volumes of data across language boundaries. The typical requirement is for the user to input a free form query, usually a brief description of a topic, into a search or retrieval engine which returns a list, in ranked order, of documents or web pages that are relevant to the topic. The search engine matches the terms in the query to indexed terms, usually keywords previously derived from the target documents. Unlike monolingual information retrieval, CLIR requires query terms in one language to be matched to indexed terms in another. Matching can be done by bilingual dictionary lookup, full machine translation, or by applying statistical methods. A query's success is measured in terms of recall (how many potentially relevant target documents are found) and precision (what proportion of documents found are relevant). Issues in CLIR are how to translate query terms into index terms, how to eliminate alternative translations (e.g. to decide that French 'traitement' in a query means 'treatment' and not 'salary'), and how to rank or weight translation alternatives that are retained (e.g. how to order the French terms 'aventure', 'business', 'affaire', and 'liaison' as relevant translations of English 'affair'). Grefenstette provides a lucid and useful overview of the field and the problems. The volume brings together a number of experiments and projects in CLIR. Mark Davies (New Mexico State University) describes Recuerdo, a Spanish retrieval engine which reduces translation ambiguities by scanning indexes for parallel texts; it also uses either a bilingual dictionary or direct equivalents from a parallel corpus in order to compare results for queries on parallel texts. Lisa Ballesteros and Bruce Croft (University of Massachusetts) use a 'local feedback' technique which automatically enhances a query by adding extra terms to it both before and after translation; such terms can be derived from documents known to be relevant to the query.
Christian Fluhr at al (DIST/SMTI, France) outline the EMIR (European Multilingual Information Retrieval) and ESPRIT projects. They found that using SYSTRAN to machine translate queries and to access material from various multilingual databases produced less relevant results than a method referred to as 'multilingual reformulation' (the mechanics of which are only hinted at). An interesting technique is Latent Semantic Indexing (LSI), described by Michael Littman et al (Brown University) and, most clearly, by David Evans et al (Carnegie Mellon University). LSI involves creating matrices of documents and the terms they contain and 'fitting' related documents into a reduced matrix space. This effectively allows queries to be mapped onto a common semantic representation of the documents. Eugenio Picchi and Carol Peters (Pisa) report on a procedure to create links between translation equivalents in an Italian-English parallel corpus. The links are used to construct parallel linguistic contexts in real-time for any term or combination of terms that is being searched for in either language. Their interest is primarily lexicographic but they plan to apply the same procedure to comparable corpora, i.e. to texts which are not translations of each other but which share the same domain. Kiyoshi Yamabana et al (NEC, Japan) address the issue of how to disambiguate between alternative translations of query terms. Their DMAX (double maximise) method looks at co-occurrence frequencies between both source language words and target language words in order to arrive at the most probable translation. The statistical data for the decision are derived, not from the translation texts but independently from monolingual corpora in each language. An interactive user interface allows the user to influence the selection of terms during the matching process. Denis Gachot et al (SYSTRAN) describe the SYSTRAN NLP browser, a prototype tool which collects parsing information derived from a text or corpus previously translated with SYSTRAN. The user enters queries into the browser in either a structured or free form and receives grammatical and lexical information about the source text and/or its translation.
The retrieved output from a query including the phrase 'big rockets' may be, for instance, a sentence containing 'giant rocket' which is semantically ranked above 'military ocket'. David Hull (Xerox Research Centre, Grenoble) describes an implementation of a weighted Boolean model for Spanish-English CLIR. Users construct Boolean-type queries, weighting each term in the query, which is then translated by an on-line dictionary before being applied to the database. Comparisons with the performance of unweighted free-form queries ('vector space' models) proved encouraging. Two contributions consider the evaluation of CLIR systems. In order to by-pass the time-consuming and expensive process of assembling a standard collection of documents and of user queries against which the performance of an CLIR system is manually assessed, Páriac Sheridan et al (ETH Zurich) propose a method based on retrieving 'seed documents'. This involves identifying a unique document in a database (the 'seed document') and, for a number of queries, measuring how fast it is retrieved. The authors have also assembled a large database of multilingual news documents for testing purposes. By storing the (fairly short) documents in a structured form tagged with descriptor codes (e.g. for topic, country and area), the test suite is easily expanded while remaining consistent for the purposes of testing. Douglas Ouard and Bonne Dorr (University of Maryland) describe an evaluation methodology which appears to apply LSI techniques in order to filter and rank incoming documents designed for testing CLIR systems. The volume provides the reader an excellent overview of several projects in CLIR. It is well supported with references and is intended as a secondary text for researchers and practitioners. It highlights the need for a good, general tutorial introduction to the field."

Harman, D.K.: ¬The TREC test collections (2005) 0.03

0.030225653 = product of:
  0.07556413 = sum of:
    0.06722792 = weight(_text_:u in 4637) [ClassicSimilarity], result of:
      0.06722792 = score(doc=4637,freq=2.0), product of:
        0.13273303 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.040536046 = queryNorm
        0.50648975 = fieldWeight in 4637, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.109375 = fieldNorm(doc=4637)
    0.008336206 = weight(_text_:a in 4637) [ClassicSimilarity], result of:
      0.008336206 = score(doc=4637,freq=2.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.17835285 = fieldWeight in 4637, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.109375 = fieldNorm(doc=4637)
  0.4 = coord(2/5)

Source: TREC: experiment and evaluation in information retrieval. Ed.: E.M. Voorhees, u. D.K. Harman
Type: a

Buckley, C.; Voorhees, E.M.: Retrieval system evaluation (2005) 0.03

0.030225653 = product of:
  0.07556413 = sum of:
    0.06722792 = weight(_text_:u in 648) [ClassicSimilarity], result of:
      0.06722792 = score(doc=648,freq=2.0), product of:
        0.13273303 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.040536046 = queryNorm
        0.50648975 = fieldWeight in 648, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.109375 = fieldNorm(doc=648)
    0.008336206 = weight(_text_:a in 648) [ClassicSimilarity], result of:
      0.008336206 = score(doc=648,freq=2.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.17835285 = fieldWeight in 648, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.109375 = fieldNorm(doc=648)
  0.4 = coord(2/5)

Source: TREC: experiment and evaluation in information retrieval. Ed.: E.M. Voorhees, u. D.K. Harman
Type: a

Harman, D.K.: ¬The TREC ad hoc experiments (2005) 0.03

0.030225653 = product of:
  0.07556413 = sum of:
    0.06722792 = weight(_text_:u in 5711) [ClassicSimilarity], result of:
      0.06722792 = score(doc=5711,freq=2.0), product of:
        0.13273303 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.040536046 = queryNorm
        0.50648975 = fieldWeight in 5711, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.109375 = fieldNorm(doc=5711)
    0.008336206 = weight(_text_:a in 5711) [ClassicSimilarity], result of:
      0.008336206 = score(doc=5711,freq=2.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.17835285 = fieldWeight in 5711, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.109375 = fieldNorm(doc=5711)
  0.4 = coord(2/5)

Source: TREC: experiment and evaluation in information retrieval. Ed.: E.M. Voorhees, u. D.K. Harman
Type: a

Robertson, S.; Callan, J.: Routing and filtering (2005) 0.03

0.030225653 = product of:
  0.07556413 = sum of:
    0.06722792 = weight(_text_:u in 4688) [ClassicSimilarity], result of:
      0.06722792 = score(doc=4688,freq=2.0), product of:
        0.13273303 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.040536046 = queryNorm
        0.50648975 = fieldWeight in 4688, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.109375 = fieldNorm(doc=4688)
    0.008336206 = weight(_text_:a in 4688) [ClassicSimilarity], result of:
      0.008336206 = score(doc=4688,freq=2.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.17835285 = fieldWeight in 4688, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.109375 = fieldNorm(doc=4688)
  0.4 = coord(2/5)

Source: TREC: experiment and evaluation in information retrieval. Ed.: E.M. Voorhees, u. D.K. Harman
Type: a

Harman, D.K.: Beyond English (2005) 0.03

0.030225653 = product of:
  0.07556413 = sum of:
    0.06722792 = weight(_text_:u in 4850) [ClassicSimilarity], result of:
      0.06722792 = score(doc=4850,freq=2.0), product of:
        0.13273303 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.040536046 = queryNorm
        0.50648975 = fieldWeight in 4850, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.109375 = fieldNorm(doc=4850)
    0.008336206 = weight(_text_:a in 4850) [ClassicSimilarity], result of:
      0.008336206 = score(doc=4850,freq=2.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.17835285 = fieldWeight in 4850, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.109375 = fieldNorm(doc=4850)
  0.4 = coord(2/5)

Source: TREC: experiment and evaluation in information retrieval. Ed.: E.M. Voorhees, u. D.K. Harman
Type: a

Voorhees, E.M.; Garofolo, J.S.: Retrieving noisy text (2005) 0.03

0.030225653 = product of:
  0.07556413 = sum of:
    0.06722792 = weight(_text_:u in 5084) [ClassicSimilarity], result of:
      0.06722792 = score(doc=5084,freq=2.0), product of:
        0.13273303 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.040536046 = queryNorm
        0.50648975 = fieldWeight in 5084, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.109375 = fieldNorm(doc=5084)
    0.008336206 = weight(_text_:a in 5084) [ClassicSimilarity], result of:
      0.008336206 = score(doc=5084,freq=2.0), product of:
        0.046739966 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.040536046 = queryNorm
        0.17835285 = fieldWeight in 5084, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.109375 = fieldNorm(doc=5084)
  0.4 = coord(2/5)

Source: TREC: experiment and evaluation in information retrieval. Ed.: E.M. Voorhees, u. D.K. Harman
Type: a

Search (466 results, page 1 of 24)

Authors

Years

Languages

Types

Themes

Subjects

Classifications