Search (358 results, page 1 of 18)

Salton, G.; Buckley, C.: Term-weighting approaches in automatic text retrieval (1988) 0.09

0.09155957 = product of:
  0.16022924 = sum of:
    0.031747986 = product of:
      0.06349597 = sum of:
        0.06349597 = weight(_text_:p in 1938) [ClassicSimilarity], result of:
          0.06349597 = score(doc=1938,freq=2.0), product of:
            0.13319843 = queryWeight, product of:
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.03704574 = queryNorm
            0.47670212 = fieldWeight in 1938, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.09375 = fieldNorm(doc=1938)
      0.5 = coord(1/2)
    0.06928888 = weight(_text_:g in 1938) [ClassicSimilarity], result of:
      0.06928888 = score(doc=1938,freq=2.0), product of:
        0.13914184 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.03704574 = queryNorm
        0.49797297 = fieldWeight in 1938, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.09375 = fieldNorm(doc=1938)
    0.05266229 = weight(_text_:u in 1938) [ClassicSimilarity], result of:
      0.05266229 = score(doc=1938,freq=2.0), product of:
        0.121304214 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.03704574 = queryNorm
        0.43413407 = fieldWeight in 1938, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.09375 = fieldNorm(doc=1938)
    0.006530081 = weight(_text_:a in 1938) [ClassicSimilarity], result of:
      0.006530081 = score(doc=1938,freq=2.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.15287387 = fieldWeight in 1938, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.09375 = fieldNorm(doc=1938)
  0.5714286 = coord(4/7)

Footnote: Wiederabgedruckt in: Readings in information retrieval. Ed.: K. Sparck Jones u. P. Willett. San Francisco: Morgan Kaufmann 1997. S.323-328.
Type: a

Smeaton, A.F.; Rijsbergen, C.J. van: ¬The retrieval effects of query expansion on a feedback document retrieval system (1983) 0.05

0.04600617 = product of:
  0.10734773 = sum of:
    0.061439343 = weight(_text_:u in 2134) [ClassicSimilarity], result of:
      0.061439343 = score(doc=2134,freq=2.0), product of:
        0.121304214 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.03704574 = queryNorm
        0.50648975 = fieldWeight in 2134, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.109375 = fieldNorm(doc=2134)
    0.010774084 = weight(_text_:a in 2134) [ClassicSimilarity], result of:
      0.010774084 = score(doc=2134,freq=4.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.25222903 = fieldWeight in 2134, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.109375 = fieldNorm(doc=2134)
    0.035134304 = product of:
      0.07026861 = sum of:
        0.07026861 = weight(_text_:22 in 2134) [ClassicSimilarity], result of:
          0.07026861 = score(doc=2134,freq=2.0), product of:
            0.12972787 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03704574 = queryNorm
            0.5416616 = fieldWeight in 2134, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=2134)
      0.5 = coord(1/2)
  0.42857143 = coord(3/7)

Date: 30. 3.2001 13:32:22
Theme: Semantisches Umfeld in Indexierung u. Retrieval
Type: a

Fuhr, N.: Rankingexperimente mit gewichteter Indexierung (1986) 0.05

0.04540032 = product of:
  0.10593408 = sum of:
    0.06928888 = weight(_text_:g in 2051) [ClassicSimilarity], result of:
      0.06928888 = score(doc=2051,freq=2.0), product of:
        0.13914184 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.03704574 = queryNorm
        0.49797297 = fieldWeight in 2051, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.09375 = fieldNorm(doc=2051)
    0.006530081 = weight(_text_:a in 2051) [ClassicSimilarity], result of:
      0.006530081 = score(doc=2051,freq=2.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.15287387 = fieldWeight in 2051, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.09375 = fieldNorm(doc=2051)
    0.030115116 = product of:
      0.060230233 = sum of:
        0.060230233 = weight(_text_:22 in 2051) [ClassicSimilarity], result of:
          0.060230233 = score(doc=2051,freq=2.0), product of:
            0.12972787 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03704574 = queryNorm
            0.46428138 = fieldWeight in 2051, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=2051)
      0.5 = coord(1/2)
  0.42857143 = coord(3/7)

Date: 14. 6.2015 22:12:56
Source: Automatische Indexierung zwischen Forschung und Anwendung, Hrsg.: G. Lustig
Type: a

Information retrieval : data structures and algorithms (1992) 0.04

0.0433435 = product of:
  0.07585112 = sum of:
    0.013228328 = product of:
      0.026456656 = sum of:
        0.026456656 = weight(_text_:p in 3495) [ClassicSimilarity], result of:
          0.026456656 = score(doc=3495,freq=2.0), product of:
            0.13319843 = queryWeight, product of:
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.03704574 = queryNorm
            0.19862589 = fieldWeight in 3495, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3495)
      0.5 = coord(1/2)
    0.028870367 = weight(_text_:g in 3495) [ClassicSimilarity], result of:
      0.028870367 = score(doc=3495,freq=2.0), product of:
        0.13914184 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.03704574 = queryNorm
        0.20748875 = fieldWeight in 3495, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3495)
    0.031031553 = weight(_text_:u in 3495) [ClassicSimilarity], result of:
      0.031031553 = score(doc=3495,freq=4.0), product of:
        0.121304214 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.03704574 = queryNorm
        0.25581595 = fieldWeight in 3495, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3495)
    0.0027208668 = weight(_text_:a in 3495) [ClassicSimilarity], result of:
      0.0027208668 = score(doc=3495,freq=2.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.06369744 = fieldWeight in 3495, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3495)
  0.5714286 = coord(4/7)

Content: An edited volume containing data structures and algorithms for information retrieval including a disk with examples written in C. for prgrammers and students interested in parsing text, automated indexing, its the first collection in book form of the basic data structures and algorithms that are critical to the storage and retrieval of documents. ------------------Enthält die Kapitel: FRAKES, W.B.: Introduction to information storage and retrieval systems; BAEZA-YATES, R.S.: Introduction to data structures and algorithms related to information retrieval; HARMAN, D. u.a.: Inverted files; FALOUTSOS, C.: Signature files; GONNET, G.H. u.a.: New indices for text: PAT trees and PAT arrays; FORD, D.A. u. S. CHRISTODOULAKIS: File organizations for optical disks; FOX, C.: Lexical analysis and stoplists; FRAKES, W.B.: Stemming algorithms; SRINIVASAN, P.: Thesaurus construction; BAEZA-YATES, R.A.: String searching algorithms; HARMAN, D.: Relevance feedback and other query modification techniques; WARTIK, S.: Boolean operators; WARTIK, S. u.a.: Hashing algorithms; HARMAN, D.: Ranking algorithms; FOX, E.: u.a.: Extended Boolean models; RASMUSSEN, E.: Clustering algorithms; HOLLAAR, L.: Special-purpose hardware for information retrieval; STANFILL, C.: Parallel information retrieval algorithms
Editor: Frakes, W.B. u. R. Baeza-Yates
Footnote: Rez. in: Computing reviews. July 1993, S.341-342 (G. Salton)

Klas, C.-P.; Fuhr, N.; Schaefer, A.: Evaluating strategic support for information access in the DAFFODIL system (2004) 0.04

0.04312587 = product of:
  0.07547027 = sum of:
    0.015873993 = product of:
      0.031747986 = sum of:
        0.031747986 = weight(_text_:p in 2419) [ClassicSimilarity], result of:
          0.031747986 = score(doc=2419,freq=2.0), product of:
            0.13319843 = queryWeight, product of:
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.03704574 = queryNorm
            0.23835106 = fieldWeight in 2419, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.046875 = fieldNorm(doc=2419)
      0.5 = coord(1/2)
    0.037237864 = weight(_text_:u in 2419) [ClassicSimilarity], result of:
      0.037237864 = score(doc=2419,freq=4.0), product of:
        0.121304214 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.03704574 = queryNorm
        0.30697915 = fieldWeight in 2419, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.046875 = fieldNorm(doc=2419)
    0.007300853 = weight(_text_:a in 2419) [ClassicSimilarity], result of:
      0.007300853 = score(doc=2419,freq=10.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.1709182 = fieldWeight in 2419, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.046875 = fieldNorm(doc=2419)
    0.015057558 = product of:
      0.030115116 = sum of:
        0.030115116 = weight(_text_:22 in 2419) [ClassicSimilarity], result of:
          0.030115116 = score(doc=2419,freq=2.0), product of:
            0.12972787 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03704574 = queryNorm
            0.23214069 = fieldWeight in 2419, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=2419)
      0.5 = coord(1/2)
  0.5714286 = coord(4/7)

Abstract: The digital library system Daffodil is targeted at strategic support of users during the information search process. For searching, exploring and managing digital library objects it provides user-customisable information seeking patterns over a federation of heterogeneous digital libraries. In this paper evaluation results with respect to retrieval effectiveness, efficiency and user satisfaction are presented. The analysis focuses on strategic support for the scientific work-flow. Daffodil supports the whole work-flow, from data source selection over information seeking to the representation, organisation and reuse of information. By embedding high level search functionality into the scientific work-flow, the user experiences better strategic system support due to a more systematic work process. These ideas have been implemented in Daffodil followed by a qualitative evaluation. The evaluation has been conducted with 28 participants, ranging from information seeking novices to experts. The results are promising, as they support the chosen model.
Date: 16.11.2008 16:22:48
Source: Research and advanced technology for digital libraries : 8th European conference, ECDL 2004, Bath, UK, September 12-17, 2004 : proceedings. Eds.: Heery, R. u. E. Lyon
Theme: Semantisches Umfeld in Indexierung u. Retrieval
Type: a

Symonds, M.; Bruza, P.; Zuccon, G.; Koopman, B.; Sitbon, L.; Turner, I.: Automatic query expansion : a structural linguistic perspective (2014) 0.04

0.041259386 = product of:
  0.07220392 = sum of:
    0.013228328 = product of:
      0.026456656 = sum of:
        0.026456656 = weight(_text_:p in 1338) [ClassicSimilarity], result of:
          0.026456656 = score(doc=1338,freq=2.0), product of:
            0.13319843 = queryWeight, product of:
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.03704574 = queryNorm
            0.19862589 = fieldWeight in 1338, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1338)
      0.5 = coord(1/2)
    0.028870367 = weight(_text_:g in 1338) [ClassicSimilarity], result of:
      0.028870367 = score(doc=1338,freq=2.0), product of:
        0.13914184 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.03704574 = queryNorm
        0.20748875 = fieldWeight in 1338, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1338)
    0.021942623 = weight(_text_:u in 1338) [ClassicSimilarity], result of:
      0.021942623 = score(doc=1338,freq=2.0), product of:
        0.121304214 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.03704574 = queryNorm
        0.1808892 = fieldWeight in 1338, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1338)
    0.008162601 = weight(_text_:a in 1338) [ClassicSimilarity], result of:
      0.008162601 = score(doc=1338,freq=18.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.19109234 = fieldWeight in 1338, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1338)
  0.5714286 = coord(4/7)

Abstract: A user's query is considered to be an imprecise description of their information need. Automatic query expansion is the process of reformulating the original query with the goal of improving retrieval effectiveness. Many successful query expansion techniques model syntagmatic associations that infer two terms co-occur more often than by chance in natural language. However, structural linguistics relies on both syntagmatic and paradigmatic associations to deduce the meaning of a word. Given the success of dependency-based approaches to query expansion and the reliance on word meanings in the query formulation process, we argue that modeling both syntagmatic and paradigmatic information in the query expansion process improves retrieval effectiveness. This article develops and evaluates a new query expansion technique that is based on a formal, corpus-based model of word meaning that models syntagmatic and paradigmatic associations. We demonstrate that when sufficient statistical information exists, as in the case of longer queries, including paradigmatic information alone provides significant improvements in retrieval effectiveness across a wide variety of data sets. More generally, when our new query expansion approach is applied to large-scale web retrieval it demonstrates significant improvements in retrieval effectiveness over a strong baseline system, based on a commercial search engine.
Theme: Semantisches Umfeld in Indexierung u. Retrieval
Type: a

Robertson, S.E.: ¬The probability ranking principle in IR (1977) 0.04

0.03897444 = product of:
  0.09094036 = sum of:
    0.031747986 = product of:
      0.06349597 = sum of:
        0.06349597 = weight(_text_:p in 1935) [ClassicSimilarity], result of:
          0.06349597 = score(doc=1935,freq=2.0), product of:
            0.13319843 = queryWeight, product of:
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.03704574 = queryNorm
            0.47670212 = fieldWeight in 1935, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.09375 = fieldNorm(doc=1935)
      0.5 = coord(1/2)
    0.05266229 = weight(_text_:u in 1935) [ClassicSimilarity], result of:
      0.05266229 = score(doc=1935,freq=2.0), product of:
        0.121304214 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.03704574 = queryNorm
        0.43413407 = fieldWeight in 1935, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.09375 = fieldNorm(doc=1935)
    0.006530081 = weight(_text_:a in 1935) [ClassicSimilarity], result of:
      0.006530081 = score(doc=1935,freq=2.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.15287387 = fieldWeight in 1935, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.09375 = fieldNorm(doc=1935)
  0.42857143 = coord(3/7)

Footnote: Wiederabgedruckt in: Readings in information retrieval. Ed.: K. Sparck Jones u. P. Willet. San Francisco: Morgan Kaufmann 1997. S.281-286.
Type: a

Sparck Jones, K.: Search term relevance weighting given little relevance information (1979) 0.04

0.03897444 = product of:
  0.09094036 = sum of:
    0.031747986 = product of:
      0.06349597 = sum of:
        0.06349597 = weight(_text_:p in 1939) [ClassicSimilarity], result of:
          0.06349597 = score(doc=1939,freq=2.0), product of:
            0.13319843 = queryWeight, product of:
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.03704574 = queryNorm
            0.47670212 = fieldWeight in 1939, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.09375 = fieldNorm(doc=1939)
      0.5 = coord(1/2)
    0.05266229 = weight(_text_:u in 1939) [ClassicSimilarity], result of:
      0.05266229 = score(doc=1939,freq=2.0), product of:
        0.121304214 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.03704574 = queryNorm
        0.43413407 = fieldWeight in 1939, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.09375 = fieldNorm(doc=1939)
    0.006530081 = weight(_text_:a in 1939) [ClassicSimilarity], result of:
      0.006530081 = score(doc=1939,freq=2.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.15287387 = fieldWeight in 1939, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.09375 = fieldNorm(doc=1939)
  0.42857143 = coord(3/7)

Footnote: Wiederabgedruckt in: Readings in information retrieval. Ed.: K. Sparck Jones u. P. Willett. San Francisco: Morgan Kaufmann 1997. S.329-338.
Type: a

Al-Hawamdeh, S.; Smith, G.; Willett, P.; Vere, R. de: Using nearest-neighbour searching techniques to access full-text documents (1991) 0.03

0.033039592 = product of:
  0.07709238 = sum of:
    0.021165324 = product of:
      0.04233065 = sum of:
        0.04233065 = weight(_text_:p in 2300) [ClassicSimilarity], result of:
          0.04233065 = score(doc=2300,freq=2.0), product of:
            0.13319843 = queryWeight, product of:
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.03704574 = queryNorm
            0.31780142 = fieldWeight in 2300, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.0625 = fieldNorm(doc=2300)
      0.5 = coord(1/2)
    0.046192586 = weight(_text_:g in 2300) [ClassicSimilarity], result of:
      0.046192586 = score(doc=2300,freq=2.0), product of:
        0.13914184 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.03704574 = queryNorm
        0.331982 = fieldWeight in 2300, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.0625 = fieldNorm(doc=2300)
    0.0097344695 = weight(_text_:a in 2300) [ClassicSimilarity], result of:
      0.0097344695 = score(doc=2300,freq=10.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.22789092 = fieldWeight in 2300, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.0625 = fieldNorm(doc=2300)
  0.42857143 = coord(3/7)

Abstract: Summarises the results to date of a continuing programme of research at Sheffield Univ. to investigate the use of nearest-neighbour retrieval algorithms for full text searching. Given a natural language query statement, the research methods result in a ranking of the paragraphs comprising a full text document in order of decreasing similarity with the query, where the similarity for each paragraph is determined by the number of keyword stems that it has in common with the query
Type: a

Jones, G.; Robertson, A.M.; Willett, P.: ¬An introduction to genetic algorithms and to their use in information retrieval (1994) 0.03

0.03209923 = product of:
  0.0748982 = sum of:
    0.021165324 = product of:
      0.04233065 = sum of:
        0.04233065 = weight(_text_:p in 7415) [ClassicSimilarity], result of:
          0.04233065 = score(doc=7415,freq=2.0), product of:
            0.13319843 = queryWeight, product of:
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.03704574 = queryNorm
            0.31780142 = fieldWeight in 7415, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.0625 = fieldNorm(doc=7415)
      0.5 = coord(1/2)
    0.046192586 = weight(_text_:g in 7415) [ClassicSimilarity], result of:
      0.046192586 = score(doc=7415,freq=2.0), product of:
        0.13914184 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.03704574 = queryNorm
        0.331982 = fieldWeight in 7415, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.0625 = fieldNorm(doc=7415)
    0.007540288 = weight(_text_:a in 7415) [ClassicSimilarity], result of:
      0.007540288 = score(doc=7415,freq=6.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.17652355 = fieldWeight in 7415, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.0625 = fieldNorm(doc=7415)
  0.42857143 = coord(3/7)

Abstract: This paper provides an introduction to genetic algorithms, a new approach to the investigation of computationally-intensive problems that may be insoluble using conventional, deterministic approaches. A genetic algorithm takes an initial set of possible starting solutions and then iteratively improves theses solutions using operators that are analogous to those involved in Darwinian evolution. The approach is illusrated by reference to several problems in information retrieval
Type: a

Cross-language information retrieval (1998) 0.03
```
0.03103238 = product of:
  0.054306664 = sum of:
    0.006614164 = product of:
      0.013228328 = sum of:
        0.013228328 = weight(_text_:p in 6299) [ClassicSimilarity], result of:
          0.013228328 = score(doc=6299,freq=2.0), product of:
            0.13319843 = queryWeight, product of:
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.03704574 = queryNorm
            0.099312946 = fieldWeight in 6299, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.01953125 = fieldNorm(doc=6299)
      0.5 = coord(1/2)
    0.020414433 = weight(_text_:g in 6299) [ClassicSimilarity], result of:
      0.020414433 = score(doc=6299,freq=4.0), product of:
        0.13914184 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.03704574 = queryNorm
        0.1467167 = fieldWeight in 6299, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.01953125 = fieldNorm(doc=6299)
    0.01900287 = weight(_text_:u in 6299) [ClassicSimilarity], result of:
      0.01900287 = score(doc=6299,freq=6.0), product of:
        0.121304214 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.03704574 = queryNorm
        0.15665466 = fieldWeight in 6299, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.01953125 = fieldNorm(doc=6299)
    0.008275194 = weight(_text_:a in 6299) [ClassicSimilarity], result of:
      0.008275194 = score(doc=6299,freq=74.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.19372822 = fieldWeight in 6299, product of:
          8.602325 = tf(freq=74.0), with freq of:
            74.0 = termFreq=74.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.01953125 = fieldNorm(doc=6299)
  0.5714286 = coord(4/7)
```
Content

Enthält die Beiträge: GREFENSTETTE, G.: The Problem of Cross-Language Information Retrieval; DAVIS, M.W.: On the Effective Use of Large Parallel Corpora in Cross-Language Text Retrieval; BALLESTEROS, L. u. W.B. CROFT: Statistical Methods for Cross-Language Information Retrieval; Distributed Cross-Lingual Information Retrieval; Automatic Cross-Language Information Retrieval Using Latent Semantic Indexing; EVANS, D.A. u.a.: Mapping Vocabularies Using Latent Semantics; PICCHI, E. u. C. PETERS: Cross-Language Information Retrieval: A System for Comparable Corpus Querying; YAMABANA, K. u.a.: A Language Conversion Front-End for Cross-Language Information Retrieval; GACHOT, D.A. u.a.: The Systran NLP Browser: An Application of Machine Translation Technology in Cross-Language Information Retrieval; HULL, D.: A Weighted Boolean Model for Cross-Language Text Retrieval; SHERIDAN, P. u.a. Building a Large Multilingual Test Collection from Comparable News Documents; OARD; D.W. u. B.J. DORR: Evaluating Cross-Language Text Filtering Effectiveness

Editor

Grefenstette, G.

Footnote

Rez. in: Machine translation review: 1999, no.10, S.26-27 (D. Lewis): "Cross Language Information Retrieval (CLIR) addresses the growing need to access large volumes of data across language boundaries. The typical requirement is for the user to input a free form query, usually a brief description of a topic, into a search or retrieval engine which returns a list, in ranked order, of documents or web pages that are relevant to the topic. The search engine matches the terms in the query to indexed terms, usually keywords previously derived from the target documents. Unlike monolingual information retrieval, CLIR requires query terms in one language to be matched to indexed terms in another. Matching can be done by bilingual dictionary lookup, full machine translation, or by applying statistical methods. A query's success is measured in terms of recall (how many potentially relevant target documents are found) and precision (what proportion of documents found are relevant). Issues in CLIR are how to translate query terms into index terms, how to eliminate alternative translations (e.g. to decide that French 'traitement' in a query means 'treatment' and not 'salary'), and how to rank or weight translation alternatives that are retained (e.g. how to order the French terms 'aventure', 'business', 'affaire', and 'liaison' as relevant translations of English 'affair'). Grefenstette provides a lucid and useful overview of the field and the problems. The volume brings together a number of experiments and projects in CLIR. Mark Davies (New Mexico State University) describes Recuerdo, a Spanish retrieval engine which reduces translation ambiguities by scanning indexes for parallel texts; it also uses either a bilingual dictionary or direct equivalents from a parallel corpus in order to compare results for queries on parallel texts. Lisa Ballesteros and Bruce Croft (University of Massachusetts) use a 'local feedback' technique which automatically enhances a query by adding extra terms to it both before and after translation; such terms can be derived from documents known to be relevant to the query.
Christian Fluhr at al (DIST/SMTI, France) outline the EMIR (European Multilingual Information Retrieval) and ESPRIT projects. They found that using SYSTRAN to machine translate queries and to access material from various multilingual databases produced less relevant results than a method referred to as 'multilingual reformulation' (the mechanics of which are only hinted at). An interesting technique is Latent Semantic Indexing (LSI), described by Michael Littman et al (Brown University) and, most clearly, by David Evans et al (Carnegie Mellon University). LSI involves creating matrices of documents and the terms they contain and 'fitting' related documents into a reduced matrix space. This effectively allows queries to be mapped onto a common semantic representation of the documents. Eugenio Picchi and Carol Peters (Pisa) report on a procedure to create links between translation equivalents in an Italian-English parallel corpus. The links are used to construct parallel linguistic contexts in real-time for any term or combination of terms that is being searched for in either language. Their interest is primarily lexicographic but they plan to apply the same procedure to comparable corpora, i.e. to texts which are not translations of each other but which share the same domain. Kiyoshi Yamabana et al (NEC, Japan) address the issue of how to disambiguate between alternative translations of query terms. Their DMAX (double maximise) method looks at co-occurrence frequencies between both source language words and target language words in order to arrive at the most probable translation. The statistical data for the decision are derived, not from the translation texts but independently from monolingual corpora in each language. An interactive user interface allows the user to influence the selection of terms during the matching process. Denis Gachot et al (SYSTRAN) describe the SYSTRAN NLP browser, a prototype tool which collects parsing information derived from a text or corpus previously translated with SYSTRAN. The user enters queries into the browser in either a structured or free form and receives grammatical and lexical information about the source text and/or its translation.
The retrieved output from a query including the phrase 'big rockets' may be, for instance, a sentence containing 'giant rocket' which is semantically ranked above 'military ocket'. David Hull (Xerox Research Centre, Grenoble) describes an implementation of a weighted Boolean model for Spanish-English CLIR. Users construct Boolean-type queries, weighting each term in the query, which is then translated by an on-line dictionary before being applied to the database. Comparisons with the performance of unweighted free-form queries ('vector space' models) proved encouraging. Two contributions consider the evaluation of CLIR systems. In order to by-pass the time-consuming and expensive process of assembling a standard collection of documents and of user queries against which the performance of an CLIR system is manually assessed, Páriac Sheridan et al (ETH Zurich) propose a method based on retrieving 'seed documents'. This involves identifying a unique document in a database (the 'seed document') and, for a number of queries, measuring how fast it is retrieved. The authors have also assembled a large database of multilingual news documents for testing purposes. By storing the (fairly short) documents in a structured form tagged with descriptor codes (e.g. for topic, country and area), the test suite is easily expanded while remaining consistent for the purposes of testing. Douglas Ouard and Bonne Dorr (University of Maryland) describe an evaluation methodology which appears to apply LSI techniques in order to filter and rank incoming documents designed for testing CLIR systems. The volume provides the reader an excellent overview of several projects in CLIR. It is well supported with references and is intended as a secondary text for researchers and practitioners. It highlights the need for a good, general tutorial introduction to the field."

Salton, G.: ¬A simple blueprint for automatic Boolean query processing (1988) 0.03

0.029913833 = product of:
  0.10469841 = sum of:
    0.09238517 = weight(_text_:g in 6774) [ClassicSimilarity], result of:
      0.09238517 = score(doc=6774,freq=2.0), product of:
        0.13914184 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.03704574 = queryNorm
        0.663964 = fieldWeight in 6774, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.125 = fieldNorm(doc=6774)
    0.012313238 = weight(_text_:a in 6774) [ClassicSimilarity], result of:
      0.012313238 = score(doc=6774,freq=4.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.28826174 = fieldWeight in 6774, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.125 = fieldNorm(doc=6774)
  0.2857143 = coord(2/7)

Type: a

Salton, G.; Buckley, C.: Parallel text search methods (1988) 0.03

0.028883414 = product of:
  0.10109194 = sum of:
    0.09238517 = weight(_text_:g in 404) [ClassicSimilarity], result of:
      0.09238517 = score(doc=404,freq=2.0), product of:
        0.13914184 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.03704574 = queryNorm
        0.663964 = fieldWeight in 404, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.125 = fieldNorm(doc=404)
    0.008706774 = weight(_text_:a in 404) [ClassicSimilarity], result of:
      0.008706774 = score(doc=404,freq=2.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.20383182 = fieldWeight in 404, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.125 = fieldNorm(doc=404)
  0.2857143 = coord(2/7)

Type: a

Wu, H.; Salton, G.: ¬The estimation of term relevance weights using relevance feedback (1981) 0.03

0.028883414 = product of:
  0.10109194 = sum of:
    0.09238517 = weight(_text_:g in 4728) [ClassicSimilarity], result of:
      0.09238517 = score(doc=4728,freq=2.0), product of:
        0.13914184 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.03704574 = queryNorm
        0.663964 = fieldWeight in 4728, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.125 = fieldNorm(doc=4728)
    0.008706774 = weight(_text_:a in 4728) [ClassicSimilarity], result of:
      0.008706774 = score(doc=4728,freq=2.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.20383182 = fieldWeight in 4728, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.125 = fieldNorm(doc=4728)
  0.2857143 = coord(2/7)

Type: a

Srinivasan, P.: Query expansion and MEDLINE (1996) 0.03

0.027848696 = product of:
  0.06498029 = sum of:
    0.021165324 = product of:
      0.04233065 = sum of:
        0.04233065 = weight(_text_:p in 8453) [ClassicSimilarity], result of:
          0.04233065 = score(doc=8453,freq=2.0), product of:
            0.13319843 = queryWeight, product of:
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.03704574 = queryNorm
            0.31780142 = fieldWeight in 8453, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.0625 = fieldNorm(doc=8453)
      0.5 = coord(1/2)
    0.035108197 = weight(_text_:u in 8453) [ClassicSimilarity], result of:
      0.035108197 = score(doc=8453,freq=2.0), product of:
        0.121304214 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.03704574 = queryNorm
        0.28942272 = fieldWeight in 8453, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.0625 = fieldNorm(doc=8453)
    0.008706774 = weight(_text_:a in 8453) [ClassicSimilarity], result of:
      0.008706774 = score(doc=8453,freq=8.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.20383182 = fieldWeight in 8453, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.0625 = fieldNorm(doc=8453)
  0.42857143 = coord(3/7)

Abstract: Evaluates the retrieval effectiveness of query expansion strategies on a test collection of the medical database MEDLINE using Cornell University's SMART retrieval system. Tests 3 expansion strategies for their ability to identify appropriate MeSH terms for user queries. Compares retrieval effectiveness using the original unexpanded and the alternative expanded user queries on a collection of 75 queries and 2.334 Medline citations. Recommends query expansions using retrieval feedback for adding MeSH search terms to a user's initial query
Theme: Semantisches Umfeld in Indexierung u. Retrieval
Type: a

Faloutsos, C.: Signature files (1992) 0.03

0.027382165 = product of:
  0.06389172 = sum of:
    0.035108197 = weight(_text_:u in 3499) [ClassicSimilarity], result of:
      0.035108197 = score(doc=3499,freq=2.0), product of:
        0.121304214 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.03704574 = queryNorm
        0.28942272 = fieldWeight in 3499, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.0625 = fieldNorm(doc=3499)
    0.008706774 = weight(_text_:a in 3499) [ClassicSimilarity], result of:
      0.008706774 = score(doc=3499,freq=8.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.20383182 = fieldWeight in 3499, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.0625 = fieldNorm(doc=3499)
    0.020076746 = product of:
      0.040153492 = sum of:
        0.040153492 = weight(_text_:22 in 3499) [ClassicSimilarity], result of:
          0.040153492 = score(doc=3499,freq=2.0), product of:
            0.12972787 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03704574 = queryNorm
            0.30952093 = fieldWeight in 3499, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=3499)
      0.5 = coord(1/2)
  0.42857143 = coord(3/7)

Abstract: Presents a survey and discussion on signature-based text retrieval methods. It describes the main idea behind the signature approach and its advantages over other text retrieval methods, it provides a classification of the signature methods that have appeared in the literature, it describes the main representatives of each class, together with the relative advantages and drawbacks, and it gives a list of applications as well as commercial or university prototypes that use the signature approach
Date: 7. 5.1999 15:22:48
Source: Information retrieval: data structures and algorithms. Ed.: W.B. Frakes u. R. Baeza-Yates
Type: a

Ciocca, G.; Schettini, R.: ¬A relevance feedback mechanism for content-based image retrieval (1999) 0.03

0.026174603 = product of:
  0.09161111 = sum of:
    0.080837026 = weight(_text_:g in 6498) [ClassicSimilarity], result of:
      0.080837026 = score(doc=6498,freq=2.0), product of:
        0.13914184 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.03704574 = queryNorm
        0.5809685 = fieldWeight in 6498, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.109375 = fieldNorm(doc=6498)
    0.010774084 = weight(_text_:a in 6498) [ClassicSimilarity], result of:
      0.010774084 = score(doc=6498,freq=4.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.25222903 = fieldWeight in 6498, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.109375 = fieldNorm(doc=6498)
  0.2857143 = coord(2/7)

Type: a

Karlsson, A.; Hammarfelt, B.; Steinhauer, H.J.; Falkman, G.; Olson, N.; Nelhans, G.; Nolin, J.: Modeling uncertainty in bibliometrics and information retrieval : an information fusion approach (2015) 0.03

0.025529573 = product of:
  0.0893535 = sum of:
    0.08165773 = weight(_text_:g in 1696) [ClassicSimilarity], result of:
      0.08165773 = score(doc=1696,freq=4.0), product of:
        0.13914184 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.03704574 = queryNorm
        0.5868668 = fieldWeight in 1696, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.078125 = fieldNorm(doc=1696)
    0.007695774 = weight(_text_:a in 1696) [ClassicSimilarity], result of:
      0.007695774 = score(doc=1696,freq=4.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.18016359 = fieldWeight in 1696, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.078125 = fieldNorm(doc=1696)
  0.2857143 = coord(2/7)

Type: a

Bhogal, J.; Macfarlane, A.; Smith, P.: ¬A review of ontology based query expansion (2007) 0.02

0.024752997 = product of:
  0.05775699 = sum of:
    0.018519659 = product of:
      0.037039317 = sum of:
        0.037039317 = weight(_text_:p in 919) [ClassicSimilarity], result of:
          0.037039317 = score(doc=919,freq=2.0), product of:
            0.13319843 = queryWeight, product of:
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.03704574 = queryNorm
            0.27807623 = fieldWeight in 919, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.0546875 = fieldNorm(doc=919)
      0.5 = coord(1/2)
    0.030719671 = weight(_text_:u in 919) [ClassicSimilarity], result of:
      0.030719671 = score(doc=919,freq=2.0), product of:
        0.121304214 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.03704574 = queryNorm
        0.25324488 = fieldWeight in 919, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.0546875 = fieldNorm(doc=919)
    0.008517661 = weight(_text_:a in 919) [ClassicSimilarity], result of:
      0.008517661 = score(doc=919,freq=10.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.19940455 = fieldWeight in 919, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.0546875 = fieldNorm(doc=919)
  0.42857143 = coord(3/7)

Abstract: This paper examines the meaning of context in relation to ontology based query expansion and contains a review of query expansion approaches. The various query expansion approaches include relevance feedback, corpus dependent knowledge models and corpus independent knowledge models. Case studies detailing query expansion using domain-specific and domain-independent ontologies are also included. The penultimate section attempts to synthesise the information obtained from the review and provide success factors in using an ontology for query expansion. Finally the area of further research in applying context from an ontology to query expansion within a newswire domain is described.
Theme: Semantisches Umfeld in Indexierung u. Retrieval
Type: a

Fox, E.; Betrabet, S.; Koushik, M.; Lee, W.: Extended Boolean models (1992) 0.02

0.023704477 = product of:
  0.055310443 = sum of:
    0.022449218 = product of:
      0.044898435 = sum of:
        0.044898435 = weight(_text_:p in 3512) [ClassicSimilarity], result of:
          0.044898435 = score(doc=3512,freq=4.0), product of:
            0.13319843 = queryWeight, product of:
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.03704574 = queryNorm
            0.33707932 = fieldWeight in 3512, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5955126 = idf(docFreq=3298, maxDocs=44218)
              0.046875 = fieldNorm(doc=3512)
      0.5 = coord(1/2)
    0.026331145 = weight(_text_:u in 3512) [ClassicSimilarity], result of:
      0.026331145 = score(doc=3512,freq=2.0), product of:
        0.121304214 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.03704574 = queryNorm
        0.21706703 = fieldWeight in 3512, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.046875 = fieldNorm(doc=3512)
    0.006530081 = weight(_text_:a in 3512) [ClassicSimilarity], result of:
      0.006530081 = score(doc=3512,freq=8.0), product of:
        0.04271548 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.03704574 = queryNorm
        0.15287387 = fieldWeight in 3512, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.046875 = fieldNorm(doc=3512)
  0.42857143 = coord(3/7)

Abstract: The classical interpretation of Boolean operators in an information retrieval system is in general too strict. A standard Boolean query rarely comes close to retrieving all and only those documents which are relevant to a query. Many models have been proposed with the aim of softening the interpretation of the Boolean operators in order to improve the precision and recall of the search results. This chapter discusses 3 such models: the Mixed Min and Max (MMM), the Paice, and the P-noem models. The MMM and Paice models are essentially variations of the classical fuzzy-set model, while the P-norm scheme is a distance-based approach. Our experimental results indicate that each of the above models provide better performance than the classical Boolean model in terms of retrieval effectiveness
Source: Information retrieval: data structures and algorithms. Ed.: W.B. Frakes u. R. Baeza-Yates
Type: a

Search (358 results, page 1 of 18)

Authors

Years

Languages

Types

Themes

Subjects

Classifications