Search (16 results, page 1 of 1)

  • × theme_ss:"Retrievalalgorithmen"
  • × year_i:[2000 TO 2010}
  1. Cannane, A.; Williams, H.E.: General-purpose compression for efficient retrieval (2001) 0.14
    0.13827354 = product of:
      0.27654707 = sum of:
        0.27654707 = product of:
          0.55309415 = sum of:
            0.55309415 = weight(_text_:compression in 5705) [ClassicSimilarity], result of:
              0.55309415 = score(doc=5705,freq=20.0), product of:
                0.36069217 = queryWeight, product of:
                  7.314861 = idf(docFreq=79, maxDocs=44218)
                  0.049309507 = queryNorm
                1.5334243 = fieldWeight in 5705, product of:
                  4.472136 = tf(freq=20.0), with freq of:
                    20.0 = termFreq=20.0
                  7.314861 = idf(docFreq=79, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5705)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Compression of databases not only reduces space requirements but can also reduce overall retrieval times. In text databases, compression of documents based on semistatic modeling with words has been shown to be both practical and fast. Similarly, for specific applications -such as databases of integers or scientific databases-specially designed semistatic compression schemes work well. We propose a scheme for general-purpose compression that can be applied to all types of data stored in large collections. We describe our approach -which we call RAY-in detail, and show experimentally the compression available, compression and decompression costs, and performance as a stream and random-access technique. We show that, in many cases, RAY achieves better compression than an efficient Huffman scheme and popular adaptive compression techniques, and that it can be used as an efficient general-purpose compression scheme
  2. Cheng, C.-S.; Chung, C.-P.; Shann, J.J.-J.: Fast query evaluation through document identifier assignment for inverted file-based information retrieval systems (2006) 0.05
    0.0515315 = product of:
      0.103063 = sum of:
        0.103063 = product of:
          0.206126 = sum of:
            0.206126 = weight(_text_:compression in 979) [ClassicSimilarity], result of:
              0.206126 = score(doc=979,freq=4.0), product of:
                0.36069217 = queryWeight, product of:
                  7.314861 = idf(docFreq=79, maxDocs=44218)
                  0.049309507 = queryNorm
                0.5714735 = fieldWeight in 979, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  7.314861 = idf(docFreq=79, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=979)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Compressing an inverted file can greatly improve query performance of an information retrieval system (IRS) by reducing disk I/Os. We observe that a good document identifier assignment (DIA) can make the document identifiers in the posting lists more clustered, and result in better compression as well as shorter query processing time. In this paper, we tackle the NP-complete problem of finding an optimal DIA to minimize the average query processing time in an IRS when the probability distribution of query terms is given. We indicate that the greedy nearest neighbor (Greedy-NN) algorithm can provide excellent performance for this problem. However, the Greedy-NN algorithm is inappropriate if used in large-scale IRSs, due to its high complexity O(N2 × n), where N denotes the number of documents and n denotes the number of distinct terms. In real-world IRSs, the distribution of query terms is skewed. Based on this fact, we propose a fast O(N × n) heuristic, called partition-based document identifier assignment (PBDIA) algorithm, which can efficiently assign consecutive document identifiers to those documents containing frequently used query terms, and improve compression efficiency of the posting lists for those terms. This can result in reduced query processing time. The experimental results show that the PBDIA algorithm can yield a competitive performance versus the Greedy-NN for the DIA problem, and that this optimization problem has significant advantages for both long queries and parallel information retrieval (IR).
  3. Back, J.: ¬An evaluation of relevancy ranking techniques used by Internet search engines (2000) 0.02
    0.02338265 = product of:
      0.0467653 = sum of:
        0.0467653 = product of:
          0.0935306 = sum of:
            0.0935306 = weight(_text_:22 in 3445) [ClassicSimilarity], result of:
              0.0935306 = score(doc=3445,freq=2.0), product of:
                0.1726735 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.049309507 = queryNorm
                0.5416616 = fieldWeight in 3445, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=3445)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    25. 8.2005 17:42:22
  4. MacFarlane, A.; Robertson, S.E.; McCann, J.A.: Parallel computing for passage retrieval (2004) 0.01
    0.0133615155 = product of:
      0.026723031 = sum of:
        0.026723031 = product of:
          0.053446062 = sum of:
            0.053446062 = weight(_text_:22 in 5108) [ClassicSimilarity], result of:
              0.053446062 = score(doc=5108,freq=2.0), product of:
                0.1726735 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.049309507 = queryNorm
                0.30952093 = fieldWeight in 5108, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=5108)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    20. 1.2007 18:30:22
  5. Losada, D.E.; Barreiro, A.: Emebedding term similarity and inverse document frequency into a logical model of information retrieval (2003) 0.01
    0.0133615155 = product of:
      0.026723031 = sum of:
        0.026723031 = product of:
          0.053446062 = sum of:
            0.053446062 = weight(_text_:22 in 1422) [ClassicSimilarity], result of:
              0.053446062 = score(doc=1422,freq=2.0), product of:
                0.1726735 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.049309507 = queryNorm
                0.30952093 = fieldWeight in 1422, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1422)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 3.2003 19:27:23
  6. Kanaeva, Z.: Ranking: Google und CiteSeer (2005) 0.01
    0.011691325 = product of:
      0.02338265 = sum of:
        0.02338265 = product of:
          0.0467653 = sum of:
            0.0467653 = weight(_text_:22 in 3276) [ClassicSimilarity], result of:
              0.0467653 = score(doc=3276,freq=2.0), product of:
                0.1726735 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.049309507 = queryNorm
                0.2708308 = fieldWeight in 3276, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3276)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    20. 3.2005 16:23:22
  7. Crestani, F.; Dominich, S.; Lalmas, M.; Rijsbergen, C.J.K. van: Mathematical, logical, and formal methods in information retrieval : an introduction to the special issue (2003) 0.01
    0.010021136 = product of:
      0.020042272 = sum of:
        0.020042272 = product of:
          0.040084545 = sum of:
            0.040084545 = weight(_text_:22 in 1451) [ClassicSimilarity], result of:
              0.040084545 = score(doc=1451,freq=2.0), product of:
                0.1726735 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.049309507 = queryNorm
                0.23214069 = fieldWeight in 1451, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1451)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 3.2003 19:27:36
  8. Fan, W.; Fox, E.A.; Pathak, P.; Wu, H.: ¬The effects of fitness functions an genetic programming-based ranking discovery for Web search (2004) 0.01
    0.010021136 = product of:
      0.020042272 = sum of:
        0.020042272 = product of:
          0.040084545 = sum of:
            0.040084545 = weight(_text_:22 in 2239) [ClassicSimilarity], result of:
              0.040084545 = score(doc=2239,freq=2.0), product of:
                0.1726735 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.049309507 = queryNorm
                0.23214069 = fieldWeight in 2239, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2239)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    31. 5.2004 19:22:06
  9. Furner, J.: ¬A unifying model of document relatedness for hybrid search engines (2003) 0.01
    0.010021136 = product of:
      0.020042272 = sum of:
        0.020042272 = product of:
          0.040084545 = sum of:
            0.040084545 = weight(_text_:22 in 2717) [ClassicSimilarity], result of:
              0.040084545 = score(doc=2717,freq=2.0), product of:
                0.1726735 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.049309507 = queryNorm
                0.23214069 = fieldWeight in 2717, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2717)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    11. 9.2004 17:32:22
  10. Witschel, H.F.: Global term weights in distributed environments (2008) 0.01
    0.010021136 = product of:
      0.020042272 = sum of:
        0.020042272 = product of:
          0.040084545 = sum of:
            0.040084545 = weight(_text_:22 in 2096) [ClassicSimilarity], result of:
              0.040084545 = score(doc=2096,freq=2.0), product of:
                0.1726735 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.049309507 = queryNorm
                0.23214069 = fieldWeight in 2096, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2096)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    1. 8.2008 9:44:22
  11. Klas, C.-P.; Fuhr, N.; Schaefer, A.: Evaluating strategic support for information access in the DAFFODIL system (2004) 0.01
    0.010021136 = product of:
      0.020042272 = sum of:
        0.020042272 = product of:
          0.040084545 = sum of:
            0.040084545 = weight(_text_:22 in 2419) [ClassicSimilarity], result of:
              0.040084545 = score(doc=2419,freq=2.0), product of:
                0.1726735 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.049309507 = queryNorm
                0.23214069 = fieldWeight in 2419, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2419)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    16.11.2008 16:22:48
  12. Campos, L.M. de; Fernández-Luna, J.M.; Huete, J.F.: Implementing relevance feedback in the Bayesian network retrieval model (2003) 0.01
    0.010021136 = product of:
      0.020042272 = sum of:
        0.020042272 = product of:
          0.040084545 = sum of:
            0.040084545 = weight(_text_:22 in 825) [ClassicSimilarity], result of:
              0.040084545 = score(doc=825,freq=2.0), product of:
                0.1726735 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.049309507 = queryNorm
                0.23214069 = fieldWeight in 825, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=825)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 3.2003 19:30:19
  13. Song, D.; Bruza, P.D.: Towards context sensitive information inference (2003) 0.01
    0.008350947 = product of:
      0.016701894 = sum of:
        0.016701894 = product of:
          0.033403788 = sum of:
            0.033403788 = weight(_text_:22 in 1428) [ClassicSimilarity], result of:
              0.033403788 = score(doc=1428,freq=2.0), product of:
                0.1726735 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.049309507 = queryNorm
                0.19345059 = fieldWeight in 1428, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1428)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 3.2003 19:35:46
  14. Shiri, A.A.; Revie, C.: Query expansion behavior within a thesaurus-enhanced search environment : a user-centered evaluation (2006) 0.01
    0.008350947 = product of:
      0.016701894 = sum of:
        0.016701894 = product of:
          0.033403788 = sum of:
            0.033403788 = weight(_text_:22 in 56) [ClassicSimilarity], result of:
              0.033403788 = score(doc=56,freq=2.0), product of:
                0.1726735 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.049309507 = queryNorm
                0.19345059 = fieldWeight in 56, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=56)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 7.2006 16:32:43
  15. Dominich, S.: Mathematical foundations of information retrieval (2001) 0.01
    0.008350947 = product of:
      0.016701894 = sum of:
        0.016701894 = product of:
          0.033403788 = sum of:
            0.033403788 = weight(_text_:22 in 1753) [ClassicSimilarity], result of:
              0.033403788 = score(doc=1753,freq=2.0), product of:
                0.1726735 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.049309507 = queryNorm
                0.19345059 = fieldWeight in 1753, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1753)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 3.2008 12:26:32
  16. Khoo, C.S.G.; Wan, K.-W.: ¬A simple relevancy-ranking strategy for an interface to Boolean OPACs (2004) 0.01
    0.0058456627 = product of:
      0.011691325 = sum of:
        0.011691325 = product of:
          0.02338265 = sum of:
            0.02338265 = weight(_text_:22 in 2509) [ClassicSimilarity], result of:
              0.02338265 = score(doc=2509,freq=2.0), product of:
                0.1726735 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.049309507 = queryNorm
                0.1354154 = fieldWeight in 2509, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.02734375 = fieldNorm(doc=2509)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Electronic library. 22(2004) no.2, S.112-120