Search (120 results, page 1 of 6)

  • × theme_ss:"Retrievalalgorithmen"
  1. Ding, Y.; Yan, E.; Frazho, A.; Caverlee, J.: PageRank for ranking authors in co-citation networks (2009) 0.05
    0.0536729 = sum of:
      0.043685135 = product of:
        0.17474054 = sum of:
          0.17474054 = weight(_text_:authors in 3161) [ClassicSimilarity], result of:
            0.17474054 = score(doc=3161,freq=12.0), product of:
              0.2360532 = queryWeight, product of:
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.05177952 = queryNorm
              0.7402591 = fieldWeight in 3161, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.046875 = fieldNorm(doc=3161)
        0.25 = coord(1/4)
      0.009987764 = product of:
        0.029963292 = sum of:
          0.029963292 = weight(_text_:h in 3161) [ClassicSimilarity], result of:
            0.029963292 = score(doc=3161,freq=4.0), product of:
              0.12864359 = queryWeight, product of:
                2.4844491 = idf(docFreq=10020, maxDocs=44218)
                0.05177952 = queryNorm
              0.2329171 = fieldWeight in 3161, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4844491 = idf(docFreq=10020, maxDocs=44218)
                0.046875 = fieldNorm(doc=3161)
        0.33333334 = coord(1/3)
    
    Abstract
    This paper studies how varied damping factors in the PageRank algorithm influence the ranking of authors and proposes weighted PageRank algorithms. We selected the 108 most highly cited authors in the information retrieval (IR) area from the 1970s to 2008 to form the author co-citation network. We calculated the ranks of these 108 authors based on PageRank with the damping factor ranging from 0.05 to 0.95. In order to test the relationship between different measures, we compared PageRank and weighted PageRank results with the citation ranking, h-index, and centrality measures. We found that in our author co-citation network, citation rank is highly correlated with PageRank with different damping factors and also with different weighted PageRank algorithms; citation rank and PageRank are not significantly correlated with centrality measures; and h-index rank does not significantly correlate with centrality measures but does significantly correlate with other measures. The key factors that have impact on the PageRank of authors in the author co-citation network are being co-cited with important authors.
  2. Khoo, C.S.G.; Wan, K.-W.: ¬A simple relevancy-ranking strategy for an interface to Boolean OPACs (2004) 0.04
    0.04378338 = sum of:
      0.010403389 = product of:
        0.041613556 = sum of:
          0.041613556 = weight(_text_:authors in 2509) [ClassicSimilarity], result of:
            0.041613556 = score(doc=2509,freq=2.0), product of:
              0.2360532 = queryWeight, product of:
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.05177952 = queryNorm
              0.17628889 = fieldWeight in 2509, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.02734375 = fieldNorm(doc=2509)
        0.25 = coord(1/4)
      0.033379994 = product of:
        0.050069988 = sum of:
          0.025516056 = weight(_text_:k in 2509) [ClassicSimilarity], result of:
            0.025516056 = score(doc=2509,freq=2.0), product of:
              0.1848414 = queryWeight, product of:
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.05177952 = queryNorm
              0.13804297 = fieldWeight in 2509, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.02734375 = fieldNorm(doc=2509)
          0.024553934 = weight(_text_:22 in 2509) [ClassicSimilarity], result of:
            0.024553934 = score(doc=2509,freq=2.0), product of:
              0.18132305 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.05177952 = queryNorm
              0.1354154 = fieldWeight in 2509, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.02734375 = fieldNorm(doc=2509)
        0.6666667 = coord(2/3)
    
    Abstract
    A relevancy-ranking algorithm for a natural language interface to Boolean online public access catalogs (OPACs) was formulated and compared with that currently used in a knowledge-based search interface called the E-Referencer, being developed by the authors. The algorithm makes use of seven weIl-known ranking criteria: breadth of match, section weighting, proximity of query words, variant word forms (stemming), document frequency, term frequency and document length. The algorithm converts a natural language query into a series of increasingly broader Boolean search statements. In a small experiment with ten subjects in which the algorithm was simulated by hand, the algorithm obtained good results with a mean overall precision of 0.42 and mean average precision of 0.62, representing a 27 percent improvement in precision and 41 percent improvement in average precision compared to the E-Referencer. The usefulness of each step in the algorithm was analyzed and suggestions are made for improving the algorithm.
    Source
    Electronic library. 22(2004) no.2, S.112-120
  3. Fuhr, N.: Ranking-Experimente mit gewichteter Indexierung (1986) 0.04
    0.04218647 = product of:
      0.08437294 = sum of:
        0.08437294 = product of:
          0.1265594 = sum of:
            0.04237449 = weight(_text_:h in 58) [ClassicSimilarity], result of:
              0.04237449 = score(doc=58,freq=2.0), product of:
                0.12864359 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.05177952 = queryNorm
                0.32939452 = fieldWeight in 58, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.09375 = fieldNorm(doc=58)
            0.084184915 = weight(_text_:22 in 58) [ClassicSimilarity], result of:
              0.084184915 = score(doc=58,freq=2.0), product of:
                0.18132305 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.05177952 = queryNorm
                0.46428138 = fieldWeight in 58, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=58)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    14. 6.2015 22:12:44
    Source
    Deutscher Dokumentartag 1985, Nürnberg, 1.-4.10.1985: Fachinformation: Methodik - Management - Markt; neue Entwicklungen, Berufe, Produkte. Bearb.: H. Strohl-Goebel
  4. Ding, Y.: Topic-based PageRank on author cocitation networks (2011) 0.04
    0.03795247 = sum of:
      0.030890055 = product of:
        0.12356022 = sum of:
          0.12356022 = weight(_text_:authors in 4348) [ClassicSimilarity], result of:
            0.12356022 = score(doc=4348,freq=6.0), product of:
              0.2360532 = queryWeight, product of:
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.05177952 = queryNorm
              0.52344227 = fieldWeight in 4348, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.046875 = fieldNorm(doc=4348)
        0.25 = coord(1/4)
      0.0070624156 = product of:
        0.021187246 = sum of:
          0.021187246 = weight(_text_:h in 4348) [ClassicSimilarity], result of:
            0.021187246 = score(doc=4348,freq=2.0), product of:
              0.12864359 = queryWeight, product of:
                2.4844491 = idf(docFreq=10020, maxDocs=44218)
                0.05177952 = queryNorm
              0.16469726 = fieldWeight in 4348, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4844491 = idf(docFreq=10020, maxDocs=44218)
                0.046875 = fieldNorm(doc=4348)
        0.33333334 = coord(1/3)
    
    Abstract
    Ranking authors is vital for identifying a researcher's impact and standing within a scientific field. There are many different ranking methods (e.g., citations, publications, h-index, PageRank, and weighted PageRank), but most of them are topic-independent. This paper proposes topic-dependent ranks based on the combination of a topic model and a weighted PageRank algorithm. The author-conference-topic (ACT) model was used to extract topic distribution of individual authors. Two ways for combining the ACT model with the PageRank algorithm are proposed: simple combination (I_PR) or using a topic distribution as a weighted vector for PageRank (PR_t). Information retrieval was chosen as the test field and representative authors for different topics at different time phases were identified. Principal component analysis (PCA) was applied to analyze the ranking difference between I_PR and PR_t.
  5. Chen, Z.; Fu, B.: On the complexity of Rocchio's similarity-based relevance feedback algorithm (2007) 0.03
    0.033168525 = sum of:
      0.02101802 = product of:
        0.08407208 = sum of:
          0.08407208 = weight(_text_:authors in 578) [ClassicSimilarity], result of:
            0.08407208 = score(doc=578,freq=4.0), product of:
              0.2360532 = queryWeight, product of:
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.05177952 = queryNorm
              0.35615736 = fieldWeight in 578, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.0390625 = fieldNorm(doc=578)
        0.25 = coord(1/4)
      0.012150502 = product of:
        0.036451504 = sum of:
          0.036451504 = weight(_text_:k in 578) [ClassicSimilarity], result of:
            0.036451504 = score(doc=578,freq=2.0), product of:
              0.1848414 = queryWeight, product of:
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.05177952 = queryNorm
              0.19720423 = fieldWeight in 578, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.0390625 = fieldNorm(doc=578)
        0.33333334 = coord(1/3)
    
    Abstract
    Rocchio's similarity-based relevance feedback algorithm, one of the most important query reformation methods in information retrieval, is essentially an adaptive learning algorithm from examples in searching for documents represented by a linear classifier. Despite its popularity in various applications, there is little rigorous analysis of its learning complexity in literature. In this article, the authors prove for the first time that the learning complexity of Rocchio's algorithm is O(d + d**2(log d + log n)) over the discretized vector space {0, ... , n - 1 }**d when the inner product similarity measure is used. The upper bound on the learning complexity for searching for documents represented by a monotone linear classifier (q, 0) over {0, ... , n - 1 }d can be improved to, at most, 1 + 2k (n - 1) (log d + log(n - 1)), where k is the number of nonzero components in q. Several lower bounds on the learning complexity are also obtained for Rocchio's algorithm. For example, the authors prove that Rocchio's algorithm has a lower bound Omega((d über 2)log n) on its learning complexity over the Boolean vector space {0,1}**d.
  6. Soulier, L.; Jabeur, L.B.; Tamine, L.; Bahsoun, W.: On ranking relevant entities in heterogeneous networks using a language-based model (2013) 0.03
    0.032710373 = sum of:
      0.02101802 = product of:
        0.08407208 = sum of:
          0.08407208 = weight(_text_:authors in 664) [ClassicSimilarity], result of:
            0.08407208 = score(doc=664,freq=4.0), product of:
              0.2360532 = queryWeight, product of:
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.05177952 = queryNorm
              0.35615736 = fieldWeight in 664, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.0390625 = fieldNorm(doc=664)
        0.25 = coord(1/4)
      0.011692351 = product of:
        0.03507705 = sum of:
          0.03507705 = weight(_text_:22 in 664) [ClassicSimilarity], result of:
            0.03507705 = score(doc=664,freq=2.0), product of:
              0.18132305 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.05177952 = queryNorm
              0.19345059 = fieldWeight in 664, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0390625 = fieldNorm(doc=664)
        0.33333334 = coord(1/3)
    
    Abstract
    A new challenge, accessing multiple relevant entities, arises from the availability of linked heterogeneous data. In this article, we address more specifically the problem of accessing relevant entities, such as publications and authors within a bibliographic network, given an information need. We propose a novel algorithm, called BibRank, that estimates a joint relevance of documents and authors within a bibliographic network. This model ranks each type of entity using a score propagation algorithm with respect to the query topic and the structure of the underlying bi-type information entity network. Evidence sources, namely content-based and network-based scores, are both used to estimate the topical similarity between connected entities. For this purpose, authorship relationships are analyzed through a language model-based score on the one hand and on the other hand, non topically related entities of the same type are detected through marginal citations. The article reports the results of experiments using the Bibrank algorithm for an information retrieval task. The CiteSeerX bibliographic data set forms the basis for the topical query automatic generation and evaluation. We show that a statistically significant improvement over closely related ranking models is achieved.
    Date
    22. 3.2013 19:34:49
  7. Weller, K.; Stock, W.G.: Transitive meronymy : automatic concept-based query expansion using weighted transitive part-whole relations (2008) 0.03
    0.025250189 = product of:
      0.050500378 = sum of:
        0.050500378 = product of:
          0.07575057 = sum of:
            0.05103211 = weight(_text_:k in 1835) [ClassicSimilarity], result of:
              0.05103211 = score(doc=1835,freq=2.0), product of:
                0.1848414 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.05177952 = queryNorm
                0.27608594 = fieldWeight in 1835, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1835)
            0.024718454 = weight(_text_:h in 1835) [ClassicSimilarity], result of:
              0.024718454 = score(doc=1835,freq=2.0), product of:
                0.12864359 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.05177952 = queryNorm
                0.19214681 = fieldWeight in 1835, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1835)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Source
    Information - Wissenschaft und Praxis. 59(2008) H.3, S.165-170
  8. Behnert, C.; Plassmeier, K.; Borst, T.; Lewandowski, D.: Evaluierung von Rankingverfahren für bibliothekarische Informationssysteme (2019) 0.03
    0.025250189 = product of:
      0.050500378 = sum of:
        0.050500378 = product of:
          0.07575057 = sum of:
            0.05103211 = weight(_text_:k in 5023) [ClassicSimilarity], result of:
              0.05103211 = score(doc=5023,freq=2.0), product of:
                0.1848414 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.05177952 = queryNorm
                0.27608594 = fieldWeight in 5023, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5023)
            0.024718454 = weight(_text_:h in 5023) [ClassicSimilarity], result of:
              0.024718454 = score(doc=5023,freq=2.0), product of:
                0.12864359 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.05177952 = queryNorm
                0.19214681 = fieldWeight in 5023, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5023)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Source
    Information - Wissenschaft und Praxis. 70(2019) H.1, S.14-23
  9. Chang, C.-H.; Hsu, C.-C.: Integrating query expansion and conceptual relevance feedback for personalized Web information retrieval (1998) 0.02
    0.024608774 = product of:
      0.04921755 = sum of:
        0.04921755 = product of:
          0.07382632 = sum of:
            0.024718454 = weight(_text_:h in 1319) [ClassicSimilarity], result of:
              0.024718454 = score(doc=1319,freq=2.0), product of:
                0.12864359 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.05177952 = queryNorm
                0.19214681 = fieldWeight in 1319, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1319)
            0.04910787 = weight(_text_:22 in 1319) [ClassicSimilarity], result of:
              0.04910787 = score(doc=1319,freq=2.0), product of:
                0.18132305 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.05177952 = queryNorm
                0.2708308 = fieldWeight in 1319, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1319)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    1. 8.1996 22:08:06
  10. Kanaeva, Z.: Ranking: Google und CiteSeer (2005) 0.02
    0.024608774 = product of:
      0.04921755 = sum of:
        0.04921755 = product of:
          0.07382632 = sum of:
            0.024718454 = weight(_text_:h in 3276) [ClassicSimilarity], result of:
              0.024718454 = score(doc=3276,freq=2.0), product of:
                0.12864359 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.05177952 = queryNorm
                0.19214681 = fieldWeight in 3276, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3276)
            0.04910787 = weight(_text_:22 in 3276) [ClassicSimilarity], result of:
              0.04910787 = score(doc=3276,freq=2.0), product of:
                0.18132305 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.05177952 = queryNorm
                0.2708308 = fieldWeight in 3276, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3276)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    20. 3.2005 16:23:22
    Source
    Information - Wissenschaft und Praxis. 56(2005) H.2, S.87-92
  11. Fan, W.; Fox, E.A.; Pathak, P.; Wu, H.: ¬The effects of fitness functions an genetic programming-based ranking discovery for Web search (2004) 0.02
    0.021093234 = product of:
      0.04218647 = sum of:
        0.04218647 = product of:
          0.0632797 = sum of:
            0.021187246 = weight(_text_:h in 2239) [ClassicSimilarity], result of:
              0.021187246 = score(doc=2239,freq=2.0), product of:
                0.12864359 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.05177952 = queryNorm
                0.16469726 = fieldWeight in 2239, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2239)
            0.042092457 = weight(_text_:22 in 2239) [ClassicSimilarity], result of:
              0.042092457 = score(doc=2239,freq=2.0), product of:
                0.18132305 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.05177952 = queryNorm
                0.23214069 = fieldWeight in 2239, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2239)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    31. 5.2004 19:22:06
  12. Sparck Jones, K.: Search term relevance weighting given little relevance information (1979) 0.02
    0.020620085 = product of:
      0.04124017 = sum of:
        0.04124017 = product of:
          0.123720504 = sum of:
            0.123720504 = weight(_text_:k in 1939) [ClassicSimilarity], result of:
              0.123720504 = score(doc=1939,freq=4.0), product of:
                0.1848414 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.05177952 = queryNorm
                0.66933334 = fieldWeight in 1939, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.09375 = fieldNorm(doc=1939)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Footnote
    Wiederabgedruckt in: Readings in information retrieval. Ed.: K. Sparck Jones u. P. Willett. San Francisco: Morgan Kaufmann 1997. S.329-338.
  13. Sparck Jones, K.: ¬A statistical interpretation of term specifity and its application in retrieval (1972) 0.02
    0.019440804 = product of:
      0.038881607 = sum of:
        0.038881607 = product of:
          0.11664482 = sum of:
            0.11664482 = weight(_text_:k in 5187) [ClassicSimilarity], result of:
              0.11664482 = score(doc=5187,freq=2.0), product of:
                0.1848414 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.05177952 = queryNorm
                0.63105357 = fieldWeight in 5187, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.125 = fieldNorm(doc=5187)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  14. Voorhees, E.M.: Implementing agglomerative hierarchic clustering algorithms for use in document retrieval (1986) 0.02
    0.01870776 = product of:
      0.03741552 = sum of:
        0.03741552 = product of:
          0.11224656 = sum of:
            0.11224656 = weight(_text_:22 in 402) [ClassicSimilarity], result of:
              0.11224656 = score(doc=402,freq=2.0), product of:
                0.18132305 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.05177952 = queryNorm
                0.61904186 = fieldWeight in 402, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.125 = fieldNorm(doc=402)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Source
    Information processing and management. 22(1986) no.6, S.465-476
  15. Tsai, C.-F.; Hu, Y.-H.; Chen, Z.-Y.: Factors affecting rocchio-based pseudorelevance feedback in image retrieval (2015) 0.02
    0.018035848 = product of:
      0.036071695 = sum of:
        0.036071695 = product of:
          0.054107543 = sum of:
            0.036451504 = weight(_text_:k in 1607) [ClassicSimilarity], result of:
              0.036451504 = score(doc=1607,freq=2.0), product of:
                0.1848414 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.05177952 = queryNorm
                0.19720423 = fieldWeight in 1607, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1607)
            0.01765604 = weight(_text_:h in 1607) [ClassicSimilarity], result of:
              0.01765604 = score(doc=1607,freq=2.0), product of:
                0.12864359 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.05177952 = queryNorm
                0.13724773 = fieldWeight in 1607, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1607)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Abstract
    Pseudorelevance feedback (PRF) was proposed to solve the limitation of relevance feedback (RF), which is based on the user-in-the-loop process. In PRF, the top-k retrieved images are regarded as PRF. Although the PRF set contains noise, PRF has proven effective for automatically improving the overall retrieval result. To implement PRF, the Rocchio algorithm has been considered as a reasonable and well-established baseline. However, the performance of Rocchio-based PRF is subject to various representation choices (or factors). In this article, we examine these factors that affect the performance of Rocchio-based PRF, including image-feature representation, the number of top-ranked images, the weighting parameters of Rocchio, and similarity measure. We offer practical insights on how to optimize the performance of Rocchio-based PRF by choosing appropriate representation choices. Our extensive experiments on NUS-WIDE-LITE and Caltech 101 + Corel 5000 data sets show that the optimal feature representation is color moment + wavelet texture in terms of retrieval efficiency and effectiveness. Other representation choices are that using top-20 ranked images as pseudopositive and pseudonegative feedback sets with the equal weight (i.e., 0.5) by the correlation and cosine distance functions can produce the optimal retrieval result.
  16. Bhansali, D.; Desai, H.; Deulkar, K.: ¬A study of different ranking approaches for semantic search (2015) 0.02
    0.018035848 = product of:
      0.036071695 = sum of:
        0.036071695 = product of:
          0.054107543 = sum of:
            0.036451504 = weight(_text_:k in 2696) [ClassicSimilarity], result of:
              0.036451504 = score(doc=2696,freq=2.0), product of:
                0.1848414 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.05177952 = queryNorm
                0.19720423 = fieldWeight in 2696, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2696)
            0.01765604 = weight(_text_:h in 2696) [ClassicSimilarity], result of:
              0.01765604 = score(doc=2696,freq=2.0), product of:
                0.12864359 = queryWeight, product of:
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.05177952 = queryNorm
                0.13724773 = fieldWeight in 2696, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4844491 = idf(docFreq=10020, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2696)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  17. Yu, K.; Tresp, V.; Yu, S.: ¬A nonparametric hierarchical Bayesian framework for information filtering (2004) 0.02
    0.017183406 = product of:
      0.034366813 = sum of:
        0.034366813 = product of:
          0.103100434 = sum of:
            0.103100434 = weight(_text_:k in 4117) [ClassicSimilarity], result of:
              0.103100434 = score(doc=4117,freq=4.0), product of:
                0.1848414 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.05177952 = queryNorm
                0.5577778 = fieldWeight in 4117, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.078125 = fieldNorm(doc=4117)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Source
    SIGIR'04: Proceedings of the 27th Annual International ACM-SIGIR Conference an Research and Development in Information Retrieval. Ed.: K. Järvelin, u.a
  18. Niemi, T.; Junkkari, M.; Järvelin, K.; Viita, S.: Advanced query language for manipulating complex entities (2004) 0.02
    0.017010704 = product of:
      0.034021407 = sum of:
        0.034021407 = product of:
          0.10206422 = sum of:
            0.10206422 = weight(_text_:k in 4218) [ClassicSimilarity], result of:
              0.10206422 = score(doc=4218,freq=2.0), product of:
                0.1848414 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.05177952 = queryNorm
                0.5521719 = fieldWeight in 4218, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4218)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  19. Jones, K.: Linguistic searching versus relevance ranking : DR-LINK and TARGET (1999) 0.02
    0.017010704 = product of:
      0.034021407 = sum of:
        0.034021407 = product of:
          0.10206422 = sum of:
            0.10206422 = weight(_text_:k in 6423) [ClassicSimilarity], result of:
              0.10206422 = score(doc=6423,freq=2.0), product of:
                0.1848414 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.05177952 = queryNorm
                0.5521719 = fieldWeight in 6423, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.109375 = fieldNorm(doc=6423)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  20. Smeaton, A.F.; Rijsbergen, C.J. van: ¬The retrieval effects of query expansion on a feedback document retrieval system (1983) 0.02
    0.01636929 = product of:
      0.03273858 = sum of:
        0.03273858 = product of:
          0.09821574 = sum of:
            0.09821574 = weight(_text_:22 in 2134) [ClassicSimilarity], result of:
              0.09821574 = score(doc=2134,freq=2.0), product of:
                0.18132305 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.05177952 = queryNorm
                0.5416616 = fieldWeight in 2134, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=2134)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    30. 3.2001 13:32:22

Languages

  • e 93
  • d 26
  • m 1
  • More… Less…

Types

  • a 107
  • m 8
  • s 3
  • x 2
  • el 1
  • r 1
  • More… Less…