Search (32 results, page 1 of 2)

  • × theme_ss:"Retrievalalgorithmen"
  1. Kang, I.-H.; Kim, G.C.: Integration of multiple evidences based on a query type for web search (2004) 0.04
    0.044735108 = product of:
      0.089470215 = sum of:
        0.089470215 = product of:
          0.17894043 = sum of:
            0.17894043 = weight(_text_:homepage in 2568) [ClassicSimilarity], result of:
              0.17894043 = score(doc=2568,freq=4.0), product of:
                0.33761188 = queryWeight, product of:
                  6.784232 = idf(docFreq=135, maxDocs=44218)
                  0.0497642 = queryNorm
                0.53001815 = fieldWeight in 2568, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  6.784232 = idf(docFreq=135, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2568)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    The massive and heterogeneous Web exacerbates IR problems and short user queries make them worse. The contents of web pages are not enough to find answer pages. PageRank compensates for the insufficiencies of content information. The content information and PageRank are combined to get better results. However, static combination of multiple evidences may lower the retrieval performance. We have to use different strategies to meet the need of a user. We can classify user queries as three categories according to users' intent, the topic relevance task, the homepage finding task, and the service finding task. In this paper, we present a user query classification method. The difference of distribution, mutual information, the usage rate as anchor texts and the POS information are used for the classification. After we classified a user query, we apply different algorithms and information for the better results. For the topic relevance task, we emphasize the content information, on the other hand, for the homepage finding task, we emphasize the Link information and the URL information. We could get the best performance when our proposed classification method with the OKAPI scoring algorithm was used.
  2. Voorhees, E.M.: Implementing agglomerative hierarchic clustering algorithms for use in document retrieval (1986) 0.03
    0.02696945 = product of:
      0.0539389 = sum of:
        0.0539389 = product of:
          0.1078778 = sum of:
            0.1078778 = weight(_text_:22 in 402) [ClassicSimilarity], result of:
              0.1078778 = score(doc=402,freq=2.0), product of:
                0.17426576 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0497642 = queryNorm
                0.61904186 = fieldWeight in 402, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.125 = fieldNorm(doc=402)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information processing and management. 22(1986) no.6, S.465-476
  3. Wills, R.S.: Google's PageRank : the math behind the search engine (2006) 0.03
    0.025305996 = product of:
      0.05061199 = sum of:
        0.05061199 = product of:
          0.10122398 = sum of:
            0.10122398 = weight(_text_:homepage in 5954) [ClassicSimilarity], result of:
              0.10122398 = score(doc=5954,freq=2.0), product of:
                0.33761188 = queryWeight, product of:
                  6.784232 = idf(docFreq=135, maxDocs=44218)
                  0.0497642 = queryNorm
                0.29982352 = fieldWeight in 5954, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  6.784232 = idf(docFreq=135, maxDocs=44218)
                  0.03125 = fieldNorm(doc=5954)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Approximately 91 million American adults use the Internet on a typical day The number-one Internet activity is reading and writing e-mail. Search engine use is next in line and continues to increase in popularity. In fact, survey findings indicate that nearly 60 million American adults use search engines on a given day. Even though there are many Internet search engines, Google, Yahoo!, and MSN receive over 81% of all search requests. Despite claims that the quality of search provided by Yahoo! and MSN now equals that of Google, Google continues to thrive as the search engine of choice, receiving over 46% of all search requests, nearly double the volume of Yahoo! and over four times that of MSN. I use Google's search engine on a daily basis and rarely request information from other search engines. One day, I decided to visit the homepages of Google. Yahoo!, and MSN to compare the quality of search results. Coffee was on my mind that day, so I entered the simple query "coffee" in the search box at each homepage. Table 1 shows the top ten (unsponsored) results returned by each search engine. Although ordered differently, two webpages, www.peets.com and www.coffeegeek.com, appear in all three top ten lists. In addition, each pairing of top ten lists has two additional results in common. Depending on the information I hoped to obtain about coffee by using the search engines, I could argue that any one of the three returned better results: however, I was not looking for a particular webpage, so all three listings of search results seemed of equal quality. Thus, I plan to continue using Google. My decision is indicative of the problem Yahoo!, MSN, and other search engine companies face in the quest to obtain a larger percentage of Internet search volume. Search engine users are loyal to one or a few search engines and are generally happy with search results. Thus, as long as Google continues to provide results deemed high in quality, Google likely will remain the top search engine. But what set Google apart from its competitors in the first place? The answer is PageRank. In this article I explain this simple mathematical algorithm that revolutionized Web search.
  4. Henzinger, M.R.: Link analysis in Web information retrieval (2000) 0.03
    0.025305996 = product of:
      0.05061199 = sum of:
        0.05061199 = product of:
          0.10122398 = sum of:
            0.10122398 = weight(_text_:homepage in 801) [ClassicSimilarity], result of:
              0.10122398 = score(doc=801,freq=2.0), product of:
                0.33761188 = queryWeight, product of:
                  6.784232 = idf(docFreq=135, maxDocs=44218)
                  0.0497642 = queryNorm
                0.29982352 = fieldWeight in 801, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  6.784232 = idf(docFreq=135, maxDocs=44218)
                  0.03125 = fieldNorm(doc=801)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Content
    The goal of information retrieval is to find all documents relevant for a user query in a collection of documents. Decades of research in information retrieval were successful in developing and refining techniques that are solely word-based (see e.g., [2]). With the advent of the web new sources of information became available, one of them being the hyperlinks between documents and records of user behavior. To be precise, hypertexts (i.e., collections of documents connected by hyperlinks) have existed and have been studied for a long time. What was new was the large number of hyperlinks created by independent individuals. Hyperlinks provide a valuable source of information for web information retrieval as we will show in this article. This area of information retrieval is commonly called link analysis. Why would one expect hyperlinks to be useful? Ahyperlink is a reference of a web page B that is contained in a web page A. When the hyperlink is clicked on in a web browser, the browser displays page B. This functionality alone is not helpful for web information retrieval. However, the way hyperlinks are typically used by authors of web pages can give them valuable information content. Typically, authors create links because they think they will be useful for the readers of the pages. Thus, links are usually either navigational aids that, for example, bring the reader back to the homepage of the site, or links that point to pages whose content augments the content of the current page. The second kind of links tend to point to high-quality pages that might be on the same topic as the page containing the link.
  5. Smeaton, A.F.; Rijsbergen, C.J. van: ¬The retrieval effects of query expansion on a feedback document retrieval system (1983) 0.02
    0.023598269 = product of:
      0.047196537 = sum of:
        0.047196537 = product of:
          0.094393075 = sum of:
            0.094393075 = weight(_text_:22 in 2134) [ClassicSimilarity], result of:
              0.094393075 = score(doc=2134,freq=2.0), product of:
                0.17426576 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0497642 = queryNorm
                0.5416616 = fieldWeight in 2134, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=2134)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    30. 3.2001 13:32:22
  6. Back, J.: ¬An evaluation of relevancy ranking techniques used by Internet search engines (2000) 0.02
    0.023598269 = product of:
      0.047196537 = sum of:
        0.047196537 = product of:
          0.094393075 = sum of:
            0.094393075 = weight(_text_:22 in 3445) [ClassicSimilarity], result of:
              0.094393075 = score(doc=3445,freq=2.0), product of:
                0.17426576 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0497642 = queryNorm
                0.5416616 = fieldWeight in 3445, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=3445)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    25. 8.2005 17:42:22
  7. Fuhr, N.: Ranking-Experimente mit gewichteter Indexierung (1986) 0.02
    0.020227086 = product of:
      0.04045417 = sum of:
        0.04045417 = product of:
          0.08090834 = sum of:
            0.08090834 = weight(_text_:22 in 58) [ClassicSimilarity], result of:
              0.08090834 = score(doc=58,freq=2.0), product of:
                0.17426576 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0497642 = queryNorm
                0.46428138 = fieldWeight in 58, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=58)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    14. 6.2015 22:12:44
  8. Fuhr, N.: Rankingexperimente mit gewichteter Indexierung (1986) 0.02
    0.020227086 = product of:
      0.04045417 = sum of:
        0.04045417 = product of:
          0.08090834 = sum of:
            0.08090834 = weight(_text_:22 in 2051) [ClassicSimilarity], result of:
              0.08090834 = score(doc=2051,freq=2.0), product of:
                0.17426576 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0497642 = queryNorm
                0.46428138 = fieldWeight in 2051, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=2051)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    14. 6.2015 22:12:56
  9. MacFarlane, A.; Robertson, S.E.; McCann, J.A.: Parallel computing for passage retrieval (2004) 0.01
    0.013484725 = product of:
      0.02696945 = sum of:
        0.02696945 = product of:
          0.0539389 = sum of:
            0.0539389 = weight(_text_:22 in 5108) [ClassicSimilarity], result of:
              0.0539389 = score(doc=5108,freq=2.0), product of:
                0.17426576 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0497642 = queryNorm
                0.30952093 = fieldWeight in 5108, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=5108)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    20. 1.2007 18:30:22
  10. Faloutsos, C.: Signature files (1992) 0.01
    0.013484725 = product of:
      0.02696945 = sum of:
        0.02696945 = product of:
          0.0539389 = sum of:
            0.0539389 = weight(_text_:22 in 3499) [ClassicSimilarity], result of:
              0.0539389 = score(doc=3499,freq=2.0), product of:
                0.17426576 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0497642 = queryNorm
                0.30952093 = fieldWeight in 3499, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=3499)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    7. 5.1999 15:22:48
  11. Losada, D.E.; Barreiro, A.: Emebedding term similarity and inverse document frequency into a logical model of information retrieval (2003) 0.01
    0.013484725 = product of:
      0.02696945 = sum of:
        0.02696945 = product of:
          0.0539389 = sum of:
            0.0539389 = weight(_text_:22 in 1422) [ClassicSimilarity], result of:
              0.0539389 = score(doc=1422,freq=2.0), product of:
                0.17426576 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0497642 = queryNorm
                0.30952093 = fieldWeight in 1422, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1422)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 3.2003 19:27:23
  12. Bornmann, L.; Mutz, R.: From P100 to P100' : a new citation-rank approach (2014) 0.01
    0.013484725 = product of:
      0.02696945 = sum of:
        0.02696945 = product of:
          0.0539389 = sum of:
            0.0539389 = weight(_text_:22 in 1431) [ClassicSimilarity], result of:
              0.0539389 = score(doc=1431,freq=2.0), product of:
                0.17426576 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0497642 = queryNorm
                0.30952093 = fieldWeight in 1431, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1431)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 8.2014 17:05:18
  13. Tober, M.; Hennig, L.; Furch, D.: SEO Ranking-Faktoren und Rang-Korrelationen 2014 : Google Deutschland (2014) 0.01
    0.013484725 = product of:
      0.02696945 = sum of:
        0.02696945 = product of:
          0.0539389 = sum of:
            0.0539389 = weight(_text_:22 in 1484) [ClassicSimilarity], result of:
              0.0539389 = score(doc=1484,freq=2.0), product of:
                0.17426576 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0497642 = queryNorm
                0.30952093 = fieldWeight in 1484, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1484)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    13. 9.2014 14:45:22
  14. Ravana, S.D.; Rajagopal, P.; Balakrishnan, V.: Ranking retrieval systems using pseudo relevance judgments (2015) 0.01
    0.011918926 = product of:
      0.023837851 = sum of:
        0.023837851 = product of:
          0.047675703 = sum of:
            0.047675703 = weight(_text_:22 in 2591) [ClassicSimilarity], result of:
              0.047675703 = score(doc=2591,freq=4.0), product of:
                0.17426576 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0497642 = queryNorm
                0.27358043 = fieldWeight in 2591, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2591)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    20. 1.2015 18:30:22
    18. 9.2018 18:22:56
  15. Chang, C.-H.; Hsu, C.-C.: Integrating query expansion and conceptual relevance feedback for personalized Web information retrieval (1998) 0.01
    0.011799134 = product of:
      0.023598269 = sum of:
        0.023598269 = product of:
          0.047196537 = sum of:
            0.047196537 = weight(_text_:22 in 1319) [ClassicSimilarity], result of:
              0.047196537 = score(doc=1319,freq=2.0), product of:
                0.17426576 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0497642 = queryNorm
                0.2708308 = fieldWeight in 1319, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1319)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    1. 8.1996 22:08:06
  16. Kanaeva, Z.: Ranking: Google und CiteSeer (2005) 0.01
    0.011799134 = product of:
      0.023598269 = sum of:
        0.023598269 = product of:
          0.047196537 = sum of:
            0.047196537 = weight(_text_:22 in 3276) [ClassicSimilarity], result of:
              0.047196537 = score(doc=3276,freq=2.0), product of:
                0.17426576 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0497642 = queryNorm
                0.2708308 = fieldWeight in 3276, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3276)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    20. 3.2005 16:23:22
  17. Joss, M.W.; Wszola, S.: ¬The engines that can : text search and retrieval software, their strategies, and vendors (1996) 0.01
    0.010113543 = product of:
      0.020227086 = sum of:
        0.020227086 = product of:
          0.04045417 = sum of:
            0.04045417 = weight(_text_:22 in 5123) [ClassicSimilarity], result of:
              0.04045417 = score(doc=5123,freq=2.0), product of:
                0.17426576 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0497642 = queryNorm
                0.23214069 = fieldWeight in 5123, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5123)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    12. 9.1996 13:56:22
  18. Kelledy, F.; Smeaton, A.F.: Signature files and beyond (1996) 0.01
    0.010113543 = product of:
      0.020227086 = sum of:
        0.020227086 = product of:
          0.04045417 = sum of:
            0.04045417 = weight(_text_:22 in 6973) [ClassicSimilarity], result of:
              0.04045417 = score(doc=6973,freq=2.0), product of:
                0.17426576 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0497642 = queryNorm
                0.23214069 = fieldWeight in 6973, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=6973)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information retrieval: new systems and current research. Proceedings of the 16th Research Colloquium of the British Computer Society Information Retrieval Specialist Group, Drymen, Scotland, 22-23 Mar 94. Ed.: R. Leon
  19. Crestani, F.; Dominich, S.; Lalmas, M.; Rijsbergen, C.J.K. van: Mathematical, logical, and formal methods in information retrieval : an introduction to the special issue (2003) 0.01
    0.010113543 = product of:
      0.020227086 = sum of:
        0.020227086 = product of:
          0.04045417 = sum of:
            0.04045417 = weight(_text_:22 in 1451) [ClassicSimilarity], result of:
              0.04045417 = score(doc=1451,freq=2.0), product of:
                0.17426576 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0497642 = queryNorm
                0.23214069 = fieldWeight in 1451, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1451)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 3.2003 19:27:36
  20. Fan, W.; Fox, E.A.; Pathak, P.; Wu, H.: ¬The effects of fitness functions an genetic programming-based ranking discovery for Web search (2004) 0.01
    0.010113543 = product of:
      0.020227086 = sum of:
        0.020227086 = product of:
          0.04045417 = sum of:
            0.04045417 = weight(_text_:22 in 2239) [ClassicSimilarity], result of:
              0.04045417 = score(doc=2239,freq=2.0), product of:
                0.17426576 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0497642 = queryNorm
                0.23214069 = fieldWeight in 2239, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2239)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    31. 5.2004 19:22:06