Search (657 results, page 1 of 33)

Li, L.; Shang, Y.; Zhang, W.: Improvement of HITS-based algorithms on Web documents 0.17

0.16532192 = product of:
  0.38575113 = sum of:
    0.06973943 = product of:
      0.20921828 = sum of:
        0.20921828 = weight(_text_:3a in 2514) [ClassicSimilarity], result of:
          0.20921828 = score(doc=2514,freq=2.0), product of:
            0.37226257 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.043909185 = queryNorm
            0.56201804 = fieldWeight in 2514, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.046875 = fieldNorm(doc=2514)
      0.33333334 = coord(1/3)
    0.020132389 = weight(_text_:of in 2514) [ClassicSimilarity], result of:
      0.020132389 = score(doc=2514,freq=16.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.2932045 = fieldWeight in 2514, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=2514)
    0.2958793 = weight(_text_:2f in 2514) [ClassicSimilarity], result of:
      0.2958793 = score(doc=2514,freq=4.0), product of:
        0.37226257 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.043909185 = queryNorm
        0.7948135 = fieldWeight in 2514, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.046875 = fieldNorm(doc=2514)
  0.42857143 = coord(3/7)

Abstract: In this paper, we present two ways to improve the precision of HITS-based algorithms onWeb documents. First, by analyzing the limitations of current HITS-based algorithms, we propose a new weighted HITS-based method that assigns appropriate weights to in-links of root documents. Then, we combine content analysis with HITS-based algorithms and study the effects of four representative relevance scoring methods, VSM, Okapi, TLS, and CDR, using a set of broad topic queries. Our experimental results show that our weighted HITS-based method performs significantly better than Bharat's improved HITS algorithm. When we combine our weighted HITS-based method or Bharat's HITS algorithm with any of the four relevance scoring methods, the combined methods are only marginally better than our weighted HITS-based method. Between the four relevance scoring methods, there is no significant quality difference when they are combined with a HITS-based algorithm.
Content: Vgl.: http%3A%2F%2Fdelab.csd.auth.gr%2F~dimitris%2Fcourses%2Fir_spring06%2Fpage_rank_computing%2Fp527-li.pdf. Vgl. auch: http://www2002.org/CDROM/refereed/643/.
Source: WWW '02: Proceedings of the 11th International Conference on World Wide Web, May 7-11, 2002, Honolulu, Hawaii, USA

Sachse, J.: ¬The influence of snippet length on user behavior in mobile web search (2019) 0.06

0.0584074 = product of:
  0.13628393 = sum of:
    0.018757246 = weight(_text_:of in 5493) [ClassicSimilarity], result of:
      0.018757246 = score(doc=5493,freq=20.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.27317715 = fieldWeight in 5493, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5493)
    0.102653965 = weight(_text_:distribution in 5493) [ClassicSimilarity], result of:
      0.102653965 = score(doc=5493,freq=4.0), product of:
        0.24019864 = queryWeight, product of:
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.043909185 = queryNorm
        0.42737114 = fieldWeight in 5493, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5493)
    0.014872721 = product of:
      0.029745443 = sum of:
        0.029745443 = weight(_text_:22 in 5493) [ClassicSimilarity], result of:
          0.029745443 = score(doc=5493,freq=2.0), product of:
            0.15376249 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043909185 = queryNorm
            0.19345059 = fieldWeight in 5493, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5493)
      0.5 = coord(1/2)
  0.42857143 = coord(3/7)

Abstract: Purpose Web search is more and more moving into mobile contexts. However, screen size of mobile devices is limited and search engine result pages face a trade-off between offering informative snippets and optimal use of space. One factor clearly influencing this trade-off is snippet length. The purpose of this paper is to find out what snippet size to use in mobile web search. Design/methodology/approach For this purpose, an eye-tracking experiment was conducted showing participants search interfaces with snippets of one, three or five lines on a mobile device to analyze 17 dependent variables. In total, 31 participants took part in the study. Each of the participants solved informational and navigational tasks. Findings Results indicate a strong influence of page fold on scrolling behavior and attention distribution across search results. Regardless of query type, short snippets seem to provide too little information about the result, so that search performance and subjective measures are negatively affected. Long snippets of five lines lead to better performance than medium snippets for navigational queries, but to worse performance for informational queries. Originality/value Although space in mobile search is limited, this study shows that longer snippets improve usability and user experience. It further emphasizes that page fold plays a stronger role in mobile than in desktop search for attention distribution.
Date: 20. 1.2015 18:30:22
Source: Aslib journal of information management. 71(2019) no.3, S.325-343

Mowshowitz, A.; Kawaguchi, A.: Assessing bias in search engines (2002) 0.05

0.049850646 = product of:
  0.17447725 = sum of:
    0.023607321 = weight(_text_:of in 2574) [ClassicSimilarity], result of:
      0.023607321 = score(doc=2574,freq=22.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.34381276 = fieldWeight in 2574, product of:
          4.690416 = tf(freq=22.0), with freq of:
            22.0 = termFreq=22.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=2574)
    0.15086992 = weight(_text_:distribution in 2574) [ClassicSimilarity], result of:
      0.15086992 = score(doc=2574,freq=6.0), product of:
        0.24019864 = queryWeight, product of:
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.043909185 = queryNorm
        0.6281048 = fieldWeight in 2574, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.046875 = fieldNorm(doc=2574)
  0.2857143 = coord(2/7)

Abstract: This paper deals with the measurement of bias in search engines on the World Wide Web. Bias is taken to mean the balance and representativeness of items in a collection retrieved from a database for a set of queries. This calls for assessing the degree to which the distribution of items in a collection deviates from the ideal. Ascertaining this ideal poses problems similar to those associated with determining relevance in the measurement of recall and precision. Instead of enlisting subject experts or users to determine such an ideal, a family of comparable search engines is used to approximate it for a set of queries. The distribution is obtained by computing the frequencies of occurrence of the uniform resource locators (URLs) in the collection retrieved by several search engines for the given queries. Bias is assessed by measuring the deviation from the ideal of the distribution produced by a particular search engine.

Mowshowitz, A.; Kawaguchi, A.: Measuring search engine bias (2005) 0.05

0.04953675 = product of:
  0.17337862 = sum of:
    0.022508696 = weight(_text_:of in 1045) [ClassicSimilarity], result of:
      0.022508696 = score(doc=1045,freq=20.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.32781258 = fieldWeight in 1045, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=1045)
    0.15086992 = weight(_text_:distribution in 1045) [ClassicSimilarity], result of:
      0.15086992 = score(doc=1045,freq=6.0), product of:
        0.24019864 = queryWeight, product of:
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.043909185 = queryNorm
        0.6281048 = fieldWeight in 1045, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.046875 = fieldNorm(doc=1045)
  0.2857143 = coord(2/7)

Abstract: This paper examines a real-time measure of bias in Web search engines. The measure captures the degree to which the distribution of URLs, retrieved in response to a query, deviates from an ideal or fair distribution for that query. This ideal is approximated by the distribution produced by a collection of search engines. Differences between bias and classical retrieval measures are highlighted by examining the possibilities for bias in four extreme cases of recall and precision. The results of experiments examining the influence on bias measurement of subject domains, search engines, and search terms are presented. Three general conclusions are drawn: (1) the performance of search engines can be distinguished with the aid of the bias measure; (2) bias values depend on the subject matter under consideration; (3) choice of search terms does not account for much of the variance in bias values. These conclusions underscore the need to develop "bias profiles" for search engines.

Loia, V.; Pedrycz, W.; Senatore, S.; Sessa, M.I.: Web navigation support by means of proximity-driven assistant agents (2006) 0.05

0.04628896 = product of:
  0.10800757 = sum of:
    0.020547535 = weight(_text_:of in 5283) [ClassicSimilarity], result of:
      0.020547535 = score(doc=5283,freq=24.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.2992506 = fieldWeight in 5283, product of:
          4.8989797 = tf(freq=24.0), with freq of:
            24.0 = termFreq=24.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5283)
    0.07258732 = weight(_text_:distribution in 5283) [ClassicSimilarity], result of:
      0.07258732 = score(doc=5283,freq=2.0), product of:
        0.24019864 = queryWeight, product of:
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.043909185 = queryNorm
        0.30219704 = fieldWeight in 5283, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5283)
    0.014872721 = product of:
      0.029745443 = sum of:
        0.029745443 = weight(_text_:22 in 5283) [ClassicSimilarity], result of:
          0.029745443 = score(doc=5283,freq=2.0), product of:
            0.15376249 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043909185 = queryNorm
            0.19345059 = fieldWeight in 5283, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5283)
      0.5 = coord(1/2)
  0.42857143 = coord(3/7)

Abstract: The explosive growth of the Web and the consequent exigency of the Web personalization domain have gained a key position in the direction of customization of the Web information to the needs of specific users, taking advantage of the knowledge acquired from the analysis of the user's navigational behavior (usage data) in correlation with other information collected in the Web context, namely, structure, content, and user profile data. This work presents an agent-based framework designed to help a user in achieving personalized navigation, by recommending related documents according to the user's responses in similar-pages searching mode. Our agent-based approach is grounded in the integration of different techniques and methodologies into a unique platform featuring user profiling, fuzzy multisets, proximity-oriented fuzzy clustering, and knowledge-based discovery technologies. Each of these methodologies serves to solve one facet of the general problem (discovering documents relevant to the user by searching the Web) and is treated by specialized agents that ultimately achieve the final functionality through cooperation and task distribution.
Date: 22. 7.2006 16:59:13
Source: Journal of the American Society for Information Science and Technology. 57(2006) no.4, S.515-527

Price, A.: NOVAGate : a Nordic gateway to electronic resources in the forestry, veterinary and agricultural sciences (2000) 0.04

0.041834794 = product of:
  0.14642178 = sum of:
    0.016608374 = weight(_text_:of in 4874) [ClassicSimilarity], result of:
      0.016608374 = score(doc=4874,freq=8.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.24188137 = fieldWeight in 4874, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4874)
    0.1298134 = sum of:
      0.08816978 = weight(_text_:service in 4874) [ClassicSimilarity], result of:
        0.08816978 = score(doc=4874,freq=4.0), product of:
          0.18813887 = queryWeight, product of:
            4.284727 = idf(docFreq=1655, maxDocs=44218)
            0.043909185 = queryNorm
          0.46864203 = fieldWeight in 4874, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            4.284727 = idf(docFreq=1655, maxDocs=44218)
            0.0546875 = fieldNorm(doc=4874)
      0.04164362 = weight(_text_:22 in 4874) [ClassicSimilarity], result of:
        0.04164362 = score(doc=4874,freq=2.0), product of:
          0.15376249 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.043909185 = queryNorm
          0.2708308 = fieldWeight in 4874, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0546875 = fieldNorm(doc=4874)
  0.2857143 = coord(2/7)

Abstract: NOVAGate is a subject-based information gateway covering electronic resources in the agricultural, veterinary and related fields. The service, which opened in July 1998, is produced by the veterinary and agricultural libraries of the 5 Nordic countries - Denmark, Finland, Iceland, Norway and Sweden - which serve the NOVA University. The gateway covers Nordic and European resources as well as the resources of international organizations, but being planned is a network of subject gateways which will give access to a wide range of international quality resources within the agricultural, veterinary and related fields. The service uses the ROADS software
Date: 22. 6.2002 19:41:00

Peereboom, M.: DutchESS : Dutch Electronic Subject Service - a Dutch national collaborative effort (2000) 0.04

0.040018875 = product of:
  0.14006606 = sum of:
    0.021221403 = weight(_text_:of in 4869) [ClassicSimilarity], result of:
      0.021221403 = score(doc=4869,freq=10.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.3090647 = fieldWeight in 4869, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=4869)
    0.11884465 = sum of:
      0.07125194 = weight(_text_:service in 4869) [ClassicSimilarity], result of:
        0.07125194 = score(doc=4869,freq=2.0), product of:
          0.18813887 = queryWeight, product of:
            4.284727 = idf(docFreq=1655, maxDocs=44218)
            0.043909185 = queryNorm
          0.37871996 = fieldWeight in 4869, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.284727 = idf(docFreq=1655, maxDocs=44218)
            0.0625 = fieldNorm(doc=4869)
      0.047592707 = weight(_text_:22 in 4869) [ClassicSimilarity], result of:
        0.047592707 = score(doc=4869,freq=2.0), product of:
          0.15376249 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.043909185 = queryNorm
          0.30952093 = fieldWeight in 4869, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0625 = fieldNorm(doc=4869)
  0.2857143 = coord(2/7)

Abstract: This article gives an overview of the design and organisation of DutchESS, a Dutch information subject gateway created as a national collaborative effort of the National Library and a number of academic libraries. The combined centralised and distributed model of DutchESS is discussed, as well as its selection policy, its metadata format, classification scheme and retrieval options. Also some options for future collaboration on an international level are explored
Date: 22. 6.2002 19:39:23

Campbell, D.: Australian subject gateways : political and strategic issues (2000) 0.04

0.03937876 = product of:
  0.13782565 = sum of:
    0.018981 = weight(_text_:of in 4875) [ClassicSimilarity], result of:
      0.018981 = score(doc=4875,freq=8.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.27643585 = fieldWeight in 4875, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=4875)
    0.11884465 = sum of:
      0.07125194 = weight(_text_:service in 4875) [ClassicSimilarity], result of:
        0.07125194 = score(doc=4875,freq=2.0), product of:
          0.18813887 = queryWeight, product of:
            4.284727 = idf(docFreq=1655, maxDocs=44218)
            0.043909185 = queryNorm
          0.37871996 = fieldWeight in 4875, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.284727 = idf(docFreq=1655, maxDocs=44218)
            0.0625 = fieldNorm(doc=4875)
      0.047592707 = weight(_text_:22 in 4875) [ClassicSimilarity], result of:
        0.047592707 = score(doc=4875,freq=2.0), product of:
          0.15376249 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.043909185 = queryNorm
          0.30952093 = fieldWeight in 4875, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0625 = fieldNorm(doc=4875)
  0.2857143 = coord(2/7)

Abstract: The key political and strategic issues which needs to be addressed for the future development of the Australian subject gateways are: continued quality of content creation, integration of access to print and electronic resources, archiving and persistent identification, sustainability of services and service integration. These issues will be more effectively tackled internationally, and the Australian subject gateways are keen to work with international collaborators to achieve a mutually beneficial outcome
Date: 22. 6.2002 19:41:16

Ardo, A.; Lundberg, S.: ¬A regional distributed WWW search and indexing service : the DESIRE way (1998) 0.04

0.03867754 = product of:
  0.13537139 = sum of:
    0.007117875 = weight(_text_:of in 4190) [ClassicSimilarity], result of:
      0.007117875 = score(doc=4190,freq=2.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.103663445 = fieldWeight in 4190, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=4190)
    0.12825352 = sum of:
      0.092558995 = weight(_text_:service in 4190) [ClassicSimilarity], result of:
        0.092558995 = score(doc=4190,freq=6.0), product of:
          0.18813887 = queryWeight, product of:
            4.284727 = idf(docFreq=1655, maxDocs=44218)
            0.043909185 = queryNorm
          0.49197167 = fieldWeight in 4190, product of:
            2.4494898 = tf(freq=6.0), with freq of:
              6.0 = termFreq=6.0
            4.284727 = idf(docFreq=1655, maxDocs=44218)
            0.046875 = fieldNorm(doc=4190)
      0.035694532 = weight(_text_:22 in 4190) [ClassicSimilarity], result of:
        0.035694532 = score(doc=4190,freq=2.0), product of:
          0.15376249 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.043909185 = queryNorm
          0.23214069 = fieldWeight in 4190, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=4190)
  0.2857143 = coord(2/7)

Abstract: Creates an open, metadata aware system for distributed, collaborative WWW indexing. The system has 3 main components: a harvester (for collecting information), a database (for making the collection searchable), and a user interface (for making the information available). all components can be distributed across networked computers, thus supporting scalability. The system is metadata aware and thus allows searches on several fields including title, document author and URL. Nordic Web Index (NWI) is an application using this system to create a regional Nordic Web-indexing service. NWI is built using 5 collaborating service points within the Nordic countries. The NWI databases can be used to build additional services
Date: 1. 8.1996 22:08:06
Footnote: Contribution to a special issue devoted to the Proceedings of the 7th International World Wide Web Conference, held 14-18 April 1998, Brisbane, Australia

Dempsey, L.: ¬The subject gateway : experiences and issues based on the emergence of the Resource Discovery Network (2000) 0.04

0.038652197 = product of:
  0.13528268 = sum of:
    0.016438028 = weight(_text_:of in 628) [ClassicSimilarity], result of:
      0.016438028 = score(doc=628,freq=6.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.23940048 = fieldWeight in 628, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=628)
    0.11884465 = sum of:
      0.07125194 = weight(_text_:service in 628) [ClassicSimilarity], result of:
        0.07125194 = score(doc=628,freq=2.0), product of:
          0.18813887 = queryWeight, product of:
            4.284727 = idf(docFreq=1655, maxDocs=44218)
            0.043909185 = queryNorm
          0.37871996 = fieldWeight in 628, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.284727 = idf(docFreq=1655, maxDocs=44218)
            0.0625 = fieldNorm(doc=628)
      0.047592707 = weight(_text_:22 in 628) [ClassicSimilarity], result of:
        0.047592707 = score(doc=628,freq=2.0), product of:
          0.15376249 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.043909185 = queryNorm
          0.30952093 = fieldWeight in 628, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0625 = fieldNorm(doc=628)
  0.2857143 = coord(2/7)

Abstract: Charts the history and development of the UK's Resource Discovery Network, which brings together under a common business, technical and service framework a range of subject gateways and other services for the academic and research community. Considers its future relationship to other services, and position within the information ecology
Date: 22. 6.2002 19:36:13

Kamvar, S.; Haveliwala, T.; Golub, G.: Adaptive methods for the computation of PageRank (2003) 0.03

0.03434028 = product of:
  0.12019098 = sum of:
    0.018568728 = weight(_text_:of in 2560) [ClassicSimilarity], result of:
      0.018568728 = score(doc=2560,freq=10.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.2704316 = fieldWeight in 2560, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2560)
    0.101622246 = weight(_text_:distribution in 2560) [ClassicSimilarity], result of:
      0.101622246 = score(doc=2560,freq=2.0), product of:
        0.24019864 = queryWeight, product of:
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.043909185 = queryNorm
        0.42307585 = fieldWeight in 2560, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2560)
  0.2857143 = coord(2/7)

Abstract: We observe that the convergence patterns of pages in the PageRank algorithm have a nonuniform distribution. Specifically, many pages converge to their true PageRank quickly, while relatively few pages take a much longer time to converge. Furthermore, we observe that these slow-converging pages are generally those pages with high PageRank.We use this observation to devise a simple algorithm to speed up the computation of PageRank, in which the PageRank of pages that have converged are not recomputed at each iteration after convergence. This algorithm, which we call Adaptive PageRank, speeds up the computation of PageRank by nearly 30%.

Wang, P.; Berry, M.W.; Yang, Y.: Mining longitudinal Web queries : trends and patterns (2003) 0.03

0.03378018 = product of:
  0.11823062 = sum of:
    0.016608374 = weight(_text_:of in 6561) [ClassicSimilarity], result of:
      0.016608374 = score(doc=6561,freq=8.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.24188137 = fieldWeight in 6561, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=6561)
    0.101622246 = weight(_text_:distribution in 6561) [ClassicSimilarity], result of:
      0.101622246 = score(doc=6561,freq=2.0), product of:
        0.24019864 = queryWeight, product of:
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.043909185 = queryNorm
        0.42307585 = fieldWeight in 6561, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.0546875 = fieldNorm(doc=6561)
  0.2857143 = coord(2/7)

Abstract: This project analyzed 541,920 user queries submitted to and executed in an academic Website during a four-year period (May 1997 to May 2001) using a relational database. The purpose of the study is three-fold: (1) to understand Web users' query behavior; (2) to identify problems encountered by these Web users; (3) to develop appropriate techniques for optimization of query analysis and mining. The linguistic analyses focus an query structures, lexicon, and word associations using statistical measures such as Zipf distribution and mutual information. A data model with finest granularity is used for data storage and iterative analyses. Patterns and trends of querying behavior are identified and compared with previous studies.
Source: Journal of the American Society for Information Science and technology. 54(2003) no.8, S.743-758

Lawrence, S.; Giles, C.L.: Accessibility and distribution of information on the Web (1999) 0.03
```
0.03249641 = product of:
  0.113737434 = sum of:
    0.026632648 = weight(_text_:of in 4952) [ClassicSimilarity], result of:
      0.026632648 = score(doc=4952,freq=28.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.38787308 = fieldWeight in 4952, product of:
          5.2915025 = tf(freq=28.0), with freq of:
            28.0 = termFreq=28.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=4952)
    0.08710478 = weight(_text_:distribution in 4952) [ClassicSimilarity], result of:
      0.08710478 = score(doc=4952,freq=2.0), product of:
        0.24019864 = queryWeight, product of:
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.043909185 = queryNorm
        0.36263645 = fieldWeight in 4952, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.046875 = fieldNorm(doc=4952)
  0.2857143 = coord(2/7)
```
Abstract

Search engine coverage relative to the estimated size of the publicly indexable web has decreased substantially since December 97, with no engine indexing more than about 16% of the estimated size of the publicly indexable web. (Note that many queries can be satisfied with a relatively small database). Search engines are typically more likely to index sites that have more links to them (more 'popular' sites). They are also typically more likely to index US sites than non-US sites (AltaVista is an exception), and more likely to index commercial sites than educational sites. Indexing of new or modified pages byjust one of the major search engines can take months. 83% of sites contain commercial content and 6% contain scientific or educational content. Only 1.5% of sites contain pornographic content. The publicly indexable web contains an estimated 800 million pages as of February 1999, encompassing about 15 terabytes of information or about 6 terabytes of text after removing HTML tags, comments, and extra whitespace. The simple HTML "keywords" and "description" metatags are only used on the homepages of 34% of sites. Only 0.3% of sites use the Dublin Core metadata standard.
Waller, V.: Not just information : who searches for what on the search engine Google? (2011) 0.03
```
0.032219615 = product of:
  0.11276864 = sum of:
    0.025663862 = weight(_text_:of in 4373) [ClassicSimilarity], result of:
      0.025663862 = score(doc=4373,freq=26.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.37376386 = fieldWeight in 4373, product of:
          5.0990195 = tf(freq=26.0), with freq of:
            26.0 = termFreq=26.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=4373)
    0.08710478 = weight(_text_:distribution in 4373) [ClassicSimilarity], result of:
      0.08710478 = score(doc=4373,freq=2.0), product of:
        0.24019864 = queryWeight, product of:
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.043909185 = queryNorm
        0.36263645 = fieldWeight in 4373, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.046875 = fieldNorm(doc=4373)
  0.2857143 = coord(2/7)
```
Abstract

This paper reports on a transaction log analysis of the type and topic of search queries entered into the search engine Google (Australia). Two aspects, in particular, set this apart from previous studies: the sampling and analysis take account of the distribution of search queries, and lifestyle information of the searcher was matched with each search query. A surprising finding was that there was no observed statistically significant difference in search type or topics for different segments of the online population. It was found that queries about popular culture and Ecommerce accounted for almost half of all search engine queries and that half of the queries were entered with a particular Website in mind. The findings of this study also suggest that the Internet search engine is not only an interface to information or a shortcut to Websites, it is equally a site of leisure. This study has implications for the design and evaluation of search engines as well as our understanding of search engine use.

Source

Journal of the American Society for Information Science and Technology. 62(2011) no.4, S.761-775
Dominich, S.; Skrop, A.: PageRank and interaction information retrieval (2005) 0.03
```
0.030639194 = product of:
  0.107237175 = sum of:
    0.020132389 = weight(_text_:of in 3268) [ClassicSimilarity], result of:
      0.020132389 = score(doc=3268,freq=16.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.2932045 = fieldWeight in 3268, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=3268)
    0.08710478 = weight(_text_:distribution in 3268) [ClassicSimilarity], result of:
      0.08710478 = score(doc=3268,freq=2.0), product of:
        0.24019864 = queryWeight, product of:
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.043909185 = queryNorm
        0.36263645 = fieldWeight in 3268, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.046875 = fieldNorm(doc=3268)
  0.2857143 = coord(2/7)
```
Abstract

The PageRank method is used by the Google Web search engine to compute the importance of Web pages. Two different views have been developed for the Interpretation of the PageRank method and values: (a) stochastic (random surfer): the PageRank values can be conceived as the steady-state distribution of a Markov chain, and (b) algebraic: the PageRank values form the eigenvector corresponding to eigenvalue 1 of the Web link matrix. The Interaction Information Retrieval (1**2 R) method is a nonclassical information retrieval paradigm, which represents a connectionist approach based an dynamic systems. In the present paper, a different Interpretation of PageRank is proposed, namely, a dynamic systems viewpoint, by showing that the PageRank method can be formally interpreted as a particular case of the Interaction Information Retrieval method; and thus, the PageRank values may be interpreted as neutral equilibrium points of the Web.

Source

Journal of the American Society for Information Science and Technology. 56(2005) no.1, S.63-69

Duval, B.K.; Main, L.: Searching the Internet : part 2 trail-blazers (1997) 0.03

0.030448187 = product of:
  0.10656865 = sum of:
    0.01743516 = weight(_text_:of in 858) [ClassicSimilarity], result of:
      0.01743516 = score(doc=858,freq=12.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.25392252 = fieldWeight in 858, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=858)
    0.08913349 = sum of:
      0.05343896 = weight(_text_:service in 858) [ClassicSimilarity], result of:
        0.05343896 = score(doc=858,freq=2.0), product of:
          0.18813887 = queryWeight, product of:
            4.284727 = idf(docFreq=1655, maxDocs=44218)
            0.043909185 = queryNorm
          0.28403997 = fieldWeight in 858, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.284727 = idf(docFreq=1655, maxDocs=44218)
            0.046875 = fieldNorm(doc=858)
      0.035694532 = weight(_text_:22 in 858) [ClassicSimilarity], result of:
        0.035694532 = score(doc=858,freq=2.0), product of:
          0.15376249 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.043909185 = queryNorm
          0.23214069 = fieldWeight in 858, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=858)
  0.2857143 = coord(2/7)

Abstract: Presents a guide to searching for information on the Internet covering Research-It; familiar quotations: a collection of passages, phrases and proverbs traced to their sources in ancient and modern literature by John Bartlett; the Internet Public Library Reference Center; SearchERIC Database; Britannica Online; Britannica's Lives; The complete works of William Shakespeare; Flicks/Movie Schedules and Reviews; the Electronic Newsstand; CNN Interactive; Time Warner's Pathfinder; Electronic Newspapers from all 50 States; Yahoo, News; Newspapers; Techweb; ZDNet; the On-line Books Page; Columbia University Bartleby Library; the Children's Literature Web Guide; National Institutes of Health; US Census Bureau; Earthquake Info; US Postal Service Zip+4 Lookup; the Federal Web Locator; World Wide Web Virtual Library; US Government Information Sources; Index of the Constitution of the US; US States Code; Find California Code; Dearch for Bills; California Tenant's Rights; The Online Career Center; QuickAID Home Page; City.Net; Netscape's Destinations Button; International Telephone Directory; World Alumni Net; Archives of Adoptees and Birth Parents; and World Wide Registry Matching Adoptees with Birth Parents
Date: 6. 3.1997 16:22:15

Golderman, G.M.; Connolly, B.: Between the book covers : going beyond OPAC keyword searching with the deep linking capabilities of Google Scholar and Google Book Search (2004/05) 0.03

0.030147359 = product of:
  0.07034384 = sum of:
    0.017794685 = weight(_text_:of in 731) [ClassicSimilarity], result of:
      0.017794685 = score(doc=731,freq=18.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.25915858 = fieldWeight in 731, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0390625 = fieldNorm(doc=731)
    0.03767643 = weight(_text_:cataloging in 731) [ClassicSimilarity], result of:
      0.03767643 = score(doc=731,freq=2.0), product of:
        0.17305137 = queryWeight, product of:
          3.9411201 = idf(docFreq=2334, maxDocs=44218)
          0.043909185 = queryNorm
        0.21771818 = fieldWeight in 731, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9411201 = idf(docFreq=2334, maxDocs=44218)
          0.0390625 = fieldNorm(doc=731)
    0.014872721 = product of:
      0.029745443 = sum of:
        0.029745443 = weight(_text_:22 in 731) [ClassicSimilarity], result of:
          0.029745443 = score(doc=731,freq=2.0), product of:
            0.15376249 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043909185 = queryNorm
            0.19345059 = fieldWeight in 731, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=731)
      0.5 = coord(1/2)
  0.42857143 = coord(3/7)

Abstract: One finding of the 2006 OCLC study of College Students' Perceptions of Libraries and Information Resources was that students expressed equal levels of trust in libraries and search engines when it came to meeting their information needs in a way that they felt was authoritative. Seeking to incorporate this insight into our own instructional methodology, Schaffer Library at Union College has attempted to engineer a shift from Google to Google Scholar among our student users by representing Scholar as a viable adjunct to the catalog and to snore traditional electronic resources. By attempting to engage student researchers on their own terms, we have discovered that most of them react enthusiastically to the revelation that the Google they think they know so well is, it turns out, a multifaceted resource that is capable of delivering the sort of scholarly information that will meet with their professors' approval. Specifically, this article focuses on the fact that many Google Scholar searches link hack to our own Web catalog where they identify useful book titles that direct OPAC keyword searches have missed.
Date: 2.12.2007 19:39:22
Source: Journal of Internet cataloging. 7(2004/05) nos.3/4, S.16-24

Raeder, A.: Finding Web sites (1995) 0.03

0.029929973 = product of:
  0.1047549 = sum of:
    0.016438028 = weight(_text_:of in 2230) [ClassicSimilarity], result of:
      0.016438028 = score(doc=2230,freq=6.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.23940048 = fieldWeight in 2230, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=2230)
    0.08831687 = weight(_text_:congress in 2230) [ClassicSimilarity], result of:
      0.08831687 = score(doc=2230,freq=2.0), product of:
        0.20946044 = queryWeight, product of:
          4.7703104 = idf(docFreq=1018, maxDocs=44218)
          0.043909185 = queryNorm
        0.42163986 = fieldWeight in 2230, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.7703104 = idf(docFreq=1018, maxDocs=44218)
          0.0625 = fieldNorm(doc=2230)
  0.2857143 = coord(2/7)

Abstract: WWW sites provide graphical hyperlinked views of Internet information. Reviews selected sites that offer access to the Internet. Discusses the services offered by O'Reilly and Associates Inc Whole Internet Guide; Webcrawler from Washington University; Yahoo's Guide to WWW; Library of Congress' Global Electronic Library; The Internet Scout Report; Commerce Net; Commercial Yellow pages; the Virtual Tourist; Geographic Directory of WWW servers; and the Hot, Hot List

Bladow, N.; Dorey, C.; Frederickson, L.; Grover, P.; Knudtson, Y.; Krishnamurthy, S.; Lazarou, V.: What's the Buzz about? : An empirical examination of Search on Yahoo! (2005) 0.03
```
0.029434524 = product of:
  0.10302083 = sum of:
    0.015916053 = weight(_text_:of in 3072) [ClassicSimilarity], result of:
      0.015916053 = score(doc=3072,freq=10.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.23179851 = fieldWeight in 3072, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=3072)
    0.08710478 = weight(_text_:distribution in 3072) [ClassicSimilarity], result of:
      0.08710478 = score(doc=3072,freq=2.0), product of:
        0.24019864 = queryWeight, product of:
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.043909185 = queryNorm
        0.36263645 = fieldWeight in 3072, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.4703507 = idf(docFreq=505, maxDocs=44218)
          0.046875 = fieldNorm(doc=3072)
  0.2857143 = coord(2/7)
```
Abstract

We present an analysis of the Yahoo Buzz Index over a period of 45 weeks. Our key findings are that: (1) It is most common for a search term to show up on the index for one week, followed by two weeks, three weeks, etc. Only two terms persist for all 45 weeks studied - Britney Spears and Jennifer Lopez. Search term longevity follows a power-law distribution or a winner-take-all structure; (2) Most search terms focus on entertainment. Search terms related to serious topics are found less often. The Buzz Index does not necessarily follow the "news cycle"; and, (3) We provide two ways to determine "star power" of various search terms - one that emphasizes staying power on the Index and another that emphasizes rank. In general, the methods lead to dramatically different results. Britney Spears performs well in both methods. We conclude that the data available on the Index is symptomatic of a celebrity-crazed, entertainment-centered culture.

Callery, A.; Tracy-Proulx, D.: Yahoo! : Cataloging the Web (1997) 0.03

0.028192466 = product of:
  0.09867363 = sum of:
    0.013421593 = weight(_text_:of in 3405) [ClassicSimilarity], result of:
      0.013421593 = score(doc=3405,freq=4.0), product of:
        0.06866331 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.043909185 = queryNorm
        0.19546966 = fieldWeight in 3405, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=3405)
    0.08525203 = weight(_text_:cataloging in 3405) [ClassicSimilarity], result of:
      0.08525203 = score(doc=3405,freq=4.0), product of:
        0.17305137 = queryWeight, product of:
          3.9411201 = idf(docFreq=2334, maxDocs=44218)
          0.043909185 = queryNorm
        0.49264002 = fieldWeight in 3405, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.9411201 = idf(docFreq=2334, maxDocs=44218)
          0.0625 = fieldNorm(doc=3405)
  0.2857143 = coord(2/7)

Abstract: Discusses the ways in which Yahoo! the Internet subject guide and search engine approaches the enormous task to cataloguing resources on the Internet, how its approach differs from traditional library methods of information organization and how Yahoo! is different from most WWW search engines. Demonstrates Yahoo!'s entire cataloguing process
Source: Journal of Internet cataloging. 1(1997) no.1, S.57-64

Search (657 results, page 1 of 33)

Authors

Years

Languages

Types

Themes

Subjects

Classifications