Search (110 results, page 1 of 6)

Jansen, B.J.; Spink, A.: How are we searching the World Wide Web? : A comparison of nine search engine transaction logs (2006) 0.06

0.05844656 = product of:
  0.13637531 = sum of:
    0.025943318 = weight(_text_:management in 968) [ClassicSimilarity], result of:
      0.025943318 = score(doc=968,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.18620178 = fieldWeight in 968, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.0390625 = fieldNorm(doc=968)
    0.0847222 = weight(_text_:europe in 968) [ClassicSimilarity], result of:
      0.0847222 = score(doc=968,freq=2.0), product of:
        0.25178367 = queryWeight, product of:
          6.091085 = idf(docFreq=271, maxDocs=44218)
          0.041336425 = queryNorm
        0.33648807 = fieldWeight in 968, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.091085 = idf(docFreq=271, maxDocs=44218)
          0.0390625 = fieldNorm(doc=968)
    0.025709787 = product of:
      0.051419575 = sum of:
        0.051419575 = weight(_text_:studies in 968) [ClassicSimilarity], result of:
          0.051419575 = score(doc=968,freq=4.0), product of:
            0.16494368 = queryWeight, product of:
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.041336425 = queryNorm
            0.3117402 = fieldWeight in 968, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.0390625 = fieldNorm(doc=968)
      0.5 = coord(1/2)
  0.42857143 = coord(3/7)

Abstract: The Web and especially major Web search engines are essential tools in the quest to locate online information for many people. This paper reports results from research that examines characteristics and changes in Web searching from nine studies of five Web search engines based in the US and Europe. We compare interactions occurring between users and Web search engines from the perspectives of session length, query length, query complexity, and content viewed among the Web search engines. The results of our research shows (1) users are viewing fewer result pages, (2) searchers on US-based Web search engines use more query operators than searchers on European-based search engines, (3) there are statistically significant differences in the use of Boolean operators and result pages viewed, and (4) one cannot necessary apply results from studies of one particular Web search engine to another Web search engine. The wide spread use of Web search engines, employment of simple queries, and decreased viewing of result pages may have resulted from algorithmic enhancements by Web search engine companies. We discuss the implications of the findings for the development of Web search engines and design of online content.
Source: Information processing and management. 42(2006) no.1, S.248-263

MacLeod, R.: Promoting a subject gateway : a case study from EEVL (Edinburgh Engineering Virtual Library) (2000) 0.05

0.046983022 = product of:
  0.16444057 = sum of:
    0.124838956 = weight(_text_:case in 4872) [ClassicSimilarity], result of:
      0.124838956 = score(doc=4872,freq=4.0), product of:
        0.18173204 = queryWeight, product of:
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.041336425 = queryNorm
        0.6869397 = fieldWeight in 4872, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.078125 = fieldNorm(doc=4872)
    0.03960162 = product of:
      0.07920324 = sum of:
        0.07920324 = weight(_text_:22 in 4872) [ClassicSimilarity], result of:
          0.07920324 = score(doc=4872,freq=4.0), product of:
            0.14475311 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.041336425 = queryNorm
            0.54716086 = fieldWeight in 4872, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=4872)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)

Abstract: Describes the development of EEVL and outlines the services offered. The potential market for EEVL is discussed, and a case study of promotional activities is presented
Date: 22. 6.2002 19:40:22

Bar-Ilan, J.: Evaluating the stability of the search tools Hotbot and Snap : a case study (2000) 0.03

0.03223962 = product of:
  0.11283866 = sum of:
    0.08738727 = weight(_text_:case in 1180) [ClassicSimilarity], result of:
      0.08738727 = score(doc=1180,freq=4.0), product of:
        0.18173204 = queryWeight, product of:
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.041336425 = queryNorm
        0.48085782 = fieldWeight in 1180, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1180)
    0.02545139 = product of:
      0.05090278 = sum of:
        0.05090278 = weight(_text_:studies in 1180) [ClassicSimilarity], result of:
          0.05090278 = score(doc=1180,freq=2.0), product of:
            0.16494368 = queryWeight, product of:
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.041336425 = queryNorm
            0.30860704 = fieldWeight in 1180, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1180)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)

Abstract: Discusses the results of a case study in which 20 random queries were presented for ten consecutive days to Hotbot and Snap, two search tools that draw their results from the database of Inktomi. The results show huge daily fluctuations in the number of hits retrieved by Hotbot, and high stability in the hits displayed by Snap. These findings are to alert users of Hotbot of its instability as of October 1999, and they raise questions about the reliability of previous studies estimating the size of Hotbot based on its overlap with other search engines.

Brophy, J.; Bawden, D.: Is Google enough? : Comparison of an internet search engine with academic library resources (2005) 0.03
```
0.030838823 = product of:
  0.107935876 = sum of:
    0.07644795 = weight(_text_:case in 648) [ClassicSimilarity], result of:
      0.07644795 = score(doc=648,freq=6.0), product of:
        0.18173204 = queryWeight, product of:
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.041336425 = queryNorm
        0.420663 = fieldWeight in 648, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.0390625 = fieldNorm(doc=648)
    0.03148793 = product of:
      0.06297586 = sum of:
        0.06297586 = weight(_text_:studies in 648) [ClassicSimilarity], result of:
          0.06297586 = score(doc=648,freq=6.0), product of:
            0.16494368 = queryWeight, product of:
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.041336425 = queryNorm
            0.3818022 = fieldWeight in 648, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.0390625 = fieldNorm(doc=648)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)
```
Abstract

Purpose - The purpose of the study was to compare an internet search engine, Google, with appropriate library databases and systems, in order to assess the relative value, strengths and weaknesses of the two sorts of system. Design/methodology/approach - A case study approach was used, with detailed analysis and failure checking of results. The performance of the two systems was assessed in terms of coverage, unique records, precision, and quality and accessibility of results. A novel form of relevance assessment, based on the work of Saracevic and others was devised. Findings - Google is superior for coverage and accessibility. Library systems are superior for quality of results. Precision is similar for both systems. Good coverage requires use of both, as both have many unique items. Improving the skills of the searcher is likely to give better results from the library systems, but not from Google. Research limitations/implications - Only four case studies were included. These were limited to the kind of queries likely to be searched by university students. Library resources were limited to those in two UK academic libraries. Only the basic Google web search functionality was used, and only the top ten records examined. Practical implications - The results offer guidance for those providing support and training for use of these retrieval systems, and also provide evidence for debates on the "Google phenomenon". Originality/value - This is one of the few studies which provide evidence on the relative performance of internet search engines and library databases, and the only one to conduct such in-depth case studies. The method for the assessment of relevance is novel.

Serrano Cobos, J.; Quintero Orta, A.: Design, development and management of an information recovery system for an Internet Website : from documentary theory to practice (2003) 0.03

0.027711991 = product of:
  0.09699196 = sum of:
    0.044027276 = weight(_text_:management in 2726) [ClassicSimilarity], result of:
      0.044027276 = score(doc=2726,freq=4.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.31599492 = fieldWeight in 2726, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.046875 = fieldNorm(doc=2726)
    0.052964687 = weight(_text_:case in 2726) [ClassicSimilarity], result of:
      0.052964687 = score(doc=2726,freq=2.0), product of:
        0.18173204 = queryWeight, product of:
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.041336425 = queryNorm
        0.29144385 = fieldWeight in 2726, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.046875 = fieldNorm(doc=2726)
  0.2857143 = coord(2/7)

Abstract: A real case study is shown, explaining in a timeline the whole process of design, development and evaluation of a search engine used as a navigational help tool for end users and clients an a content website, e-commerce driven. The nature of the website is a community website, which will determine the core design of the information service. This study will involve several steps, such as information recovery system analysis, comparative analysis of other commercial search engines, service design, functionalities and scope; software selection, design of the project, project management, future service administration and conclusions.

Couvering, E. van: ¬The economy of navigation : search engines, search optimisation and search results (2007) 0.02
```
0.022998964 = product of:
  0.08049637 = sum of:
    0.04413724 = weight(_text_:case in 379) [ClassicSimilarity], result of:
      0.04413724 = score(doc=379,freq=2.0), product of:
        0.18173204 = queryWeight, product of:
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.041336425 = queryNorm
        0.24286987 = fieldWeight in 379, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.0390625 = fieldNorm(doc=379)
    0.03635913 = product of:
      0.07271826 = sum of:
        0.07271826 = weight(_text_:studies in 379) [ClassicSimilarity], result of:
          0.07271826 = score(doc=379,freq=8.0), product of:
            0.16494368 = queryWeight, product of:
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.041336425 = queryNorm
            0.44086722 = fieldWeight in 379, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.0390625 = fieldNorm(doc=379)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)
```
Abstract

The political economy of communication focuses critically on what structural issues in mass media - ownership, labour practices, professional ethics, and so on - mean for products of those mass media and thus for society more generally. In the case of new media, recent political economic studies have looked at the technical infrastructure of the Internet and also at Internet usage. However, political economic studies of internet content are only beginning. Recent studies on the phenomenology of the Web, that is, the way the Web is experienced from an individual user's perspective, highlight the centrality of the search engine to most users' experiences of the Web, particularly when they venture beyond familiar Web sites. Search engines are therefore an obvi ous place to begin the analysis of Web content. An important assumption of this chapter is that internet search engines are media businesses and that the tools developed in media studies can be profitably brought to bear on them. This focus on search engine as industry comes from the critical tradition of the political economy of communications in rejecting the notion that the market alone should be the arbiter of the structure of the media industry, as might be appropriate for other types of products.
Vise, D.A.; Malseed, M.: ¬The Google story (2005) 0.02
```
0.0197447 = product of:
  0.069106445 = sum of:
    0.05930554 = weight(_text_:europe in 5937) [ClassicSimilarity], result of:
      0.05930554 = score(doc=5937,freq=2.0), product of:
        0.25178367 = queryWeight, product of:
          6.091085 = idf(docFreq=271, maxDocs=44218)
          0.041336425 = queryNorm
        0.23554166 = fieldWeight in 5937, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.091085 = idf(docFreq=271, maxDocs=44218)
          0.02734375 = fieldNorm(doc=5937)
    0.009800901 = product of:
      0.019601801 = sum of:
        0.019601801 = weight(_text_:22 in 5937) [ClassicSimilarity], result of:
          0.019601801 = score(doc=5937,freq=2.0), product of:
            0.14475311 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.041336425 = queryNorm
            0.1354154 = fieldWeight in 5937, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02734375 = fieldNorm(doc=5937)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)
```
Abstract

Social phenomena happen, and the historians follow. So it goes with Google, the latest star shooting through the universe of trend-setting businesses. This company has even entered our popular lexicon: as many note, "Google" has moved beyond noun to verb, becoming an action which most tech-savvy citizens at the turn of the twenty-first century recognize and in fact do, on a daily basis. It's this wide societal impact that fascinated authors David Vise and Mark Malseed, who came to the book with well-established reputations in investigative reporting. Vise authored the bestselling The Bureau and the Mole, and Malseed contributed significantly to two Bob Woodward books, Bush at War and Plan of Attack. The kind of voluminous research and behind-the-scenes insight in which both writers specialize, and on which their earlier books rested, comes through in The Google Story. The strength of the book comes from its command of many small details, and its focus on the human side of the Google story, as opposed to the merely academic one. Some may prefer a dryer, more analytic approach to Google's impact on the Internet, like The Search or books that tilt more heavily towards bits and bytes on the spectrum between technology and business, like The Singularity is Near. Those wanting to understand the motivations and personal growth of founders Larry Page and Sergey Brin and CEO Eric Schmidt, however, will enjoy this book. Vise and Malseed interviewed over 150 people, including numerous Google employees, Wall Street analysts, Stanford professors, venture capitalists, even Larry Page's Cub Scout leader, and their comprehensiveness shows. As the narrative unfolds, readers learn how Google grew out of the intellectually fertile and not particularly directed friendship between Page and Brin; how the founders attempted to peddle early versions of their search technology to different Silicon Valley firms for $1 million; how Larry and Sergey celebrated their first investor's check with breakfast at Burger King; how the pair initially housed their company in a Palo Alto office, then eventually moved to a futuristic campus dubbed the "Googleplex"; how the company found its financial footing through keyword-targeted Web ads; how various products like Google News, Froogle, and others were cooked up by an inventive staff; how Brin and Page proved their mettle as tough businessmen through negotiations with AOL Europe and their controversial IPO process, among other instances; and how the company's vision for itself continues to grow, such as geographic expansion to China and cooperation with Craig Venter on the Human Genome Project. Like the company it profiles, The Google Story is a bit of a wild ride, and fun, too. Its first appendix lists 23 "tips" which readers can use to get more utility out of Google. The second contains the intelligence test which Google Research offers to prospective job applicants, and shows the sometimes zany methods of this most unusual business. Through it all, Vise and Malseed synthesize a variety of fascinating anecdotes and speculation about Google, and readers seeking a first draft of the history of the company will enjoy an easy read.

Date

3. 5.1997 8:44:22
Stacey, Alison; Stacey, Adrian: Effective information retrieval from the Internet : an advanced user's guide (2004) 0.02
```
0.018422639 = product of:
  0.06447923 = sum of:
    0.049935583 = weight(_text_:case in 4497) [ClassicSimilarity], result of:
      0.049935583 = score(doc=4497,freq=4.0), product of:
        0.18173204 = queryWeight, product of:
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.041336425 = queryNorm
        0.2747759 = fieldWeight in 4497, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.03125 = fieldNorm(doc=4497)
    0.014543652 = product of:
      0.029087303 = sum of:
        0.029087303 = weight(_text_:studies in 4497) [ClassicSimilarity], result of:
          0.029087303 = score(doc=4497,freq=2.0), product of:
            0.16494368 = queryWeight, product of:
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.041336425 = queryNorm
            0.17634688 = fieldWeight in 4497, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.03125 = fieldNorm(doc=4497)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)
```
Content

Key Features - Importantly, the book enables readers to develop strategies which will continue to be useful despite the rapidly-evolving state of the Internet and Internet technologies - it is not about technological `tricks'. - Enables readers to be aware of and compensate for bias and errors which are ubiquitous an the Internet. - Provides contemporary information an the deficiencies in web skills of novice users as well as practical techniques for teaching such users. The Authors Dr Alison Stacey works at the Learning Resource Centre, Cambridge Regional College. Dr Adrian Stacey, formerly based at Cambridge University, is a software programmer. Readership The book is aimed at a wide range of librarians and other information professionals who need to retrieve information from the Internet efficiently, to evaluate their confidence in the information they retrieve and/or to train others to use the Internet. It is primarily aimed at intermediate to advanced users of the Internet. Contents Fundamentals of information retrieval from the Internet - why learn web searching technique; types of information requests; patterns for information retrieval; leveraging the technology: Search term choice: pinpointing information an the web - why choose queries carefully; making search terms work together; how to pick search terms; finding the 'unfindable': Blas an the Internet - importance of bias; sources of bias; usergenerated bias: selecting information with which you already agree; assessing and compensating for bias; case studies: Query reformulation and longer term strategies - how to interact with your search engine; foraging for information; long term information retrieval: using the Internet to find trends; automating searches: how to make your machine do your work: Assessing the quality of results- how to assess and ensure quality: The novice user and teaching internet skills - novice users and their problems with the web; case study: research in a college library; interpreting 'second hand' web information.
Maurer, H.; Balke, T.; Kappe,, F.; Kulathuramaiyer, N.; Weber, S.; Zaka, B.: Report on dangers and opportunities posed by large search engines, particularly Google (2007) 0.02
```
0.017640304 = product of:
  0.06174106 = sum of:
    0.050833322 = weight(_text_:europe in 754) [ClassicSimilarity], result of:
      0.050833322 = score(doc=754,freq=2.0), product of:
        0.25178367 = queryWeight, product of:
          6.091085 = idf(docFreq=271, maxDocs=44218)
          0.041336425 = queryNorm
        0.20189285 = fieldWeight in 754, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.091085 = idf(docFreq=271, maxDocs=44218)
          0.0234375 = fieldNorm(doc=754)
    0.0109077385 = product of:
      0.021815477 = sum of:
        0.021815477 = weight(_text_:studies in 754) [ClassicSimilarity], result of:
          0.021815477 = score(doc=754,freq=2.0), product of:
            0.16494368 = queryWeight, product of:
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.041336425 = queryNorm
            0.13226016 = fieldWeight in 754, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.0234375 = fieldNorm(doc=754)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)
```
Abstract

The preliminary intended and approved list was: Section 1: To concentrate on Google as virtual monopoly, and Google's reported support of Wikipedia. To find experimental evidence of this support or show that the reports are not more than rumours. Section 2: To address the copy-past syndrome with socio-cultural consequences associated with it. Section 3: To deal with plagiarism and IPR violations as two intertwined topics: how they affect various players (teachers and pupils in school; academia; corporations; governmental studies, etc.). To establish that not enough is done concerning these issues, partially due to just plain ignorance. We will propose some ways to alleviate the problem. Section 4: To discuss the usual tools to fight plagiarism and their shortcomings. Section 5: To propose ways to overcome most of above problems according to proposals by Maurer/Zaka. To examples, but to make it clear that do this more seriously a pilot project is necessary beyond this particular study. Section 6: To briefly analyze various views of plagiarism as it is quite different in different fields (journalism, engineering, architecture, painting, .) and to present a concept that avoids plagiarism from the very beginning. Section 7: To point out the many other dangers of Google or Google-like undertakings: opportunistic ranking, analysis of data as window into commercial future. Section 8: To outline the need of new international laws. Section 9: To mention the feeble European attempts to fight Google, despite Google's growing power. Section 10. To argue that there is no way to catch up with Google in a frontal attack.
We believe that the importance has shifted considerably since the approval of the project. We thus will emphasize some aspects much more than ever planned, and treat others in a shorter fashion. We believe and hope that this is also seen as unexpected benefit by BMVIT. This report is structured as follows: After an Executive Summary that will highlight why the topic is of such paramount importance we explain in an introduction possible optimal ways how to study the report and its appendices. We can report with some pride that many of the ideas have been accepted by the international scene at conferences and by journals as of such crucial importance that a number of papers (constituting the appendices and elaborating the various sections) have been considered high quality material for publication. We want to thank the Austrian Federal Ministry of Transport, Innovation and Technology (BMVIT) for making this study possible. We would be delighted if the study can be distributed widely to European decision makers, as some of the issues involved do indeed involve all of Europe, if not the world.

Koch, T.: Quality-controlled subject gateways : definitions, typologies, empirical overview (2000) 0.02

0.015977843 = product of:
  0.05592245 = sum of:
    0.036320645 = weight(_text_:management in 631) [ClassicSimilarity], result of:
      0.036320645 = score(doc=631,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.2606825 = fieldWeight in 631, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.0546875 = fieldNorm(doc=631)
    0.019601801 = product of:
      0.039203603 = sum of:
        0.039203603 = weight(_text_:22 in 631) [ClassicSimilarity], result of:
          0.039203603 = score(doc=631,freq=2.0), product of:
            0.14475311 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.041336425 = queryNorm
            0.2708308 = fieldWeight in 631, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=631)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)

Abstract: 'Quality-controlled subject gateways' are Internet services which apply a rich set of quality measures to support systematic resource discovery. Considerable manual effort is used to secure a selection of resources which meet quality criteria and to display a rich description of these resources with standards-based metadata. Regular checking and updating ensure good collection management. A main goal is to provide a high quality of subject access through indexing resources using controlled vocabularies and by offering a deep classification structure for advanced searching and browsing. This article provides an initial empirical overview of existing services of this kind, their approaches and technologies, based on proposed working definitions and typologies of subject gateways
Date: 22. 6.2002 19:37:55

Song, R.; Luo, Z.; Nie, J.-Y.; Yu, Y.; Hon, H.-W.: Identification of ambiguous queries in web search (2009) 0.02

0.015127847 = product of:
  0.05294746 = sum of:
    0.031131983 = weight(_text_:management in 2441) [ClassicSimilarity], result of:
      0.031131983 = score(doc=2441,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.22344214 = fieldWeight in 2441, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.046875 = fieldNorm(doc=2441)
    0.021815477 = product of:
      0.043630954 = sum of:
        0.043630954 = weight(_text_:studies in 2441) [ClassicSimilarity], result of:
          0.043630954 = score(doc=2441,freq=2.0), product of:
            0.16494368 = queryWeight, product of:
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.041336425 = queryNorm
            0.26452032 = fieldWeight in 2441, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.046875 = fieldNorm(doc=2441)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)

Abstract: It is widely believed that many queries submitted to search engines are inherently ambiguous (e.g., java and apple). However, few studies have tried to classify queries based on ambiguity and to answer "what the proportion of ambiguous queries is". This paper deals with these issues. First, we clarify the definition of ambiguous queries by constructing the taxonomy of queries from being ambiguous to specific. Second, we ask human annotators to manually classify queries. From manually labeled results, we observe that query ambiguity is to some extent predictable. Third, we propose a supervised learning approach to automatically identify ambiguous queries. Experimental results show that we can correctly identify 87% of labeled queries with the approach. Finally, by using our approach, we estimate that about 16% of queries in a real search log are ambiguous.
Source: Information processing and management. 45(2009) no.2, S.216-229

Spink, A.; Park, M.; Jansen, B.J.; Pedersen, J.: Elicitation and use of relevance feedback information (2006) 0.01
```
0.012606538 = product of:
  0.044122882 = sum of:
    0.025943318 = weight(_text_:management in 967) [ClassicSimilarity], result of:
      0.025943318 = score(doc=967,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.18620178 = fieldWeight in 967, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.0390625 = fieldNorm(doc=967)
    0.018179566 = product of:
      0.03635913 = sum of:
        0.03635913 = weight(_text_:studies in 967) [ClassicSimilarity], result of:
          0.03635913 = score(doc=967,freq=2.0), product of:
            0.16494368 = queryWeight, product of:
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.041336425 = queryNorm
            0.22043361 = fieldWeight in 967, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.0390625 = fieldNorm(doc=967)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)
```
Abstract

A user's single session with a Web search engine or information retrieval (IR) system may consist of seeking information on single or multiple topics, and switch between tasks or multitasking information behavior. Most Web search sessions consist of two queries of approximately two words. However, some Web search sessions consist of three or more queries. We present findings from two studies. First, a study of two-query search sessions on the AltaVista Web search engine, and second, a study of three or more query search sessions on the AltaVista Web search engine. We examine the degree of multitasking search and information task switching during these two sets of AltaVista Web search sessions. A sample of two-query and three or more query sessions were filtered from AltaVista transaction logs from 2002 and qualitatively analyzed. Sessions ranged in duration from less than a minute to a few hours. Findings include: (1) 81% of two-query sessions included multiple topics, (2) 91.3% of three or more query sessions included multiple topics, (3) there are a broad variety of topics in multitasking search sessions, and (4) three or more query sessions sometimes contained frequent topic changes. Multitasking is found to be a growing element in Web searching. This paper proposes an approach to interactive information retrieval (IR) contextually within a multitasking framework. The implications of our findings for Web design and further research are discussed.

Source

Information processing and management. 42(2006) no.1, S.264-275
Thatcher, A.: Web search strategies : the influence of Web experience and task type (2008) 0.01
```
0.012606538 = product of:
  0.044122882 = sum of:
    0.025943318 = weight(_text_:management in 2095) [ClassicSimilarity], result of:
      0.025943318 = score(doc=2095,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.18620178 = fieldWeight in 2095, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2095)
    0.018179566 = product of:
      0.03635913 = sum of:
        0.03635913 = weight(_text_:studies in 2095) [ClassicSimilarity], result of:
          0.03635913 = score(doc=2095,freq=2.0), product of:
            0.16494368 = queryWeight, product of:
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.041336425 = queryNorm
            0.22043361 = fieldWeight in 2095, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2095)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)
```
Abstract

Despite a number of studies looking at Web experience and Web searching tactics and behaviours, the specific relationships between experience and cognitive search strategies have not been widely researched. This study investigates how the cognitive search strategies of 80 participants might vary with Web experience as they engaged in two researcher-defined tasks and two participant-defined information seeking tasks. Each of the two researcher-defined tasks and participant-defined tasks included a directed search task and a general-purpose browsing task. While there were almost no significant performance differences between experience levels on any of the four tasks, there were significant differences in the use of cognitive search strategies. Participants with higher levels of Web experience were more likely to use "Parallel player", "Parallel hub-and-spoke", "Known address search domain" and "Known address" strategies, whereas participants with lower levels of Web experience were more likely to use "Virtual tourist", "Link-dependent", "To-the-point", "Sequential player", "Search engine narrowing", and "Broad first" strategies. The patterns of use and differences between researcher-defined and participant-defined tasks and between directed search tasks and general-purpose browsing tasks are also discussed, although the distribution of search strategies by Web experience were not statistically significant for each individual task.

Source

Information processing and management. 44(2008) no.3, S.1308-1329
Bilal, D.: Children's use of the Yahooligans! Web search engine : III. Cognitive and physical behaviors on fully self-generated search tasks (2002) 0.01
```
0.011033435 = product of:
  0.077234045 = sum of:
    0.077234045 = sum of:
      0.043630954 = weight(_text_:studies in 5228) [ClassicSimilarity], result of:
        0.043630954 = score(doc=5228,freq=2.0), product of:
          0.16494368 = queryWeight, product of:
            3.9902744 = idf(docFreq=2222, maxDocs=44218)
            0.041336425 = queryNorm
          0.26452032 = fieldWeight in 5228, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.9902744 = idf(docFreq=2222, maxDocs=44218)
            0.046875 = fieldNorm(doc=5228)
      0.033603087 = weight(_text_:22 in 5228) [ClassicSimilarity], result of:
        0.033603087 = score(doc=5228,freq=2.0), product of:
          0.14475311 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.041336425 = queryNorm
          0.23214069 = fieldWeight in 5228, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=5228)
  0.14285715 = coord(1/7)
```
Abstract

Bilal, in this third part of her Yahooligans! study looks at children's performance with self-generated search tasks, as compared to previously assigned search tasks looking for differences in success, cognitive behavior, physical behavior, and task preference. Lotus ScreenCam was used to record interactions and post search interviews to record impressions. The subjects, the same 22 seventh grade children in the previous studies, generated topics of interest that were mediated with the researcher into more specific topics where necessary. Fifteen usable sessions form the basis of the study. Eleven children were successful in finding information, a rate of 73% compared to 69% in assigned research questions, and 50% in assigned fact-finding questions. Eighty-seven percent began using one or two keyword searches. Spelling was a problem. Successful children made fewer keyword searches and the number of search moves averaged 5.5 as compared to 2.4 on the research oriented task and 3.49 on the factual. Backtracking and looping were common. The self-generated task was preferred by 47% of the subjects.

Taylor, M.: Using the Google search appliance for federated searching : a case study (2005) 0.01

0.010088513 = product of:
  0.07061958 = sum of:
    0.07061958 = weight(_text_:case in 355) [ClassicSimilarity], result of:
      0.07061958 = score(doc=355,freq=2.0), product of:
        0.18173204 = queryWeight, product of:
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.041336425 = queryNorm
        0.3885918 = fieldWeight in 355, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.0625 = fieldNorm(doc=355)
  0.14285715 = coord(1/7)

Spink, A.; Jansen, B.J.; Blakely, C.; Koshman, S.: ¬A study of results overlap and uniqueness among major Web search engines (2006) 0.01
```
0.010085231 = product of:
  0.035298306 = sum of:
    0.020754656 = weight(_text_:management in 993) [ClassicSimilarity], result of:
      0.020754656 = score(doc=993,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.14896142 = fieldWeight in 993, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.03125 = fieldNorm(doc=993)
    0.014543652 = product of:
      0.029087303 = sum of:
        0.029087303 = weight(_text_:studies in 993) [ClassicSimilarity], result of:
          0.029087303 = score(doc=993,freq=2.0), product of:
            0.16494368 = queryWeight, product of:
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.041336425 = queryNorm
            0.17634688 = fieldWeight in 993, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.03125 = fieldNorm(doc=993)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)
```
Abstract

The performance and capabilities of Web search engines is an important and significant area of research. Millions of people world wide use Web search engines very day. This paper reports the results of a major study examining the overlap among results retrieved by multiple Web search engines for a large set of more than 10,000 queries. Previous smaller studies have discussed a lack of overlap in results returned by Web search engines for the same queries. The goal of the current study was to conduct a large-scale study to measure the overlap of search results on the first result page (both non-sponsored and sponsored) across the four most popular Web search engines, at specific points in time using a large number of queries. The Web search engines included in the study were MSN Search, Google, Yahoo! and Ask Jeeves. Our study then compares these results with the first page results retrieved for the same queries by the metasearch engine Dogpile.com. Two sets of randomly selected user-entered queries, one set was 10,316 queries and the other 12,570 queries, from Infospace's Dogpile.com search engine (the first set was from Dogpile, the second was from across the Infospace Network of search properties were submitted to the four single Web search engines). Findings show that the percent of total results unique to only one of the four Web search engines was 84.9%, shared by two of the three Web search engines was 11.4%, shared by three of the Web search engines was 2.6%, and shared by all four Web search engines was 1.1%. This small degree of overlap shows the significant difference in the way major Web search engines retrieve and rank results in response to given queries. Results point to the value of metasearch engines in Web retrieval to overcome the biases of individual search engines.

Source

Information processing and management. 42(2006) no.5, S.1379-1391
Jansen, B.J.; Pooch , U.: ¬A review of Web searching studies and a framework for future research (2001) 0.01
```
0.008906132 = product of:
  0.06234292 = sum of:
    0.06234292 = product of:
      0.12468584 = sum of:
        0.12468584 = weight(_text_:studies in 5186) [ClassicSimilarity], result of:
          0.12468584 = score(doc=5186,freq=12.0), product of:
            0.16494368 = queryWeight, product of:
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.041336425 = queryNorm
            0.75592977 = fieldWeight in 5186, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5186)
      0.5 = coord(1/2)
  0.14285715 = coord(1/7)
```
Abstract

Jansen and Pooch review three major search engine studies and compare them to three traditional search system studies and three OPAC search studies, to determine if user search characteristics differ. The web search engine studies indicate that most searchers use two, two search term queries per session, no boolean operators, and look only at the top ten items returned, while reporting the location of relevant information. In traditional search systems we find seven to 16 queries of six to nine terms, while about ten documents per session were viewed. The OPAC studies indicated two to five queries per session of two or less terms, with Boolean search about 1% and less than 50 documents viewed.

Dempsey, B.J.: Design and empirical evaluation of search software for legal professionals on the WWW (2000) 0.01

0.008894852 = product of:
  0.062263966 = sum of:
    0.062263966 = weight(_text_:management in 6274) [ClassicSimilarity], result of:
      0.062263966 = score(doc=6274,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.44688427 = fieldWeight in 6274, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.09375 = fieldNorm(doc=6274)
  0.14285715 = coord(1/7)

Source: Information processing and management. 36(2000) no.2, S.253-273

Web work : Information seeking and knowledge work on the World Wide Web (2000) 0.01

0.008894852 = product of:
  0.062263966 = sum of:
    0.062263966 = weight(_text_:management in 1190) [ClassicSimilarity], result of:
      0.062263966 = score(doc=1190,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.44688427 = fieldWeight in 1190, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.09375 = fieldNorm(doc=1190)
  0.14285715 = coord(1/7)

Series: Information science and knowledge management; vol.1

Thelwall, M.; Stuart, D.: Web crawling ethics revisited : cost, privacy, and denial of service (2006) 0.01
```
0.008827448 = product of:
  0.061792135 = sum of:
    0.061792135 = weight(_text_:case in 6098) [ClassicSimilarity], result of:
      0.061792135 = score(doc=6098,freq=2.0), product of:
        0.18173204 = queryWeight, product of:
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.041336425 = queryNorm
        0.34001783 = fieldWeight in 6098, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.0546875 = fieldNorm(doc=6098)
  0.14285715 = coord(1/7)
```
Abstract

Ethical aspects of the employment of Web crawlers for information science research and other contexts are reviewed. The difference between legal and ethical uses of communications technologies is emphasized as well as the changing boundary between ethical and unethical conduct. A review of the potential impacts on Web site owners is used to underpin a new framework for ethical crawling, and it is argued that delicate human judgment is required for each individual case, with verdicts likely to change over time. Decisions can be based upon an approximate cost-benefit analysis, but it is crucial that crawler owners find out about the technological issues affecting the owners of the sites being crawled in order to produce an informed assessment.

Search (110 results, page 1 of 6)

Authors

Types

Themes

Subjects

Classifications