Search (35 results, page 2 of 2)

  • × author_ss:"Spink, A."
  • × year_i:[2000 TO 2010}
  1. Jansen, B.J.; Booth, D.L.; Spink, A.: Patterns of query reformulation during Web searching (2009) 0.05
    0.05470205 = product of:
      0.1094041 = sum of:
        0.06981198 = weight(_text_:web in 2936) [ClassicSimilarity], result of:
          0.06981198 = score(doc=2936,freq=8.0), product of:
            0.16134618 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.049439456 = queryNorm
            0.43268442 = fieldWeight in 2936, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.046875 = fieldNorm(doc=2936)
        0.03959212 = weight(_text_:search in 2936) [ClassicSimilarity], result of:
          0.03959212 = score(doc=2936,freq=2.0), product of:
            0.17183559 = queryWeight, product of:
              3.475677 = idf(docFreq=3718, maxDocs=44218)
              0.049439456 = queryNorm
            0.230407 = fieldWeight in 2936, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.475677 = idf(docFreq=3718, maxDocs=44218)
              0.046875 = fieldNorm(doc=2936)
      0.5 = coord(2/4)
    
    Abstract
    Query reformulation is a key user behavior during Web search. Our research goal is to develop predictive models of query reformulation during Web searching. This article reports results from a study in which we automatically classified the query-reformulation patterns for 964,780 Web searching sessions, composed of 1,523,072 queries, to predict the next query reformulation. We employed an n-gram modeling approach to describe the probability of users transitioning from one query-reformulation state to another to predict their next state. We developed first-, second-, third-, and fourth-order models and evaluated each model for accuracy of prediction, coverage of the dataset, and complexity of the possible pattern set. The results show that Reformulation and Assistance account for approximately 45% of all query reformulations; furthermore, the results demonstrate that the first- and second-order models provide the best predictability, between 28 and 40% overall and higher than 70% for some patterns. Implications are that the n-gram approach can be used for improving searching systems and searching assistance.
  2. Spink, A.: Multitasking information behavior and information task switching : an exploratory study (2004) 0.05
    0.05174078 = product of:
      0.10348156 = sum of:
        0.03490599 = weight(_text_:web in 4426) [ClassicSimilarity], result of:
          0.03490599 = score(doc=4426,freq=2.0), product of:
            0.16134618 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.049439456 = queryNorm
            0.21634221 = fieldWeight in 4426, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.046875 = fieldNorm(doc=4426)
        0.068575576 = weight(_text_:search in 4426) [ClassicSimilarity], result of:
          0.068575576 = score(doc=4426,freq=6.0), product of:
            0.17183559 = queryWeight, product of:
              3.475677 = idf(docFreq=3718, maxDocs=44218)
              0.049439456 = queryNorm
            0.39907667 = fieldWeight in 4426, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.475677 = idf(docFreq=3718, maxDocs=44218)
              0.046875 = fieldNorm(doc=4426)
      0.5 = coord(2/4)
    
    Abstract
    Recent studies show that humans engage in multitasking information behaviors, often in libraries, as they seek and search for information on more than one information task. Multitasking information behaviors may consist of library search and use behaviors, or database or Web search sessions on multiple information tasks. However, few human information behavior models of seeking, searching or use, or library use models, include considerations of multitasking information behavior. This paper reports results from a case study exploring multitasking information behavior by an information seeker in a public library using diary, observation and interview data collection techniques. The information seeker sought information on four unrelated personal information tasks during two public library visits. Findings include a taxonomy of information behaviors; a sequential flowchart of the information seeker's complex and iterative processes, including multitasking information behavior, electronic searches, physical library searches, serendipitous browsing, and successive searches; and that the information seeker engaged in a process of 17 information task switches over two library visits. A model of information multitasking and information task switching is presented. Implications for library services and bibliographic instruction are also discussed.
  3. Jansen, B.J.; Spink, A.; Blakely, C.; Koshman, S.: Defining a session on Web search engines (2007) 0.04
    0.041687947 = product of:
      0.08337589 = sum of:
        0.050382458 = weight(_text_:web in 285) [ClassicSimilarity], result of:
          0.050382458 = score(doc=285,freq=6.0), product of:
            0.16134618 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.049439456 = queryNorm
            0.3122631 = fieldWeight in 285, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.0390625 = fieldNorm(doc=285)
        0.032993436 = weight(_text_:search in 285) [ClassicSimilarity], result of:
          0.032993436 = score(doc=285,freq=2.0), product of:
            0.17183559 = queryWeight, product of:
              3.475677 = idf(docFreq=3718, maxDocs=44218)
              0.049439456 = queryNorm
            0.19200584 = fieldWeight in 285, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.475677 = idf(docFreq=3718, maxDocs=44218)
              0.0390625 = fieldNorm(doc=285)
      0.5 = coord(2/4)
    
    Abstract
    Detecting query reformulations within a session by a Web searcher is an important area of research for designing more helpful searching systems and targeting content to particular users. Methods explored by other researchers include both qualitative (i.e., the use of human judges to manually analyze query patterns on usually small samples) and nondeterministic algorithms, typically using large amounts of training data to predict query modification during sessions. In this article, we explore three alternative methods for detection of session boundaries. All three methods are computationally straightforward and therefore easily implemented for detection of session changes. We examine 2,465,145 interactions from 534,507 users of Dogpile.com on May 6, 2005. We compare session analysis using (a) Internet Protocol address and cookie; (b) Internet Protocol address, cookie, and a temporal limit on intrasession interactions; and (c) Internet Protocol address, cookie, and query reformulation patterns. Overall, our analysis shows that defining sessions by query reformulation along with Internet Protocol address and cookie provides the best measure, resulting in an 82% increase in the count of sessions. Regardless of the method used, the mean session length was fewer than three queries, and the mean session duration was less than 30 min. Searchers most often modified their query by changing query terms (nearly 23% of all query modifications) rather than adding or deleting terms. Implications are that for measuring searching traffic, unique sessions may be a better indicator than the common metric of unique visitors. This research also sheds light on the more complex aspects of Web searching involving query modifications and may lead to advances in searching tools.
  4. Spink, A.; Cole, C.: New directions in cognitive information retrieval : introduction (2005) 0.04
    0.035118748 = product of:
      0.070237495 = sum of:
        0.032909684 = weight(_text_:web in 647) [ClassicSimilarity], result of:
          0.032909684 = score(doc=647,freq=4.0), product of:
            0.16134618 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.049439456 = queryNorm
            0.2039694 = fieldWeight in 647, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.03125 = fieldNorm(doc=647)
        0.03732781 = weight(_text_:search in 647) [ClassicSimilarity], result of:
          0.03732781 = score(doc=647,freq=4.0), product of:
            0.17183559 = queryWeight, product of:
              3.475677 = idf(docFreq=3718, maxDocs=44218)
              0.049439456 = queryNorm
            0.21722981 = fieldWeight in 647, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.475677 = idf(docFreq=3718, maxDocs=44218)
              0.03125 = fieldNorm(doc=647)
      0.5 = coord(2/4)
    
    Abstract
    Humans have used electronic information retrieval (IR) systems for more than 50 years as they evolved from experimental systems to full-scale Web search engines and digital libraries. The fields of library and information science (LIS), cognitive science, human factors and computer science have historically been the leading disciplines in conducting research that seeks to model human interaction with IR systems for all kinds of information related behaviors. As technology problems have been mastered, the theoretical and applied framework for studying human interaction with IR systems has evolved from systems-centered to more user-centered, or cognitive-centered approaches. However, cognitive information retrieval (CIR) research that focuses on user interaction with IR systems is still largely under-funded and is often not included at computing and systems design oriented conferences. But CIR-focused research continues, and there are signs that some IR systems designers in academia and the Web search business are realizing that user behavior research can provide valuable insights into systems design and evaluation. The goal of our book is to provide an overview of new CIR research directions. This book does not provide a history of the research field of CIR. Instead, the book confronts new ways of looking at the human information condition with regard to our increasing need to interact with IR systems. The need has grown due to a number of factors, including the increased importance of information to more people in this information age. Also, IR was once considered document-oriented, but has now evolved to include multimedia, text, and other information objects. As a result, IR systems and their complexity have proliferated as users and user purposes for using them have also proliferated. Human interaction with IR systems can often be frustrating as people often lack an understanding of IR system functionality.
  5. Ozmutlu, S.; Spink, A.; Ozmutlu, H.C.: Multimedia Web searching trends : 1997-2001 (2003) 0.02
    0.024938045 = product of:
      0.09975218 = sum of:
        0.09975218 = weight(_text_:web in 1072) [ClassicSimilarity], result of:
          0.09975218 = score(doc=1072,freq=12.0), product of:
            0.16134618 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.049439456 = queryNorm
            0.6182494 = fieldWeight in 1072, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1072)
      0.25 = coord(1/4)
    
    Abstract
    Multimedia is proliferating on Web sites, as the Web continues to enhance the integration of multimedia and textual information. In this paper we examine trends in multimedia Web searching by Excite users from 1997 to 2001. Results from an analysis of 1,025,910 Excite queries from 2001 are compared to similar Excite datasets from 1997 to 1999. Findings include: (1) queries per multimedia session have decreased since 1997 as a proportion of general queries due to the introduction of multimedia buttons near the query box, (2) multimedia queries identified are longer than non-multimedia queries, and (3) audio queries are more prevalent than image or video queries in identified multimedia queries. Overall, we see multimedia Web searching undergoing major changes as Web content and searching evolves.
  6. Spink, A.; Wolfram, D.; Jansen, B.J.; Saracevic, T.: Searching the Web : the public and their queries (2001) 0.02
    0.023986846 = product of:
      0.095947385 = sum of:
        0.095947385 = weight(_text_:web in 6980) [ClassicSimilarity], result of:
          0.095947385 = score(doc=6980,freq=34.0), product of:
            0.16134618 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.049439456 = queryNorm
            0.59466785 = fieldWeight in 6980, product of:
              5.8309517 = tf(freq=34.0), with freq of:
                34.0 = termFreq=34.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.03125 = fieldNorm(doc=6980)
      0.25 = coord(1/4)
    
    Abstract
    In previous articles, we reported the state of Web searching in 1997 (Jansen, Spink, & Saracevic, 2000) and in 1999 (Spink, Wolfram, Jansen, & Saracevic, 2001). Such snapshot studies and statistics on Web use appear regularly (OCLC, 1999), but provide little information about Web searching trends. In this article, we compare and contrast results from our two previous studies of Excite queries' data sets, each containing over 1 million queries submitted by over 200,000 Excite users collected on 16 September 1997 and 20 December 1999. We examine how public Web searching changing during that 2-year time period. As Table 1 shows, the overall structure of Web queries in some areas did not change, while in others we see change from 1997 to 1999. Our comparison shows how Web searching changed incrementally and also dramatically. We see some moves toward greater simplicity, including shorter queries (i.e., fewer terms) and shorter sessions (i.e., fewer queries per user), with little modification (addition or deletion) of terms in subsequent queries. The trend toward shorter queries suggests that Web information content should target specific terms in order to reach Web users. Another trend was to view fewer pages of results per query. Most Excite users examined only one page of results per query, since an Excite results page contains ten ranked Web sites. Were users satisfied with the results and did not need to view more pages? It appears that the public continues to have a low tolerance of wading through retrieved sites. This decline in interactivity levels is a disturbing finding for the future of Web searching. Queries that included Boolean operators were in the minority, but the percentage increased between the two time periods. Most Boolean use involved the AND operator with many mistakes. The use of relevance feedback almost doubled from 1997 to 1999, but overall use was still small. An unusually large number of terms were used with low frequency, such as personal names, spelling errors, non-English words, and Web-specific terms, such as URLs. Web query vocabulary contains more words than found in large English texts in general. The public language of Web queries has its own and unique characteristics. How did Web searching topics change from 1997 to 1999? We classified a random sample of 2,414 queries from 1997 and 2,539 queries from 1999 into 11 categories (Table 2). From 1997 to 1999, Web searching shifted from entertainment, recreation and sex, and pornography, preferences to e-commerce-related topics under commerce, travel, employment, and economy. This shift coincided with changes in information distribution on the publicly indexed Web.
  7. Spink, A.; Gunar, O.: E-Commerce Web queries : Excite and AskJeeves study (2001) 0.02
    0.023270661 = product of:
      0.093082644 = sum of:
        0.093082644 = weight(_text_:web in 910) [ClassicSimilarity], result of:
          0.093082644 = score(doc=910,freq=2.0), product of:
            0.16134618 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.049439456 = queryNorm
            0.5769126 = fieldWeight in 910, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.125 = fieldNorm(doc=910)
      0.25 = coord(1/4)
    
  8. Spink, A.; Wilson, T.D.; Ford, N.; Foster, A.; Ellis, D.: Information seeking and mediated searching : Part 3: successive searching (2002) 0.02
    0.020204272 = product of:
      0.08081709 = sum of:
        0.08081709 = weight(_text_:search in 5242) [ClassicSimilarity], result of:
          0.08081709 = score(doc=5242,freq=12.0), product of:
            0.17183559 = queryWeight, product of:
              3.475677 = idf(docFreq=3718, maxDocs=44218)
              0.049439456 = queryNorm
            0.47031635 = fieldWeight in 5242, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              3.475677 = idf(docFreq=3718, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5242)
      0.25 = coord(1/4)
    
    Abstract
    In "Part 3. Successive Searching.'' where Spink is the primary author, after a review of the work on successive searching, a portion of the Texas generated data is reviewed for insights on how frequently successive searching occurred, the motivation for its occurrence, and any distinctive characteristics of the successive search pattern. Of 18 mediated searches, half requested a second search and a quarter a third search. All but one seeker reported a need to refine and enhance the previous results. Second searches while characterized as refinements included a significantly higher number of items retrieved and more search cycles. Third searches had the most cycles but less retrieved items than the second. Number of terms utilized did not change significantly and overlap was limited to about one in five terms between first and second searches. No overlap occurred between the second and third searches. Problem solving stage shifts did occur with 2 moving to a later stage after the first search, 5 remaining in the same stage and one reverting to a previous stage. Precision did not increase over successive searches, but partial relevant judgments decreased between the second and third search.
  9. Ellis, D.; Wilson, T.D.; Ford, N.; Foster, A.; Lam, H.M.; Burton, R.; Spink, A.: Information seeking and mediated searching : Part 5: user-intermediary interaction (2002) 0.02
    0.01979606 = product of:
      0.07918424 = sum of:
        0.07918424 = weight(_text_:search in 5233) [ClassicSimilarity], result of:
          0.07918424 = score(doc=5233,freq=8.0), product of:
            0.17183559 = queryWeight, product of:
              3.475677 = idf(docFreq=3718, maxDocs=44218)
              0.049439456 = queryNorm
            0.460814 = fieldWeight in 5233, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.475677 = idf(docFreq=3718, maxDocs=44218)
              0.046875 = fieldNorm(doc=5233)
      0.25 = coord(1/4)
    
    Abstract
    Ellis, et alia, now provide part five of their study on mediated searching which is treated separately here because of the presence of additional authors. The data source remains cases collected from 198 individuals, 87 in Texas and 111 in Sheffield in the U.K. but the focus here is on seeker/intermediary interaction utilizing the Saracevic triadic IR model, and the method is the analysis of discourse. While the pre-search interview stressed problem definition, interaction during the search in terms of relevance and magnitude continued to develop the problem statement. The user and intermediary focused on search tactics, review and relevance, while the intermediary interaction with the system was comprised of terminology and answers. The interaction clearly affected the search process. Users and intermediaries considered the process effective and users felt the intermediary increased their overall satisfaction.
  10. Wolfram, D.; Spink, A.; Jansen, B.J.; Saracevic, T.: Vox populi : the public searching of the Web (2001) 0.02
    0.017452994 = product of:
      0.06981198 = sum of:
        0.06981198 = weight(_text_:web in 6949) [ClassicSimilarity], result of:
          0.06981198 = score(doc=6949,freq=2.0), product of:
            0.16134618 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.049439456 = queryNorm
            0.43268442 = fieldWeight in 6949, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.09375 = fieldNorm(doc=6949)
      0.25 = coord(1/4)
    
  11. Jansen, B.J.; Spink, A.; Saracevic, T.: Real life, real users and real needs : a study and analysis of users queries on the Web (2000) 0.02
    0.017452994 = product of:
      0.06981198 = sum of:
        0.06981198 = weight(_text_:web in 411) [ClassicSimilarity], result of:
          0.06981198 = score(doc=411,freq=2.0), product of:
            0.16134618 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.049439456 = queryNorm
            0.43268442 = fieldWeight in 411, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.09375 = fieldNorm(doc=411)
      0.25 = coord(1/4)
    
  12. Wilson, T.D.; Ford, N.; Ellis, D.; Foster, A.; Spink, A.: Information seeking and mediated searching : Part 2: uncertainty and Its correlates (2002) 0.01
    0.013997929 = product of:
      0.055991717 = sum of:
        0.055991717 = weight(_text_:search in 5232) [ClassicSimilarity], result of:
          0.055991717 = score(doc=5232,freq=4.0), product of:
            0.17183559 = queryWeight, product of:
              3.475677 = idf(docFreq=3718, maxDocs=44218)
              0.049439456 = queryNorm
            0.3258447 = fieldWeight in 5232, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.475677 = idf(docFreq=3718, maxDocs=44218)
              0.046875 = fieldNorm(doc=5232)
      0.25 = coord(1/4)
    
    Abstract
    In "Part 2. Uncertainty and Its Correlates,'' where Wilson is the primary author, after a review of uncertainty as a concept in information seeking and decision research, it is hypothesized that if the Kuhlthau problem solving stage model is appropriate the searchers will recognize the stage in which they currently are operating. Secondly to test Wilson's contention that operationalized uncertainty would be useful in characterizing users, it is hypothesized that uncertainty will decrease as the searcher proceeds through problem stages and after the completion of the search. A review of pre and post search interviews reveals that uncertainty can be operationalized, and that academic researchers have no difficulty with a stage model of the information seeking process. Uncertainty is unrelated to sex, age, or discipline, but is related to problem stage and domain knowledge. Both concepts appear robust.
  13. Spink, A.; Wilson, T.D.; Ford, N.; Foster, A.; Ellis, D.: Information seeking and mediated searching : Part 1: theoretical framework and research design (2002) 0.01
    0.013997929 = product of:
      0.055991717 = sum of:
        0.055991717 = weight(_text_:search in 5240) [ClassicSimilarity], result of:
          0.055991717 = score(doc=5240,freq=4.0), product of:
            0.17183559 = queryWeight, product of:
              3.475677 = idf(docFreq=3718, maxDocs=44218)
              0.049439456 = queryNorm
            0.3258447 = fieldWeight in 5240, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.475677 = idf(docFreq=3718, maxDocs=44218)
              0.046875 = fieldNorm(doc=5240)
      0.25 = coord(1/4)
    
    Abstract
    In this issue we begin with the first of four parts of a five part series of papers by Spink, Wilson, Ford, Foster, and Ellis. Spink, et alia, in the first section of this report set forth the design of a project to test whether existing models of the information search process are appropriate for an environment of mediated successive searching which they believe characterizes much information seeking behavior. Their goal is to develop an integrated model of the process. Data were collected from 198 individuals, 87 in Texas and 111 in Sheffield in the U.K., with individuals with real information needs engaged in interaction with operational information retrieval systems by use of transaction logs, recordings of interactions with intermediaries, pre, and post search interviews, questionnaire responses, relevance judgments of retrieved text, and responses to a test of cognitive styles. Questionnaires were based upon the Kuhlthau model, the Saracevic model, the Ellis model, and incorporated a visual analog scale to avoid a consistency bias.
  14. Spink, A.; Jansen, B.J.: Web searching : public searching of the Web (2004) 0.01
    0.012059384 = product of:
      0.048237536 = sum of:
        0.048237536 = weight(_text_:web in 1443) [ClassicSimilarity], result of:
          0.048237536 = score(doc=1443,freq=22.0), product of:
            0.16134618 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.049439456 = queryNorm
            0.29896918 = fieldWeight in 1443, product of:
              4.690416 = tf(freq=22.0), with freq of:
                22.0 = termFreq=22.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.01953125 = fieldNorm(doc=1443)
      0.25 = coord(1/4)
    
    Footnote
    Den Autoren wurden von den kommerziellen Suchmaschinen AltaVista, Excite und All the Web größere Datenbestände zur Verfügung gestellt. Die ausgewerteten Files umfassten jeweils alle an die jeweilige Suchmaschine an einem bestimmten Tag gestellten Anfragen. Die Daten wurden zwischen 199'] und 2002 erhoben; allerdings liegen nicht von allen Jahren Daten von allen Suchmaschinen vor, so dass einige der festgestellten Unterschiede im Nutzerverhalten sich wohl auf die unterschiedlichen Nutzergruppen der einzelnen Suchmaschinen zurückführen lassen. In einem Fall werden die Nutzergruppen sogar explizit nach den Suchmaschinen getrennt, so dass das Nutzerverhalten der europäischen Nutzer der Suchmaschine All the Web mit dem Verhalten der US-amerikanischen Nutzer verglichen wird. Die Analyse der Logfiles erfolgt auf unterschiedlichen Ebenen: Es werden sowohl die eingegebenen Suchbegriffe, die kompletten Suchanfragen, die Such-Sessions und die Anzahl der angesehenen Ergebnisseiten ermittelt. Bei den Suchbegriffen ist besonders interessant, dass die Spannbreite der Informationsbedürfnisse im Lauf der Jahre deutlich zugenommen hat. Zwar werden 20 Prozent aller eingegebenen Suchbegriffe regelmäßig verwendet, zehn Prozent kamen hingegen nur ein einziges Mal vor. Die thematischen Interessen der Suchmaschinen-Nutzer haben sich im Lauf der letzten Jahre ebenfalls gewandelt. Während in den Anfangsjahren viele Anfragen aus den beiden Themenfeldern Sex und Technologie stammten, gehen diese mittlerweile zurück. Dafür nehmen Anfragen im Bereich E-Commerce zu. Weiterhin zugenommen haben nicht-englischsprachige Begriffe sowie Zahlen und Akronyme. Die Popularität von Suchbegriffen ist auch saisonabhängig und wird durch aktuelle Nachrichten beeinflusst. Auf der Ebene der Suchanfragen zeigt sich weiterhin die vielfach belegte Tatsache, dass Suchanfragen in Web-Suchmaschinen extrem kurz sind. Die durchschnittliche Suchanfrage enthält je nach Suchmaschine zwischen 2,3 und 2,9 Terme. Dies deckt sich mit anderen Untersuchungen zu diesem Thema. Die Länge der Suchanfragen ist in den letzten Jahren leicht steigend; größere Sprünge hin zu längeren Anfragen sind jedoch nicht zu erwarten. Ebenso verhält es sich mit dem Einsatz von Operatoren: Nur etwa in jeder zehnten Anfrage kommen diese vor, wobei die Phrasensuche am häufigsten verwendet wird. Dass die SuchmaschinenNutzer noch weitgehend als Anfänger angesehen werden müssen, zeigt sich auch daran, dass sie pro Suchanfrage nur drei oder vier Dokumente aus der Trefferliste tatsächlich sichten.
    Der relativ hohe Wert von 17 Prozent stammt allerdings aus dem Jahr 1997; seitdem ist eine deutliche Abnahme zu verzeichnen. Betont werden muss außerdem, dass Anfragen nach sexuellen Inhalten nicht mit denen nach Pornographie gleichzusetzen sind. Die Suche nach Multimedia-Inhalten hat sich von den allgemeinen Suchinterfaces der Suchmaschinen hin zu speziellen Suchmasken verschoben, die inzwischen von allen großen Suchmaschinen angeboten werden. Die wichtigste Aussage aus den untersuchten Daten lautet, dass die Suche nach Multimedia-Inhalten komplexer und vor allem interaktiver ist als die übliche Websuche. Die Anfragen sind länger und enthalten zu einem deutlich größeren Teil Operatoren. Bei der Bildersuche stellen weiterhin sexuell orientierte Anfragen den höchsten Anteil. Bei der Bilderund Video-Suche sind die Anfragen deutlich länger als bei der regulären Suche; bei der Audio-Suche sind sie dagegen kürzer. Das vorliegende Werk bietet die bisher umfassendste Analyse des Nutzerverhaltens bezüglich der Web-Suche; insbesondere wurden bisher keine umfassenden, auf längere Zeiträume angelegten Studien vorgelegt, deren Ergebnisse wie im vorliegenden Fall direkt vergleichbar sind. Die Ergebnisse sind valide und ermöglichen es Suchmaschinen-Anbietern wie auch Forschern, künftige Entwicklungen stärker als bisher am tatsächlichen Verhalten der Nutzer auszurichten. Das Buch beschränkt sich allerdings auf die US-amerikanischen Suchmaschinen und deren Nutzer und bezieht nur bei All the Web die europäischen Nutzer ein. Insbesondere die Frage, ob die europäischen oder auch deutschsprachigen Nutzer anders suchen als die amerikanischen, bleibt unbeantwortet. Hier wären weitere Forschungen zu leisten."
    LCSH
    Web usage mining
    RSWK
    World Wide Web / Suchmaschine
    Subject
    World Wide Web / Suchmaschine
    Web usage mining
  15. Spink, A.; Cole, C.: Introduction (2004) 0.01
    0.009331953 = product of:
      0.03732781 = sum of:
        0.03732781 = weight(_text_:search in 2389) [ClassicSimilarity], result of:
          0.03732781 = score(doc=2389,freq=4.0), product of:
            0.17183559 = queryWeight, product of:
              3.475677 = idf(docFreq=3718, maxDocs=44218)
              0.049439456 = queryNorm
            0.21722981 = fieldWeight in 2389, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.475677 = idf(docFreq=3718, maxDocs=44218)
              0.03125 = fieldNorm(doc=2389)
      0.25 = coord(1/4)
    
    Abstract
    This is the second part of a two-part special topic JASIST issue an information seeking. The first part presented papers an the topics of health information seeking and everyday life information seeking or ELIS (i.e., information seeking outside of work or school). This second issue presents papers an the topics of information retrieval and information seeking in industry environments. Information retrieval involves a specific kind of information seeking, as the user is in direct contact with an information interface and with potential sources of information from the system's database. The user conducts the search using various strategies, tactics, etc., but there is also the possibility that information processes will occur resulting in a change in the way the user thinks about the topic of the search. If this occurs, the user is, in effect, using the found data, turning it into an informational element of some kind. Such processes can be facilitated in the design of the information retrieval system. Information seeking in industry environments takes up more and more of our working day. Even companies producing industrial products are in fact mainly producing informational elements of some kind, often for the purpose of making decisions or as starting positions for further information seeking. While there may be company mechanisms in place to aid such information seeking, and to make it more efficient, if better information seeking structures were in place, not only would workers waste less time in informational pursuits, but they would also find things, discover new processes, etc., that would benefit the corporation's bottom line. In Figure l, we plot the six papers in this issue an an information behavior continuum, following a taxonomy of information behavior terms from Spink and Cole (2001). Information Behavior is a broad term covering all aspects of information seeking, including passive or undetermined information behavior. Information-Seeking Behavior is usually thought of as active or conscious information behavior. Information-Searching Behavior describes the interactive elements between a user and an information system. Information-Use Behavior is about the user's acquisition and incorporation of data in some kind of information process. This leads to the production of information, but also back to the broad range of Information Behavior in the first part of the continuum. Though we plot all papers in this issue along this continuum, they take into account more than their general framework. The three information retrieval reports veer from the traditional information-searching approach of usersystem interaction, while the three industry environment articles veer from the traditional information-seeking approach of specific context information-seeking studies.