Search (191 results, page 2 of 10)

Song, R.; Luo, Z.; Nie, J.-Y.; Yu, Y.; Hon, H.-W.: Identification of ambiguous queries in web search (2009) 0.02

0.015127847 = product of:
  0.05294746 = sum of:
    0.031131983 = weight(_text_:management in 2441) [ClassicSimilarity], result of:
      0.031131983 = score(doc=2441,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.22344214 = fieldWeight in 2441, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.046875 = fieldNorm(doc=2441)
    0.021815477 = product of:
      0.043630954 = sum of:
        0.043630954 = weight(_text_:studies in 2441) [ClassicSimilarity], result of:
          0.043630954 = score(doc=2441,freq=2.0), product of:
            0.16494368 = queryWeight, product of:
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.041336425 = queryNorm
            0.26452032 = fieldWeight in 2441, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.046875 = fieldNorm(doc=2441)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)

Abstract: It is widely believed that many queries submitted to search engines are inherently ambiguous (e.g., java and apple). However, few studies have tried to classify queries based on ambiguity and to answer "what the proportion of ambiguous queries is". This paper deals with these issues. First, we clarify the definition of ambiguous queries by constructing the taxonomy of queries from being ambiguous to specific. Second, we ask human annotators to manually classify queries. From manually labeled results, we observe that query ambiguity is to some extent predictable. Third, we propose a supervised learning approach to automatically identify ambiguous queries. Experimental results show that we can correctly identify 87% of labeled queries with the approach. Finally, by using our approach, we estimate that about 16% of queries in a real search log are ambiguous.
Source: Information processing and management. 45(2009) no.2, S.216-229

Gencosman, B.C.; Ozmutlu, H.C.; Ozmutlu, S.: Character n-gram application for automatic new topic identification (2014) 0.01
```
0.014758031 = product of:
  0.051653106 = sum of:
    0.025943318 = weight(_text_:management in 2688) [ClassicSimilarity], result of:
      0.025943318 = score(doc=2688,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.18620178 = fieldWeight in 2688, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2688)
    0.025709787 = product of:
      0.051419575 = sum of:
        0.051419575 = weight(_text_:studies in 2688) [ClassicSimilarity], result of:
          0.051419575 = score(doc=2688,freq=4.0), product of:
            0.16494368 = queryWeight, product of:
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.041336425 = queryNorm
            0.3117402 = fieldWeight in 2688, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2688)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)
```
Abstract

The widespread availability of the Internet and the variety of Internet-based applications have resulted in a significant increase in the amount of web pages. Determining the behaviors of search engine users has become a critical step in enhancing search engine performance. Search engine user behaviors can be determined by content-based or content-ignorant algorithms. Although many content-ignorant studies have been performed to automatically identify new topics, previous results have demonstrated that spelling errors can cause significant errors in topic shift estimates. In this study, we focused on minimizing the number of wrong estimates that were based on spelling errors. We developed a new hybrid algorithm combining character n-gram and neural network methodologies, and compared the experimental results with results from previous studies. For the FAST and Excite datasets, the proposed algorithm improved topic shift estimates by 6.987% and 2.639%, respectively. Moreover, we analyzed the performance of the character n-gram method in different aspects including the comparison with Levenshtein edit-distance method. The experimental results demonstrated that the character n-gram method outperformed to the Levensthein edit distance method in terms of topic identification.

Source

Information processing and management. 50(2014) no.6, S.821-856

Huvila, I.: Affective capitalism of knowing and the society of search engine (2016) 0.01

0.013695294 = product of:
  0.047933526 = sum of:
    0.031131983 = weight(_text_:management in 3246) [ClassicSimilarity], result of:
      0.031131983 = score(doc=3246,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.22344214 = fieldWeight in 3246, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.046875 = fieldNorm(doc=3246)
    0.016801544 = product of:
      0.033603087 = sum of:
        0.033603087 = weight(_text_:22 in 3246) [ClassicSimilarity], result of:
          0.033603087 = score(doc=3246,freq=2.0), product of:
            0.14475311 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.041336425 = queryNorm
            0.23214069 = fieldWeight in 3246, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=3246)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)

Date: 20. 1.2015 18:30:22
Source: Aslib journal of information management. 68(2016) no.5, S.566-588

Hancock, B.: Subject-specific search engines : using the Harvest system to gather and maintain information on the Internet (1998) 0.01

0.012872341 = product of:
  0.09010638 = sum of:
    0.09010638 = sum of:
      0.05090278 = weight(_text_:studies in 3238) [ClassicSimilarity], result of:
        0.05090278 = score(doc=3238,freq=2.0), product of:
          0.16494368 = queryWeight, product of:
            3.9902744 = idf(docFreq=2222, maxDocs=44218)
            0.041336425 = queryNorm
          0.30860704 = fieldWeight in 3238, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.9902744 = idf(docFreq=2222, maxDocs=44218)
            0.0546875 = fieldNorm(doc=3238)
      0.039203603 = weight(_text_:22 in 3238) [ClassicSimilarity], result of:
        0.039203603 = score(doc=3238,freq=2.0), product of:
          0.14475311 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.041336425 = queryNorm
          0.2708308 = fieldWeight in 3238, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0546875 = fieldNorm(doc=3238)
  0.14285715 = coord(1/7)

Abstract: The increasing expansion of the Internet has made resources available to users in sometimes unmanageable abundance. To help users manage this proliferation of information, librarians have begun to add URLs to their home pages. As well, specialized search engines are being used to retrieve information from selected sources in aneffort to return pertinent results. Describes the Harvest system which has been used to develop Index Antiquus, a specialized engine, for the classics and mediaeval studies. Presents a working example of how to search Index Antiquus
Date: 6. 3.1997 16:22:15

Spink, A.; Park, M.; Jansen, B.J.; Pedersen, J.: Elicitation and use of relevance feedback information (2006) 0.01
```
0.012606538 = product of:
  0.044122882 = sum of:
    0.025943318 = weight(_text_:management in 967) [ClassicSimilarity], result of:
      0.025943318 = score(doc=967,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.18620178 = fieldWeight in 967, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.0390625 = fieldNorm(doc=967)
    0.018179566 = product of:
      0.03635913 = sum of:
        0.03635913 = weight(_text_:studies in 967) [ClassicSimilarity], result of:
          0.03635913 = score(doc=967,freq=2.0), product of:
            0.16494368 = queryWeight, product of:
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.041336425 = queryNorm
            0.22043361 = fieldWeight in 967, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.0390625 = fieldNorm(doc=967)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)
```
Abstract

A user's single session with a Web search engine or information retrieval (IR) system may consist of seeking information on single or multiple topics, and switch between tasks or multitasking information behavior. Most Web search sessions consist of two queries of approximately two words. However, some Web search sessions consist of three or more queries. We present findings from two studies. First, a study of two-query search sessions on the AltaVista Web search engine, and second, a study of three or more query search sessions on the AltaVista Web search engine. We examine the degree of multitasking search and information task switching during these two sets of AltaVista Web search sessions. A sample of two-query and three or more query sessions were filtered from AltaVista transaction logs from 2002 and qualitatively analyzed. Sessions ranged in duration from less than a minute to a few hours. Findings include: (1) 81% of two-query sessions included multiple topics, (2) 91.3% of three or more query sessions included multiple topics, (3) there are a broad variety of topics in multitasking search sessions, and (4) three or more query sessions sometimes contained frequent topic changes. Multitasking is found to be a growing element in Web searching. This paper proposes an approach to interactive information retrieval (IR) contextually within a multitasking framework. The implications of our findings for Web design and further research are discussed.

Source

Information processing and management. 42(2006) no.1, S.264-275
Thatcher, A.: Web search strategies : the influence of Web experience and task type (2008) 0.01
```
0.012606538 = product of:
  0.044122882 = sum of:
    0.025943318 = weight(_text_:management in 2095) [ClassicSimilarity], result of:
      0.025943318 = score(doc=2095,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.18620178 = fieldWeight in 2095, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2095)
    0.018179566 = product of:
      0.03635913 = sum of:
        0.03635913 = weight(_text_:studies in 2095) [ClassicSimilarity], result of:
          0.03635913 = score(doc=2095,freq=2.0), product of:
            0.16494368 = queryWeight, product of:
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.041336425 = queryNorm
            0.22043361 = fieldWeight in 2095, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2095)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)
```
Abstract

Despite a number of studies looking at Web experience and Web searching tactics and behaviours, the specific relationships between experience and cognitive search strategies have not been widely researched. This study investigates how the cognitive search strategies of 80 participants might vary with Web experience as they engaged in two researcher-defined tasks and two participant-defined information seeking tasks. Each of the two researcher-defined tasks and participant-defined tasks included a directed search task and a general-purpose browsing task. While there were almost no significant performance differences between experience levels on any of the four tasks, there were significant differences in the use of cognitive search strategies. Participants with higher levels of Web experience were more likely to use "Parallel player", "Parallel hub-and-spoke", "Known address search domain" and "Known address" strategies, whereas participants with lower levels of Web experience were more likely to use "Virtual tourist", "Link-dependent", "To-the-point", "Sequential player", "Search engine narrowing", and "Broad first" strategies. The patterns of use and differences between researcher-defined and participant-defined tasks and between directed search tasks and general-purpose browsing tasks are also discussed, although the distribution of search strategies by Web experience were not statistically significant for each individual task.

Source

Information processing and management. 44(2008) no.3, S.1308-1329
Roy, R.S.; Agarwal, S.; Ganguly, N.; Choudhury, M.: Syntactic complexity of Web search queries through the lenses of language models, networks and users (2016) 0.01
```
0.012606538 = product of:
  0.044122882 = sum of:
    0.025943318 = weight(_text_:management in 3188) [ClassicSimilarity], result of:
      0.025943318 = score(doc=3188,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.18620178 = fieldWeight in 3188, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3188)
    0.018179566 = product of:
      0.03635913 = sum of:
        0.03635913 = weight(_text_:studies in 3188) [ClassicSimilarity], result of:
          0.03635913 = score(doc=3188,freq=2.0), product of:
            0.16494368 = queryWeight, product of:
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.041336425 = queryNorm
            0.22043361 = fieldWeight in 3188, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3188)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)
```
Abstract

Across the world, millions of users interact with search engines every day to satisfy their information needs. As the Web grows bigger over time, such information needs, manifested through user search queries, also become more complex. However, there has been no systematic study that quantifies the structural complexity of Web search queries. In this research, we make an attempt towards understanding and characterizing the syntactic complexity of search queries using a multi-pronged approach. We use traditional statistical language modeling techniques to quantify and compare the perplexity of queries with natural language (NL). We then use complex network analysis for a comparative analysis of the topological properties of queries issued by real Web users and those generated by statistical models. Finally, we conduct experiments to study whether search engine users are able to identify real queries, when presented along with model-generated ones. The three complementary studies show that the syntactic structure of Web queries is more complex than what n-grams can capture, but simpler than NL. Queries, thus, seem to represent an intermediate stage between syntactic and non-syntactic communication.

Source

Information processing and management. 52(2016) no.5, S.923-948

Alqaraleh, S.; Ramadan, O.; Salamah, M.: Efficient watcher based web crawler design (2015) 0.01

0.011412744 = product of:
  0.039944604 = sum of:
    0.025943318 = weight(_text_:management in 1627) [ClassicSimilarity], result of:
      0.025943318 = score(doc=1627,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.18620178 = fieldWeight in 1627, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1627)
    0.0140012875 = product of:
      0.028002575 = sum of:
        0.028002575 = weight(_text_:22 in 1627) [ClassicSimilarity], result of:
          0.028002575 = score(doc=1627,freq=2.0), product of:
            0.14475311 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.041336425 = queryNorm
            0.19345059 = fieldWeight in 1627, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1627)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)

Date: 20. 1.2015 18:30:22
Source: Aslib journal of information management. 67(2015) no.6, S.663-686

Sachse, J.: ¬The influence of snippet length on user behavior in mobile web search (2019) 0.01

0.011412744 = product of:
  0.039944604 = sum of:
    0.025943318 = weight(_text_:management in 5493) [ClassicSimilarity], result of:
      0.025943318 = score(doc=5493,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.18620178 = fieldWeight in 5493, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5493)
    0.0140012875 = product of:
      0.028002575 = sum of:
        0.028002575 = weight(_text_:22 in 5493) [ClassicSimilarity], result of:
          0.028002575 = score(doc=5493,freq=2.0), product of:
            0.14475311 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.041336425 = queryNorm
            0.19345059 = fieldWeight in 5493, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5493)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)

Date: 20. 1.2015 18:30:22
Source: Aslib journal of information management. 71(2019) no.3, S.325-343

Aloteibi, S.; Sanderson, M.: Analyzing geographic query reformulation : an exploratory study (2014) 0.01
```
0.011346022 = product of:
  0.079422146 = sum of:
    0.079422146 = sum of:
      0.051419575 = weight(_text_:studies in 1177) [ClassicSimilarity], result of:
        0.051419575 = score(doc=1177,freq=4.0), product of:
          0.16494368 = queryWeight, product of:
            3.9902744 = idf(docFreq=2222, maxDocs=44218)
            0.041336425 = queryNorm
          0.3117402 = fieldWeight in 1177, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            3.9902744 = idf(docFreq=2222, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1177)
      0.028002575 = weight(_text_:22 in 1177) [ClassicSimilarity], result of:
        0.028002575 = score(doc=1177,freq=2.0), product of:
          0.14475311 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.041336425 = queryNorm
          0.19345059 = fieldWeight in 1177, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1177)
  0.14285715 = coord(1/7)
```
Abstract

Search engine users typically engage in multiquery sessions in their quest to fulfill their information needs. Despite a plethora of research findings suggesting that a significant group of users look for information within a specific geographical scope, existing reformulation studies lack a focused analysis of how users reformulate geographic queries. This study comprehensively investigates the ways in which users reformulate such needs in an attempt to fill this gap in the literature. Reformulated sessions were sampled from a query log of a major search engine to extract 2,400 entries that were manually inspected to filter geo sessions. This filter identified 471 search sessions that included geographical intent, and these sessions were analyzed quantitatively and qualitatively. The results revealed that one in five of the users who reformulated their queries were looking for geographically related information. They reformulated their queries by changing the content of the query rather than the structure. Users were not following a unified sequence of modifications and instead performed a single reformulation action. However, in some cases it was possible to anticipate their next move. A number of tasks in geo modifications were identified, including standard, multi-needs, multi-places, and hybrid approaches. The research concludes that it is important to specialize query reformulation studies to focus on particular query types rather than generically analyzing them, as it is apparent that geographic queries have their special reformulation characteristics.

Date

26. 1.2014 18:48:22
Bilal, D.: Children's use of the Yahooligans! Web search engine : III. Cognitive and physical behaviors on fully self-generated search tasks (2002) 0.01
```
0.011033435 = product of:
  0.077234045 = sum of:
    0.077234045 = sum of:
      0.043630954 = weight(_text_:studies in 5228) [ClassicSimilarity], result of:
        0.043630954 = score(doc=5228,freq=2.0), product of:
          0.16494368 = queryWeight, product of:
            3.9902744 = idf(docFreq=2222, maxDocs=44218)
            0.041336425 = queryNorm
          0.26452032 = fieldWeight in 5228, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.9902744 = idf(docFreq=2222, maxDocs=44218)
            0.046875 = fieldNorm(doc=5228)
      0.033603087 = weight(_text_:22 in 5228) [ClassicSimilarity], result of:
        0.033603087 = score(doc=5228,freq=2.0), product of:
          0.14475311 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.041336425 = queryNorm
          0.23214069 = fieldWeight in 5228, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=5228)
  0.14285715 = coord(1/7)
```
Abstract

Bilal, in this third part of her Yahooligans! study looks at children's performance with self-generated search tasks, as compared to previously assigned search tasks looking for differences in success, cognitive behavior, physical behavior, and task preference. Lotus ScreenCam was used to record interactions and post search interviews to record impressions. The subjects, the same 22 seventh grade children in the previous studies, generated topics of interest that were mediated with the researcher into more specific topics where necessary. Fifteen usable sessions form the basis of the study. Eleven children were successful in finding information, a rate of 73% compared to 69% in assigned research questions, and 50% in assigned fact-finding questions. Eighty-seven percent began using one or two keyword searches. Spelling was a problem. Successful children made fewer keyword searches and the number of search moves averaged 5.5 as compared to 2.4 on the research oriented task and 3.49 on the factual. Backtracking and looping were common. The self-generated task was preferred by 47% of the subjects.
Blake, P.: AltaVista and Notes for the web (1996) 0.01
```
0.010088513 = product of:
  0.07061958 = sum of:
    0.07061958 = weight(_text_:case in 4537) [ClassicSimilarity], result of:
      0.07061958 = score(doc=4537,freq=2.0), product of:
        0.18173204 = queryWeight, product of:
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.041336425 = queryNorm
        0.3885918 = fieldWeight in 4537, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.0625 = fieldNorm(doc=4537)
  0.14285715 = coord(1/7)
```
Footnote

Briefly reviews the AltaVista and Notes search software for searching the WWW. In the case of AltaVista, Digital claims that this web crawler has been crawling the WWW at the rate of 2,5 million pages per day and already accounts for the indexing of 16 million pages and 13.000 newsgroups. Suggests that AltaVista pulls of significantly more on obscure or specialist subjects than rivals like InfoSeek and Excite. concludes with details of IBM's development of the Lotus WWW searcher designed to cope with the increasing complexity of web applications

Taylor, M.: Using the Google search appliance for federated searching : a case study (2005) 0.01

0.010088513 = product of:
  0.07061958 = sum of:
    0.07061958 = weight(_text_:case in 355) [ClassicSimilarity], result of:
      0.07061958 = score(doc=355,freq=2.0), product of:
        0.18173204 = queryWeight, product of:
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.041336425 = queryNorm
        0.3885918 = fieldWeight in 355, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.3964143 = idf(docFreq=1480, maxDocs=44218)
          0.0625 = fieldNorm(doc=355)
  0.14285715 = coord(1/7)

Spink, A.; Jansen, B.J.; Blakely, C.; Koshman, S.: ¬A study of results overlap and uniqueness among major Web search engines (2006) 0.01
```
0.010085231 = product of:
  0.035298306 = sum of:
    0.020754656 = weight(_text_:management in 993) [ClassicSimilarity], result of:
      0.020754656 = score(doc=993,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.14896142 = fieldWeight in 993, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.03125 = fieldNorm(doc=993)
    0.014543652 = product of:
      0.029087303 = sum of:
        0.029087303 = weight(_text_:studies in 993) [ClassicSimilarity], result of:
          0.029087303 = score(doc=993,freq=2.0), product of:
            0.16494368 = queryWeight, product of:
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.041336425 = queryNorm
            0.17634688 = fieldWeight in 993, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.03125 = fieldNorm(doc=993)
      0.5 = coord(1/2)
  0.2857143 = coord(2/7)
```
Abstract

The performance and capabilities of Web search engines is an important and significant area of research. Millions of people world wide use Web search engines very day. This paper reports the results of a major study examining the overlap among results retrieved by multiple Web search engines for a large set of more than 10,000 queries. Previous smaller studies have discussed a lack of overlap in results returned by Web search engines for the same queries. The goal of the current study was to conduct a large-scale study to measure the overlap of search results on the first result page (both non-sponsored and sponsored) across the four most popular Web search engines, at specific points in time using a large number of queries. The Web search engines included in the study were MSN Search, Google, Yahoo! and Ask Jeeves. Our study then compares these results with the first page results retrieved for the same queries by the metasearch engine Dogpile.com. Two sets of randomly selected user-entered queries, one set was 10,316 queries and the other 12,570 queries, from Infospace's Dogpile.com search engine (the first set was from Dogpile, the second was from across the Infospace Network of search properties were submitted to the four single Web search engines). Findings show that the percent of total results unique to only one of the four Web search engines was 84.9%, shared by two of the three Web search engines was 11.4%, shared by three of the Web search engines was 2.6%, and shared by all four Web search engines was 1.1%. This small degree of overlap shows the significant difference in the way major Web search engines retrieve and rank results in response to given queries. Results point to the value of metasearch engines in Web retrieval to overcome the biases of individual search engines.

Source

Information processing and management. 42(2006) no.5, S.1379-1391
Vaughan, L.; Chen, Y.: Data mining from web search queries : a comparison of Google trends and Baidu index (2015) 0.01
```
0.00919453 = product of:
  0.06436171 = sum of:
    0.06436171 = sum of:
      0.03635913 = weight(_text_:studies in 1605) [ClassicSimilarity], result of:
        0.03635913 = score(doc=1605,freq=2.0), product of:
          0.16494368 = queryWeight, product of:
            3.9902744 = idf(docFreq=2222, maxDocs=44218)
            0.041336425 = queryNorm
          0.22043361 = fieldWeight in 1605, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.9902744 = idf(docFreq=2222, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1605)
      0.028002575 = weight(_text_:22 in 1605) [ClassicSimilarity], result of:
        0.028002575 = score(doc=1605,freq=2.0), product of:
          0.14475311 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.041336425 = queryNorm
          0.19345059 = fieldWeight in 1605, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1605)
  0.14285715 = coord(1/7)
```
Abstract

Numerous studies have explored the possibility of uncovering information from web search queries but few have examined the factors that affect web query data sources. We conducted a study that investigated this issue by comparing Google Trends and Baidu Index. Data from these two services are based on queries entered by users into Google and Baidu, two of the largest search engines in the world. We first compared the features and functions of the two services based on documents and extensive testing. We then carried out an empirical study that collected query volume data from the two sources. We found that data from both sources could be used to predict the quality of Chinese universities and companies. Despite the differences between the two services in terms of technology, such as differing methods of language processing, the search volume data from the two were highly correlated and combining the two data sources did not improve the predictive power of the data. However, there was a major difference between the two in terms of data availability. Baidu Index was able to provide more search volume data than Google Trends did. Our analysis showed that the disadvantage of Google Trends in this regard was due to Google's smaller user base in China. The implication of this finding goes beyond China. Google's user bases in many countries are smaller than that in China, so the search volume data related to those countries could result in the same issue as that related to China.

Source

Journal of the Association for Information Science and Technology. 66(2015) no.1, S.13-22
Gossen, T.: Search engines for children : search user interfaces and information-seeking behaviour (2016) 0.01
```
0.009097843 = product of:
  0.0636849 = sum of:
    0.0636849 = sum of:
      0.0440831 = weight(_text_:studies in 2752) [ClassicSimilarity], result of:
        0.0440831 = score(doc=2752,freq=6.0), product of:
          0.16494368 = queryWeight, product of:
            3.9902744 = idf(docFreq=2222, maxDocs=44218)
            0.041336425 = queryNorm
          0.26726153 = fieldWeight in 2752, product of:
            2.4494898 = tf(freq=6.0), with freq of:
              6.0 = termFreq=6.0
            3.9902744 = idf(docFreq=2222, maxDocs=44218)
            0.02734375 = fieldNorm(doc=2752)
      0.019601801 = weight(_text_:22 in 2752) [ClassicSimilarity], result of:
        0.019601801 = score(doc=2752,freq=2.0), product of:
          0.14475311 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.041336425 = queryNorm
          0.1354154 = fieldWeight in 2752, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.02734375 = fieldNorm(doc=2752)
  0.14285715 = coord(1/7)
```
Abstract

The doctoral thesis of Tatiana Gossen formulates criteria and guidelines on how to design the user interfaces of search engines for children. In her work, the author identifies the conceptual challenges based on own and previous user studies and addresses the changing characteristics of the users by providing a means of adaptation. Additionally, a novel type of search result visualisation for children with cartoon style characters is developed taking children's preference for visual information into account.

Content

Inhalt: Acknowledgments; Abstract; Zusammenfassung; Contents; List of Figures; List of Tables; List of Acronyms; Chapter 1 Introduction ; 1.1 Research Questions; 1.2 Thesis Outline; Part I Fundamentals ; Chapter 2 Information Retrieval for Young Users ; 2.1 Basics of Information Retrieval; 2.1.1 Architecture of an IR System; 2.1.2 Relevance Ranking; 2.1.3 Search User Interfaces; 2.1.4 Targeted Search Engines; 2.2 Aspects of Child Development Relevant for Information Retrieval Tasks; 2.2.1 Human Cognitive Development; 2.2.2 Information Processing Theory; 2.2.3 Psychosocial Development 2.3 User Studies and Evaluation2.3.1 Methods in User Studies; 2.3.2 Types of Evaluation; 2.3.3 Evaluation with Children; 2.4 Discussion; Chapter 3 State of the Art ; 3.1 Children's Information-Seeking Behaviour; 3.1.1 Querying Behaviour; 3.1.2 Search Strategy; 3.1.3 Navigation Style; 3.1.4 User Interface; 3.1.5 Relevance Judgement; 3.2 Existing Algorithms and User Interface Concepts for Children; 3.2.1 Query; 3.2.2 Content; 3.2.3 Ranking; 3.2.4 Search Result Visualisation; 3.3 Existing Information Retrieval Systems for Children; 3.3.1 Digital Book Libraries; 3.3.2 Web Search Engines 3.4 Summary and DiscussionPart II Studying Open Issues ; Chapter 4 Usability of Existing Search Engines for Young Users ; 4.1 Assessment Criteria; 4.1.1 Criteria for Matching the Motor Skills; 4.1.2 Criteria for Matching the Cognitive Skills; 4.2 Results; 4.2.1 Conformance with Motor Skills; 4.2.2 Conformance with the Cognitive Skills; 4.2.3 Presentation of Search Results; 4.2.4 Browsing versus Searching; 4.2.5 Navigational Style; 4.3 Summary and Discussion; Chapter 5 Large-scale Analysis of Children's Queries and Search Interactions; 5.1 Dataset; 5.2 Results; 5.3 Summary and Discussion Chapter 6 Differences in Usability and Perception of Targeted Web Search Engines between Children and Adults 6.1 Related Work; 6.2 User Study; 6.3 Study Results; 6.4 Summary and Discussion; Part III Tackling the Challenges ; Chapter 7 Search User Interface Design for Children ; 7.1 Conceptual Challenges and Possible Solutions; 7.2 Knowledge Journey Design; 7.3 Evaluation; 7.3.1 Study Design; 7.3.2 Study Results; 7.4 Voice-Controlled Search: Initial Study; 7.4.1 User Study; 7.5 Summary and Discussion; Chapter 8 Addressing User Diversity ; 8.1 Evolving Search User Interface 8.1.1 Mapping Function8.1.2 Evolving Skills; 8.1.3 Detection of User Abilities; 8.1.4 Design Concepts; 8.2 Adaptation of a Search User Interface towards User Needs; 8.2.1 Design & Implementation; 8.2.2 Search Input; 8.2.3 Result Output; 8.2.4 General Properties; 8.2.5 Configuration and Further Details; 8.3 Evaluation; 8.3.1 Study Design; 8.3.2 Study Results; 8.3.3 Preferred UI Settings; 8.3.4 User satisfaction; 8.4 Knowledge Journey Exhibit; 8.4.1 Hardware; 8.4.2 Frontend; 8.4.3 Backend; 8.5 Summary and Discussion; Chapter 9 Supporting Visual Searchers in Processing Search Results 9.1 Related Work

Date

1. 2.2016 18:25:22
Jansen, B.J.; Pooch , U.: ¬A review of Web searching studies and a framework for future research (2001) 0.01
```
0.008906132 = product of:
  0.06234292 = sum of:
    0.06234292 = product of:
      0.12468584 = sum of:
        0.12468584 = weight(_text_:studies in 5186) [ClassicSimilarity], result of:
          0.12468584 = score(doc=5186,freq=12.0), product of:
            0.16494368 = queryWeight, product of:
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.041336425 = queryNorm
            0.75592977 = fieldWeight in 5186, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              3.9902744 = idf(docFreq=2222, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5186)
      0.5 = coord(1/2)
  0.14285715 = coord(1/7)
```
Abstract

Jansen and Pooch review three major search engine studies and compare them to three traditional search system studies and three OPAC search studies, to determine if user search characteristics differ. The web search engine studies indicate that most searchers use two, two search term queries per session, no boolean operators, and look only at the top ten items returned, while reporting the location of relevant information. In traditional search systems we find seven to 16 queries of six to nine terms, while about ten documents per session were viewed. The OPAC studies indicated two to five queries per session of two or less terms, with Boolean search about 1% and less than 50 documents viewed.

Dempsey, B.J.: Design and empirical evaluation of search software for legal professionals on the WWW (2000) 0.01

0.008894852 = product of:
  0.062263966 = sum of:
    0.062263966 = weight(_text_:management in 6274) [ClassicSimilarity], result of:
      0.062263966 = score(doc=6274,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.44688427 = fieldWeight in 6274, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.09375 = fieldNorm(doc=6274)
  0.14285715 = coord(1/7)

Source: Information processing and management. 36(2000) no.2, S.253-273

Web work : Information seeking and knowledge work on the World Wide Web (2000) 0.01

0.008894852 = product of:
  0.062263966 = sum of:
    0.062263966 = weight(_text_:management in 1190) [ClassicSimilarity], result of:
      0.062263966 = score(doc=1190,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.44688427 = fieldWeight in 1190, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.09375 = fieldNorm(doc=1190)
  0.14285715 = coord(1/7)

Series: Information science and knowledge management; vol.1

Gordon, M.; Pathak, P.: Finding information on the World Wide Web : the retrieval effectiveness of search engines. (1999) 0.01

0.008894852 = product of:
  0.062263966 = sum of:
    0.062263966 = weight(_text_:management in 3941) [ClassicSimilarity], result of:
      0.062263966 = score(doc=3941,freq=2.0), product of:
        0.13932906 = queryWeight, product of:
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.041336425 = queryNorm
        0.44688427 = fieldWeight in 3941, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.3706124 = idf(docFreq=4130, maxDocs=44218)
          0.09375 = fieldNorm(doc=3941)
  0.14285715 = coord(1/7)

Source: Information processing and management. 35(1999) no.2, S.141-180

Search (191 results, page 2 of 10)

Authors

Years

Types

Themes

Subjects

Classifications