Search (79 results, page 1 of 4)

Mohan, K.C.: Boolean and nearest neighbour text searching in a multi-strategy retrieval system (1996) 0.11

0.11417843 = product of:
  0.22835685 = sum of:
    0.17447734 = weight(_text_:vector in 7255) [ClassicSimilarity], result of:
      0.17447734 = score(doc=7255,freq=2.0), product of:
        0.30654848 = queryWeight, product of:
          6.439392 = idf(docFreq=191, maxDocs=44218)
          0.047605187 = queryNorm
        0.5691672 = fieldWeight in 7255, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.439392 = idf(docFreq=191, maxDocs=44218)
          0.0625 = fieldNorm(doc=7255)
    0.053879518 = product of:
      0.107759036 = sum of:
        0.107759036 = weight(_text_:model in 7255) [ClassicSimilarity], result of:
          0.107759036 = score(doc=7255,freq=6.0), product of:
            0.1830527 = queryWeight, product of:
              3.845226 = idf(docFreq=2569, maxDocs=44218)
              0.047605187 = queryNorm
            0.58867764 = fieldWeight in 7255, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.845226 = idf(docFreq=2569, maxDocs=44218)
              0.0625 = fieldNorm(doc=7255)
      0.5 = coord(1/2)
  0.5 = coord(2/4)

Abstract: Information retrieval systems based on the Boolean model have been popular for some time. A major challenge to this model has come from the development of approaches based on the vector processing model. Both search strategies are explained and evaluated. Describes an experimental study in an opertational environment to compare the retrieval effectiveness of Boolean and nearest neighbour searching in a multi-strategy retrieval system based on query characteristic variables. Considers the significance of the results of the study

Sachse, J.: ¬The influence of snippet length on user behavior in mobile web search (2019) 0.06
```
0.05870133 = product of:
  0.11740266 = sum of:
    0.101278044 = weight(_text_:space in 5493) [ClassicSimilarity], result of:
      0.101278044 = score(doc=5493,freq=4.0), product of:
        0.24842183 = queryWeight, product of:
          5.2183776 = idf(docFreq=650, maxDocs=44218)
          0.047605187 = queryNorm
        0.40768576 = fieldWeight in 5493, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          5.2183776 = idf(docFreq=650, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5493)
    0.016124614 = product of:
      0.032249227 = sum of:
        0.032249227 = weight(_text_:22 in 5493) [ClassicSimilarity], result of:
          0.032249227 = score(doc=5493,freq=2.0), product of:
            0.16670525 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.047605187 = queryNorm
            0.19345059 = fieldWeight in 5493, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5493)
      0.5 = coord(1/2)
  0.5 = coord(2/4)
```
Abstract

Purpose Web search is more and more moving into mobile contexts. However, screen size of mobile devices is limited and search engine result pages face a trade-off between offering informative snippets and optimal use of space. One factor clearly influencing this trade-off is snippet length. The purpose of this paper is to find out what snippet size to use in mobile web search. Design/methodology/approach For this purpose, an eye-tracking experiment was conducted showing participants search interfaces with snippets of one, three or five lines on a mobile device to analyze 17 dependent variables. In total, 31 participants took part in the study. Each of the participants solved informational and navigational tasks. Findings Results indicate a strong influence of page fold on scrolling behavior and attention distribution across search results. Regardless of query type, short snippets seem to provide too little information about the result, so that search performance and subjective measures are negatively affected. Long snippets of five lines lead to better performance than medium snippets for navigational queries, but to worse performance for informational queries. Originality/value Although space in mobile search is limited, this study shows that longer snippets improve usability and user experience. It further emphasizes that page fold plays a stronger role in mobile than in desktop search for attention distribution.

Date

20. 1.2015 18:30:22
Lin, S.-j.; Belkin, N.: Validation of a model of information seeking over multiple search sessions (2005) 0.04
```
0.038248695 = product of:
  0.15299478 = sum of:
    0.15299478 = sum of:
      0.11429571 = weight(_text_:model in 3450) [ClassicSimilarity], result of:
        0.11429571 = score(doc=3450,freq=12.0), product of:
          0.1830527 = queryWeight, product of:
            3.845226 = idf(docFreq=2569, maxDocs=44218)
            0.047605187 = queryNorm
          0.6243869 = fieldWeight in 3450, product of:
            3.4641016 = tf(freq=12.0), with freq of:
              12.0 = termFreq=12.0
            3.845226 = idf(docFreq=2569, maxDocs=44218)
            0.046875 = fieldNorm(doc=3450)
      0.03869907 = weight(_text_:22 in 3450) [ClassicSimilarity], result of:
        0.03869907 = score(doc=3450,freq=2.0), product of:
          0.16670525 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.047605187 = queryNorm
          0.23214069 = fieldWeight in 3450, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=3450)
  0.25 = coord(1/4)
```
Abstract

Most information systems share a common assumption: information seeking is discrete. Such an assumption neither reflects real-life information seeking processes nor conforms to the perspective of phenomenology, "life is a journey constituted by continuous acquisition of knowledge." Thus, this study develops and validates a theoretical model that explains successive search experience for essentially the same information problem. The proposed model is called Multiple Information Seeking Episodes (MISE), which consists of four dimensions: problematic situation, information problem, information seeking process, episodes. Eight modes of multiple information seeking episodes are identified and specified with properties of the four dimensions of MISE. The results partially validate MISE by finding that the original MISE model is highly accurate, but less sufficient in characterizing successive searches; all factors in the MISE model are empirically confirmed, but new factors are identified as weIl. The revised MISE model is shifted from the user-centered to the interaction-centered perspective, taking into account factors of searcher, system, search activity, search context, information attainment, and information use activities.

Date

10. 4.2005 14:52:22
Toms, E.G.: What motivates the browser? (1999) 0.04
```
0.035095602 = product of:
  0.070191205 = sum of:
    0.05729151 = weight(_text_:space in 292) [ClassicSimilarity], result of:
      0.05729151 = score(doc=292,freq=2.0), product of:
        0.24842183 = queryWeight, product of:
          5.2183776 = idf(docFreq=650, maxDocs=44218)
          0.047605187 = queryNorm
        0.23062189 = fieldWeight in 292, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.2183776 = idf(docFreq=650, maxDocs=44218)
          0.03125 = fieldNorm(doc=292)
    0.012899691 = product of:
      0.025799382 = sum of:
        0.025799382 = weight(_text_:22 in 292) [ClassicSimilarity], result of:
          0.025799382 = score(doc=292,freq=2.0), product of:
            0.16670525 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.047605187 = queryNorm
            0.15476047 = fieldWeight in 292, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=292)
      0.5 = coord(1/2)
  0.5 = coord(2/4)
```
Abstract

Browsing is considered to be unstructured and human-driven, although not a cognitively intensive process. It is conducted using systems that facilitate considerable user-system interactivity. Cued by the content, people immerse themselves in a topic of interest and meander from topic to topic while concurrently recognising interesting and informative information en route. They seem to seek and gather information in a purposeless, illogical and indiscriminate manner. Typical examples of these ostensibly random acts are scanning a non-fiction book, examining the morning newspaper, perusing the contents of a business report and scavenging the World Wide Web. Often the result is the acquisition of new information, the rejection or confirmation of an idea, or the genesis of new, perhaps not-wholly-formed thoughts about a topic. Noteworthy about this approach is that people explore information without having consciously structured queries or explicit goals. This form of passive information interaction behaviour is defined as acquiring and gathering information while scanning an information space without a specific goal in mind (Waterworth & Chignell, 1991; Toms, 1997), and for the purposes of this study, is called browsing. Traditionally, browsing is thought of in two ways: as a physical process - the action taken when one scans a list, a document, or a set of linked information nodes (e.g., Fox & Palay, 1979; Thompson & Croft, 1989; Ellis, 1989), and as a conceptual process, information seeking when the goal is ill-defined (e.g., Cove & Walsh, 1987). Browsing is also combined with searching in an integrated information-seeking process for retrieving information (e.g., Ellis, 1989; Belkin, Marchetti & Cool, 1993; Marchionini, 1995; Chang, 1995). Each of these cases focuses primarily on seeking information when the objective ranges from fuzzy to explicit.

Date

22. 3.2002 9:44:47
Ren, P.; Chen, Z.; Ma, J.; Zhang, Z.; Si, L.; Wang, S.: Detecting temporal patterns of user queries (2017) 0.03
```
0.0327145 = product of:
  0.130858 = sum of:
    0.130858 = weight(_text_:vector in 3315) [ClassicSimilarity], result of:
      0.130858 = score(doc=3315,freq=2.0), product of:
        0.30654848 = queryWeight, product of:
          6.439392 = idf(docFreq=191, maxDocs=44218)
          0.047605187 = queryNorm
        0.4268754 = fieldWeight in 3315, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.439392 = idf(docFreq=191, maxDocs=44218)
          0.046875 = fieldNorm(doc=3315)
  0.25 = coord(1/4)
```
Abstract

Query classification is an important part of exploring the characteristics of web queries. Existing studies are mainly based on Broder's classification scheme and classify user queries into navigational, informational, and transactional categories according to users' information needs. In this article, we present a novel classification scheme from the perspective of queries' temporal patterns. Queries' temporal patterns are inherent time series patterns of the search volumes of queries that reflect the evolution of the popularity of a query over time. By analyzing the temporal patterns of queries, search engines can more deeply understand the users' search intents and thus improve performance. Furthermore, we extract three groups of features based on the queries' search volume time series and use a support vector machine (SVM) to automatically detect the temporal patterns of user queries. Extensive experiments on the Million Query Track data sets of the Text REtrieval Conference (TREC) demonstrate the effectiveness of our approach.
Mansourian, Y.: Contextual elements and conceptual components of information visibility on the web (2008) 0.02
```
0.024899656 = product of:
  0.099598624 = sum of:
    0.099598624 = sum of:
      0.0673494 = weight(_text_:model in 2603) [ClassicSimilarity], result of:
        0.0673494 = score(doc=2603,freq=6.0), product of:
          0.1830527 = queryWeight, product of:
            3.845226 = idf(docFreq=2569, maxDocs=44218)
            0.047605187 = queryNorm
          0.36792353 = fieldWeight in 2603, product of:
            2.4494898 = tf(freq=6.0), with freq of:
              6.0 = termFreq=6.0
            3.845226 = idf(docFreq=2569, maxDocs=44218)
            0.0390625 = fieldNorm(doc=2603)
      0.032249227 = weight(_text_:22 in 2603) [ClassicSimilarity], result of:
        0.032249227 = score(doc=2603,freq=2.0), product of:
          0.16670525 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.047605187 = queryNorm
          0.19345059 = fieldWeight in 2603, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=2603)
  0.25 = coord(1/4)
```
Abstract

Purpose - This paper aims to report the result of follow-up research on end-users' conceptions of information visibility on the web and their conceptualizations of success and failure in web searching. Design/methodology/approach - The data were collected by a questionnaire followed by a brief interview with the participants. The questionnaire was developed based on the information visibility model suggested by the author in the original study. Fifty-two library and information sciences students from Tarbiat Mollem University (TMU) and Iran University of Medical Sciences (IUMS) in Tehran took part in the study. Findings - The model of information visibility can enable web users to gain a better understanding of their information seeking (IS) outcomes and it can assist them to improve their information literacy skills. The model can provide a theoretical framework to investigate web users' IS behavior and can be used as a diagnostic tool to explore the contextual and conceptual elements affecting the visibility of information for end-users. Research limitations/implications - The paper suggests a visibility learning diary (VLD), which might be useful to measure the efficiency of information literacy training courses. Originality/value - The contextual and conceptual approach of the paper provides a deeper insight into the issue of information visibility, which has received little attention by IS and information retrieval researchers until now.

Date

1. 1.2009 10:22:40
Hopkins, M.E.; Zavalina, O.L.: Evaluating physicians' serendipitous knowledge discovery in online discovery systems : a new approach (2019) 0.02
```
0.021809943 = product of:
  0.08723977 = sum of:
    0.08723977 = sum of:
      0.054990545 = weight(_text_:model in 5842) [ClassicSimilarity], result of:
        0.054990545 = score(doc=5842,freq=4.0), product of:
          0.1830527 = queryWeight, product of:
            3.845226 = idf(docFreq=2569, maxDocs=44218)
            0.047605187 = queryNorm
          0.30040827 = fieldWeight in 5842, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            3.845226 = idf(docFreq=2569, maxDocs=44218)
            0.0390625 = fieldNorm(doc=5842)
      0.032249227 = weight(_text_:22 in 5842) [ClassicSimilarity], result of:
        0.032249227 = score(doc=5842,freq=2.0), product of:
          0.16670525 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.047605187 = queryNorm
          0.19345059 = fieldWeight in 5842, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=5842)
  0.25 = coord(1/4)
```
Abstract

Purpose A new approach to investigate serendipitous knowledge discovery (SKD) of health information is developed and tested to evaluate the information flow-serendipitous knowledge discovery (IF-SKD) model. The purpose of this paper is to determine the degree to which IF-SKD reflects physicians' information behaviour in a clinical setting and explore how the information system, Spark, designed to support physicians' SKD, meets its goals. Design/methodology/approach The proposed pre-experimental study design employs an adapted version of the McCay-Peet's (2013) and McCay-Peet et al.'s (2015) serendipitous digital environment (SDE) questionnaire research tool to address the complexity associated with defining the way in which SKD is understood and applied in system design. To test the IF-SKD model, the new data analysis approach combining confirmatory factor analysis, data imputation and Monte Carlo simulations was developed. Findings The piloting of the proposed novel analysis approach demonstrated that small sample information behaviour survey data can be meaningfully examined using a confirmatory factor analysis technique. Research limitations/implications This method allows to improve the reliability in measuring SKD and the generalisability of findings. Originality/value This paper makes an original contribution to developing and refining methods and tools of research into information-system-supported serendipitous discovery of information by health providers.

Date

20. 1.2015 18:30:22

Lee, S.-S.; Theng, Y.-L.; Goh, D.H.-L.: Creative information seeking : part II: empirical verification (2007) 0.02

0.021340024 = product of:
  0.085360095 = sum of:
    0.085360095 = sum of:
      0.046661027 = weight(_text_:model in 813) [ClassicSimilarity], result of:
        0.046661027 = score(doc=813,freq=2.0), product of:
          0.1830527 = queryWeight, product of:
            3.845226 = idf(docFreq=2569, maxDocs=44218)
            0.047605187 = queryNorm
          0.25490487 = fieldWeight in 813, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.845226 = idf(docFreq=2569, maxDocs=44218)
            0.046875 = fieldNorm(doc=813)
      0.03869907 = weight(_text_:22 in 813) [ClassicSimilarity], result of:
        0.03869907 = score(doc=813,freq=2.0), product of:
          0.16670525 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.047605187 = queryNorm
          0.23214069 = fieldWeight in 813, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=813)
  0.25 = coord(1/4)

Abstract: Purpose - This is part II of on-going research, the purpose being to establish a creative information-seeking model. Design/methodology/approach - Two studies were conducted to examine the subjects' creative information seeking behaviours and the extent to which they exhibited the proposed stages in creative information seeking when accomplishing a directed and an open-ended information-seeking task respectively. Findings - Findings seemed to indicate that all the subjects underwent the proposed stages although they seemed to embrace characteristics of these stages in varying degrees. Findings also showed that if subjects performed the proposed stages more iteratively or non-sequentially, then a greater amount of creativity was needed to accomplish the information-seeking task. Originality/value - The paper offers a discussion on the relationships between creativity, complexity of tasks, and levels of expertise in domain knowledge.
Date: 23.12.2007 12:22:16

Pontis, S.; Blandford, A.; Greifeneder, E.; Attalla, H.; Neal, D.: Keeping up to date : an academic researcher's information journey (2017) 0.02
```
0.017783355 = product of:
  0.07113342 = sum of:
    0.07113342 = sum of:
      0.03888419 = weight(_text_:model in 3340) [ClassicSimilarity], result of:
        0.03888419 = score(doc=3340,freq=2.0), product of:
          0.1830527 = queryWeight, product of:
            3.845226 = idf(docFreq=2569, maxDocs=44218)
            0.047605187 = queryNorm
          0.21242073 = fieldWeight in 3340, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.845226 = idf(docFreq=2569, maxDocs=44218)
            0.0390625 = fieldNorm(doc=3340)
      0.032249227 = weight(_text_:22 in 3340) [ClassicSimilarity], result of:
        0.032249227 = score(doc=3340,freq=2.0), product of:
          0.16670525 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.047605187 = queryNorm
          0.19345059 = fieldWeight in 3340, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=3340)
  0.25 = coord(1/4)
```
Abstract

Keeping up to date with research developments is a central activity of academic researchers, but researchers face difficulties in managing the rapid growth of available scientific information. This study examined how researchers stay up to date, using the information journey model as a framework for analysis and investigating which dimensions influence information behaviors. We designed a 2-round study involving semistructured interviews and prototype testing with 61 researchers with 3 levels of seniority (PhD student to professor). Data were analyzed following a semistructured qualitative approach. Five key dimensions that influence information behaviors were identified: level of seniority, information sources, state of the project, level of familiarity, and how well defined the relevant community is. These dimensions are interrelated and their values determine the flow of the information journey. Across all levels of professional expertise, researchers used similar hard (formal) sources to access content, while soft (interpersonal) sources were used to filter information. An important "pain point" that future information tools should address is helping researchers filter information at the point of need.

Source

Journal of the Association for Information Science and Technology. 68(2017) no.1, S.22-35
Ennis, M.; Sutcliffe, A.G.; Watkinson, S.J.: Towards a predictive model of information seeking : empirical studies of end-user-searching (1999) 0.02
```
0.017447956 = product of:
  0.06979182 = sum of:
    0.06979182 = sum of:
      0.043992437 = weight(_text_:model in 296) [ClassicSimilarity], result of:
        0.043992437 = score(doc=296,freq=4.0), product of:
          0.1830527 = queryWeight, product of:
            3.845226 = idf(docFreq=2569, maxDocs=44218)
            0.047605187 = queryNorm
          0.24032663 = fieldWeight in 296, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            3.845226 = idf(docFreq=2569, maxDocs=44218)
            0.03125 = fieldNorm(doc=296)
      0.025799382 = weight(_text_:22 in 296) [ClassicSimilarity], result of:
        0.025799382 = score(doc=296,freq=2.0), product of:
          0.16670525 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.047605187 = queryNorm
          0.15476047 = fieldWeight in 296, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.03125 = fieldNorm(doc=296)
  0.25 = coord(1/4)
```
Abstract

Previous empirical studies of searcher behaviour have drawn attention to a wide variety of factors that affect performance; for instance, the display of retrieved results can alter search strategies (Allen 1991, 1994), the information need type influences search behaviour, (Elkerton et al 1984, Marchionini 1995); while the task complexity, reflected in the information need can affect user's search behaviour (Large et al 1994). Furthermore, information source selection (Bassilli 1977), and the user's model of the system and domain impact on the search process (Michel 1994); while motivation (Solomon 1993, Jacobsen et al 1992) and the importance of the information need (Wendt 1969) also influence search duration and the effort a user will employ. Rouse and Rouse (1984) in a review of empirical studies, summarise a wide variety of variables that can effect searching behaviour, including payoff, costs of searching, resource available, amount of information sought, characteristics of the data and conflicts between documents. It appears that user behaviour is inconsistent in the search strategies adopted even for the same search need and system (Davidson 1977, Iivonen 1995). Theories of searcher behaviour have been proposed that provide explanations of aspects of end-user behaviour, such as the evolution of the user's information need and the problems of articulating a query, [Bates (1979, 1989), Markey and Atherton 1978], effective search strategies in browsing and goal directed searches [Marchionini 1995, Belkin (1987, 1993)], the linguistic problem of matching search terms with indexing terms or content of target documents through an expert intermediary (Ingwersen 1982) or cognitive aspects of IR (Kulthau 1984, Ingwersen 1996).

Date

22. 3.2002 9:54:13

Kuhlthau, C.C.: Developing a model of the library search process : cognitive and affective aspects (1988) 0.02

0.015553676 = product of:
  0.062214702 = sum of:
    0.062214702 = product of:
      0.124429405 = sum of:
        0.124429405 = weight(_text_:model in 3966) [ClassicSimilarity], result of:
          0.124429405 = score(doc=3966,freq=2.0), product of:
            0.1830527 = queryWeight, product of:
              3.845226 = idf(docFreq=2569, maxDocs=44218)
              0.047605187 = queryNorm
            0.6797463 = fieldWeight in 3966, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.845226 = idf(docFreq=2569, maxDocs=44218)
              0.125 = fieldNorm(doc=3966)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Barrio, P.; Gravano, L.: Sampling strategies for information extraction over the deep web (2017) 0.01
```
0.014322878 = product of:
  0.05729151 = sum of:
    0.05729151 = weight(_text_:space in 3412) [ClassicSimilarity], result of:
      0.05729151 = score(doc=3412,freq=2.0), product of:
        0.24842183 = queryWeight, product of:
          5.2183776 = idf(docFreq=650, maxDocs=44218)
          0.047605187 = queryNorm
        0.23062189 = fieldWeight in 3412, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.2183776 = idf(docFreq=650, maxDocs=44218)
          0.03125 = fieldNorm(doc=3412)
  0.25 = coord(1/4)
```
Abstract

Information extraction systems discover structured information in natural language text. Having information in structured form enables much richer querying and data mining than possible over the natural language text. However, information extraction is a computationally expensive task, and hence improving the efficiency of the extraction process over large text collections is of critical interest. In this paper, we focus on an especially valuable family of text collections, namely, the so-called deep-web text collections, whose contents are not crawlable and are only available via querying. Important steps for efficient information extraction over deep-web text collections (e.g., selecting the collections on which to focus the extraction effort, based on their contents; or learning which documents within these collections-and in which order-to process, based on their words and phrases) require having a representative document sample from each collection. These document samples have to be collected by querying the deep-web text collections, an expensive process that renders impractical the existing sampling approaches developed for other data scenarios. In this paper, we systematically study the space of query-based document sampling techniques for information extraction over the deep web. Specifically, we consider (i) alternative query execution schedules, which vary on how they account for the query effectiveness, and (ii) alternative document retrieval and processing schedules, which vary on how they distribute the extraction effort over documents. We report the results of the first large-scale experimental evaluation of sampling techniques for information extraction over the deep web. Our results show the merits and limitations of the alternative query execution and document retrieval and processing strategies, and provide a roadmap for addressing this critically important building block for efficient, scalable information extraction.
Whitmire, E.: Disciplinary differences and undergraduates' information-seeking behavior (2002) 0.01
```
0.013609466 = product of:
  0.054437865 = sum of:
    0.054437865 = product of:
      0.10887573 = sum of:
        0.10887573 = weight(_text_:model in 6566) [ClassicSimilarity], result of:
          0.10887573 = score(doc=6566,freq=8.0), product of:
            0.1830527 = queryWeight, product of:
              3.845226 = idf(docFreq=2569, maxDocs=44218)
              0.047605187 = queryNorm
            0.59477806 = fieldWeight in 6566, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.845226 = idf(docFreq=2569, maxDocs=44218)
              0.0546875 = fieldNorm(doc=6566)
      0.5 = coord(1/2)
  0.25 = coord(1/4)
```
Abstract

This study applied the Biglan model of disciplinary differences to the information-seeking behavior patterns of 5,175 undergraduates responding to questions on the College Student Experiences Questionnaire (CSEQ). The Biglan model categorizes academic disciplines along three dimensions: (1) hard-soft, (2) pure-applied, and (3) life-nonlife systems. Using t-tests, this model proved to be valid for distinguishing differences in undergraduates' information-seeking behavior patterns among various academic disciplines. The results indicate that the Biglan model has implications for the redesign of academic library services and use as a valid theoretical framework for future library and information science research.

Morse, P.M.: Search theory and browsing (1970) 0.01

0.012899691 = product of:
  0.051598765 = sum of:
    0.051598765 = product of:
      0.10319753 = sum of:
        0.10319753 = weight(_text_:22 in 1448) [ClassicSimilarity], result of:
          0.10319753 = score(doc=1448,freq=2.0), product of:
            0.16670525 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.047605187 = queryNorm
            0.61904186 = fieldWeight in 1448, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.125 = fieldNorm(doc=1448)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Date: 22. 5.2005 19:53:09

Lin, S.-j.: Internetworking of factors affecting successive searches over multiple episodes (2005) 0.01
```
0.011665257 = product of:
  0.046661027 = sum of:
    0.046661027 = product of:
      0.09332205 = sum of:
        0.09332205 = weight(_text_:model in 3330) [ClassicSimilarity], result of:
          0.09332205 = score(doc=3330,freq=8.0), product of:
            0.1830527 = queryWeight, product of:
              3.845226 = idf(docFreq=2569, maxDocs=44218)
              0.047605187 = queryNorm
            0.50980973 = fieldWeight in 3330, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.845226 = idf(docFreq=2569, maxDocs=44218)
              0.046875 = fieldNorm(doc=3330)
      0.5 = coord(1/2)
  0.25 = coord(1/4)
```
Abstract

Successive information searches are fairly common. To enhance the understanding of the behavior, this study attempted to improve both the descriptive and explanatory power of the Multiple Information Seeking Episodes (MISE) model, a conceptual model characterizing factors affecting successive searches. It empirically observed how the key factors in the information seeking process in the MISE model evolve over multiple search sessions and explained how those factors are affected by other factors associated with searchers, search activity, search context, systems, information attainment, and information-use activities. The validated and enriched MISE model can be extended to serve the basis for future studies in other complex searches process such as multi-tasking and collaborative searches, and can also help identify problems that users face and thus derive requirements for system support.
Spink, A.; Wilson, T.D.; Ford, N.; Foster, A.; Ellis, D.: Information seeking and mediated searching : Part 1: theoretical framework and research design (2002) 0.01
```
0.011665257 = product of:
  0.046661027 = sum of:
    0.046661027 = product of:
      0.09332205 = sum of:
        0.09332205 = weight(_text_:model in 5240) [ClassicSimilarity], result of:
          0.09332205 = score(doc=5240,freq=8.0), product of:
            0.1830527 = queryWeight, product of:
              3.845226 = idf(docFreq=2569, maxDocs=44218)
              0.047605187 = queryNorm
            0.50980973 = fieldWeight in 5240, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.845226 = idf(docFreq=2569, maxDocs=44218)
              0.046875 = fieldNorm(doc=5240)
      0.5 = coord(1/2)
  0.25 = coord(1/4)
```
Abstract

In this issue we begin with the first of four parts of a five part series of papers by Spink, Wilson, Ford, Foster, and Ellis. Spink, et alia, in the first section of this report set forth the design of a project to test whether existing models of the information search process are appropriate for an environment of mediated successive searching which they believe characterizes much information seeking behavior. Their goal is to develop an integrated model of the process. Data were collected from 198 individuals, 87 in Texas and 111 in Sheffield in the U.K., with individuals with real information needs engaged in interaction with operational information retrieval systems by use of transaction logs, recordings of interactions with intermediaries, pre, and post search interviews, questionnaire responses, relevance judgments of retrieved text, and responses to a test of cognitive styles. Questionnaires were based upon the Kuhlthau model, the Saracevic model, the Ellis model, and incorporated a visual analog scale to avoid a consistency bias.

Morse, P.M.: Browsing and search theory (1973) 0.01

0.011287229 = product of:
  0.045148917 = sum of:
    0.045148917 = product of:
      0.09029783 = sum of:
        0.09029783 = weight(_text_:22 in 3339) [ClassicSimilarity], result of:
          0.09029783 = score(doc=3339,freq=2.0), product of:
            0.16670525 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.047605187 = queryNorm
            0.5416616 = fieldWeight in 3339, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=3339)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Date: 22. 5.2005 19:52:29

Branch, J.L.: Investigating the information-seeking process of adolescents : the value of using think alouds and think afters (2000) 0.01

0.011287229 = product of:
  0.045148917 = sum of:
    0.045148917 = product of:
      0.09029783 = sum of:
        0.09029783 = weight(_text_:22 in 3924) [ClassicSimilarity], result of:
          0.09029783 = score(doc=3924,freq=2.0), product of:
            0.16670525 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.047605187 = queryNorm
            0.5416616 = fieldWeight in 3924, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=3924)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Source: Library and information science research. 22(2000) no.4, S.371-382

Spink, A.: Towards a theoretical framework for information retrieval in an information seeking context (1999) 0.01
```
0.010868462 = product of:
  0.043473847 = sum of:
    0.043473847 = product of:
      0.086947694 = sum of:
        0.086947694 = weight(_text_:model in 300) [ClassicSimilarity], result of:
          0.086947694 = score(doc=300,freq=10.0), product of:
            0.1830527 = queryWeight, product of:
              3.845226 = idf(docFreq=2569, maxDocs=44218)
              0.047605187 = queryNorm
            0.4749872 = fieldWeight in 300, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.845226 = idf(docFreq=2569, maxDocs=44218)
              0.0390625 = fieldNorm(doc=300)
      0.5 = coord(1/2)
  0.25 = coord(1/4)
```
Abstract

This paper presents the initial stages of the development of a three-dimensional model as a theoretical framework for conceptualizing and exploring interactive information retrieval (IR) with an information seeking context. The model, displayed in Figure 1, includes a Plane of Judgment within a Plane of Interaction within a Plane of Time. The Plane of Judgment includes levels and regions of relevance judgments, and other user judgments during interactive IR, e.g., magnitude or strategy feedback, tactics, search strategies, or search terms. The Plane of Judgment exists within a Plane of Interaction. The Plane of Interaction consists of interactive IR models, including Ingwersen (1992, 1996), Belkin, Cool, Stein and Theil (1995), and Saracevic (1996b, 1997). The Plane of Interaction includes movement or shifts within interactions or search episodes, e.g., tactics, information problem, strategies, terms, feedback, goal states, or uncertainty. IR interactions that occur within a Plane of Interaction exist within a Plane of Time. The Plane of Time includes users' information seeking stages, represented in the model by Kuhlthau's Information Search Process Model (1993) and users' successive searches over time related to the same or evolving information problem (Spink, 1996). The three-dimensional model is a framework for the development of theoretical and empirical research to: 1. Integrate interactive IR research within information-seeking context 2. Explore users' interactive IR episodes within their changing information-seeking contexts 3. Examine relevance judgments within users' information seeking processes 4. Broaden relevance research to include the concurrent exploration of relevance judgment level, region and time
Kuhlthau, C.C.: Investigating patterns in information seeking : concepts in context (1999) 0.01
```
0.010868462 = product of:
  0.043473847 = sum of:
    0.043473847 = product of:
      0.086947694 = sum of:
        0.086947694 = weight(_text_:model in 301) [ClassicSimilarity], result of:
          0.086947694 = score(doc=301,freq=10.0), product of:
            0.1830527 = queryWeight, product of:
              3.845226 = idf(docFreq=2569, maxDocs=44218)
              0.047605187 = queryNorm
            0.4749872 = fieldWeight in 301, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.845226 = idf(docFreq=2569, maxDocs=44218)
              0.0390625 = fieldNorm(doc=301)
      0.5 = coord(1/2)
  0.25 = coord(1/4)
```
Abstract

This paper presents the initial stages of the development of a three-dimensional model as a theoretical framework for conceptualizing and exploring interactive information retrieval (IR) with an information seeking context. The model, displayed in Figure 1, includes a Plane of Judgment within a Plane of Interaction within a Plane of Time. The Plane of Judgment includes levels and regions of relevance judgments, and other user judgments during interactive IR, e.g., magnitude or strategy feedback, tactics, search strategies, or search terms. The Plane of Judgment exists within a Plane of Interaction. The Plane of Interaction consists of interactive IR models, including Ingwersen (1992, 1996), Belkin, Cool, Stein and Theil (1995), and Saracevic (1996b, 1997). The Plane of Interaction includes movement or shifts within interactions or search episodes, e.g., tactics, information problem, strategies, terms, feedback, goal states, or uncertainty. IR interactions that occur within a Plane of Interaction exist within a Plane of Time. The Plane of Time includes users' information seeking stages, represented in the model by Kuhlthau's Information Search Process Model (1993) and users' successive searches over time related to the same or evolving information problem (Spink, 1996). The three-dimensional model is a framework for the development of theoretical and empirical research to: 1. Integrate interactive IR research within information-seeking context 2. Explore users' interactive IR episodes within their changing information-seeking contexts 3. Examine relevance judgments within users' information seeking processes 4. Broaden relevance research to include the concurrent exploration of relevance judgment level, region and time

Search (79 results, page 1 of 4)

Authors

Years

Languages

Types

Themes

Classifications