Search (36 results, page 1 of 2)

Cool, C.; Spink, A.: Issues of context in information retrieval (IR) : an introduction to the special issue (2002) 0.02

0.024061583 = product of:
  0.09624633 = sum of:
    0.07648118 = weight(_text_:great in 2587) [ClassicSimilarity], result of:
      0.07648118 = score(doc=2587,freq=2.0), product of:
        0.20489426 = queryWeight, product of:
          5.6307793 = idf(docFreq=430, maxDocs=44218)
          0.03638826 = queryNorm
        0.37327147 = fieldWeight in 2587, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.6307793 = idf(docFreq=430, maxDocs=44218)
          0.046875 = fieldNorm(doc=2587)
    0.01976515 = product of:
      0.0395303 = sum of:
        0.0395303 = weight(_text_:processing in 2587) [ClassicSimilarity], result of:
          0.0395303 = score(doc=2587,freq=2.0), product of:
            0.14730503 = queryWeight, product of:
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.03638826 = queryNorm
            0.26835677 = fieldWeight in 2587, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.046875 = fieldNorm(doc=2587)
      0.5 = coord(1/2)
  0.25 = coord(2/8)

Abstract: The subject of context has received a great deal of attention in the information retrieval (IR) literature over the past decade, primarily in studies of information seeking and IR interactions. Recently, attention to context in IR has expanded to address new problems in new environments. In this paper we outline five overlapping dimensions of context which we believe to be important constituent elements and we discuss how they are related to different issues in IR research. The papers in this special issue are summarized with respect to how they represent work that is being conducted within these dimensions of context. We conclude with future areas of research which are needed in order to fully understand the multidimensional nature of context in IR.
Source: Information processing and management. 38(2002) no.5, S.605-611

Kelly, D.: Measuring online information seeking context : Part 1: background and method (2006) 0.02
```
0.020958323 = product of:
  0.08383329 = sum of:
    0.063734315 = weight(_text_:great in 206) [ClassicSimilarity], result of:
      0.063734315 = score(doc=206,freq=2.0), product of:
        0.20489426 = queryWeight, product of:
          5.6307793 = idf(docFreq=430, maxDocs=44218)
          0.03638826 = queryNorm
        0.31105953 = fieldWeight in 206, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.6307793 = idf(docFreq=430, maxDocs=44218)
          0.0390625 = fieldNorm(doc=206)
    0.020098975 = weight(_text_:data in 206) [ClassicSimilarity], result of:
      0.020098975 = score(doc=206,freq=2.0), product of:
        0.115061514 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.03638826 = queryNorm
        0.17468026 = fieldWeight in 206, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.0390625 = fieldNorm(doc=206)
  0.25 = coord(2/8)
```
Abstract

Context is one of the most important concepts in information seeking and retrieval research. However, the challenges of studying context are great; thus, it is more common for researchers to use context as a post hoc explanatory factor, rather than as a concept that drives inquiry. The purposes of this study were to develop a method for collecting data about information seeking context in natural online environments, and identify which aspects of context should be considered when studying online information seeking. The study is reported in two parts. In this, the first part, the background and method are presented. Results and implications of this research are presented in Part 2 (Kelly, in press). Part 1 discusses previous literature on information seeking context and behavior and situates the current work within this literature. This part further describes the naturalistic, longitudinal research design that was used to examine and measure the online information seeking contexts of users during a 14-week period. In this design, information seeking context was characterized by a user's self-identified tasks and topics, and several attributes of these, such as the length of time the user expected to work on a task and the user's familiarity with a topic. At weekly intervals, users evaluated the usefulness of the documents that they viewed, and classified these documents according to their tasks and topics. At the end of the study, users provided feedback about the study method.
Kelly, D.: Measuring online information seeking context : Part 2: Findings and discussion (2006) 0.02
```
0.020958323 = product of:
  0.08383329 = sum of:
    0.063734315 = weight(_text_:great in 215) [ClassicSimilarity], result of:
      0.063734315 = score(doc=215,freq=2.0), product of:
        0.20489426 = queryWeight, product of:
          5.6307793 = idf(docFreq=430, maxDocs=44218)
          0.03638826 = queryNorm
        0.31105953 = fieldWeight in 215, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.6307793 = idf(docFreq=430, maxDocs=44218)
          0.0390625 = fieldNorm(doc=215)
    0.020098975 = weight(_text_:data in 215) [ClassicSimilarity], result of:
      0.020098975 = score(doc=215,freq=2.0), product of:
        0.115061514 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.03638826 = queryNorm
        0.17468026 = fieldWeight in 215, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.0390625 = fieldNorm(doc=215)
  0.25 = coord(2/8)
```
Abstract

Context is one of the most important concepts in information seeking and retrieval research. However, the challenges of studying context are great; thus, it is more common for researchers to use context as a post hoc explanatory factor, rather than as a concept that drives inquiry. The purpose of this study was to develop a method for collecting data about information seeking context in natural online environments, and identify which aspects of context should be considered when studying online information seeking. The study is reported in two parts. In this, the second part, results and implications of this research are presented. Part 1 (Kelly, 2006) discussed previous literature on information seeking context and behavior, situated the current study within this literature, and described the naturalistic, longitudinal research design that was used to examine and measure the online information seeking context of seven users during a 14-week period. Results provide support for the value of the method in studying online information seeking context, the relative importance of various measures of context, how these measures change over time, and, finally, the relationship between these measures. In particular, results demonstrate significant differences in distributions of usefulness ratings according to task and topic.
Bilal, D.; Kirby, J.: Differences and similarities in information seeking : children and adults as Web users (2002) 0.01
```
0.014386982 = product of:
  0.057547927 = sum of:
    0.044371158 = weight(_text_:higher in 2591) [ClassicSimilarity], result of:
      0.044371158 = score(doc=2591,freq=2.0), product of:
        0.19113865 = queryWeight, product of:
          5.252756 = idf(docFreq=628, maxDocs=44218)
          0.03638826 = queryNorm
        0.23214121 = fieldWeight in 2591, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.252756 = idf(docFreq=628, maxDocs=44218)
          0.03125 = fieldNorm(doc=2591)
    0.013176767 = product of:
      0.026353534 = sum of:
        0.026353534 = weight(_text_:processing in 2591) [ClassicSimilarity], result of:
          0.026353534 = score(doc=2591,freq=2.0), product of:
            0.14730503 = queryWeight, product of:
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.03638826 = queryNorm
            0.17890452 = fieldWeight in 2591, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.03125 = fieldNorm(doc=2591)
      0.5 = coord(1/2)
  0.25 = coord(2/8)
```
Abstract

This study examined the success and information seeking behaviors of seventh-grade science students and graduate students in information science in using Yahooligans! Web search engine/directory. It investigated these users' cognitive, affective, and physical behaviors as they sought the answer for a fact-finding task. It analyzed and compared the overall patterns of children's and graduate students' Web activities, including searching moves, browsing moves, backtracking moves, looping moves, screen scrolling, target location and deviation moves, and the time they took to complete the task. The authors applied Bilal's Web Traversal Measure to quantify these users' effectiveness, efficiency, and quality of moves they made. Results were based on 14 children's Web sessions and nine graduate students' sessions. Both groups' Web activities were captured online using Lotus ScreenCam, a software package that records and replays online activities in Web browsers. Children's affective states were captured via exit interviews. Graduate students' affective states were extracted from the journal writings they kept during the traversal process. The study findings reveal that 89% of the graduate students found the correct answer to the search task as opposed to 50% of the children. Based on the Measure, graduate students' weighted effectiveness, efficiency, and quality of the Web moves they made were much higher than those of the children. Regardless of success and weighted scores, however, similarities and differences in information seeking were found between the two groups. Yahooligans! poor structure of keyword searching was a major factor that contributed to the "breakdowns" children and graduate students experienced. Unlike children, graduate students were able to recover from "breakdowns" quickly and effectively. Three main factors influenced these users' performance: ability to recover from "breakdowns", navigational style, and focus on task. Children and graduate students made recommendations for improving Yahooligans! interface design. Implications for Web user training and system design improvements are made.

Source

Information processing and management. 38(2002) no.5, S.649-670
Wolfram, D.; Xie, H.I.: Traditional IR for web users : a context for general audience digital libraries (2002) 0.01
```
0.012820851 = product of:
  0.051283404 = sum of:
    0.034812447 = weight(_text_:data in 2589) [ClassicSimilarity], result of:
      0.034812447 = score(doc=2589,freq=6.0), product of:
        0.115061514 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.03638826 = queryNorm
        0.30255508 = fieldWeight in 2589, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2589)
    0.01647096 = product of:
      0.03294192 = sum of:
        0.03294192 = weight(_text_:processing in 2589) [ClassicSimilarity], result of:
          0.03294192 = score(doc=2589,freq=2.0), product of:
            0.14730503 = queryWeight, product of:
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.03638826 = queryNorm
            0.22363065 = fieldWeight in 2589, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2589)
      0.5 = coord(1/2)
  0.25 = coord(2/8)
```
Abstract

The emergence of general audience digital libraries (GADLs) defines a context that represents a hybrid of both "traditional" IR, using primarily bibliographic resources provided by database vendors, and "popular" IR, exemplified by public search systems available on the World Wide Web. Findings of a study investigating end-user searching and response to a GADL are reported. Data collected from a Web-based end-user survey and data logs of resource usage for a Web-based GADL were analyzed for user characteristics, patterns of access and use, and user feedback. Cross-tabulations using respondent demographics revealed several key differences in how the system was used and valued by users of different age groups. Older users valued the service more than younger users and engaged in different searching and viewing behaviors. The GADL more closely resembles traditional retrieval systems in terms of content and purpose of use, but is more similar to popular IR systems in terms of user behavior and accessibility. A model that defines the dual context of the GADL environment is derived from the data analysis and existing IR models in general and other specific contexts. The authors demonstrate the distinguishing characteristics of this IR context, and discuss implications for the development and evaluation of future GADLs to accommodate a variety of user needs and expectations.

Source

Information processing and management. 38(2002) no.5, S.627-648

Gao, J.; Zhang, J.: Clustered SVD strategies in latent semantic indexing (2005) 0.01

0.012799477 = product of:
  0.05119791 = sum of:
    0.028138565 = weight(_text_:data in 1166) [ClassicSimilarity], result of:
      0.028138565 = score(doc=1166,freq=2.0), product of:
        0.115061514 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.03638826 = queryNorm
        0.24455236 = fieldWeight in 1166, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1166)
    0.023059342 = product of:
      0.046118684 = sum of:
        0.046118684 = weight(_text_:processing in 1166) [ClassicSimilarity], result of:
          0.046118684 = score(doc=1166,freq=2.0), product of:
            0.14730503 = queryWeight, product of:
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.03638826 = queryNorm
            0.3130829 = fieldWeight in 1166, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1166)
      0.5 = coord(1/2)
  0.25 = coord(2/8)

Abstract: The text retrieval method using latent semantic indexing (LSI) technique with truncated singular value decomposition (SVD) has been intensively studied in recent years. The SVD reduces the noise contained in the original representation of the term-document matrix and improves the information retrieval accuracy. Recent studies indicate that SVD is mostly useful for small homogeneous data collections. For large inhomogeneous datasets, the performance of the SVD based text retrieval technique may deteriorate. We propose to partition a large inhomogeneous dataset into several smaller ones with clustered structure, on which we apply the truncated SVD. Our experimental results show that the clustered SVD strategies may enhance the retrieval accuracy and reduce the computing and storage costs.
Source: Information processing and management. 41(2005) no.5, S.1051-1064

Quiroga, L.M.; Mostafa, J.: ¬An experiment in building profiles in information filtering : the role of context of user relevance feedback (2002) 0.01
```
0.0112238005 = product of:
  0.044895202 = sum of:
    0.028424243 = weight(_text_:data in 2579) [ClassicSimilarity], result of:
      0.028424243 = score(doc=2579,freq=4.0), product of:
        0.115061514 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.03638826 = queryNorm
        0.24703519 = fieldWeight in 2579, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2579)
    0.01647096 = product of:
      0.03294192 = sum of:
        0.03294192 = weight(_text_:processing in 2579) [ClassicSimilarity], result of:
          0.03294192 = score(doc=2579,freq=2.0), product of:
            0.14730503 = queryWeight, product of:
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.03638826 = queryNorm
            0.22363065 = fieldWeight in 2579, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2579)
      0.5 = coord(1/2)
  0.25 = coord(2/8)
```
Abstract

An experiment was conducted to see how relevance feedback could be used to build and adjust profiles to improve the performance of filtering systems. Data was collected during the system interaction of 18 graduate students with SIFTER (Smart Information Filtering Technology for Electronic Resources), a filtering system that ranks incoming information based on users' profiles. The data set came from a collection of 6000 records concerning consumer health. In the first phase of the study, three different modes of profile acquisition were compared. The explicit mode allowed users to directly specify the profile; the implicit mode utilized relevance feedback to create and refine the profile; and the combined mode allowed users to initialize the profile and to continuously refine it using relevance feedback. Filtering performance, measured in terms of Normalized Precision, showed that the three approaches were significantly different ( [small alpha, Greek] =0.05 and p =0.012). The explicit mode of profile acquisition consistently produced superior results. Exclusive reliance on relevance feedback in the implicit mode resulted in inferior performance. The low performance obtained by the implicit acquisition mode motivated the second phase of the study, which aimed to clarify the role of context in relevance feedback judgments. An inductive content analysis of thinking aloud protocols showed dimensions that were highly situational, establishing the importance context plays in feedback relevance assessments. Results suggest the need for better representation of documents, profiles, and relevance feedback mechanisms that incorporate dimensions identified in this research.

Source

Information processing and management. 38(2002) no.5, S.671-694

Lin, J.; DiCuccio, M.; Grigoryan, V.; Wilbur, W.J.: Navigating information spaces : a case study of related article search in PubMed (2008) 0.01

0.01097098 = product of:
  0.04388392 = sum of:
    0.02411877 = weight(_text_:data in 2124) [ClassicSimilarity], result of:
      0.02411877 = score(doc=2124,freq=2.0), product of:
        0.115061514 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.03638826 = queryNorm
        0.2096163 = fieldWeight in 2124, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.046875 = fieldNorm(doc=2124)
    0.01976515 = product of:
      0.0395303 = sum of:
        0.0395303 = weight(_text_:processing in 2124) [ClassicSimilarity], result of:
          0.0395303 = score(doc=2124,freq=2.0), product of:
            0.14730503 = queryWeight, product of:
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.03638826 = queryNorm
            0.26835677 = fieldWeight in 2124, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.046875 = fieldNorm(doc=2124)
      0.5 = coord(1/2)
  0.25 = coord(2/8)

Abstract: The concept of an "information space" provides a powerful metaphor for guiding the design of interactive retrieval systems. We present a case study of related article search, a browsing tool designed to help users navigate the information space defined by results of the PubMed® search engine. This feature leverages content-similarity links that tie MEDLINE® citations together in a vast document network. We examine the effectiveness of related article search from two perspectives: a topological analysis of networks generated from information needs represented in the TREC 2005 genomics track and a query log analysis of real PubMed users. Together, data suggest that related article search is a useful feature and that browsing related articles has become an integral part of how users interact with PubMed.
Source: Information processing and management. 44(2008) no.5, S.1771-1783

Shiri, A.A.; Revie, C.: Query expansion behavior within a thesaurus-enhanced search environment : a user-centered evaluation (2006) 0.01
```
0.010187378 = product of:
  0.040749513 = sum of:
    0.028424243 = weight(_text_:data in 56) [ClassicSimilarity], result of:
      0.028424243 = score(doc=56,freq=4.0), product of:
        0.115061514 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.03638826 = queryNorm
        0.24703519 = fieldWeight in 56, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.0390625 = fieldNorm(doc=56)
    0.012325268 = product of:
      0.024650536 = sum of:
        0.024650536 = weight(_text_:22 in 56) [ClassicSimilarity], result of:
          0.024650536 = score(doc=56,freq=2.0), product of:
            0.12742549 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03638826 = queryNorm
            0.19345059 = fieldWeight in 56, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=56)
      0.5 = coord(1/2)
  0.25 = coord(2/8)
```
Abstract

The study reported here investigated the query expansion behavior of end-users interacting with a thesaurus-enhanced search system on the Web. Two groups, namely academic staff and postgraduate students, were recruited into this study. Data were collected from 90 searches performed by 30 users using the OVID interface to the CAB abstracts database. Data-gathering techniques included questionnaires, screen capturing software, and interviews. The results presented here relate to issues of search-topic and search-term characteristics, number and types of expanded queries, usefulness of thesaurus terms, and behavioral differences between academic staff and postgraduate students in their interaction. The key conclusions drawn were that (a) academic staff chose more narrow and synonymous terms than did postgraduate students, who generally selected broader and related terms; (b) topic complexity affected users' interaction with the thesaurus in that complex topics required more query expansion and search term selection; (c) users' prior topic-search experience appeared to have a significant effect on their selection and evaluation of thesaurus terms; (d) in 50% of the searches where additional terms were suggested from the thesaurus, users stated that they had not been aware of the terms at the beginning of the search; this observation was particularly noticeable in the case of postgraduate students.

Date

22. 7.2006 16:32:43
Klas, C.-P.; Fuhr, N.; Schaefer, A.: Evaluating strategic support for information access in the DAFFODIL system (2004) 0.01
```
0.009727273 = product of:
  0.038909093 = sum of:
    0.02411877 = weight(_text_:data in 2419) [ClassicSimilarity], result of:
      0.02411877 = score(doc=2419,freq=2.0), product of:
        0.115061514 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.03638826 = queryNorm
        0.2096163 = fieldWeight in 2419, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.046875 = fieldNorm(doc=2419)
    0.014790321 = product of:
      0.029580642 = sum of:
        0.029580642 = weight(_text_:22 in 2419) [ClassicSimilarity], result of:
          0.029580642 = score(doc=2419,freq=2.0), product of:
            0.12742549 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03638826 = queryNorm
            0.23214069 = fieldWeight in 2419, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=2419)
      0.5 = coord(1/2)
  0.25 = coord(2/8)
```
Abstract

The digital library system Daffodil is targeted at strategic support of users during the information search process. For searching, exploring and managing digital library objects it provides user-customisable information seeking patterns over a federation of heterogeneous digital libraries. In this paper evaluation results with respect to retrieval effectiveness, efficiency and user satisfaction are presented. The analysis focuses on strategic support for the scientific work-flow. Daffodil supports the whole work-flow, from data source selection over information seeking to the representation, organisation and reuse of information. By embedding high level search functionality into the scientific work-flow, the user experiences better strategic system support due to a more systematic work process. These ideas have been implemented in Daffodil followed by a qualitative evaluation. The evaluation has been conducted with 28 participants, ranging from information seeking novices to experts. The results are promising, as they support the chosen model.

Date

16.11.2008 16:22:48
Kruschwitz, U.; AI-Bakour, H.: Users want more sophisticated search assistants : results of a task-based evaluation (2005) 0.01
```
0.009142484 = product of:
  0.036569934 = sum of:
    0.020098975 = weight(_text_:data in 4575) [ClassicSimilarity], result of:
      0.020098975 = score(doc=4575,freq=2.0), product of:
        0.115061514 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.03638826 = queryNorm
        0.17468026 = fieldWeight in 4575, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4575)
    0.01647096 = product of:
      0.03294192 = sum of:
        0.03294192 = weight(_text_:processing in 4575) [ClassicSimilarity], result of:
          0.03294192 = score(doc=4575,freq=2.0), product of:
            0.14730503 = queryWeight, product of:
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.03638826 = queryNorm
            0.22363065 = fieldWeight in 4575, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4575)
      0.5 = coord(1/2)
  0.25 = coord(2/8)
```
Abstract

The Web provides a massive knowledge source, as do intranets and other electronic document collections. However, much of that knowledge is encoded implicitly and cannot be applied directly without processing into some more appropriate structures. Searching, browsing, question answering, for example, could all benefit from domain-specific knowledge contained in the documents, and in applications such as simple search we do not actually need very "deep" knowledge structures such as ontologies, but we can get a long way with a model of the domain that consists of term hierarchies. We combine domain knowledge automatically acquired by exploiting the documents' markup structure with knowledge extracted an the fly to assist a user with ad hoc search requests. Such a search system can suggest query modification options derived from the actual data and thus guide a user through the space of documents. This article gives a detailed account of a task-based evaluation that compares a search system that uses the outlined domain knowledge with a standard search system. We found that users do use the query modification suggestions proposed by the system. The main conclusion we can draw from this evaluation, however, is that users prefer a system that can suggest query modifications over a standard search engine, which simply presents a ranked list of documents. Most interestingly, we observe this user preference despite the fact that the baseline system even performs slightly better under certain criteria.
Lehtokangas, R.; Järvelin, K.: Consistency of textual expression in newspaper articles : an argument for semantically based query expansion (2001) 0.01
```
0.0069329934 = product of:
  0.055463947 = sum of:
    0.055463947 = weight(_text_:higher in 4485) [ClassicSimilarity], result of:
      0.055463947 = score(doc=4485,freq=2.0), product of:
        0.19113865 = queryWeight, product of:
          5.252756 = idf(docFreq=628, maxDocs=44218)
          0.03638826 = queryNorm
        0.2901765 = fieldWeight in 4485, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.252756 = idf(docFreq=628, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4485)
  0.125 = coord(1/8)
```
Abstract

This article investigates how consistent different newspapers are in their choice of words when writing about the same news events. News articles on the same news events were taken from three Finnish newspapers and compared in regard to their central concepts and words representing the concepts in the news texts. Consistency figures were calculated for each set of three articles (the total number of sets was sixty). Inconsistency in words and concepts was found between news articles from different newspapers. The mean value of consistency calculated on the basis of words was 65 per cent; this however depended on the article length. For short news wires consistency was 83 per cent while for long articles it was only 47 per cent. At the concept level, consistency was considerably higher, ranging from 92 per cent to 97 per cent between short and long articles. The articles also represented three categories of topic (event, process and opinion). Statistically significant differences in consistency were found in regard to length but not in regard to the categories of topic. We argue that the expression inconsistency is a clear sign of a retrieval problem and that query expansion based on semantic relationships can significantly improve retrieval performance on free-text sources.

Bradford, R.B.: Relationship discovery in large text collections using Latent Semantic Indexing (2006) 0.01

0.0064848484 = product of:
  0.025939394 = sum of:
    0.01607918 = weight(_text_:data in 1163) [ClassicSimilarity], result of:
      0.01607918 = score(doc=1163,freq=2.0), product of:
        0.115061514 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.03638826 = queryNorm
        0.1397442 = fieldWeight in 1163, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.03125 = fieldNorm(doc=1163)
    0.009860214 = product of:
      0.019720428 = sum of:
        0.019720428 = weight(_text_:22 in 1163) [ClassicSimilarity], result of:
          0.019720428 = score(doc=1163,freq=2.0), product of:
            0.12742549 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03638826 = queryNorm
            0.15476047 = fieldWeight in 1163, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=1163)
      0.5 = coord(1/2)
  0.25 = coord(2/8)

Source: Proceedings of the Fourth Workshop on Link Analysis, Counterterrorism, and Security, SIAM Data Mining Conference, Bethesda, MD, 20-22 April, 2006. [http://www.siam.org/meetings/sdm06/workproceed/Link%20Analysis/15.pdf]

Niemi, T.; Jämsen , J.: ¬A query language for discovering semantic associations, part I : approach and formal definition of query primitives (2007) 0.01
```
0.0056178346 = product of:
  0.044942677 = sum of:
    0.044942677 = weight(_text_:data in 591) [ClassicSimilarity], result of:
      0.044942677 = score(doc=591,freq=10.0), product of:
        0.115061514 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.03638826 = queryNorm
        0.39059696 = fieldWeight in 591, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.0390625 = fieldNorm(doc=591)
  0.125 = coord(1/8)
```
Abstract

In contemporary query languages, the user is responsible for navigation among semantically related data. Because of the huge amount of data and the complex structural relationships among data in modern applications, it is unrealistic to suppose that the user could know completely the content and structure of the available information. There are several query languages whose purpose is to facilitate navigation in unknown structures of databases. However, the background assumption of these languages is that the user knows how data are related to each other semantically in the structure at hand. So far only little attention has been paid to how unknown semantic associations among available data can be discovered. We address this problem in this article. A semantic association between two entities can be constructed if a sequence of relationships expressed explicitly in a database can be found that connects these entities to each other. This sequence may contain several other entities through which the original entities are connected to each other indirectly. We introduce an expressive and declarative query language for discovering semantic associations. Our query language is able, for example, to discover semantic associations between entities for which only some of the characteristics are known. Further, it integrates the manipulation of semantic associations with the manipulation of documents that may contain information on entities in semantic associations.

Mandala, R.; Tokunaga, T.; Tanaka, H.: Query expansion using heterogeneous thesauri (2000) 0.00

0.0049412875 = product of:
  0.0395303 = sum of:
    0.0395303 = product of:
      0.0790606 = sum of:
        0.0790606 = weight(_text_:processing in 1277) [ClassicSimilarity], result of:
          0.0790606 = score(doc=1277,freq=2.0), product of:
            0.14730503 = queryWeight, product of:
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.03638826 = queryNorm
            0.53671354 = fieldWeight in 1277, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.09375 = fieldNorm(doc=1277)
      0.5 = coord(1/2)
  0.125 = coord(1/8)

Source: Information processing and management. 36(2000) no.3, S.361-378

Boyack, K.W.; Wylie,B.N.; Davidson, G.S.: Information Visualization, Human-Computer Interaction, and Cognitive Psychology : Domain Visualizations (2002) 0.00

0.00435764 = product of:
  0.03486112 = sum of:
    0.03486112 = product of:
      0.06972224 = sum of:
        0.06972224 = weight(_text_:22 in 1352) [ClassicSimilarity], result of:
          0.06972224 = score(doc=1352,freq=4.0), product of:
            0.12742549 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03638826 = queryNorm
            0.54716086 = fieldWeight in 1352, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=1352)
      0.5 = coord(1/2)
  0.125 = coord(1/8)

Date: 22. 2.2003 17:25:39
22. 2.2003 18:17:40

Greenberg, J.: Optimal query expansion (QE) processing methods with semantically encoded structured thesaurus terminology (2001) 0.00
```
0.004279281 = product of:
  0.034234248 = sum of:
    0.034234248 = product of:
      0.068468496 = sum of:
        0.068468496 = weight(_text_:processing in 5750) [ClassicSimilarity], result of:
          0.068468496 = score(doc=5750,freq=6.0), product of:
            0.14730503 = queryWeight, product of:
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.03638826 = queryNorm
            0.4648076 = fieldWeight in 5750, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.046875 = fieldNorm(doc=5750)
      0.5 = coord(1/2)
  0.125 = coord(1/8)
```
Abstract

While researchers have explored the value of structured thesauri as controlled vocabularies for general information retrieval (IR) activities, they have not identified the optimal query expansion (QE) processing methods for taking advantage of the semantic encoding underlying the terminology in these tools. The study reported on in this article addresses this question, and examined whether QE via semantically encoded thesauri terminology is more effective in the automatic or interactive processing environment. The research found that, regardless of end-users' retrieval goals, synonyms and partial synonyms (SYNs) and narrower terms (NTs) are generally good candidates for automatic QE and that related (RTs) are better candidates for interactive QE. The study also examined end-users' selection of semantically encoded thesauri terms for interactive QE, and explored how retrieval goals and QE processes may be combined in future thesauri-supported IR systems

Sacco, G.M.: Dynamic taxonomies and guided searches (2006) 0.00

0.0030503483 = product of:
  0.024402786 = sum of:
    0.024402786 = product of:
      0.048805572 = sum of:
        0.048805572 = weight(_text_:22 in 5295) [ClassicSimilarity], result of:
          0.048805572 = score(doc=5295,freq=4.0), product of:
            0.12742549 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03638826 = queryNorm
            0.38301262 = fieldWeight in 5295, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5295)
      0.5 = coord(1/2)
  0.125 = coord(1/8)

Date: 22. 7.2006 17:56:22

Sanderson, M.; Lawrie, D.: Building, testing, and applying concept hierarchies (2000) 0.00
```
0.0030148462 = product of:
  0.02411877 = sum of:
    0.02411877 = weight(_text_:data in 37) [ClassicSimilarity], result of:
      0.02411877 = score(doc=37,freq=2.0), product of:
        0.115061514 = queryWeight, product of:
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.03638826 = queryNorm
        0.2096163 = fieldWeight in 37, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.1620505 = idf(docFreq=5088, maxDocs=44218)
          0.046875 = fieldNorm(doc=37)
  0.125 = coord(1/8)
```
Abstract

A means of automatically deriving a hierarchical organization of concepts from a set of documents without use of training data or standard clustering techniques is presented. Using a process that extracts salient words and phrases from the documents, these terms are organized hierarchically using a type of co-occurrence known as subsumption. The resulting structure is displayed as a series of hierarchical menus. When generated from a set of retrieved documents, a user browsing the menus gains an overview of their content in a manner distinct from existing techniques. The methods used to build the structure are simple and appear to be effective. The formation and presentation of the hierarchy is described along with a study of some of its properties, including a preliminary experiment, which indicates that users may find the hierarchy a more efficient means of locating relevant documents than the classic method of scanning a ranked document list

Hetzler, B.: Visual analysis and exploration of relationships (2002) 0.00

0.0028824178 = product of:
  0.023059342 = sum of:
    0.023059342 = product of:
      0.046118684 = sum of:
        0.046118684 = weight(_text_:processing in 1189) [ClassicSimilarity], result of:
          0.046118684 = score(doc=1189,freq=2.0), product of:
            0.14730503 = queryWeight, product of:
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.03638826 = queryNorm
            0.3130829 = fieldWeight in 1189, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1189)
      0.5 = coord(1/2)
  0.125 = coord(1/8)

Abstract: Relationships can provide a rich and powerful set of information and can be used to accomplish application goals, such as information retrieval and natural language processing. A growing trend in the information science community is the use of information visualization-taking advantage of people's natural visual capabilities to perceive and understand complex information. This chapter explores how visualization and visual exploration can help users gain insight from known relationships and discover evidence of new relationships not previously anticipated.

Search (36 results, page 1 of 2)

Authors

Languages

Types

Themes

Subjects

Classifications