Search (232 results, page 1 of 12)

Smeaton, A.F.; Rijsbergen, C.J. van: ¬The retrieval effects of query expansion on a feedback document retrieval system (1983) 0.04

0.043452375 = product of:
  0.06517856 = sum of:
    0.018583227 = weight(_text_:of in 2134) [ClassicSimilarity], result of:
      0.018583227 = score(doc=2134,freq=2.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.24188137 = fieldWeight in 2134, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.109375 = fieldNorm(doc=2134)
    0.046595335 = product of:
      0.09319067 = sum of:
        0.09319067 = weight(_text_:22 in 2134) [ClassicSimilarity], result of:
          0.09319067 = score(doc=2134,freq=2.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.5416616 = fieldWeight in 2134, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=2134)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Date: 30. 3.2001 13:32:22

Efthimiadis, E.N.: End-users' understanding of thesaural knowledge structures in interactive query expansion (1994) 0.03

0.033580456 = product of:
  0.050370682 = sum of:
    0.023744777 = weight(_text_:of in 5693) [ClassicSimilarity], result of:
      0.023744777 = score(doc=5693,freq=10.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.3090647 = fieldWeight in 5693, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=5693)
    0.026625905 = product of:
      0.05325181 = sum of:
        0.05325181 = weight(_text_:22 in 5693) [ClassicSimilarity], result of:
          0.05325181 = score(doc=5693,freq=2.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.30952093 = fieldWeight in 5693, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=5693)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: The process of term selection for query expansion by end-users is discussed within the context of a study of interactive query expansion in a relevance feedback environment. This user study focuses on how users' perceive and understand term relationships, such as hierarchical and associative relationships, in their searches
Date: 30. 3.2001 13:35:22
Source: Knowledge organization and quality management: Proc. of the 3rd International ISKO Conference, 20-24 June 1994, Copenhagen, Denmark. Ed.: H. Albrechtsen et al

Fieldhouse, M.; Hancock-Beaulieu, M.: ¬The design of a graphical user interface for a highly interactive information retrieval system (1996) 0.03

0.033052213 = product of:
  0.049578317 = sum of:
    0.02628065 = weight(_text_:of in 6958) [ClassicSimilarity], result of:
      0.02628065 = score(doc=6958,freq=16.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.34207192 = fieldWeight in 6958, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=6958)
    0.023297668 = product of:
      0.046595335 = sum of:
        0.046595335 = weight(_text_:22 in 6958) [ClassicSimilarity], result of:
          0.046595335 = score(doc=6958,freq=2.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.2708308 = fieldWeight in 6958, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=6958)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: Reports on the design of a GUI for the Okapi 'best match' retrieval system developed at the Centre for Interactive Systems Research, City University, UK, for online library catalogues. The X-Windows interface includes an interactive query expansion (IQE) facilty which involves the user in the selection of query terms to reformulate a search. Presents the design rationale, based on a game board metaphor, and describes the features of each of the stages of the search interaction. Reports on the early operational field trial and discusses relevant evaluation issues and objectives
Source: Information retrieval: new systems and current research. Proceedings of the 16th Research Colloquium of the British Computer Society Information Retrieval Specialist Group, Drymen, Scotland, 22-23 Mar 94. Ed.: R. Leon

Sacco, G.M.: Dynamic taxonomies and guided searches (2006) 0.03

0.032694288 = product of:
  0.049041428 = sum of:
    0.016093547 = weight(_text_:of in 5295) [ClassicSimilarity], result of:
      0.016093547 = score(doc=5295,freq=6.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.20947541 = fieldWeight in 5295, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=5295)
    0.03294788 = product of:
      0.06589576 = sum of:
        0.06589576 = weight(_text_:22 in 5295) [ClassicSimilarity], result of:
          0.06589576 = score(doc=5295,freq=4.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.38301262 = fieldWeight in 5295, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5295)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: A new search paradigm, in which the primary user activity is the guided exploration of a complex information space rather than the retrieval of items based on precise specifications, is proposed. The author claims that this paradigm is the norm in most practical applications, and that solutions based on traditional search methods are not effective in this context. He then presents a solution based on dynamic taxonomies, a knowledge management model that effectively guides users to reach their goal while giving them total freedom in exploring the information base. Applications, benefits, and current research are discussed.
Date: 22. 7.2006 17:56:22
Source: Journal of the American Society for Information Science and Technology. 57(2006) no.6, S.792-796

Kozikowski, P. et al.: Support of part-whole relations in query answering (2016) 0.03

0.031037413 = product of:
  0.04655612 = sum of:
    0.013273734 = weight(_text_:of in 2754) [ClassicSimilarity], result of:
      0.013273734 = score(doc=2754,freq=2.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.17277241 = fieldWeight in 2754, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.078125 = fieldNorm(doc=2754)
    0.033282384 = product of:
      0.06656477 = sum of:
        0.06656477 = weight(_text_:22 in 2754) [ClassicSimilarity], result of:
          0.06656477 = score(doc=2754,freq=2.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.38690117 = fieldWeight in 2754, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=2754)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Date: 1. 2.2016 18:25:22

Kopácsi, S. et al.: Development of a classification server to support metadata harmonization in a long term preservation system (2016) 0.03

0.031037413 = product of:
  0.04655612 = sum of:
    0.013273734 = weight(_text_:of in 3280) [ClassicSimilarity], result of:
      0.013273734 = score(doc=3280,freq=2.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.17277241 = fieldWeight in 3280, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.078125 = fieldNorm(doc=3280)
    0.033282384 = product of:
      0.06656477 = sum of:
        0.06656477 = weight(_text_:22 in 3280) [ClassicSimilarity], result of:
          0.06656477 = score(doc=3280,freq=2.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.38690117 = fieldWeight in 3280, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=3280)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Source: Metadata and semantics research: 10th International Conference, MTSR 2016, Göttingen, Germany, November 22-25, 2016, Proceedings. Eds.: E. Garoufallou

Mlodzka-Stybel, A.: Towards continuous improvement of users' access to a library catalogue (2014) 0.03

0.03070492 = product of:
  0.046057377 = sum of:
    0.02275971 = weight(_text_:of in 1466) [ClassicSimilarity], result of:
      0.02275971 = score(doc=1466,freq=12.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.29624295 = fieldWeight in 1466, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1466)
    0.023297668 = product of:
      0.046595335 = sum of:
        0.046595335 = weight(_text_:22 in 1466) [ClassicSimilarity], result of:
          0.046595335 = score(doc=1466,freq=2.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.2708308 = fieldWeight in 1466, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1466)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: The paper discusses the issue of increasing users' access to library records by their publication in Google. Data from the records, converted into html format, have been indexed by Google. The process covered basic formal description fields of the records, description of the content, supported with a thesaurus, as well as an abstract, if present in the record. In addition to monitoring the end users' statistics, the pilot testing covered visibility of library records in Google search results.
Source: Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik

Lund, K.; Burgess, C.; Atchley, R.A.: Semantic and associative priming in high-dimensional semantic space (1995) 0.03

0.03070492 = product of:
  0.046057377 = sum of:
    0.02275971 = weight(_text_:of in 2151) [ClassicSimilarity], result of:
      0.02275971 = score(doc=2151,freq=12.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.29624295 = fieldWeight in 2151, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2151)
    0.023297668 = product of:
      0.046595335 = sum of:
        0.046595335 = weight(_text_:22 in 2151) [ClassicSimilarity], result of:
          0.046595335 = score(doc=2151,freq=2.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.2708308 = fieldWeight in 2151, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2151)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: We present a model of semantic memory that utilizes a high dimensional semantic space constructed from a co-occurrence matrix. This matrix was formed by analyzing a lot) million word corpus. Word vectors were then obtained by extracting rows and columns of this matrix, These vectors were subjected to multidimensional scaling. Words were found to cluster semantically. suggesting that interword distance may be interpretable as a measure of semantic similarity, In attempting to replicate with our simulation the semantic and ...
Source: Proceedings of the Seventeenth Annual Conference of the Cognitive Science Society: July 22 - 25, 1995, University of Pittsburgh / ed. by Johanna D. Moore and Jill Fain Lehmann

Faaborg, A.; Lagoze, C.: Semantic browsing (2003) 0.03

0.027920596 = product of:
  0.041880894 = sum of:
    0.018583227 = weight(_text_:of in 1026) [ClassicSimilarity], result of:
      0.018583227 = score(doc=1026,freq=8.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.24188137 = fieldWeight in 1026, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1026)
    0.023297668 = product of:
      0.046595335 = sum of:
        0.046595335 = weight(_text_:22 in 1026) [ClassicSimilarity], result of:
          0.046595335 = score(doc=1026,freq=2.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.2708308 = fieldWeight in 1026, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1026)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: We have created software applications that allow users to both author and use Semantic Web metadata. To create and use a layer of semantic content on top of the existing Web, we have (1) implemented a user interface that expedites the task of attributing metadata to resources on the Web, and (2) augmented a Web browser to leverage this semantic metadata to provide relevant information and tasks to the user. This project provides a framework for annotating and reorganizing existing files, pages, and sites on the Web that is similar to Vannevar Bushrsquos original concepts of trail blazing and associative indexing.
Source: Research and advanced technology for digital libraries : 7th European Conference, proceedings / ECDL 2003, Trondheim, Norway, August 17-22, 2003

Efthimiadis, E.N.: User choices : a new yardstick for the evaluation of ranking algorithms for interactive query expansion (1995) 0.03
```
0.026421316 = product of:
  0.039631974 = sum of:
    0.022990782 = weight(_text_:of in 5697) [ClassicSimilarity], result of:
      0.022990782 = score(doc=5697,freq=24.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.2992506 = fieldWeight in 5697, product of:
          4.8989797 = tf(freq=24.0), with freq of:
            24.0 = termFreq=24.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5697)
    0.016641192 = product of:
      0.033282384 = sum of:
        0.033282384 = weight(_text_:22 in 5697) [ClassicSimilarity], result of:
          0.033282384 = score(doc=5697,freq=2.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.19345059 = fieldWeight in 5697, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5697)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)
```
Abstract

The performance of 8 ranking algorithms was evaluated with respect to their effectiveness in ranking terms for query expansion. The evaluation was conducted within an investigation of interactive query expansion and relevance feedback in a real operational environment. Focuses on the identification of algorithms that most effectively take cognizance of user preferences. user choices (i.e. the terms selected by the searchers for the query expansion search) provided the yardstick for the evaluation of the 8 ranking algorithms. This methodology introduces a user oriented approach in evaluating ranking algorithms for query expansion in contrast to the standard, system oriented approaches. Similarities in the performance of the 8 algorithms and the ways these algorithms rank terms were the main focus of this evaluation. The findings demonstrate that the r-lohi, wpq, enim, and porter algorithms have similar performance in bringing good terms to the top of a ranked list of terms for query expansion. However, further evaluation of the algorithms in different (e.g. full text) environments is needed before these results can be generalized beyond the context of the present study

Date

22. 2.1996 13:14:10
Song, D.; Bruza, P.D.: Towards context sensitive information inference (2003) 0.03
```
0.025085872 = product of:
  0.037628807 = sum of:
    0.020987613 = weight(_text_:of in 1428) [ClassicSimilarity], result of:
      0.020987613 = score(doc=1428,freq=20.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.27317715 = fieldWeight in 1428, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1428)
    0.016641192 = product of:
      0.033282384 = sum of:
        0.033282384 = weight(_text_:22 in 1428) [ClassicSimilarity], result of:
          0.033282384 = score(doc=1428,freq=2.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.19345059 = fieldWeight in 1428, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1428)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)
```
Abstract

Humans can make hasty, but generally robust judgements about what a text fragment is, or is not, about. Such judgements are termed information inference. This article furnishes an account of information inference from a psychologistic stance. By drawing an theories from nonclassical logic and applied cognition, an information inference mechanism is proposed that makes inferences via computations of information flow through an approximation of a conceptual space. Within a conceptual space information is represented geometrically. In this article, geometric representations of words are realized as vectors in a high dimensional semantic space, which is automatically constructed from a text corpus. Two approaches were presented for priming vector representations according to context. The first approach uses a concept combination heuristic to adjust the vector representation of a concept in the light of the representation of another concept. The second approach computes a prototypical concept an the basis of exemplar trace texts and moves it in the dimensional space according to the context. Information inference is evaluated by measuring the effectiveness of query models derived by information flow computations. Results show that information flow contributes significantly to query model effectiveness, particularly with respect to precision. Moreover, retrieval effectiveness compares favorably with two probabilistic query models, and another based an semantic association. More generally, this article can be seen as a contribution towards realizing operational systems that mimic text-based human reasoning.

Date

22. 3.2003 19:35:46

Source

Journal of the American Society for Information Science and technology. 54(2003) no.4, S.321-334
Shiri, A.A.; Revie, C.: Query expansion behavior within a thesaurus-enhanced search environment : a user-centered evaluation (2006) 0.03
```
0.025085872 = product of:
  0.037628807 = sum of:
    0.020987613 = weight(_text_:of in 56) [ClassicSimilarity], result of:
      0.020987613 = score(doc=56,freq=20.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.27317715 = fieldWeight in 56, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0390625 = fieldNorm(doc=56)
    0.016641192 = product of:
      0.033282384 = sum of:
        0.033282384 = weight(_text_:22 in 56) [ClassicSimilarity], result of:
          0.033282384 = score(doc=56,freq=2.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.19345059 = fieldWeight in 56, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=56)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)
```
Abstract

The study reported here investigated the query expansion behavior of end-users interacting with a thesaurus-enhanced search system on the Web. Two groups, namely academic staff and postgraduate students, were recruited into this study. Data were collected from 90 searches performed by 30 users using the OVID interface to the CAB abstracts database. Data-gathering techniques included questionnaires, screen capturing software, and interviews. The results presented here relate to issues of search-topic and search-term characteristics, number and types of expanded queries, usefulness of thesaurus terms, and behavioral differences between academic staff and postgraduate students in their interaction. The key conclusions drawn were that (a) academic staff chose more narrow and synonymous terms than did postgraduate students, who generally selected broader and related terms; (b) topic complexity affected users' interaction with the thesaurus in that complex topics required more query expansion and search term selection; (c) users' prior topic-search experience appeared to have a significant effect on their selection and evaluation of thesaurus terms; (d) in 50% of the searches where additional terms were suggested from the thesaurus, users stated that they had not been aware of the terms at the beginning of the search; this observation was particularly noticeable in the case of postgraduate students.

Date

22. 7.2006 16:32:43

Source

Journal of the American Society for Information Science and Technology. 57(2006) no.4, S.462-478
Brandão, W.C.; Santos, R.L.T.; Ziviani, N.; Moura, E.S. de; Silva, A.S. da: Learning to expand queries using entities (2014) 0.03
```
0.025085872 = product of:
  0.037628807 = sum of:
    0.020987613 = weight(_text_:of in 1343) [ClassicSimilarity], result of:
      0.020987613 = score(doc=1343,freq=20.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.27317715 = fieldWeight in 1343, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1343)
    0.016641192 = product of:
      0.033282384 = sum of:
        0.033282384 = weight(_text_:22 in 1343) [ClassicSimilarity], result of:
          0.033282384 = score(doc=1343,freq=2.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.19345059 = fieldWeight in 1343, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1343)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)
```
Abstract

A substantial fraction of web search queries contain references to entities, such as persons, organizations, and locations. Recently, methods that exploit named entities have been shown to be more effective for query expansion than traditional pseudorelevance feedback methods. In this article, we introduce a supervised learning approach that exploits named entities for query expansion using Wikipedia as a repository of high-quality feedback documents. In contrast with existing entity-oriented pseudorelevance feedback approaches, we tackle query expansion as a learning-to-rank problem. As a result, not only do we select effective expansion terms but we also weigh these terms according to their predicted effectiveness. To this end, we exploit the rich structure of Wikipedia articles to devise discriminative term features, including each candidate term's proximity to the original query terms, as well as its frequency across multiple article fields and in category and infobox descriptors. Experiments on three Text REtrieval Conference web test collections attest the effectiveness of our approach, with gains of up to 23.32% in terms of mean average precision, 19.49% in terms of precision at 10, and 7.86% in terms of normalized discounted cumulative gain compared with a state-of-the-art approach for entity-oriented query expansion.

Date

22. 8.2014 17:07:50

Source

Journal of the Association for Information Science and Technology. 65(2014) no.9, S.1870-1883

Chang, C.-H.; Hsu, C.-C.: Integrating query expansion and conceptual relevance feedback for personalized Web information retrieval (1998) 0.02

0.024291996 = product of:
  0.036437992 = sum of:
    0.013140325 = weight(_text_:of in 1319) [ClassicSimilarity], result of:
      0.013140325 = score(doc=1319,freq=4.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.17103596 = fieldWeight in 1319, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1319)
    0.023297668 = product of:
      0.046595335 = sum of:
        0.046595335 = weight(_text_:22 in 1319) [ClassicSimilarity], result of:
          0.046595335 = score(doc=1319,freq=2.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.2708308 = fieldWeight in 1319, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1319)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: Keyword based querying has been an immediate and efficient way to specify and retrieve related information that the user inquired. However, conventional document ranking based on an automatic assessment of document relevance to the query may not be the best approach when little information is given. Proposes an idea to integrate 2 existing techniques, query expansion and relevance feedback to achieve a concept-based information search for the Web
Date: 1. 8.1996 22:08:06
Footnote: Contribution to a special issue devoted to the Proceedings of the 7th International World Wide Web Conference, held 14-18 April 1998, Brisbane, Australia

Bradford, R.B.: Relationship discovery in large text collections using Latent Semantic Indexing (2006) 0.02
```
0.023892816 = product of:
  0.035839222 = sum of:
    0.022526272 = weight(_text_:of in 1163) [ClassicSimilarity], result of:
      0.022526272 = score(doc=1163,freq=36.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.2932045 = fieldWeight in 1163, product of:
          6.0 = tf(freq=36.0), with freq of:
            36.0 = termFreq=36.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03125 = fieldNorm(doc=1163)
    0.013312953 = product of:
      0.026625905 = sum of:
        0.026625905 = weight(_text_:22 in 1163) [ClassicSimilarity], result of:
          0.026625905 = score(doc=1163,freq=2.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.15476047 = fieldWeight in 1163, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=1163)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)
```
Abstract

This paper addresses the problem of information discovery in large collections of text. For users, one of the key problems in working with such collections is determining where to focus their attention. In selecting documents for examination, users must be able to formulate reasonably precise queries. Queries that are too broad will greatly reduce the efficiency of information discovery efforts by overwhelming the users with peripheral information. In order to formulate efficient queries, a mechanism is needed to automatically alert users regarding potentially interesting information contained within the collection. This paper presents the results of an experiment designed to test one approach to generation of such alerts. The technique of latent semantic indexing (LSI) is used to identify relationships among entities of interest. Entity extraction software is used to pre-process the text of the collection so that the LSI space contains representation vectors for named entities in addition to those for individual terms. In the LSI space, the cosine of the angle between the representation vectors for two entities captures important information regarding the degree of association of those two entities. For appropriate choices of entities, determining the entity pairs with the highest mutual cosine values yields valuable information regarding the contents of the text collection. The test database used for the experiment consists of 150,000 news articles. The proposed approach for alert generation is tested using a counterterrorism analysis example. The approach is shown to have significant potential for aiding users in rapidly focusing on information of potential importance in large text collections. The approach also has value in identifying possible use of aliases.

Source

Proceedings of the Fourth Workshop on Link Analysis, Counterterrorism, and Security, SIAM Data Mining Conference, Bethesda, MD, 20-22 April, 2006. [http://www.siam.org/meetings/sdm06/workproceed/Link%20Analysis/15.pdf]
Klas, C.-P.; Fuhr, N.; Schaefer, A.: Evaluating strategic support for information access in the DAFFODIL system (2004) 0.02
```
0.022509266 = product of:
  0.033763897 = sum of:
    0.013794468 = weight(_text_:of in 2419) [ClassicSimilarity], result of:
      0.013794468 = score(doc=2419,freq=6.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.17955035 = fieldWeight in 2419, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=2419)
    0.019969428 = product of:
      0.039938856 = sum of:
        0.039938856 = weight(_text_:22 in 2419) [ClassicSimilarity], result of:
          0.039938856 = score(doc=2419,freq=2.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.23214069 = fieldWeight in 2419, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=2419)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)
```
Abstract

The digital library system Daffodil is targeted at strategic support of users during the information search process. For searching, exploring and managing digital library objects it provides user-customisable information seeking patterns over a federation of heterogeneous digital libraries. In this paper evaluation results with respect to retrieval effectiveness, efficiency and user satisfaction are presented. The analysis focuses on strategic support for the scientific work-flow. Daffodil supports the whole work-flow, from data source selection over information seeking to the representation, organisation and reuse of information. By embedding high level search functionality into the scientific work-flow, the user experiences better strategic system support due to a more systematic work process. These ideas have been implemented in Daffodil followed by a qualitative evaluation. The evaluation has been conducted with 28 participants, ranging from information seeking novices to experts. The results are promising, as they support the chosen model.

Date

16.11.2008 16:22:48
Thenmalar, S.; Geetha, T.V.: Enhanced ontology-based indexing and searching (2014) 0.02
```
0.021959063 = product of:
  0.032938592 = sum of:
    0.021289758 = weight(_text_:of in 1633) [ClassicSimilarity], result of:
      0.021289758 = score(doc=1633,freq=42.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.2771099 = fieldWeight in 1633, product of:
          6.4807405 = tf(freq=42.0), with freq of:
            42.0 = termFreq=42.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.02734375 = fieldNorm(doc=1633)
    0.011648834 = product of:
      0.023297668 = sum of:
        0.023297668 = weight(_text_:22 in 1633) [ClassicSimilarity], result of:
          0.023297668 = score(doc=1633,freq=2.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.1354154 = fieldWeight in 1633, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02734375 = fieldNorm(doc=1633)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)
```
Abstract

Purpose - The purpose of this paper is to improve the conceptual-based search by incorporating structural ontological information such as concepts and relations. Generally, Semantic-based information retrieval aims to identify relevant information based on the meanings of the query terms or on the context of the terms and the performance of semantic information retrieval is carried out through standard measures-precision and recall. Higher precision leads to the (meaningful) relevant documents obtained and lower recall leads to the less coverage of the concepts. Design/methodology/approach - In this paper, the authors enhance the existing ontology-based indexing proposed by Kohler et al., by incorporating sibling information to the index. The index designed by Kohler et al., contains only super and sub-concepts from the ontology. In addition, in our approach, we focus on two tasks; query expansion and ranking of the expanded queries, to improve the efficiency of the ontology-based search. The aforementioned tasks make use of ontological concepts, and relations existing between those concepts so as to obtain semantically more relevant search results for a given query. Findings - The proposed ontology-based indexing technique is investigated by analysing the coverage of concepts that are being populated in the index. Here, we introduce a new measure called index enhancement measure, to estimate the coverage of ontological concepts being indexed. We have evaluated the ontology-based search for the tourism domain with the tourism documents and tourism-specific ontology. The comparison of search results based on the use of ontology "with and without query expansion" is examined to estimate the efficiency of the proposed query expansion task. The ranking is compared with the ORank system to evaluate the performance of our ontology-based search. From these analyses, the ontology-based search results shows better recall when compared to the other concept-based search systems. The mean average precision of the ontology-based search is found to be 0.79 and the recall is found to be 0.65, the ORank system has the mean average precision of 0.62 and the recall is found to be 0.51, while the concept-based search has the mean average precision of 0.56 and the recall is found to be 0.42. Practical implications - When the concept is not present in the domain-specific ontology, the concept cannot be indexed. When the given query term is not available in the ontology then the term-based results are retrieved. Originality/value - In addition to super and sub-concepts, we incorporate the concepts present in same level (siblings) to the ontological index. The structural information from the ontology is determined for the query expansion. The ranking of the documents depends on the type of the query (single concept query, multiple concept queries and concept with relation queries) and the ontological relations that exists in the query and the documents. With this ontological structural information, the search results showed us better coverage of concepts with respect to the query.

Date

20. 1.2015 18:30:22

Source

Aslib journal of information management. 66(2014) no.6, S.678-696

Salaba, A.; Zeng, M.L.: Extending the "Explore" user task beyond subject authority data into the linked data sphere (2014) 0.02

0.021726187 = product of:
  0.03258928 = sum of:
    0.0092916135 = weight(_text_:of in 1465) [ClassicSimilarity], result of:
      0.0092916135 = score(doc=1465,freq=2.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.120940685 = fieldWeight in 1465, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1465)
    0.023297668 = product of:
      0.046595335 = sum of:
        0.046595335 = weight(_text_:22 in 1465) [ClassicSimilarity], result of:
          0.046595335 = score(doc=1465,freq=2.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.2708308 = fieldWeight in 1465, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1465)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Source: Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik

Zeng, M.L.; Gracy, K.F.; Zumer, M.: Using a semantic analysis tool to generate subject access points : a study using Panofsky's theory and two research samples (2014) 0.02

0.02082171 = product of:
  0.031232564 = sum of:
    0.011263136 = weight(_text_:of in 1464) [ClassicSimilarity], result of:
      0.011263136 = score(doc=1464,freq=4.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.14660224 = fieldWeight in 1464, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=1464)
    0.019969428 = product of:
      0.039938856 = sum of:
        0.039938856 = weight(_text_:22 in 1464) [ClassicSimilarity], result of:
          0.039938856 = score(doc=1464,freq=2.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.23214069 = fieldWeight in 1464, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=1464)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: This paper attempts to explore an approach of using an automatic semantic analysis tool to enhance the "subject" access to materials that are not included in the usual library subject cataloging process. Using two research samples the authors analyzed the access points supplied by OpenCalais, a semantic analysis tool. As an aid in understanding how computerized subject analysis might be approached, this paper suggests using the three-layer framework that has been accepted and applied in image analysis, developed by Erwin Panofsky.
Source: Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik

Järvelin, K.; Kristensen, J.; Niemi, T.; Sormunen, E.; Keskustalo, H.: ¬A deductive data model for query expansion (1996) 0.02

0.02082171 = product of:
  0.031232564 = sum of:
    0.011263136 = weight(_text_:of in 2230) [ClassicSimilarity], result of:
      0.011263136 = score(doc=2230,freq=4.0), product of:
        0.076827854 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.049130294 = queryNorm
        0.14660224 = fieldWeight in 2230, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=2230)
    0.019969428 = product of:
      0.039938856 = sum of:
        0.039938856 = weight(_text_:22 in 2230) [ClassicSimilarity], result of:
          0.039938856 = score(doc=2230,freq=2.0), product of:
            0.17204592 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049130294 = queryNorm
            0.23214069 = fieldWeight in 2230, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=2230)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Abstract: We present a deductive data model for concept-based query expansion. It is based on three abstraction levels: the conceptual, linguistic and occurrence levels. Concepts and relationships among them are represented at the conceptual level. The expression level represents natural language expressions for concepts. Each expression has one or more matching models at the occurrence level. Each model specifies the matching of the expression in database indices built in varying ways. The data model supports a concept-based query expansion and formulation tool, the ExpansionTool, for environments providing heterogeneous IR systems. Expansion is controlled by adjustable matching reliability.
Source: Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (ACM SIGIR '96), Zürich, Switzerland, August 18-22, 1996. Eds.: H.P. Frei et al

Search (232 results, page 1 of 12)

Authors

Years

Languages

Types

Themes

Subjects

Classifications