Search (50 results, page 1 of 3)

Brown, M.E.: By any other name : accounting for failure in the naming of subject categories (1995) 0.05

0.052284636 = product of:
  0.10456927 = sum of:
    0.08294755 = weight(_text_:headings in 5598) [ClassicSimilarity], result of:
      0.08294755 = score(doc=5598,freq=2.0), product of:
        0.22113821 = queryWeight, product of:
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.045596033 = queryNorm
        0.37509373 = fieldWeight in 5598, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.0546875 = fieldNorm(doc=5598)
    0.021621715 = product of:
      0.04324343 = sum of:
        0.04324343 = weight(_text_:22 in 5598) [ClassicSimilarity], result of:
          0.04324343 = score(doc=5598,freq=2.0), product of:
            0.15966953 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045596033 = queryNorm
            0.2708308 = fieldWeight in 5598, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5598)
      0.5 = coord(1/2)
  0.5 = coord(2/4)

Abstract: Research shows that 65-80% of subject search terms fail to match the appropriate subject heading and one third to one half of subject searches result in no references being retrieved. Examines the subject search terms geberated by 82 school and college students in Princeton, NJ, evaluated the match between the named terms and the expected subject headings, proposes an explanation for match failures in relation to 3 invariant properties common to all search terms: concreteness, complexity, and syndeticity. Suggests that match failure is a consequence of developmental naming patterns and that these patterns can be overcome through the use of metacognitive naming skills
Date: 2.11.1996 13:08:22

Voorbij, H.: ¬Een goede titel behoeft geen trefwoord, of toch wel? : een vergelijkend oderzoek titelwoorden - trefwoorden (1997) 0.05
```
0.0507948 = product of:
  0.2031792 = sum of:
    0.2031792 = weight(_text_:headings in 1446) [ClassicSimilarity], result of:
      0.2031792 = score(doc=1446,freq=12.0), product of:
        0.22113821 = queryWeight, product of:
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.045596033 = queryNorm
        0.91878825 = fieldWeight in 1446, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1446)
  0.25 = coord(1/4)
```
Abstract

A recent survey at the Royal Library in the Netherlands showed that subject headings are more efficient than title keywords for retrieval purposes. 475 Dutch publications were selected at random and assigned subject headings. The study showed that subject headings provided additional useful information in 56% of titles. Subsequent searching of the library's online catalogue showed that 88% of titles were retrieved via subject headings against 57% through title keywords. Further precision may be achieved with the help of indexing staff, but at considerable cost

Footnote

Übers. d. Titels: A good title has no need of subject headings, or does it?: a comparative study of title keywords against subject headings

Voorbij, H.: Titelwoorden - trefwoorden : een vergelijkend onderzoek (1997) 0.04

0.041473776 = product of:
  0.1658951 = sum of:
    0.1658951 = weight(_text_:headings in 3175) [ClassicSimilarity], result of:
      0.1658951 = score(doc=3175,freq=2.0), product of:
        0.22113821 = queryWeight, product of:
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.045596033 = queryNorm
        0.75018746 = fieldWeight in 3175, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.109375 = fieldNorm(doc=3175)
  0.25 = coord(1/4)

Footnote: Übers. d. Titels: Title words - subject headings: a comparative research

Schabas, A.H.: ¬A comparative evaluation of the retrieval effectiveness of titles, Library of Congress Subject Headings and PRECIS strings for computer searching of UK MARC data (1979) 0.04

0.03554895 = product of:
  0.1421958 = sum of:
    0.1421958 = weight(_text_:headings in 5277) [ClassicSimilarity], result of:
      0.1421958 = score(doc=5277,freq=2.0), product of:
        0.22113821 = queryWeight, product of:
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.045596033 = queryNorm
        0.6430178 = fieldWeight in 5277, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.09375 = fieldNorm(doc=5277)
  0.25 = coord(1/4)

Drabenstott, K.M.; Vizine-Goetz, D.: Using subject headings for online retrieval : theory, practice and potential (1994) 0.03
```
0.030786294 = product of:
  0.12314518 = sum of:
    0.12314518 = weight(_text_:headings in 386) [ClassicSimilarity], result of:
      0.12314518 = score(doc=386,freq=6.0), product of:
        0.22113821 = queryWeight, product of:
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.045596033 = queryNorm
        0.55686975 = fieldWeight in 386, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.046875 = fieldNorm(doc=386)
  0.25 = coord(1/4)
```
Abstract

Using subject headings for Online Retrieval is an indispensable tool for online system desingners who are developing new systems or refining exicting ones. The book describes subject analysis and subject searching in online catalogs, including the limitations of retrieval, and demonstrates how such limitations can be overcome through system design and programming. The book describes the Library of Congress Subject headings system and system characteristics, shows how information is stored in machine readable files, and offers examples of and recommendations for successful methods. Tables are included to support these recommendations, and diagrams, graphs, and bar charts are used to provide results of data analyses.
Byrne, J.R.: Relative effectiveness of titles, abstracts, and subject headings for machine retrieval from the COMPENDEX services (1975) 0.03
```
0.029326389 = product of:
  0.117305554 = sum of:
    0.117305554 = weight(_text_:headings in 1604) [ClassicSimilarity], result of:
      0.117305554 = score(doc=1604,freq=4.0), product of:
        0.22113821 = queryWeight, product of:
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.045596033 = queryNorm
        0.5304626 = fieldWeight in 1604, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1604)
  0.25 = coord(1/4)
```
Abstract

We have investigated the relative merits of searching on titles, subject headings, abstracts, free-language terms, and combinations of these elements. The COMPENDEX data base was used for this study since it combined all of these data elements of interest. In general, the results obtained from the experiments indicate that, as expected, titles alone are not satisfactory for efficient retrieval. The combination of titles and abstracts came the closest to 100% retrieval, with searching of abstracts alone doing almost as well. Indexer input, although necessary for 100% retrieval in almost all cases, was found to be relatively unimportant
Schultz Jr., W.N.; Braddy, L.: ¬A librarian-centered study of perceptions of subject terms and controlled vocabulary (2017) 0.02
```
0.020736888 = product of:
  0.08294755 = sum of:
    0.08294755 = weight(_text_:headings in 5156) [ClassicSimilarity], result of:
      0.08294755 = score(doc=5156,freq=2.0), product of:
        0.22113821 = queryWeight, product of:
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.045596033 = queryNorm
        0.37509373 = fieldWeight in 5156, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.0546875 = fieldNorm(doc=5156)
  0.25 = coord(1/4)
```
Abstract

Controlled vocabulary and subject headings in OPAC records have proven to be useful in improving search results. The authors used a survey to gather information about librarian opinions and professional use of controlled vocabulary. Data from a range of backgrounds and expertise were examined, including academic and public libraries, and technical services as well as public services professionals. Responses overall demonstrated positive opinions of the value of controlled vocabulary, including in reference interactions as well as during bibliographic instruction sessions. Results are also examined based upon factors such as age and type of librarian.
Tibbo, H.R.: ¬The epic struggle : subject retrieval from large bibliographic databases (1994) 0.02
```
0.017774476 = product of:
  0.0710979 = sum of:
    0.0710979 = weight(_text_:headings in 2179) [ClassicSimilarity], result of:
      0.0710979 = score(doc=2179,freq=2.0), product of:
        0.22113821 = queryWeight, product of:
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.045596033 = queryNorm
        0.3215089 = fieldWeight in 2179, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.046875 = fieldNorm(doc=2179)
  0.25 = coord(1/4)
```
Abstract

Discusses a retrieval study that focused on collection level archival records in the OCLC OLUC, made accessible through the EPIC online search system. Data were also collected from the local OPAC at North Carolina University at Chapel Hill (UNC-CH) in which UNC-CH produced OCLC records are loaded. The chief objective was to explore the retrieval environments in which a random sample of USMARC AMC records produced at UNC-CH were found: specifically to obtain a picture of the density of these databases in regard to each subject heading applied and, more generally, for each records. Key questions were: how many records would be retrieved for each subject heading attached to each of the records; and what was the nature of these subject headings vis a vis the numer of hits associated with them. Results show that large retrieval sets are a potential problem with national bibliographic utilities and that the local and national retrieval environments can vary greatly. The need for specifity in indexing is emphasized
McJunkin, M.C.: Precision and recall in title keyword searching (1995) 0.02
```
0.017774476 = product of:
  0.0710979 = sum of:
    0.0710979 = weight(_text_:headings in 3351) [ClassicSimilarity], result of:
      0.0710979 = score(doc=3351,freq=2.0), product of:
        0.22113821 = queryWeight, product of:
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.045596033 = queryNorm
        0.3215089 = fieldWeight in 3351, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.046875 = fieldNorm(doc=3351)
  0.25 = coord(1/4)
```
Abstract

Investigates the extent to which title keywords convey subject content and compares the relative effectiveness of searching title keywords using 2 search strategies to examine whether adjacency operators in title keyword searches are effective in improving recall and precision of online searching. Title keywords from a random sample of titles in the field of economics were searched on FirstSearch, using the WorldCat database, which is equivalent in coverage to the OCLC OLUC, with and without adjacency of the keywords specified. The LCSH of the items retrieved were compared with the sample title subject headings to determine the degree of match or relevance and the values for precision and recall were calculated. Results indicated that, when keywords were discipline specific, adjacency operators improved precision with little degradation of recall. Systems that allow positional operators or rank output by proximity of terms may increase search success
Abdou, S.; Savoy, J.: Searching in Medline : query expansion and manual indexing evaluation (2008) 0.02
```
0.017774476 = product of:
  0.0710979 = sum of:
    0.0710979 = weight(_text_:headings in 2062) [ClassicSimilarity], result of:
      0.0710979 = score(doc=2062,freq=2.0), product of:
        0.22113821 = queryWeight, product of:
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.045596033 = queryNorm
        0.3215089 = fieldWeight in 2062, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.046875 = fieldNorm(doc=2062)
  0.25 = coord(1/4)
```
Abstract

Based on a relatively large subset representing one third of the Medline collection, this paper evaluates ten different IR models, including recent developments in both probabilistic and language models. We show that the best performing IR models is a probabilistic model developed within the Divergence from Randomness framework [Amati, G., & van Rijsbergen, C.J. (2002) Probabilistic models of information retrieval based on measuring the divergence from randomness. ACM-Transactions on Information Systems 20(4), 357-389], which result in 170% enhancements in mean average precision when compared to the classical tf idf vector-space model. This paper also reports on our impact evaluations on the retrieval effectiveness of manually assigned descriptors (MeSH or Medical Subject Headings), showing that by including these terms retrieval performance can improve from 2.4% to 13.5%, depending on the underling IR model. Finally, we design a new general blind-query expansion approach showing improved retrieval performances compared to those obtained using the Rocchio approach.
Bhattacharyya, K.: ¬The effectiveness of natural language in science indexing and retrieval (1974) 0.01
```
0.014869518 = product of:
  0.05947807 = sum of:
    0.05947807 = product of:
      0.11895614 = sum of:
        0.11895614 = weight(_text_:terminology in 2628) [ClassicSimilarity], result of:
          0.11895614 = score(doc=2628,freq=4.0), product of:
            0.24053115 = queryWeight, product of:
              5.2752647 = idf(docFreq=614, maxDocs=44218)
              0.045596033 = queryNorm
            0.49455607 = fieldWeight in 2628, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              5.2752647 = idf(docFreq=614, maxDocs=44218)
              0.046875 = fieldNorm(doc=2628)
      0.5 = coord(1/2)
  0.25 = coord(1/4)
```
Abstract

This paper examines the implications of the findings of evaluative tests regarding the retrieval performance of natural language in various subject fields. It suggests parallel investigations into the structure of natural language, with particular reference to terminology, as used in the different branches of basic science. The criteria for defining the terminological consistency of a subject are formulated and a measure suggested for determining the degree of terminological consistency. The terminological and information structures of specific disciplines such as, chemistry, physics, botany, zoology, and geology; the circumstances in which terms originate; and the efforts made by the international scientific community to standardize the terminology in their respective disciplines - are examined in detail. This investigation shows why and how an artificially created scientific language finds it impossible to keep pace with current developments and thus points to the source of strength of natural language
Hider, P.: ¬The search value added by professional indexing to a bibliographic database (2017) 0.01
```
0.014812064 = product of:
  0.059248257 = sum of:
    0.059248257 = weight(_text_:headings in 3868) [ClassicSimilarity], result of:
      0.059248257 = score(doc=3868,freq=2.0), product of:
        0.22113821 = queryWeight, product of:
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.045596033 = queryNorm
        0.2679241 = fieldWeight in 3868, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3868)
  0.25 = coord(1/4)
```
Abstract

Gross et al. (2015) have demonstrated that about a quarter of hits would typically be lost to keyword searchers if contemporary academic library catalogs dropped their controlled subject headings. This paper reports on an analysis of the loss levels that would result if a bibliographic database, namely the Australian Education Index (AEI), were missing the subject descriptors and identifiers assigned by its professional indexers, employing the methodology developed by Gross and Taylor (2005), and later by Gross et al. (2015). The results indicate that AEI users would lose a similar proportion of hits per query to that experienced by library catalog users: on average, 27% of the resources found by a sample of keyword queries on the AEI database would not have been found without the subject indexing, based on the Australian Thesaurus of Education Descriptors (ATED). The paper also discusses the methodological limitations of these studies, pointing out that real-life users might still find some of the resources missed by a particular query through follow-up searches, while additional resources might also be found through iterative searching on the subject vocabulary. The paper goes on to describe a new research design, based on a before - and - after experiment, which addresses some of these limitations. It is argued that this alternative design will provide a more realistic picture of the value that professionally assigned subject indexing and controlled subject vocabularies can add to literature searching of a more scholarly and thorough kind.
Hider, P.: ¬The search value added by professional indexing to a bibliographic database (2018) 0.01
```
0.014812064 = product of:
  0.059248257 = sum of:
    0.059248257 = weight(_text_:headings in 4300) [ClassicSimilarity], result of:
      0.059248257 = score(doc=4300,freq=2.0), product of:
        0.22113821 = queryWeight, product of:
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.045596033 = queryNorm
        0.2679241 = fieldWeight in 4300, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4300)
  0.25 = coord(1/4)
```
Abstract

Gross et al. (2015) have demonstrated that about a quarter of hits would typically be lost to keyword searchers if contemporary academic library catalogs dropped their controlled subject headings. This article reports on an investigation of the search value that subject descriptors and identifiers assigned by professional indexers add to a bibliographic database, namely the Australian Education Index (AEI). First, a similar methodology to that developed by Gross et al. (2015) was applied, with keyword searches representing a range of educational topics run on the AEI database with and without its subject indexing. The results indicated that AEI users would also lose, on average, about a quarter of hits per query. Second, an alternative research design was applied in which an experienced literature searcher was asked to find resources on a set of educational topics on an AEI database stripped of its subject indexing and then asked to search for additional resources on the same topics after the subject indexing had been reinserted. In this study, the proportion of additional resources that would have been lost had it not been for the subject indexing was again found to be about a quarter of the total resources found for each topic, on average.
Cleverdon, C.W.; Mills, J.: ¬The testing of index language devices (1985) 0.01
```
0.01184965 = product of:
  0.0473986 = sum of:
    0.0473986 = weight(_text_:headings in 3643) [ClassicSimilarity], result of:
      0.0473986 = score(doc=3643,freq=2.0), product of:
        0.22113821 = queryWeight, product of:
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.045596033 = queryNorm
        0.21433927 = fieldWeight in 3643, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.849944 = idf(docFreq=940, maxDocs=44218)
          0.03125 = fieldNorm(doc=3643)
  0.25 = coord(1/4)
```
Abstract

A landmark event in the twentieth-century development of subject analysis theory was a retrieval experiment, begun in 1957, by Cyril Cleverdon, Librarian of the Cranfield Institute of Technology. For this work he received the Professional Award of the Special Libraries Association in 1962 and the Award of Merit of the American Society for Information Science in 1970. The objective of the experiment, called Cranfield I, was to test the ability of four indexing systems-UDC, Facet, Uniterm, and Alphabetic-Subject Headings-to retrieve material responsive to questions addressed to a collection of documents. The experiment was ambitious in scale, consisting of eighteen thousand documents and twelve hundred questions. Prior to Cranfield I, the question of what constitutes good indexing was approached subjectively and reference was made to assumptions in the form of principles that should be observed or user needs that should be met. Cranfield I was the first large-scale effort to use objective criteria for determining the parameters of good indexing. Its creative impetus was the definition of user satisfaction in terms of precision and recall. Out of the experiment emerged the definition of recall as the percentage of relevant documents retrieved and precision as the percentage of retrieved documents that were relevant. Operationalizing the concept of user satisfaction, that is, making it measurable, meant that it could be studied empirically and manipulated as a variable in mathematical equations. Much has been made of the fact that the experimental methodology of Cranfield I was seriously flawed. This is unfortunate as it tends to diminish Cleverdon's contribu tion, which was not methodological-such contributions can be left to benchmark researchers-but rather creative: the introduction of a new paradigm, one that proved to be eminently productive. The criticism leveled at the methodological shortcomings of Cranfield I underscored the need for more precise definitions of the variables involved in information retrieval. Particularly important was the need for a definition of the dependent variable index language. Like the definitions of precision and recall, that of index language provided a new way of looking at the indexing process. It was a re-visioning that stimulated research activity and led not only to a better understanding of indexing but also the design of better retrieval systems." Cranfield I was followed by Cranfield II. While Cranfield I was a wholesale comparison of four indexing "systems," Cranfield II aimed to single out various individual factors in index languages, called "indexing devices," and to measure how variations in these affected retrieval performance. The following selection represents the thinking at Cranfield midway between these two notable retrieval experiments.

Fuhr, N.; Niewelt, B.: ¬Ein Retrievaltest mit automatisch indexierten Dokumenten (1984) 0.01

0.010810858 = product of:
  0.04324343 = sum of:
    0.04324343 = product of:
      0.08648686 = sum of:
        0.08648686 = weight(_text_:22 in 262) [ClassicSimilarity], result of:
          0.08648686 = score(doc=262,freq=2.0), product of:
            0.15966953 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045596033 = queryNorm
            0.5416616 = fieldWeight in 262, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=262)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Date: 20.10.2000 12:22:23

Tomaiuolo, N.G.; Parker, J.: Maximizing relevant retrieval : keyword and natural language searching (1998) 0.01

0.010810858 = product of:
  0.04324343 = sum of:
    0.04324343 = product of:
      0.08648686 = sum of:
        0.08648686 = weight(_text_:22 in 6418) [ClassicSimilarity], result of:
          0.08648686 = score(doc=6418,freq=2.0), product of:
            0.15966953 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045596033 = queryNorm
            0.5416616 = fieldWeight in 6418, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=6418)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Source: Online. 22(1998) no.6, S.57-58

Voorhees, E.M.; Harman, D.: Overview of the Sixth Text REtrieval Conference (TREC-6) (2000) 0.01

0.010810858 = product of:
  0.04324343 = sum of:
    0.04324343 = product of:
      0.08648686 = sum of:
        0.08648686 = weight(_text_:22 in 6438) [ClassicSimilarity], result of:
          0.08648686 = score(doc=6438,freq=2.0), product of:
            0.15966953 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045596033 = queryNorm
            0.5416616 = fieldWeight in 6438, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=6438)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Date: 11. 8.2001 16:22:19

Dalrymple, P.W.: Retrieval by reformulation in two library catalogs : toward a cognitive model of searching behavior (1990) 0.01

0.010810858 = product of:
  0.04324343 = sum of:
    0.04324343 = product of:
      0.08648686 = sum of:
        0.08648686 = weight(_text_:22 in 5089) [ClassicSimilarity], result of:
          0.08648686 = score(doc=5089,freq=2.0), product of:
            0.15966953 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045596033 = queryNorm
            0.5416616 = fieldWeight in 5089, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=5089)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Date: 22. 7.2006 18:43:54

Tamine, L.; Chouquet, C.; Palmer, T.: Analysis of biomedical and health queries : lessons learned from TREC and CLEF evaluation benchmarks (2015) 0.01
```
0.008761947 = product of:
  0.03504779 = sum of:
    0.03504779 = product of:
      0.07009558 = sum of:
        0.07009558 = weight(_text_:terminology in 2341) [ClassicSimilarity], result of:
          0.07009558 = score(doc=2341,freq=2.0), product of:
            0.24053115 = queryWeight, product of:
              5.2752647 = idf(docFreq=614, maxDocs=44218)
              0.045596033 = queryNorm
            0.29141995 = fieldWeight in 2341, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.2752647 = idf(docFreq=614, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2341)
      0.5 = coord(1/2)
  0.25 = coord(1/4)
```
Abstract

A large body of research work examined, from both the query side and the user behavior side, the characteristics of medical- and health-related searches. One of the core issues in medical information retrieval (IR) is diversity of tasks that lead to diversity of categories of information needs and queries. From the evaluation perspective, another related and challenging issue is the limited availability of appropriate test collections allowing the experimental validation of medically task oriented IR techniques and systems. In this paper, we explore the peculiarities of TREC and CLEF medically oriented tasks and queries through the analysis of the differences and the similarities between queries across tasks, with respect to length, specificity, and clarity features and then study their effect on retrieval performance. We show that, even for expert oriented queries, language specificity level varies significantly across tasks as well as search difficulty. Additional findings highlight that query clarity factors are task dependent and that query terms specificity based on domain-specific terminology resources is not significantly linked to term rareness in the document collection. The lessons learned from our study could serve as starting points for the design of future task-based medical information retrieval frameworks.

Allan, J.; Callan, J.P.; Croft, W.B.; Ballesteros, L.; Broglio, J.; Xu, J.; Shu, H.: INQUERY at TREC-5 (1997) 0.01

0.007722041 = product of:
  0.030888164 = sum of:
    0.030888164 = product of:
      0.06177633 = sum of:
        0.06177633 = weight(_text_:22 in 3103) [ClassicSimilarity], result of:
          0.06177633 = score(doc=3103,freq=2.0), product of:
            0.15966953 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045596033 = queryNorm
            0.38690117 = fieldWeight in 3103, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=3103)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Date: 27. 2.1999 20:55:22

Search (50 results, page 1 of 3)

Authors

Years

Languages

Types

Themes