Search (87 results, page 5 of 5)

Lu, K.; Mao, J.; Li, G.: Toward effective automated weighted subject indexing : a comparison of different approaches in different environments (2018) 0.00
```
0.0019158293 = product of:
  0.005747488 = sum of:
    0.005747488 = weight(_text_:a in 4292) [ClassicSimilarity], result of:
      0.005747488 = score(doc=4292,freq=6.0), product of:
        0.05209492 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.045180224 = queryNorm
        0.11032722 = fieldWeight in 4292, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4292)
  0.33333334 = coord(1/3)
```
Abstract

Subject indexing plays an important role in supporting subject access to information resources. Current subject indexing systems do not make adequate distinctions on the importance of assigned subject descriptors. Assigning numeric weights to subject descriptors to distinguish their importance to the documents can strengthen the role of subject metadata. Automated methods are more cost-effective. This study compares different automated weighting methods in different environments. Two evaluation methods were used to assess the performance. Experiments on three datasets in the biomedical domain suggest the performance of different weighting methods depends on whether it is an abstract or full text environment. Mutual information with bag-of-words representation shows the best average performance in the full text environment, while cosine with bag-of-words representation is the best in an abstract environment. The cosine measure has relatively consistent and robust performance. A direct weighting method, IDF (Inverse Document Frequency), can produce quick and reasonable estimates of the weights. Bag-of-words representation generally outperforms the concept-based representation. Further improvement in performance can be obtained by using the learning-to-rank method to integrate different weighting methods. This study follows up Lu and Mao (Journal of the Association for Information Science and Technology, 66, 1776-1784, 2015), in which an automated weighted subject indexing method was proposed and validated. The findings from this study contribute to more effective weighted subject indexing.

Type

a
Braam, R.R.; Bruil, J.: Quality of indexing information : authors' views on indexing of their articles in chemical abstracts online CA-file (1992) 0.00
```
0.0018771215 = product of:
  0.0056313644 = sum of:
    0.0056313644 = weight(_text_:a in 2638) [ClassicSimilarity], result of:
      0.0056313644 = score(doc=2638,freq=4.0), product of:
        0.05209492 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.045180224 = queryNorm
        0.10809815 = fieldWeight in 2638, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.046875 = fieldNorm(doc=2638)
  0.33333334 = coord(1/3)
```
Abstract

Studies the quality of subject indexing by Chemical Abstracts Indexing Service by confronting authors with the particular indexing terms attributed to their computer, for 270 articles published in 54 journals, 5 articles out of each journal. Responses (80%) indicate the superior quality of keywords, both as content descriptors and as retrieval tools. Author judgements on these 2 different aspects do not always converge, however. CAS's indexing policy to cover only 'new' aspects is reflected in author's judgements that index lists are somewhat incomplete, in particular in the case of thesaurus terms (index headings). The large effort expanded by CAS in maintaining and using a subject thesuaurs, in order to select valid index headings, as compared to quick and cheap keyword postings, does not lead to clear superior quality of thesaurus terms for document description nor in retrieval. Some 20% of papers were not placed in 'proper' CA main section, according to authors. As concerns the use of indexing data by third parties, in bibliometrics, users should be aware of the indexing policies behind the data, in order to prevent invalid interpretations

Type

a
Hudon, M.: Conceptual compatibility in controlled language tools used to index and access the content of moving image collections (2004) 0.00
```
0.0018771215 = product of:
  0.0056313644 = sum of:
    0.0056313644 = weight(_text_:a in 2655) [ClassicSimilarity], result of:
      0.0056313644 = score(doc=2655,freq=4.0), product of:
        0.05209492 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.045180224 = queryNorm
        0.10809815 = fieldWeight in 2655, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.046875 = fieldNorm(doc=2655)
  0.33333334 = coord(1/3)
```
Abstract

Five controlled vocabularies currently used for content representation in collections of non art moving images were examined to determine their level of conceptual compatibility. Methods borrowed from previous research in the area of indexing language compatibility were used. Quantitative data and qualitative observations allowed us to estimate more precisely and realistically the actual degree of conceptual redundancy in these indexing languages. It was found that the conceptual overlap is high enough to justify the pursuit of research and development work an a common basic indexing and access language that could be used to name objects, events, categories of persons, and relations most frequently depicted in non art moving image collections.

Type

a

Chan, L.M.: Alphabetical arrangement and subject collocation in Library of Congress Subject Headings (1977) 0.00

0.0017697671 = product of:
  0.0053093014 = sum of:
    0.0053093014 = weight(_text_:a in 2268) [ClassicSimilarity], result of:
      0.0053093014 = score(doc=2268,freq=2.0), product of:
        0.05209492 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.045180224 = queryNorm
        0.10191591 = fieldWeight in 2268, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.0625 = fieldNorm(doc=2268)
  0.33333334 = coord(1/3)

Type: a

Iivonen, M.: ¬The impact of the indexing environment on interindexer consistency (1990) 0.00

0.0017697671 = product of:
  0.0053093014 = sum of:
    0.0053093014 = weight(_text_:a in 4779) [ClassicSimilarity], result of:
      0.0053093014 = score(doc=4779,freq=2.0), product of:
        0.05209492 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.045180224 = queryNorm
        0.10191591 = fieldWeight in 4779, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.0625 = fieldNorm(doc=4779)
  0.33333334 = coord(1/3)

Type: a

Prasher, R.G.: Evaluation of indexing system (1989) 0.00

0.0017697671 = product of:
  0.0053093014 = sum of:
    0.0053093014 = weight(_text_:a in 4998) [ClassicSimilarity], result of:
      0.0053093014 = score(doc=4998,freq=2.0), product of:
        0.05209492 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.045180224 = queryNorm
        0.10191591 = fieldWeight in 4998, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.0625 = fieldNorm(doc=4998)
  0.33333334 = coord(1/3)

Type: a

Iivonen, M.; Kivimäki, K.: Common entities and missing properties : similarities and differences in the indexing of concepts (1998) 0.00

0.0013273255 = product of:
  0.0039819763 = sum of:
    0.0039819763 = weight(_text_:a in 3074) [ClassicSimilarity], result of:
      0.0039819763 = score(doc=3074,freq=2.0), product of:
        0.05209492 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.045180224 = queryNorm
        0.07643694 = fieldWeight in 3074, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.046875 = fieldNorm(doc=3074)
  0.33333334 = coord(1/3)

Type: a

Search (87 results, page 5 of 5)

Authors

Years

Types

Themes