Search (32 results, page 1 of 2)

Veenema, F.: To index or not to index (1996) 0.04

0.04254054 = product of:
  0.08508108 = sum of:
    0.08508108 = sum of:
      0.028259892 = weight(_text_:2 in 7247) [ClassicSimilarity], result of:
        0.028259892 = score(doc=7247,freq=2.0), product of:
          0.1294644 = queryWeight, product of:
            2.4695914 = idf(docFreq=10170, maxDocs=44218)
            0.05242341 = queryNorm
          0.2182831 = fieldWeight in 7247, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.4695914 = idf(docFreq=10170, maxDocs=44218)
            0.0625 = fieldNorm(doc=7247)
      0.056821186 = weight(_text_:22 in 7247) [ClassicSimilarity], result of:
        0.056821186 = score(doc=7247,freq=2.0), product of:
          0.18357785 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.05242341 = queryNorm
          0.30952093 = fieldWeight in 7247, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0625 = fieldNorm(doc=7247)
  0.5 = coord(1/2)

Source: Canadian journal of information and library science. 21(1996) no.2, S.1-22

Booth, A.: How consistent is MEDLINE indexing? (1990) 0.04

0.042344183 = product of:
  0.084688365 = sum of:
    0.084688365 = sum of:
      0.034969833 = weight(_text_:2 in 3510) [ClassicSimilarity], result of:
        0.034969833 = score(doc=3510,freq=4.0), product of:
          0.1294644 = queryWeight, product of:
            2.4695914 = idf(docFreq=10170, maxDocs=44218)
            0.05242341 = queryNorm
          0.27011156 = fieldWeight in 3510, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            2.4695914 = idf(docFreq=10170, maxDocs=44218)
            0.0546875 = fieldNorm(doc=3510)
      0.049718536 = weight(_text_:22 in 3510) [ClassicSimilarity], result of:
        0.049718536 = score(doc=3510,freq=2.0), product of:
          0.18357785 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.05242341 = queryNorm
          0.2708308 = fieldWeight in 3510, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0546875 = fieldNorm(doc=3510)
  0.5 = coord(1/2)

Abstract: A known-item search for abstracts to previously retrieved references revealed that 2 documents from the same annual volume had been indexed twice. Working from the premise that the whole volume may have been double-indexed, a search strategy was devised that limited the journal code to the year in question. 57 references were retrieved, comprising 28 pairs of duplicates plus a citation for the whole volume. Author, title, source and descriptors were requested off-line and the citations were paired with their duplicates. The 4 categories of descriptors-major descriptors, minor descriptors, subheadings and check-tags-were compared for depth and consistency of indexing and lessons that might be learnt from the study are discussed.
Date: 17.12.1995 2:15:31
Source: Health libraries review. 7(1990) no.1, S.22-26

Subrahmanyam, B.: Library of Congress Classification numbers : issues of consistency and their implications for union catalogs (2006) 0.03

0.026587836 = product of:
  0.053175673 = sum of:
    0.053175673 = sum of:
      0.017662432 = weight(_text_:2 in 5784) [ClassicSimilarity], result of:
        0.017662432 = score(doc=5784,freq=2.0), product of:
          0.1294644 = queryWeight, product of:
            2.4695914 = idf(docFreq=10170, maxDocs=44218)
            0.05242341 = queryNorm
          0.13642694 = fieldWeight in 5784, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.4695914 = idf(docFreq=10170, maxDocs=44218)
            0.0390625 = fieldNorm(doc=5784)
      0.03551324 = weight(_text_:22 in 5784) [ClassicSimilarity], result of:
        0.03551324 = score(doc=5784,freq=2.0), product of:
          0.18357785 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.05242341 = queryNorm
          0.19345059 = fieldWeight in 5784, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=5784)
  0.5 = coord(1/2)

Date: 10. 9.2000 17:38:22
Source: Library resources and technical services. 50(2006) no.2, S.111-119

Cleverdon, C.W.: ASLIB Cranfield Research Project : Report on the first stage of an investigation into the comparative efficiency of indexing systems (1960) 0.02

0.021307945 = product of:
  0.04261589 = sum of:
    0.04261589 = product of:
      0.08523178 = sum of:
        0.08523178 = weight(_text_:22 in 6158) [ClassicSimilarity], result of:
          0.08523178 = score(doc=6158,freq=2.0), product of:
            0.18357785 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05242341 = queryNorm
            0.46428138 = fieldWeight in 6158, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=6158)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Footnote: Rez. in: College and research libraries 22(1961) no.3, S.228 (G. Jahoda)

Tinker, F.F.: Imprecision in meaning measured by inconsistency of indexing (1966-68) 0.02

0.0152961165 = product of:
  0.030592233 = sum of:
    0.030592233 = product of:
      0.061184466 = sum of:
        0.061184466 = weight(_text_:2 in 2275) [ClassicSimilarity], result of:
          0.061184466 = score(doc=2275,freq=6.0), product of:
            0.1294644 = queryWeight, product of:
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.05242341 = queryNorm
            0.47259682 = fieldWeight in 2275, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.078125 = fieldNorm(doc=2275)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Content: Ergebnisse: (1) Wenn SW frei gewählt, Recherche um so schwieriger, je mehr SW; (2) 'ältere' SW häufiger und weniger genau verwendet als 'jüngere'; (3) viele Wörter mit ungenauer Bedeutung
Issue: T.1-2.
Source: American documentation. 17(1966) no.2, S.96-102 (T.1); 19(1988) no.3, S

Neshat, N.; Horri, A.: ¬A study of subject indexing consistency between the National Library of Iran and Humanities Libraries in the area of Iranian studies (2006) 0.01

0.012429634 = product of:
  0.024859268 = sum of:
    0.024859268 = product of:
      0.049718536 = sum of:
        0.049718536 = weight(_text_:22 in 230) [ClassicSimilarity], result of:
          0.049718536 = score(doc=230,freq=2.0), product of:
            0.18357785 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05242341 = queryNorm
            0.2708308 = fieldWeight in 230, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=230)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 4. 1.2007 10:22:26

Cleverdon, C.W.; Mills, J.; Keen, M.: Factors determining the performance of indexing systems : ASLIB Cranfield research project (1966) 0.01

0.012363703 = product of:
  0.024727406 = sum of:
    0.024727406 = product of:
      0.049454812 = sum of:
        0.049454812 = weight(_text_:2 in 5363) [ClassicSimilarity], result of:
          0.049454812 = score(doc=5363,freq=2.0), product of:
            0.1294644 = queryWeight, product of:
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.05242341 = queryNorm
            0.38199544 = fieldWeight in 5363, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.109375 = fieldNorm(doc=5363)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Issue: Vol.1: Design; Vol.2: Test results

Taniguchi, S.: Recording evidence in bibliographic records and descriptive metadata (2005) 0.01

0.010653973 = product of:
  0.021307945 = sum of:
    0.021307945 = product of:
      0.04261589 = sum of:
        0.04261589 = weight(_text_:22 in 3565) [ClassicSimilarity], result of:
          0.04261589 = score(doc=3565,freq=2.0), product of:
            0.18357785 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05242341 = queryNorm
            0.23214069 = fieldWeight in 3565, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=3565)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 18. 6.2005 13:16:22

Leininger, K.: Interindexer consistency in PsychINFO (2000) 0.01

0.010653973 = product of:
  0.021307945 = sum of:
    0.021307945 = product of:
      0.04261589 = sum of:
        0.04261589 = weight(_text_:22 in 2552) [ClassicSimilarity], result of:
          0.04261589 = score(doc=2552,freq=2.0), product of:
            0.18357785 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05242341 = queryNorm
            0.23214069 = fieldWeight in 2552, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=2552)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 9. 2.1997 18:44:22

Bade, D.: ¬The creation and persistence of misinformation in shared library catalogs : language and subject knowledge in a technological era (2002) 0.01
```
0.010635135 = product of:
  0.02127027 = sum of:
    0.02127027 = sum of:
      0.007064973 = weight(_text_:2 in 1858) [ClassicSimilarity], result of:
        0.007064973 = score(doc=1858,freq=2.0), product of:
          0.1294644 = queryWeight, product of:
            2.4695914 = idf(docFreq=10170, maxDocs=44218)
            0.05242341 = queryNorm
          0.054570775 = fieldWeight in 1858, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.4695914 = idf(docFreq=10170, maxDocs=44218)
            0.015625 = fieldNorm(doc=1858)
      0.0142052965 = weight(_text_:22 in 1858) [ClassicSimilarity], result of:
        0.0142052965 = score(doc=1858,freq=2.0), product of:
          0.18357785 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.05242341 = queryNorm
          0.07738023 = fieldWeight in 1858, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.015625 = fieldNorm(doc=1858)
  0.5 = coord(1/2)
```
Date

22. 9.1997 19:16:05

Footnote

Rez. in JASIST 54(2003) no.4, S.356-357 (S.J. Lincicum): "Reliance upon shared cataloging in academic libraries in the United States has been driven largely by the need to reduce the expense of cataloging operations without muck regard for the Impact that this approach might have an the quality of the records included in local catalogs. In recent years, ever increasing pressures have prompted libraries to adopt practices such as "rapid" copy cataloging that purposely reduce the scrutiny applied to bibliographic records downloaded from shared databases, possibly increasing the number of errors that slip through unnoticed. Errors in bibliographic records can lead to serious problems for library catalog users. If the data contained in bibliographic records is inaccurate, users will have difficulty discovering and recognizing resources in a library's collection that are relevant to their needs. Thus, it has become increasingly important to understand the extent and nature of errors that occur in the records found in large shared bibliographic databases, such as OCLC WorldCat, to develop cataloging practices optimized for the shared cataloging environment. Although this monograph raises a few legitimate concerns about recent trends in cataloging practice, it fails to provide the "detailed look" at misinformation in library catalogs arising from linguistic errors and mistakes in subject analysis promised by the publisher. A basic premise advanced throughout the text is that a certain amount of linguistic and subject knowledge is required to catalog library materials effectively. The author emphasizes repeatedly that most catalogers today are asked to catalog an increasingly diverse array of materials, and that they are often required to work in languages or subject areas of which they have little or no knowledge. He argues that the records contributed to shared databases are increasingly being created by catalogers with inadequate linguistic or subject expertise. This adversely affects the quality of individual library catalogs because errors often go uncorrected as records are downloaded from shared databases to local catalogs by copy catalogers who possess even less knowledge. Calling misinformation an "evil phenomenon," Bade states that his main goal is to discuss, "two fundamental types of misinformation found in bibliographic and authority records in library catalogs: that arising from linguistic errors, and that caused by errors in subject analysis, including missing or wrong subject headings" (p. 2). After a superficial discussion of "other" types of errors that can occur in bibliographic records, such as typographical errors and errors in the application of descriptive cataloging rules, Bade begins his discussion of linguistic errors. He asserts that sharing bibliographic records created by catalogers with inadequate linguistic or subject knowledge has, "disastrous effects an the library community" (p. 6). To support this bold assertion, Bade provides as evidence little more than a laundry list of errors that he has personally observed in bibliographic records over the years. When he eventually cites several studies that have addressed the availability and quality of records available for materials in languages other than English, he fails to describe the findings of these studies in any detail, let alone relate the findings to his own observations in a meaningful way. Bade claims that a lack of linguistic expertise among catalogers is the "primary source for linguistic misinformation in our databases" (p. 10), but he neither cites substantive data from existing studies nor provides any new data regarding the overall level of linguistic knowledge among catalogers to support this claim. The section concludes with a brief list of eight sensible, if unoriginal, suggestions for coping with the challenge of cataloging materials in unfamiliar languages.

Pimenov, E.N.: O faktorah, vliyayushchikh na indeksirivanie : indeksirovanie i predmetnaya oblast' (2000) 0.01

0.010597459 = product of:
  0.021194918 = sum of:
    0.021194918 = product of:
      0.042389836 = sum of:
        0.042389836 = weight(_text_:2 in 898) [ClassicSimilarity], result of:
          0.042389836 = score(doc=898,freq=2.0), product of:
            0.1294644 = queryWeight, product of:
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.05242341 = queryNorm
            0.32742465 = fieldWeight in 898, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.09375 = fieldNorm(doc=898)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Nauchno- Tekhnicheskaya Informatsiya; Series 1. 2000, no.2, S.15-23

White, H.; Willis, C.; Greenberg, J.: HIVEing : the effect of a semantic web technology on inter-indexer consistency (2014) 0.01
```
0.00887831 = product of:
  0.01775662 = sum of:
    0.01775662 = product of:
      0.03551324 = sum of:
        0.03551324 = weight(_text_:22 in 1781) [ClassicSimilarity], result of:
          0.03551324 = score(doc=1781,freq=2.0), product of:
            0.18357785 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05242341 = queryNorm
            0.19345059 = fieldWeight in 1781, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1781)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Purpose - The purpose of this paper is to examine the effect of the Helping Interdisciplinary Vocabulary Engineering (HIVE) system on the inter-indexer consistency of information professionals when assigning keywords to a scientific abstract. This study examined first, the inter-indexer consistency of potential HIVE users; second, the impact HIVE had on consistency; and third, challenges associated with using HIVE. Design/methodology/approach - A within-subjects quasi-experimental research design was used for this study. Data were collected using a task-scenario based questionnaire. Analysis was performed on consistency results using Hooper's and Rolling's inter-indexer consistency measures. A series of t-tests was used to judge the significance between consistency measure results. Findings - Results suggest that HIVE improves inter-indexing consistency. Working with HIVE increased consistency rates by 22 percent (Rolling's) and 25 percent (Hooper's) when selecting relevant terms from all vocabularies. A statistically significant difference exists between the assignment of free-text keywords and machine-aided keywords. Issues with homographs, disambiguation, vocabulary choice, and document structure were all identified as potential challenges. Research limitations/implications - Research limitations for this study can be found in the small number of vocabularies used for the study. Future research will include implementing HIVE into the Dryad Repository and studying its application in a repository system. Originality/value - This paper showcases several features used in HIVE system. By using traditional consistency measures to evaluate a semantic web technology, this paper emphasizes the link between traditional indexing and next generation machine-aided indexing (MAI) tools.
Keen, E.M.: Designing and testing an interactive ranked retrieval system for professional searchers (1994) 0.01
```
0.008831216 = product of:
  0.017662432 = sum of:
    0.017662432 = product of:
      0.035324864 = sum of:
        0.035324864 = weight(_text_:2 in 1066) [ClassicSimilarity], result of:
          0.035324864 = score(doc=1066,freq=8.0), product of:
            0.1294644 = queryWeight, product of:
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.05242341 = queryNorm
            0.27285388 = fieldWeight in 1066, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1066)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Reports 3 explorations of ranked system design. 2 tests used a 'cystic fibrosis' test collection with 100 queries. Experiment 1 compared a Boolean with a ranked interactive system using a subject qualified trained searcher, and reporting recall and precision results. Experiment 2 compared 15 different ranked match algorithms in a batch mode using 2 test collections, and included some new proximate pairs and term weighting approaches. Experiment 3 is a design plan for an interactive ranked prototype offering mid search algorithm choices plus other manual search devices (such as obligatory and unwanted terms), as influenced by thinking aloud comments from experiment 1. Concludes that, in Boolean versus ranked using inverse collection frequency, the searcher inspected more records on ranked than Boolean and so achieved a higher recall but lower precision; however, the presentation order of the relevant records, was, on average, very similar in both systems. Concludes also that: query reformulation was quite strongly practised in ranked searching but does not appear to have been effective; the term pairs proximate weithing methods in experiment 2 enhanced precision on both test collections when used with inverse collection frequency weighting (ICF); and the design plan for an interactive prototype adds to a selection of match algorithms other devices, such as obligatory and unwanted term marking, evidence for this being found from think aloud comments

Broxis, P.F.: ASSIA social science information service (1989) 0.01

0.008831216 = product of:
  0.017662432 = sum of:
    0.017662432 = product of:
      0.035324864 = sum of:
        0.035324864 = weight(_text_:2 in 1511) [ClassicSimilarity], result of:
          0.035324864 = score(doc=1511,freq=2.0), product of:
            0.1294644 = queryWeight, product of:
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.05242341 = queryNorm
            0.27285388 = fieldWeight in 1511, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.078125 = fieldNorm(doc=1511)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Outlook on research libraries. 11(1989) no.2, S.3-8

Gretz, M.; Thomas, M.: Indexierungen in biomedizinischen Literaturdatenbanken : eine vergleichende Analyse (1991) 0.01
```
0.008742458 = product of:
  0.017484916 = sum of:
    0.017484916 = product of:
      0.034969833 = sum of:
        0.034969833 = weight(_text_:2 in 5104) [ClassicSimilarity], result of:
          0.034969833 = score(doc=5104,freq=4.0), product of:
            0.1294644 = queryWeight, product of:
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.05242341 = queryNorm
            0.27011156 = fieldWeight in 5104, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5104)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Auf der Grundlage von vier Originaldokumenten, d.h. dokumentarischen Bezugseinheiten (DBEs), wird die Indexierung in vier biomedizinischen Online-Datenbanken (MEDLINE, EMBASE, BIOSIS PREVIEWS, SCISEARCH) analysiert. Anhand von Beispielen werden inahltliche Erschließung, Indexierungstiefe, Indexierungsbreite, Indexierungskonsistenz, Präzision (durch syntaktisches Indexieren, Gewichtung, Proximity Operatoren) und Wiederauffindbarkeit (Recall) der in den Datenbanken gespeicherten Dokumentationseinheien (DBEs) untersucht. Die zeitaufwendigere intellektuelle Indexierung bei MEDLINE und EMBASE erweist sich als wesentlich präziser als die schneller verfügbare maschinelle Zuteilung von Deskriptoren in BIOSIS PREVIEWS und SCISEARCH. In Teil 1 der Untersuchung werden die Indexierungen in MEDLINE und EMBASE, in Teil 2 die Deskriptorenzuteilungen in BIOSIS PREVIEWS und SCISEARCH verglichen

Source

Nachrichten für Dokumentation. 42(1991) H.5, S.337-344 (T.1); 42(1991) H.6, S.405-411 (T.2)
Qin, J.: Semantic similarities between a keyword database and a controlled vocabulary database : an investigation in the antibiotic resistance literature (2000) 0.01
```
0.0076480582 = product of:
  0.0152961165 = sum of:
    0.0152961165 = product of:
      0.030592233 = sum of:
        0.030592233 = weight(_text_:2 in 4386) [ClassicSimilarity], result of:
          0.030592233 = score(doc=4386,freq=6.0), product of:
            0.1294644 = queryWeight, product of:
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.05242341 = queryNorm
            0.23629841 = fieldWeight in 4386, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4386)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

The 'KeyWords Plus' in the Science Citation Index database represents an approach to combining citation and semantic indexing in describing the document content. This paper explores the similariites or dissimilarities between citation-semantic and analytic indexing. The dataset consisted of over 400 matching records in the SCI and MEDLINE databases on antibiotic resistance in pneumonia. The degree of similarity in indexing terms was found to vary on a scale from completely different to completely identical with various levels in between. The within-document similarity in the 2 databases was measured by a variation on the Jaccard coefficient - the Inclusion Index. The average inclusion coefficient was 0,4134 for SCI and 0,3371 for Medline. The 20 terms occuring most frequently in each database were identified. The 2 groups of terms shared the same terms that consist of the 'intellectual base' for the subject. conceptual similarity was analyzed through scatterplots of matching and nonmatching terms vs. partially identical and broader/narrower terms. The study also found that both databases differed in assigning terms in various semantic categories. Implications of this research and further studies are suggested

Source

Journal of the American Society for Information Science. 51(2000) no.2, S.166-180
Harter, S.P.; Cheng, Y.-R.: Colinked descriptors : improving vocabulary selection for end-user searching (1996) 0.01
```
0.0074935355 = product of:
  0.014987071 = sum of:
    0.014987071 = product of:
      0.029974142 = sum of:
        0.029974142 = weight(_text_:2 in 4216) [ClassicSimilarity], result of:
          0.029974142 = score(doc=4216,freq=4.0), product of:
            0.1294644 = queryWeight, product of:
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.05242341 = queryNorm
            0.2315242 = fieldWeight in 4216, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.046875 = fieldNorm(doc=4216)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

This article introduces a new concept and technique for information retrieval called 'colinked descriptors'. Borrowed from an analogous idea in bibliometrics - cocited references - colinked descriptors provide a theory and method for identifying search terms that, by hypothesis, will be superior to those entered initially by a searcher. The theory suggests a means of moving automatically from 2 or more initial search terms, to other terms that should be superior in retrieval performance to the 2 original terms. A research project designed to test this colinked descriptor hypothesis is reported. The results suggest that the approach is effective, although methodological problems in testing the idea are reported. Algorithms to generate colinked descriptors can be incorporated easily into system interfaces, front-end or pre-search systems, or help software, in any database that employs a thesaurus. The potential use of colinked descriptors is a strong argument for building richer and more complex thesauri that reflect as many legitimate links among descriptors as possible

Chan, L.M.: Inter-indexer consistency in subject cataloging (1989) 0.01

0.007064973 = product of:
  0.014129946 = sum of:
    0.014129946 = product of:
      0.028259892 = sum of:
        0.028259892 = weight(_text_:2 in 2276) [ClassicSimilarity], result of:
          0.028259892 = score(doc=2276,freq=2.0), product of:
            0.1294644 = queryWeight, product of:
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.05242341 = queryNorm
            0.2182831 = fieldWeight in 2276, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.0625 = fieldNorm(doc=2276)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Abstract: The purpose of the current study has been twofold: (1) to develop a valid methodology for studying indexing consistency in MARC records and, (2) to study such consistency in subject cataloging practice between non-LC libraries and the Library of Congress

Haanen, E.: Specificiteit en consistentie : een kwantitatief oderzoek naar trefwoordtoekenning door UBA en UBN (1991) 0.01

0.007064973 = product of:
  0.014129946 = sum of:
    0.014129946 = product of:
      0.028259892 = sum of:
        0.028259892 = weight(_text_:2 in 4778) [ClassicSimilarity], result of:
          0.028259892 = score(doc=4778,freq=2.0), product of:
            0.1294644 = queryWeight, product of:
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.05242341 = queryNorm
            0.2182831 = fieldWeight in 4778, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.0625 = fieldNorm(doc=4778)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Open. 23(1991) no.2, S.45-49

David, C.; Giroux, L.; Bertrand-Gastaldy, S.; Lanteigne, D.: Indexing as problem solving : a cognitive approach to consistency (1995) 0.01

0.007064973 = product of:
  0.014129946 = sum of:
    0.014129946 = product of:
      0.028259892 = sum of:
        0.028259892 = weight(_text_:2 in 3833) [ClassicSimilarity], result of:
          0.028259892 = score(doc=3833,freq=2.0), product of:
            0.1294644 = queryWeight, product of:
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.05242341 = queryNorm
            0.2182831 = fieldWeight in 3833, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4695914 = idf(docFreq=10170, maxDocs=44218)
              0.0625 = fieldNorm(doc=3833)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Abstract: Presents results of an experiment in which 8 indexers (4 beginners and 4 experts) were asked to index the same 4 documents with 2 different thesauri. The 3 kind of verbal reports provide complementary data on strategic behaviour. it is of prime importance to consider the indexing task as an ill-defined problem, where the solutionm is partly defined by the indexer

Search (32 results, page 1 of 2)

Authors

Years

Languages

Types

Themes