Document (#30440)

Author
Bloomfield, M.
Title
Indexing : neglected and poorly understood
Source
Cataloging and classification quarterly. 33(2001) no.1, S.63-75
Year
2001
Abstract
The growth of the Internet has highlighted the use of machine indexing. The difficulties in using the Internet as a searching device can be frustrating. The use of the term "Python" is given as an example. Machine indexing is noted as "rotten" and human indexing as "capricious." The problem seems to be a lack of a theoretical foundation for the art of indexing. What librarians have learned over the last hundred years has yet to yield a consistent approach to what really works best in preparing index terms and in the ability of our customers to search the various indexes. An attempt is made to consider the elements of indexing, their pros and cons. The argument is made that machine indexing is far too prolific in its production of index terms. Neither librarians nor computer programmers have made much progress to improve Internet indexing. Human indexing has had the same problems for over fifty years.
Footnote
Vgl. auch: http://catalogingandclassificationquarterly.com/
Theme
Automatisches Indexieren
Internet

Similar documents (content)

  1. Lancaster, F.W.: Trends in subject indexing from 1957 to 2000 (1980) 0.17
    0.16907863 = sum of:
      0.16907863 = product of:
        0.8453931 = sum of:
          0.052157704 = weight(abstract_txt:terms in 208) [ClassicSimilarity], result of:
            0.052157704 = score(doc=208,freq=2.0), product of:
              0.09728264 = queryWeight, product of:
                1.1112505 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.021648439 = queryNorm
              0.53614604 = fieldWeight in 208, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=208)
          0.08447355 = weight(abstract_txt:index in 208) [ClassicSimilarity], result of:
            0.08447355 = score(doc=208,freq=2.0), product of:
              0.13416429 = queryWeight, product of:
                1.3050067 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.021648439 = queryNorm
              0.6296277 = fieldWeight in 208, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.09375 = fieldNorm(doc=208)
          0.079845764 = weight(abstract_txt:made in 208) [ClassicSimilarity], result of:
            0.079845764 = score(doc=208,freq=1.0), product of:
              0.18636516 = queryWeight, product of:
                1.8837458 = boost
                4.5699964 = idf(docFreq=1244, maxDocs=44218)
                0.021648439 = queryNorm
              0.42843717 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5699964 = idf(docFreq=1244, maxDocs=44218)
                0.09375 = fieldNorm(doc=208)
          0.123038605 = weight(abstract_txt:machine in 208) [ClassicSimilarity], result of:
            0.123038605 = score(doc=208,freq=1.0), product of:
              0.24863242 = queryWeight, product of:
                2.1757991 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.021648439 = queryNorm
              0.49486148 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.09375 = fieldNorm(doc=208)
          0.5058775 = weight(abstract_txt:indexing in 208) [ClassicSimilarity], result of:
            0.5058775 = score(doc=208,freq=6.0), product of:
              0.5064661 = queryWeight, product of:
                5.37868 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.021648439 = queryNorm
              0.9988378 = fieldWeight in 208, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.09375 = fieldNorm(doc=208)
        0.2 = coord(5/25)
    
  2. Lauser, B.; Johannsen, G.; Caracciolo, C.; Hage, W.R. van; Keizer, J.; Mayr, P.: Comparing human and automatic thesaurus mapping approaches in the agricultural domain (2008) 0.13
    0.13071781 = sum of:
      0.13071781 = product of:
        0.5446576 = sum of:
          0.12675114 = weight(abstract_txt:cons in 2627) [ClassicSimilarity], result of:
            0.12675114 = score(doc=2627,freq=1.0), product of:
              0.19856918 = queryWeight, product of:
                1.1226263 = boost
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.021648439 = queryNorm
              0.63832235 = fieldWeight in 2627, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.078125 = fieldNorm(doc=2627)
          0.12814558 = weight(abstract_txt:pros in 2627) [ClassicSimilarity], result of:
            0.12814558 = score(doc=2627,freq=1.0), product of:
              0.20002286 = queryWeight, product of:
                1.126728 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.021648439 = queryNorm
              0.6406546 = fieldWeight in 2627, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.078125 = fieldNorm(doc=2627)
          0.052798025 = weight(abstract_txt:what in 2627) [ClassicSimilarity], result of:
            0.052798025 = score(doc=2627,freq=2.0), product of:
              0.110753044 = queryWeight, product of:
                1.1856927 = boost
                4.314763 = idf(docFreq=1606, maxDocs=44218)
                0.021648439 = queryNorm
              0.4767185 = fieldWeight in 2627, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.314763 = idf(docFreq=1606, maxDocs=44218)
                0.078125 = fieldNorm(doc=2627)
          0.06789256 = weight(abstract_txt:human in 2627) [ClassicSimilarity], result of:
            0.06789256 = score(doc=2627,freq=2.0), product of:
              0.13096604 = queryWeight, product of:
                1.2893584 = boost
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.021648439 = queryNorm
              0.5183982 = fieldWeight in 2627, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.078125 = fieldNorm(doc=2627)
          0.06653813 = weight(abstract_txt:made in 2627) [ClassicSimilarity], result of:
            0.06653813 = score(doc=2627,freq=1.0), product of:
              0.18636516 = queryWeight, product of:
                1.8837458 = boost
                4.5699964 = idf(docFreq=1244, maxDocs=44218)
                0.021648439 = queryNorm
              0.35703096 = fieldWeight in 2627, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5699964 = idf(docFreq=1244, maxDocs=44218)
                0.078125 = fieldNorm(doc=2627)
          0.10253217 = weight(abstract_txt:machine in 2627) [ClassicSimilarity], result of:
            0.10253217 = score(doc=2627,freq=1.0), product of:
              0.24863242 = queryWeight, product of:
                2.1757991 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.021648439 = queryNorm
              0.41238457 = fieldWeight in 2627, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.078125 = fieldNorm(doc=2627)
        0.24 = coord(6/25)
    
  3. Mooers, C.N.: ¬The indexing language of an information retrieval system (1985) 0.13
    0.12615797 = sum of:
      0.12615797 = product of:
        0.39424366 = sum of:
          0.05795022 = weight(abstract_txt:hundred in 3644) [ClassicSimilarity], result of:
            0.05795022 = score(doc=3644,freq=1.0), product of:
              0.16565827 = queryWeight, product of:
                1.0253824 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.021648439 = queryNorm
              0.34981787 = fieldWeight in 3644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.046875 = fieldNorm(doc=3644)
          0.026078852 = weight(abstract_txt:terms in 3644) [ClassicSimilarity], result of:
            0.026078852 = score(doc=3644,freq=2.0), product of:
              0.09728264 = queryWeight, product of:
                1.1112505 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.021648439 = queryNorm
              0.26807302 = fieldWeight in 3644, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.046875 = fieldNorm(doc=3644)
          0.021323476 = weight(abstract_txt:over in 3644) [ClassicSimilarity], result of:
            0.021323476 = score(doc=3644,freq=1.0), product of:
              0.10717457 = queryWeight, product of:
                1.1663803 = boost
                4.244485 = idf(docFreq=1723, maxDocs=44218)
                0.021648439 = queryNorm
              0.19896023 = fieldWeight in 3644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.244485 = idf(docFreq=1723, maxDocs=44218)
                0.046875 = fieldNorm(doc=3644)
          0.022400303 = weight(abstract_txt:what in 3644) [ClassicSimilarity], result of:
            0.022400303 = score(doc=3644,freq=1.0), product of:
              0.110753044 = queryWeight, product of:
                1.1856927 = boost
                4.314763 = idf(docFreq=1606, maxDocs=44218)
                0.021648439 = queryNorm
              0.20225452 = fieldWeight in 3644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.314763 = idf(docFreq=1606, maxDocs=44218)
                0.046875 = fieldNorm(doc=3644)
          0.028804375 = weight(abstract_txt:years in 3644) [ClassicSimilarity], result of:
            0.028804375 = score(doc=3644,freq=1.0), product of:
              0.13096604 = queryWeight, product of:
                1.2893584 = boost
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.021648439 = queryNorm
              0.21993774 = fieldWeight in 3644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.046875 = fieldNorm(doc=3644)
          0.051729284 = weight(abstract_txt:index in 3644) [ClassicSimilarity], result of:
            0.051729284 = score(doc=3644,freq=3.0), product of:
              0.13416429 = queryWeight, product of:
                1.3050067 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.021648439 = queryNorm
              0.3855667 = fieldWeight in 3644, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.046875 = fieldNorm(doc=3644)
          0.039922882 = weight(abstract_txt:made in 3644) [ClassicSimilarity], result of:
            0.039922882 = score(doc=3644,freq=1.0), product of:
              0.18636516 = queryWeight, product of:
                1.8837458 = boost
                4.5699964 = idf(docFreq=1244, maxDocs=44218)
                0.021648439 = queryNorm
              0.21421859 = fieldWeight in 3644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5699964 = idf(docFreq=1244, maxDocs=44218)
                0.046875 = fieldNorm(doc=3644)
          0.14603426 = weight(abstract_txt:indexing in 3644) [ClassicSimilarity], result of:
            0.14603426 = score(doc=3644,freq=2.0), product of:
              0.5064661 = queryWeight, product of:
                5.37868 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.021648439 = queryNorm
              0.28833964 = fieldWeight in 3644, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.046875 = fieldNorm(doc=3644)
        0.32 = coord(8/25)
    
  4. Carroll, D.J.; Lele, P.: Human intervention in the networked environment : metadata alternatives (1998) 0.13
    0.12594806 = sum of:
      0.12594806 = product of:
        0.62974024 = sum of:
          0.15210138 = weight(abstract_txt:cons in 2221) [ClassicSimilarity], result of:
            0.15210138 = score(doc=2221,freq=1.0), product of:
              0.19856918 = queryWeight, product of:
                1.1226263 = boost
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.021648439 = queryNorm
              0.76598686 = fieldWeight in 2221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.09375 = fieldNorm(doc=2221)
          0.1537747 = weight(abstract_txt:pros in 2221) [ClassicSimilarity], result of:
            0.1537747 = score(doc=2221,freq=1.0), product of:
              0.20002286 = queryWeight, product of:
                1.126728 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.021648439 = queryNorm
              0.7687856 = fieldWeight in 2221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.09375 = fieldNorm(doc=2221)
          0.05760875 = weight(abstract_txt:human in 2221) [ClassicSimilarity], result of:
            0.05760875 = score(doc=2221,freq=1.0), product of:
              0.13096604 = queryWeight, product of:
                1.2893584 = boost
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.021648439 = queryNorm
              0.43987548 = fieldWeight in 2221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.09375 = fieldNorm(doc=2221)
          0.059731826 = weight(abstract_txt:index in 2221) [ClassicSimilarity], result of:
            0.059731826 = score(doc=2221,freq=1.0), product of:
              0.13416429 = queryWeight, product of:
                1.3050067 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.021648439 = queryNorm
              0.44521406 = fieldWeight in 2221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.09375 = fieldNorm(doc=2221)
          0.20652361 = weight(abstract_txt:indexing in 2221) [ClassicSimilarity], result of:
            0.20652361 = score(doc=2221,freq=1.0), product of:
              0.5064661 = queryWeight, product of:
                5.37868 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.021648439 = queryNorm
              0.40777382 = fieldWeight in 2221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.09375 = fieldNorm(doc=2221)
        0.2 = coord(5/25)
    
  5. Cleverdon, C.W.; Mills, J.: ¬The testing of index language devices (1985) 0.12
    0.11524436 = sum of:
      0.11524436 = product of:
        0.48018485 = sum of:
          0.05795022 = weight(abstract_txt:hundred in 3643) [ClassicSimilarity], result of:
            0.05795022 = score(doc=3643,freq=1.0), product of:
              0.16565827 = queryWeight, product of:
                1.0253824 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.021648439 = queryNorm
              0.34981787 = fieldWeight in 3643, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.046875 = fieldNorm(doc=3643)
          0.018440532 = weight(abstract_txt:terms in 3643) [ClassicSimilarity], result of:
            0.018440532 = score(doc=3643,freq=1.0), product of:
              0.09728264 = queryWeight, product of:
                1.1112505 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.021648439 = queryNorm
              0.18955624 = fieldWeight in 3643, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.046875 = fieldNorm(doc=3643)
          0.022400303 = weight(abstract_txt:what in 3643) [ClassicSimilarity], result of:
            0.022400303 = score(doc=3643,freq=1.0), product of:
              0.110753044 = queryWeight, product of:
                1.1856927 = boost
                4.314763 = idf(docFreq=1606, maxDocs=44218)
                0.021648439 = queryNorm
              0.20225452 = fieldWeight in 3643, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.314763 = idf(docFreq=1606, maxDocs=44218)
                0.046875 = fieldNorm(doc=3643)
          0.051729284 = weight(abstract_txt:index in 3643) [ClassicSimilarity], result of:
            0.051729284 = score(doc=3643,freq=3.0), product of:
              0.13416429 = queryWeight, product of:
                1.3050067 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.021648439 = queryNorm
              0.3855667 = fieldWeight in 3643, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.046875 = fieldNorm(doc=3643)
          0.05645947 = weight(abstract_txt:made in 3643) [ClassicSimilarity], result of:
            0.05645947 = score(doc=3643,freq=2.0), product of:
              0.18636516 = queryWeight, product of:
                1.8837458 = boost
                4.5699964 = idf(docFreq=1244, maxDocs=44218)
                0.021648439 = queryNorm
              0.3029508 = fieldWeight in 3643, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5699964 = idf(docFreq=1244, maxDocs=44218)
                0.046875 = fieldNorm(doc=3643)
          0.27320504 = weight(abstract_txt:indexing in 3643) [ClassicSimilarity], result of:
            0.27320504 = score(doc=3643,freq=7.0), product of:
              0.5064661 = queryWeight, product of:
                5.37868 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.021648439 = queryNorm
              0.539434 = fieldWeight in 3643, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.046875 = fieldNorm(doc=3643)
        0.24 = coord(6/25)