Document (#33025)

Author
Mai, J.-E.
Title
Analysis in indexing : document and domain centered approaches
Source
Information processing and management. 41(2005) no.3, S.599-611
Year
2005
Abstract
The paper discusses the notion of steps in indexing and reveals that the document-centered approach to indexing is prevalent and argues that the document-centered approach is problematic because it blocks out context-dependent factors in the indexing process. A domain-centered approach to indexing is presented as an alternative and the paper discusses how this approach includes a broader range of analyses and how it requires a new set of actions from using this approach; analysis of the domain, users and indexers. The paper concludes that the two-step procedure to indexing is insufficient to explain the indexing process and suggests that the domain-centered approach offers a guide for indexers that can help them manage the complexity of indexing.
Theme
Inhaltsanalyse

Similar documents (content)

  1. Jens-Erik Mai, J.-E.: ¬The role of documents, domains and decisions in indexing (2004) 0.48
    0.48274782 = sum of:
      0.48274782 = product of:
        1.508587 = sum of:
          0.049311474 = weight(abstract_txt:analysis in 2653) [ClassicSimilarity], result of:
            0.049311474 = score(doc=2653,freq=4.0), product of:
              0.07198338 = queryWeight, product of:
                1.2103032 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.016278844 = queryNorm
              0.6850397 = fieldWeight in 2653, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.09375 = fieldNorm(doc=2653)
          0.03360973 = weight(abstract_txt:process in 2653) [ClassicSimilarity], result of:
            0.03360973 = score(doc=2653,freq=1.0), product of:
              0.08849735 = queryWeight, product of:
                1.3419712 = boost
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.016278844 = queryNorm
              0.37978232 = fieldWeight in 2653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.09375 = fieldNorm(doc=2653)
          0.05475569 = weight(abstract_txt:paper in 2653) [ClassicSimilarity], result of:
            0.05475569 = score(doc=2653,freq=3.0), product of:
              0.09725153 = queryWeight, product of:
                1.7229469 = boost
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.016278844 = queryNorm
              0.5630317 = fieldWeight in 2653, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.09375 = fieldNorm(doc=2653)
          0.029122697 = weight(abstract_txt:that in 2653) [ClassicSimilarity], result of:
            0.029122697 = score(doc=2653,freq=3.0), product of:
              0.07569158 = queryWeight, product of:
                1.9623291 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016278844 = queryNorm
              0.38475478 = fieldWeight in 2653, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=2653)
          0.15185957 = weight(abstract_txt:domain in 2653) [ClassicSimilarity], result of:
            0.15185957 = score(doc=2653,freq=2.0), product of:
              0.24186982 = queryWeight, product of:
                3.1375012 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.016278844 = queryNorm
              0.6278566 = fieldWeight in 2653, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.09375 = fieldNorm(doc=2653)
          0.1380149 = weight(abstract_txt:approach in 2653) [ClassicSimilarity], result of:
            0.1380149 = score(doc=2653,freq=3.0), product of:
              0.22693643 = queryWeight, product of:
                3.7221236 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.016278844 = queryNorm
              0.60816544 = fieldWeight in 2653, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.09375 = fieldNorm(doc=2653)
          0.5812351 = weight(abstract_txt:centered in 2653) [ClassicSimilarity], result of:
            0.5812351 = score(doc=2653,freq=2.0), product of:
              0.63751656 = queryWeight, product of:
                5.6950006 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.016278844 = queryNorm
              0.9117177 = fieldWeight in 2653, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.09375 = fieldNorm(doc=2653)
          0.47067785 = weight(abstract_txt:indexing in 2653) [ClassicSimilarity], result of:
            0.47067785 = score(doc=2653,freq=8.0), product of:
              0.40809324 = queryWeight, product of:
                5.7635193 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.016278844 = queryNorm
              1.1533586 = fieldWeight in 2653, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.09375 = fieldNorm(doc=2653)
        0.32 = coord(8/25)
    
  2. Fidel, R.: User-centered indexing (1994) 0.28
    0.28343698 = sum of:
      0.28343698 = product of:
        1.1809875 = sum of:
          0.028008109 = weight(abstract_txt:process in 8259) [ClassicSimilarity], result of:
            0.028008109 = score(doc=8259,freq=1.0), product of:
              0.08849735 = queryWeight, product of:
                1.3419712 = boost
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.016278844 = queryNorm
              0.3164853 = fieldWeight in 8259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.078125 = fieldNorm(doc=8259)
          0.019815486 = weight(abstract_txt:that in 8259) [ClassicSimilarity], result of:
            0.019815486 = score(doc=8259,freq=2.0), product of:
              0.07569158 = queryWeight, product of:
                1.9623291 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016278844 = queryNorm
              0.26179248 = fieldWeight in 8259, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=8259)
          0.09997198 = weight(abstract_txt:document in 8259) [ClassicSimilarity], result of:
            0.09997198 = score(doc=8259,freq=4.0), product of:
              0.14905173 = queryWeight, product of:
                2.1330066 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.016278844 = queryNorm
              0.67072004 = fieldWeight in 8259, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=8259)
          0.13280489 = weight(abstract_txt:approach in 8259) [ClassicSimilarity], result of:
            0.13280489 = score(doc=8259,freq=4.0), product of:
              0.22693643 = queryWeight, product of:
                3.7221236 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.016278844 = queryNorm
              0.58520746 = fieldWeight in 8259, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.078125 = fieldNorm(doc=8259)
          0.48436263 = weight(abstract_txt:centered in 8259) [ClassicSimilarity], result of:
            0.48436263 = score(doc=8259,freq=2.0), product of:
              0.63751656 = queryWeight, product of:
                5.6950006 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.016278844 = queryNorm
              0.7597648 = fieldWeight in 8259, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.078125 = fieldNorm(doc=8259)
          0.41602436 = weight(abstract_txt:indexing in 8259) [ClassicSimilarity], result of:
            0.41602436 = score(doc=8259,freq=9.0), product of:
              0.40809324 = queryWeight, product of:
                5.7635193 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.016278844 = queryNorm
              1.0194346 = fieldWeight in 8259, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.078125 = fieldNorm(doc=8259)
        0.24 = coord(6/25)
    
  3. Wu, Y.: Indexing historical, political cartoons for retrieval (2013) 0.20
    0.19738452 = sum of:
      0.19738452 = product of:
        0.8224355 = sum of:
          0.020546447 = weight(abstract_txt:analysis in 1070) [ClassicSimilarity], result of:
            0.020546447 = score(doc=1070,freq=1.0), product of:
              0.07198338 = queryWeight, product of:
                1.2103032 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.016278844 = queryNorm
              0.2854332 = fieldWeight in 1070, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.078125 = fieldNorm(doc=1070)
          0.026344344 = weight(abstract_txt:paper in 1070) [ClassicSimilarity], result of:
            0.026344344 = score(doc=1070,freq=1.0), product of:
              0.09725153 = queryWeight, product of:
                1.7229469 = boost
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.016278844 = queryNorm
              0.27088875 = fieldWeight in 1070, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.078125 = fieldNorm(doc=1070)
          0.028023332 = weight(abstract_txt:that in 1070) [ClassicSimilarity], result of:
            0.028023332 = score(doc=1070,freq=4.0), product of:
              0.07569158 = queryWeight, product of:
                1.9623291 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016278844 = queryNorm
              0.3702305 = fieldWeight in 1070, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=1070)
          0.24259073 = weight(abstract_txt:indexers in 1070) [ClassicSimilarity], result of:
            0.24259073 = score(doc=1070,freq=4.0), product of:
              0.23512773 = queryWeight, product of:
                2.187409 = boost
                6.603137 = idf(docFreq=162, maxDocs=44218)
                0.016278844 = queryNorm
              1.0317402 = fieldWeight in 1070, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.603137 = idf(docFreq=162, maxDocs=44218)
                0.078125 = fieldNorm(doc=1070)
          0.06640244 = weight(abstract_txt:approach in 1070) [ClassicSimilarity], result of:
            0.06640244 = score(doc=1070,freq=1.0), product of:
              0.22693643 = queryWeight, product of:
                3.7221236 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.016278844 = queryNorm
              0.29260373 = fieldWeight in 1070, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.078125 = fieldNorm(doc=1070)
          0.43852818 = weight(abstract_txt:indexing in 1070) [ClassicSimilarity], result of:
            0.43852818 = score(doc=1070,freq=10.0), product of:
              0.40809324 = queryWeight, product of:
                5.7635193 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.016278844 = queryNorm
              1.0745784 = fieldWeight in 1070, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.078125 = fieldNorm(doc=1070)
        0.24 = coord(6/25)
    
  4. Sigel, A.: How can user-oriented depth analysis be constructively guided? (2000) 0.19
    0.19318159 = sum of:
      0.19318159 = product of:
        0.5366155 = sum of:
          0.023759214 = weight(abstract_txt:step in 133) [ClassicSimilarity], result of:
            0.023759214 = score(doc=133,freq=1.0), product of:
              0.099916935 = queryWeight, product of:
                1.0082834 = boost
                6.087415 = idf(docFreq=272, maxDocs=44218)
                0.016278844 = queryNorm
              0.23778966 = fieldWeight in 133, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.087415 = idf(docFreq=272, maxDocs=44218)
                0.0390625 = fieldNorm(doc=133)
          0.020546447 = weight(abstract_txt:analysis in 133) [ClassicSimilarity], result of:
            0.020546447 = score(doc=133,freq=4.0), product of:
              0.07198338 = queryWeight, product of:
                1.2103032 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.016278844 = queryNorm
              0.2854332 = fieldWeight in 133, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0390625 = fieldNorm(doc=133)
          0.019804724 = weight(abstract_txt:process in 133) [ClassicSimilarity], result of:
            0.019804724 = score(doc=133,freq=2.0), product of:
              0.08849735 = queryWeight, product of:
                1.3419712 = boost
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.016278844 = queryNorm
              0.22378889 = fieldWeight in 133, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.0390625 = fieldNorm(doc=133)
          0.013172172 = weight(abstract_txt:paper in 133) [ClassicSimilarity], result of:
            0.013172172 = score(doc=133,freq=1.0), product of:
              0.09725153 = queryWeight, product of:
                1.7229469 = boost
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.016278844 = queryNorm
              0.13544437 = fieldWeight in 133, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.0390625 = fieldNorm(doc=133)
          0.015665518 = weight(abstract_txt:that in 133) [ClassicSimilarity], result of:
            0.015665518 = score(doc=133,freq=5.0), product of:
              0.07569158 = queryWeight, product of:
                1.9623291 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016278844 = queryNorm
              0.20696513 = fieldWeight in 133, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0390625 = fieldNorm(doc=133)
          0.024992995 = weight(abstract_txt:document in 133) [ClassicSimilarity], result of:
            0.024992995 = score(doc=133,freq=1.0), product of:
              0.14905173 = queryWeight, product of:
                2.1330066 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.016278844 = queryNorm
              0.16768001 = fieldWeight in 133, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0390625 = fieldNorm(doc=133)
          0.06064768 = weight(abstract_txt:indexers in 133) [ClassicSimilarity], result of:
            0.06064768 = score(doc=133,freq=1.0), product of:
              0.23512773 = queryWeight, product of:
                2.187409 = boost
                6.603137 = idf(docFreq=162, maxDocs=44218)
                0.016278844 = queryNorm
              0.25793505 = fieldWeight in 133, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.603137 = idf(docFreq=162, maxDocs=44218)
                0.0390625 = fieldNorm(doc=133)
          0.08948412 = weight(abstract_txt:domain in 133) [ClassicSimilarity], result of:
            0.08948412 = score(doc=133,freq=4.0), product of:
              0.24186982 = queryWeight, product of:
                3.1375012 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.016278844 = queryNorm
              0.3699681 = fieldWeight in 133, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0390625 = fieldNorm(doc=133)
          0.2685426 = weight(abstract_txt:indexing in 133) [ClassicSimilarity], result of:
            0.2685426 = score(doc=133,freq=15.0), product of:
              0.40809324 = queryWeight, product of:
                5.7635193 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.016278844 = queryNorm
              0.6580422 = fieldWeight in 133, product of:
                3.8729835 = tf(freq=15.0), with freq of:
                  15.0 = termFreq=15.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.0390625 = fieldNorm(doc=133)
        0.36 = coord(9/25)
    
  5. Cooper, W.S.: Indexing documents by Gedanken experimentation (1978) 0.17
    0.1718955 = sum of:
      0.1718955 = product of:
        0.7162313 = sum of:
          0.04851146 = weight(abstract_txt:process in 412) [ClassicSimilarity], result of:
            0.04851146 = score(doc=412,freq=3.0), product of:
              0.08849735 = queryWeight, product of:
                1.3419712 = boost
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.016278844 = queryNorm
              0.54816854 = fieldWeight in 412, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.078125 = fieldNorm(doc=412)
          0.014011666 = weight(abstract_txt:that in 412) [ClassicSimilarity], result of:
            0.014011666 = score(doc=412,freq=1.0), product of:
              0.07569158 = queryWeight, product of:
                1.9623291 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016278844 = queryNorm
              0.18511525 = fieldWeight in 412, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=412)
          0.04998599 = weight(abstract_txt:document in 412) [ClassicSimilarity], result of:
            0.04998599 = score(doc=412,freq=1.0), product of:
              0.14905173 = queryWeight, product of:
                2.1330066 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.016278844 = queryNorm
              0.33536002 = fieldWeight in 412, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=412)
          0.12129536 = weight(abstract_txt:indexers in 412) [ClassicSimilarity], result of:
            0.12129536 = score(doc=412,freq=1.0), product of:
              0.23512773 = queryWeight, product of:
                2.187409 = boost
                6.603137 = idf(docFreq=162, maxDocs=44218)
                0.016278844 = queryNorm
              0.5158701 = fieldWeight in 412, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.603137 = idf(docFreq=162, maxDocs=44218)
                0.078125 = fieldNorm(doc=412)
          0.06640244 = weight(abstract_txt:approach in 412) [ClassicSimilarity], result of:
            0.06640244 = score(doc=412,freq=1.0), product of:
              0.22693643 = queryWeight, product of:
                3.7221236 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.016278844 = queryNorm
              0.29260373 = fieldWeight in 412, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.078125 = fieldNorm(doc=412)
          0.41602436 = weight(abstract_txt:indexing in 412) [ClassicSimilarity], result of:
            0.41602436 = score(doc=412,freq=9.0), product of:
              0.40809324 = queryWeight, product of:
                5.7635193 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.016278844 = queryNorm
              1.0194346 = fieldWeight in 412, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.078125 = fieldNorm(doc=412)
        0.24 = coord(6/25)