Search (22 results, page 1 of 2)

Anderson, J.D.; Pérez-Carballo, J.: ¬The nature of indexing: how humans and machines analyze messages and texts for retrieval : Part II: Machine indexing, and the allocation of human versus machine effort (2001) 0.04

0.040969923 = product of:
  0.08193985 = sum of:
    0.08193985 = product of:
      0.1638797 = sum of:
        0.1638797 = weight(_text_:ii in 368) [ClassicSimilarity], result of:
          0.1638797 = score(doc=368,freq=2.0), product of:
            0.2745971 = queryWeight, product of:
              5.4016213 = idf(docFreq=541, maxDocs=44218)
              0.050836053 = queryNorm
            0.5968005 = fieldWeight in 368, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.4016213 = idf(docFreq=541, maxDocs=44218)
              0.078125 = fieldNorm(doc=368)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Pulgarin, A.; Gil-Leiva, I.: Bibliometric analysis of the automatic indexing literature : 1956-2000 (2004) 0.03
```
0.028678946 = product of:
  0.057357892 = sum of:
    0.057357892 = product of:
      0.114715785 = sum of:
        0.114715785 = weight(_text_:ii in 2566) [ClassicSimilarity], result of:
          0.114715785 = score(doc=2566,freq=2.0), product of:
            0.2745971 = queryWeight, product of:
              5.4016213 = idf(docFreq=541, maxDocs=44218)
              0.050836053 = queryNorm
            0.41776034 = fieldWeight in 2566, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.4016213 = idf(docFreq=541, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2566)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

We present a bibliometric study of a corpus of 839 bibliographic references about automatic indexing, covering the period 1956-2000. We analyse the distribution of authors and works, the obsolescence and its dispersion, and the distribution of the literature by topic, year, and source type. We conclude that: (i) there has been a constant interest on the part of researchers; (ii) the most studied topics were the techniques and methods employed and the general aspects of automatic indexing; (iii) the productivity of the authors does fit a Lotka distribution (Dmax=0.02 and critical value=0.054); (iv) the annual aging factor is 95%; and (v) the dispersion of the literature is low.

Munkelt, J.: Erstellung einer DNB-Retrieval-Testkollektion (2018) 0.03

0.028678946 = product of:
  0.057357892 = sum of:
    0.057357892 = product of:
      0.114715785 = sum of:
        0.114715785 = weight(_text_:ii in 4310) [ClassicSimilarity], result of:
          0.114715785 = score(doc=4310,freq=2.0), product of:
            0.2745971 = queryWeight, product of:
              5.4016213 = idf(docFreq=541, maxDocs=44218)
              0.050836053 = queryNorm
            0.41776034 = fieldWeight in 4310, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.4016213 = idf(docFreq=541, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4310)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Pages: II, 79 S

Voorhees, E.M.: Implementing agglomerative hierarchic clustering algorithms for use in document retrieval (1986) 0.03

0.027550334 = product of:
  0.05510067 = sum of:
    0.05510067 = product of:
      0.11020134 = sum of:
        0.11020134 = weight(_text_:22 in 402) [ClassicSimilarity], result of:
          0.11020134 = score(doc=402,freq=2.0), product of:
            0.1780192 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050836053 = queryNorm
            0.61904186 = fieldWeight in 402, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.125 = fieldNorm(doc=402)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Information processing and management. 22(1986) no.6, S.465-476

Flores, F.N.; Moreira, V.P.: Assessing the impact of stemming accuracy on information retrieval : a multilingual perspective (2016) 0.02
```
0.024581954 = product of:
  0.049163908 = sum of:
    0.049163908 = product of:
      0.098327816 = sum of:
        0.098327816 = weight(_text_:ii in 3187) [ClassicSimilarity], result of:
          0.098327816 = score(doc=3187,freq=2.0), product of:
            0.2745971 = queryWeight, product of:
              5.4016213 = idf(docFreq=541, maxDocs=44218)
              0.050836053 = queryNorm
            0.3580803 = fieldWeight in 3187, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.4016213 = idf(docFreq=541, maxDocs=44218)
              0.046875 = fieldNorm(doc=3187)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

The quality of stemming algorithms is typically measured in two different ways: (i) how accurately they map the variant forms of a word to the same stem; or (ii) how much improvement they bring to Information Retrieval systems. In this article, we evaluate various stemming algorithms, in four languages, in terms of accuracy and in terms of their aid to Information Retrieval. The aim is to assess whether the most accurate stemmers are also the ones that bring the biggest gain in Information Retrieval. Experiments in English, French, Portuguese, and Spanish show that this is not always the case, as stemmers with higher error rates yield better retrieval quality. As a byproduct, we also identified the most accurate stemmers and the best for Information Retrieval purposes.

Hlava, M.M.K.: Automatic indexing : comparing rule-based and statistics-based indexing systems (2005) 0.02

0.024106542 = product of:
  0.048213083 = sum of:
    0.048213083 = product of:
      0.09642617 = sum of:
        0.09642617 = weight(_text_:22 in 6265) [ClassicSimilarity], result of:
          0.09642617 = score(doc=6265,freq=2.0), product of:
            0.1780192 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050836053 = queryNorm
            0.5416616 = fieldWeight in 6265, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=6265)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Information outlook. 9(2005) no.8, S.22-23

Ahmed, M.: Automatic indexing for agriculture : designing a framework by deploying Agrovoc, Agris and Annif (2023) 0.02
```
0.020484962 = product of:
  0.040969923 = sum of:
    0.040969923 = product of:
      0.08193985 = sum of:
        0.08193985 = weight(_text_:ii in 1024) [ClassicSimilarity], result of:
          0.08193985 = score(doc=1024,freq=2.0), product of:
            0.2745971 = queryWeight, product of:
              5.4016213 = idf(docFreq=541, maxDocs=44218)
              0.050836053 = queryNorm
            0.29840025 = fieldWeight in 1024, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.4016213 = idf(docFreq=541, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1024)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

There are several ways to employ machine learning for automating subject indexing. One popular strategy is to utilize a supervised learning algorithm to train a model on a set of documents that have been manually indexed by subject matter using a standard vocabulary. The resulting model can then predict the subject of new and previously unseen documents by identifying patterns learned from the training data. To do this, the first step is to gather a large dataset of documents and manually assign each document a set of subject keywords/descriptors from a controlled vocabulary (e.g., from Agrovoc). Next, the dataset (obtained from Agris) can be divided into - i) a training dataset, and ii) a test dataset. The training dataset is used to train the model, while the test dataset is used to evaluate the model's performance. Machine learning can be a powerful tool for automating the process of subject indexing. This research is an attempt to apply Annif (http://annif. org/), an open-source AI/ML framework, to autogenerate subject keywords/descriptors for documentary resources in the domain of agriculture. The training dataset is obtained from Agris, which applies the Agrovoc thesaurus as a vocabulary tool (https://www.fao.org/agris/download).

Biebricher, N.; Fuhr, N.; Lustig, G.; Schwantner, M.; Knorz, G.: ¬The automatic indexing system AIR/PHYS : from research to application (1988) 0.02

0.017218959 = product of:
  0.034437917 = sum of:
    0.034437917 = product of:
      0.068875834 = sum of:
        0.068875834 = weight(_text_:22 in 1952) [ClassicSimilarity], result of:
          0.068875834 = score(doc=1952,freq=2.0), product of:
            0.1780192 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050836053 = queryNorm
            0.38690117 = fieldWeight in 1952, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=1952)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 16. 8.1998 12:51:22

Kutschekmanesch, S.; Lutes, B.; Moelle, K.; Thiel, U.; Tzeras, K.: Automated multilingual indexing : a synthesis of rule-based and thesaurus-based methods (1998) 0.02

0.017218959 = product of:
  0.034437917 = sum of:
    0.034437917 = product of:
      0.068875834 = sum of:
        0.068875834 = weight(_text_:22 in 4157) [ClassicSimilarity], result of:
          0.068875834 = score(doc=4157,freq=2.0), product of:
            0.1780192 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050836053 = queryNorm
            0.38690117 = fieldWeight in 4157, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=4157)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Information und Märkte: 50. Deutscher Dokumentartag 1998, Kongreß der Deutschen Gesellschaft für Dokumentation e.V. (DGD), Rheinische Friedrich-Wilhelms-Universität Bonn, 22.-24. September 1998. Hrsg. von Marlies Ockenfeld u. Gerhard J. Mantwill

Stankovic, R. et al.: Indexing of textual databases based on lexical resources : a case study for Serbian (2016) 0.02

0.017218959 = product of:
  0.034437917 = sum of:
    0.034437917 = product of:
      0.068875834 = sum of:
        0.068875834 = weight(_text_:22 in 2759) [ClassicSimilarity], result of:
          0.068875834 = score(doc=2759,freq=2.0), product of:
            0.1780192 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050836053 = queryNorm
            0.38690117 = fieldWeight in 2759, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=2759)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 1. 2.2016 18:25:22

Tsujii, J.-I.: Automatic acquisition of semantic collocation from corpora (1995) 0.01

0.013775167 = product of:
  0.027550334 = sum of:
    0.027550334 = product of:
      0.05510067 = sum of:
        0.05510067 = weight(_text_:22 in 4709) [ClassicSimilarity], result of:
          0.05510067 = score(doc=4709,freq=2.0), product of:
            0.1780192 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050836053 = queryNorm
            0.30952093 = fieldWeight in 4709, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=4709)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 31. 7.1996 9:22:19

Riloff, E.: ¬An empirical study of automated dictionary construction for information extraction in three domains (1996) 0.01

0.013775167 = product of:
  0.027550334 = sum of:
    0.027550334 = product of:
      0.05510067 = sum of:
        0.05510067 = weight(_text_:22 in 6752) [ClassicSimilarity], result of:
          0.05510067 = score(doc=6752,freq=2.0), product of:
            0.1780192 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050836053 = queryNorm
            0.30952093 = fieldWeight in 6752, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=6752)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 6. 3.1997 16:22:15

Hodges, P.R.: Keyword in title indexes : effectiveness of retrieval in computer searches (1983) 0.01

0.012053271 = product of:
  0.024106542 = sum of:
    0.024106542 = product of:
      0.048213083 = sum of:
        0.048213083 = weight(_text_:22 in 5001) [ClassicSimilarity], result of:
          0.048213083 = score(doc=5001,freq=2.0), product of:
            0.1780192 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050836053 = queryNorm
            0.2708308 = fieldWeight in 5001, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5001)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 14. 3.1996 13:22:21

Bordoni, L.; Pazienza, M.T.: Documents automatic indexing in an environmental domain (1997) 0.01

0.012053271 = product of:
  0.024106542 = sum of:
    0.024106542 = product of:
      0.048213083 = sum of:
        0.048213083 = weight(_text_:22 in 530) [ClassicSimilarity], result of:
          0.048213083 = score(doc=530,freq=2.0), product of:
            0.1780192 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050836053 = queryNorm
            0.2708308 = fieldWeight in 530, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=530)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: International forum on information and documentation. 22(1997) no.1, S.17-28

Wolfekuhler, M.R.; Punch, W.F.: Finding salient features for personal Web pages categories (1997) 0.01

0.012053271 = product of:
  0.024106542 = sum of:
    0.024106542 = product of:
      0.048213083 = sum of:
        0.048213083 = weight(_text_:22 in 2673) [ClassicSimilarity], result of:
          0.048213083 = score(doc=2673,freq=2.0), product of:
            0.1780192 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050836053 = queryNorm
            0.2708308 = fieldWeight in 2673, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2673)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 1. 8.1996 22:08:06

Newman, D.J.; Block, S.: Probabilistic topic decomposition of an eighteenth-century American newspaper (2006) 0.01

0.012053271 = product of:
  0.024106542 = sum of:
    0.024106542 = product of:
      0.048213083 = sum of:
        0.048213083 = weight(_text_:22 in 5291) [ClassicSimilarity], result of:
          0.048213083 = score(doc=5291,freq=2.0), product of:
            0.1780192 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050836053 = queryNorm
            0.2708308 = fieldWeight in 5291, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5291)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 22. 7.2006 17:32:00

Ward, M.L.: ¬The future of the human indexer (1996) 0.01

0.010331375 = product of:
  0.02066275 = sum of:
    0.02066275 = product of:
      0.0413255 = sum of:
        0.0413255 = weight(_text_:22 in 7244) [ClassicSimilarity], result of:
          0.0413255 = score(doc=7244,freq=2.0), product of:
            0.1780192 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050836053 = queryNorm
            0.23214069 = fieldWeight in 7244, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=7244)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 9. 2.1997 18:44:22

Plaunt, C.; Norgard, B.A.: ¬An association-based method for automatic indexing with a controlled vocabulary (1998) 0.01

0.008609479 = product of:
  0.017218959 = sum of:
    0.017218959 = product of:
      0.034437917 = sum of:
        0.034437917 = weight(_text_:22 in 1794) [ClassicSimilarity], result of:
          0.034437917 = score(doc=1794,freq=2.0), product of:
            0.1780192 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050836053 = queryNorm
            0.19345059 = fieldWeight in 1794, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1794)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 11. 9.2000 19:53:22

Milstead, J.L.: Thesauri in a full-text world (1998) 0.01

0.008609479 = product of:
  0.017218959 = sum of:
    0.017218959 = product of:
      0.034437917 = sum of:
        0.034437917 = weight(_text_:22 in 2337) [ClassicSimilarity], result of:
          0.034437917 = score(doc=2337,freq=2.0), product of:
            0.1780192 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050836053 = queryNorm
            0.19345059 = fieldWeight in 2337, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2337)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 22. 9.1997 19:16:05

Martins, A.L.; Souza, R.R.; Ribeiro de Mello, H.: ¬The use of noun phrases in information retrieval : proposing a mechanism for automatic classification (2014) 0.01

0.0068875835 = product of:
  0.013775167 = sum of:
    0.013775167 = product of:
      0.027550334 = sum of:
        0.027550334 = weight(_text_:22 in 1441) [ClassicSimilarity], result of:
          0.027550334 = score(doc=1441,freq=2.0), product of:
            0.1780192 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050836053 = queryNorm
            0.15476047 = fieldWeight in 1441, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=1441)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik

Search (22 results, page 1 of 2)

Authors

Years

Themes