Search (61 results, page 1 of 4)

  • × theme_ss:"Automatisches Indexieren"
  • × language_ss:"e"
  1. Tsujii, J.-I.: Automatic acquisition of semantic collocation from corpora (1995) 0.04
    0.035034627 = product of:
      0.07006925 = sum of:
        0.07006925 = product of:
          0.10510387 = sum of:
            0.056446664 = weight(_text_:i in 4709) [ClassicSimilarity], result of:
              0.056446664 = score(doc=4709,freq=2.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.33337694 = fieldWeight in 4709, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4709)
            0.04865721 = weight(_text_:22 in 4709) [ClassicSimilarity], result of:
              0.04865721 = score(doc=4709,freq=2.0), product of:
                0.15720168 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044891298 = queryNorm
                0.30952093 = fieldWeight in 4709, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4709)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    31. 7.1996 9:22:19
  2. Plaunt, C.; Norgard, B.A.: ¬An association-based method for automatic indexing with a controlled vocabulary (1998) 0.02
    0.019972598 = product of:
      0.039945196 = sum of:
        0.039945196 = product of:
          0.059917793 = sum of:
            0.029507035 = weight(_text_:c in 1794) [ClassicSimilarity], result of:
              0.029507035 = score(doc=1794,freq=2.0), product of:
                0.15484828 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.044891298 = queryNorm
                0.1905545 = fieldWeight in 1794, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1794)
            0.030410757 = weight(_text_:22 in 1794) [ClassicSimilarity], result of:
              0.030410757 = score(doc=1794,freq=2.0), product of:
                0.15720168 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044891298 = queryNorm
                0.19345059 = fieldWeight in 1794, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1794)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    11. 9.2000 19:53:22
  3. Martins, A.L.; Souza, R.R.; Ribeiro de Mello, H.: ¬The use of noun phrases in information retrieval : proposing a mechanism for automatic classification (2014) 0.02
    0.019237336 = product of:
      0.03847467 = sum of:
        0.03847467 = product of:
          0.057712007 = sum of:
            0.033383403 = weight(_text_:c in 1441) [ClassicSimilarity], result of:
              0.033383403 = score(doc=1441,freq=4.0), product of:
                0.15484828 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.044891298 = queryNorm
                0.21558782 = fieldWeight in 1441, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.03125 = fieldNorm(doc=1441)
            0.024328604 = weight(_text_:22 in 1441) [ClassicSimilarity], result of:
              0.024328604 = score(doc=1441,freq=2.0), product of:
                0.15720168 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044891298 = queryNorm
                0.15476047 = fieldWeight in 1441, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=1441)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Abstract
    This paper presents a research on syntactic structures known as noun phrases (NP) being applied to increase the effectiveness and efficiency of the mechanisms for the document's classification. Our hypothesis is the fact that the NP can be used instead of single words as a semantic aggregator to reduce the number of words that will be used for the classification system without losing its semantic coverage, increasing its efficiency. The experiment divided the documents classification process in three phases: a) NP preprocessing b) system training; and c) classification experiments. In the first step, a corpus of digitalized texts was submitted to a natural language processing platform1 in which the part-of-speech tagging was done, and them PERL scripts pertaining to the PALAVRAS package were used to extract the Noun Phrases. The preprocessing also involved the tasks of a) removing NP low meaning pre-modifiers, as quantifiers; b) identification of synonyms and corresponding substitution for common hyperonyms; and c) stemming of the relevant words contained in the NP, for similitude checking with other NPs. The first tests with the resulting documents have demonstrated its effectiveness. We have compared the structural similarity of the documents before and after the whole pre-processing steps of phase one. The texts maintained the consistency with the original and have kept the readability. The second phase involves submitting the modified documents to a SVM algorithm to identify clusters and classify the documents. The classification rules are to be established using a machine learning approach. Finally, tests will be conducted to check the effectiveness of the whole process.
    Source
    Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik
  4. Voorhees, E.M.: Implementing agglomerative hierarchic clustering algorithms for use in document retrieval (1986) 0.02
    0.01621907 = product of:
      0.03243814 = sum of:
        0.03243814 = product of:
          0.09731442 = sum of:
            0.09731442 = weight(_text_:22 in 402) [ClassicSimilarity], result of:
              0.09731442 = score(doc=402,freq=2.0), product of:
                0.15720168 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044891298 = queryNorm
                0.61904186 = fieldWeight in 402, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.125 = fieldNorm(doc=402)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Source
    Information processing and management. 22(1986) no.6, S.465-476
  5. Jones, K.P.: Natural-language processing and automatic indexing : a reply (1990) 0.02
    0.015737087 = product of:
      0.031474173 = sum of:
        0.031474173 = product of:
          0.09442252 = sum of:
            0.09442252 = weight(_text_:c in 394) [ClassicSimilarity], result of:
              0.09442252 = score(doc=394,freq=2.0), product of:
                0.15484828 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.044891298 = queryNorm
                0.6097744 = fieldWeight in 394, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.125 = fieldNorm(doc=394)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Footnote
    Erwiderung auf: Korycinski, C. u. A.F. Newell
  6. Hlava, M.M.K.: Automatic indexing : comparing rule-based and statistics-based indexing systems (2005) 0.01
    0.014191686 = product of:
      0.028383372 = sum of:
        0.028383372 = product of:
          0.085150115 = sum of:
            0.085150115 = weight(_text_:22 in 6265) [ClassicSimilarity], result of:
              0.085150115 = score(doc=6265,freq=2.0), product of:
                0.15720168 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044891298 = queryNorm
                0.5416616 = fieldWeight in 6265, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=6265)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Source
    Information outlook. 9(2005) no.8, S.22-23
  7. Anderson, J.D.; Pérez-Carballo, J.: ¬The nature of indexing: how humans and machines analyze messages and texts for retrieval : Part I: Research and the nature of human indexing (2001) 0.01
    0.014111667 = product of:
      0.028223334 = sum of:
        0.028223334 = product of:
          0.08467 = sum of:
            0.08467 = weight(_text_:i in 3136) [ClassicSimilarity], result of:
              0.08467 = score(doc=3136,freq=2.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.50006545 = fieldWeight in 3136, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.09375 = fieldNorm(doc=3136)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  8. Gibb, F.; Smart, G.: Knowledge-based indexing : the view from SIMPR (1991) 0.01
    0.01376995 = product of:
      0.0275399 = sum of:
        0.0275399 = product of:
          0.0826197 = sum of:
            0.0826197 = weight(_text_:c in 4424) [ClassicSimilarity], result of:
              0.0826197 = score(doc=4424,freq=2.0), product of:
                0.15484828 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.044891298 = queryNorm
                0.5335526 = fieldWeight in 4424, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4424)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Source
    Libraries and expert systems. Ed. C. MacDonald et al
  9. Salton, G.; Allen, J.; Buckley, C.; Singhal, A.: Automatic analysis, theme generation, and summarization of machine-readable data (1994) 0.01
    0.01376995 = product of:
      0.0275399 = sum of:
        0.0275399 = product of:
          0.0826197 = sum of:
            0.0826197 = weight(_text_:c in 1168) [ClassicSimilarity], result of:
              0.0826197 = score(doc=1168,freq=2.0), product of:
                0.15484828 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.044891298 = queryNorm
                0.5335526 = fieldWeight in 1168, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.109375 = fieldNorm(doc=1168)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  10. Pulgarin, A.; Gil-Leiva, I.: Bibliometric analysis of the automatic indexing literature : 1956-2000 (2004) 0.01
    0.011641531 = product of:
      0.023283063 = sum of:
        0.023283063 = product of:
          0.069849186 = sum of:
            0.069849186 = weight(_text_:i in 2566) [ClassicSimilarity], result of:
              0.069849186 = score(doc=2566,freq=4.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.41253293 = fieldWeight in 2566, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2566)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    We present a bibliometric study of a corpus of 839 bibliographic references about automatic indexing, covering the period 1956-2000. We analyse the distribution of authors and works, the obsolescence and its dispersion, and the distribution of the literature by topic, year, and source type. We conclude that: (i) there has been a constant interest on the part of researchers; (ii) the most studied topics were the techniques and methods employed and the general aspects of automatic indexing; (iii) the productivity of the authors does fit a Lotka distribution (Dmax=0.02 and critical value=0.054); (iv) the annual aging factor is 95%; and (v) the dispersion of the literature is low.
  11. Biebricher, N.; Fuhr, N.; Lustig, G.; Schwantner, M.; Knorz, G.: ¬The automatic indexing system AIR/PHYS : from research to application (1988) 0.01
    0.010136919 = product of:
      0.020273838 = sum of:
        0.020273838 = product of:
          0.060821515 = sum of:
            0.060821515 = weight(_text_:22 in 1952) [ClassicSimilarity], result of:
              0.060821515 = score(doc=1952,freq=2.0), product of:
                0.15720168 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044891298 = queryNorm
                0.38690117 = fieldWeight in 1952, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=1952)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    16. 8.1998 12:51:22
  12. Kutschekmanesch, S.; Lutes, B.; Moelle, K.; Thiel, U.; Tzeras, K.: Automated multilingual indexing : a synthesis of rule-based and thesaurus-based methods (1998) 0.01
    0.010136919 = product of:
      0.020273838 = sum of:
        0.020273838 = product of:
          0.060821515 = sum of:
            0.060821515 = weight(_text_:22 in 4157) [ClassicSimilarity], result of:
              0.060821515 = score(doc=4157,freq=2.0), product of:
                0.15720168 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044891298 = queryNorm
                0.38690117 = fieldWeight in 4157, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=4157)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Source
    Information und Märkte: 50. Deutscher Dokumentartag 1998, Kongreß der Deutschen Gesellschaft für Dokumentation e.V. (DGD), Rheinische Friedrich-Wilhelms-Universität Bonn, 22.-24. September 1998. Hrsg. von Marlies Ockenfeld u. Gerhard J. Mantwill
  13. Stankovic, R. et al.: Indexing of textual databases based on lexical resources : a case study for Serbian (2016) 0.01
    0.010136919 = product of:
      0.020273838 = sum of:
        0.020273838 = product of:
          0.060821515 = sum of:
            0.060821515 = weight(_text_:22 in 2759) [ClassicSimilarity], result of:
              0.060821515 = score(doc=2759,freq=2.0), product of:
                0.15720168 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044891298 = queryNorm
                0.38690117 = fieldWeight in 2759, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2759)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    1. 2.2016 18:25:22
  14. Salton, G.; Allan, J.; Buckley, C.; Singhal, A.: Automatic analysis, theme generation, and summarization of machine readable texts (1994) 0.01
    0.009835679 = product of:
      0.019671358 = sum of:
        0.019671358 = product of:
          0.05901407 = sum of:
            0.05901407 = weight(_text_:c in 1949) [ClassicSimilarity], result of:
              0.05901407 = score(doc=1949,freq=2.0), product of:
                0.15484828 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.044891298 = queryNorm
                0.381109 = fieldWeight in 1949, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.078125 = fieldNorm(doc=1949)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  15. Siebenkäs, A.; Markscheffel, B.: Conception of a workflow for the semi-automatic construction of a thesaurus for the German printing industry (2015) 0.01
    0.009736827 = product of:
      0.019473653 = sum of:
        0.019473653 = product of:
          0.058420956 = sum of:
            0.058420956 = weight(_text_:c in 2091) [ClassicSimilarity], result of:
              0.058420956 = score(doc=2091,freq=4.0), product of:
                0.15484828 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.044891298 = queryNorm
                0.3772787 = fieldWeight in 2091, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2091)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Source
    Re:inventing information science in the networked society: Proceedings of the 14th International Symposium on Information Science, Zadar/Croatia, 19th-21st May 2015. Eds.: F. Pehar, C. Schloegl u. C. Wolff
  16. Salton, G.: Fast document classification in automatic information retrieval (1978) 0.01
    0.009407777 = product of:
      0.018815555 = sum of:
        0.018815555 = product of:
          0.056446664 = sum of:
            0.056446664 = weight(_text_:i in 2331) [ClassicSimilarity], result of:
              0.056446664 = score(doc=2331,freq=2.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.33337694 = fieldWeight in 2331, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0625 = fieldNorm(doc=2331)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Source
    Kooperation in der Klassifikation I. Proc. der Sekt.1-3 der 2. Fachtagung der Gesellschaft für Klassifikation, Frankfurt-Hoechst, 6.-7.4.1978. Bearb.: W. Dahlberg
  17. Hlava, M.M.K.: Machine-Aided Indexing (MAI) in a multilingual environemt (1992) 0.01
    0.009407777 = product of:
      0.018815555 = sum of:
        0.018815555 = product of:
          0.056446664 = sum of:
            0.056446664 = weight(_text_:i in 2378) [ClassicSimilarity], result of:
              0.056446664 = score(doc=2378,freq=2.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.33337694 = fieldWeight in 2378, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0625 = fieldNorm(doc=2378)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Source
    Online information 92. Proc. of the 16th Int. Online Information Meeting, London, 8-10.12.1992. Ed. by David I. Raitt
  18. Souza, R.R.; Gil-Leiva, I.: Automatic indexing of scientific texts : a methodological comparison (2016) 0.01
    0.009407777 = product of:
      0.018815555 = sum of:
        0.018815555 = product of:
          0.056446664 = sum of:
            0.056446664 = weight(_text_:i in 4913) [ClassicSimilarity], result of:
              0.056446664 = score(doc=4913,freq=2.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.33337694 = fieldWeight in 4913, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4913)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  19. Gomez, I.: Coping with the problem of subject classification diversity (1996) 0.01
    0.008231806 = product of:
      0.016463611 = sum of:
        0.016463611 = product of:
          0.049390834 = sum of:
            0.049390834 = weight(_text_:i in 5074) [ClassicSimilarity], result of:
              0.049390834 = score(doc=5074,freq=2.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.29170483 = fieldWeight in 5074, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5074)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  20. Gábor, K.; Zargayouna, H.; Tellier, I.; Buscaldi, D.; Charnois, T.: ¬A typology of semantic relations dedicated to scientific literature analysis (2016) 0.01
    0.008231806 = product of:
      0.016463611 = sum of:
        0.016463611 = product of:
          0.049390834 = sum of:
            0.049390834 = weight(_text_:i in 2933) [ClassicSimilarity], result of:
              0.049390834 = score(doc=2933,freq=2.0), product of:
                0.16931784 = queryWeight, product of:
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.044891298 = queryNorm
                0.29170483 = fieldWeight in 2933, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.7717297 = idf(docFreq=2765, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2933)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    

Types

  • a 59
  • el 4
  • s 1
  • More… Less…