Search (31 results, page 1 of 2)

  • × language_ss:"e"
  • × theme_ss:"Automatisches Indexieren"
  • × type_ss:"a"
  1. Voorhees, E.M.: Implementing agglomerative hierarchic clustering algorithms for use in document retrieval (1986) 0.08
    0.084564395 = product of:
      0.16912879 = sum of:
        0.16912879 = sum of:
          0.07254933 = weight(_text_:6 in 402) [ClassicSimilarity], result of:
            0.07254933 = score(doc=402,freq=2.0), product of:
              0.13521942 = queryWeight, product of:
                3.0350742 = idf(docFreq=5777, maxDocs=44218)
                0.044552263 = queryNorm
              0.5365304 = fieldWeight in 402, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.0350742 = idf(docFreq=5777, maxDocs=44218)
                0.125 = fieldNorm(doc=402)
          0.09657946 = weight(_text_:22 in 402) [ClassicSimilarity], result of:
            0.09657946 = score(doc=402,freq=2.0), product of:
              0.15601443 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.044552263 = queryNorm
              0.61904186 = fieldWeight in 402, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.125 = fieldNorm(doc=402)
      0.5 = coord(1/2)
    
    Source
    Information processing and management. 22(1986) no.6, S.465-476
  2. Hlava, M.M.K.: Automatic indexing : comparing rule-based and statistics-based indexing systems (2005) 0.07
    0.07399385 = product of:
      0.1479877 = sum of:
        0.1479877 = sum of:
          0.06348066 = weight(_text_:6 in 6265) [ClassicSimilarity], result of:
            0.06348066 = score(doc=6265,freq=2.0), product of:
              0.13521942 = queryWeight, product of:
                3.0350742 = idf(docFreq=5777, maxDocs=44218)
                0.044552263 = queryNorm
              0.46946406 = fieldWeight in 6265, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.0350742 = idf(docFreq=5777, maxDocs=44218)
                0.109375 = fieldNorm(doc=6265)
          0.084507026 = weight(_text_:22 in 6265) [ClassicSimilarity], result of:
            0.084507026 = score(doc=6265,freq=2.0), product of:
              0.15601443 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.044552263 = queryNorm
              0.5416616 = fieldWeight in 6265, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.109375 = fieldNorm(doc=6265)
      0.5 = coord(1/2)
    
    Date
    6. 7.1997 18:25:04
    Source
    Information outlook. 9(2005) no.8, S.22-23
  3. Riloff, E.: ¬An empirical study of automated dictionary construction for information extraction in three domains (1996) 0.04
    0.042282198 = product of:
      0.084564395 = sum of:
        0.084564395 = sum of:
          0.036274664 = weight(_text_:6 in 6752) [ClassicSimilarity], result of:
            0.036274664 = score(doc=6752,freq=2.0), product of:
              0.13521942 = queryWeight, product of:
                3.0350742 = idf(docFreq=5777, maxDocs=44218)
                0.044552263 = queryNorm
              0.2682652 = fieldWeight in 6752, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.0350742 = idf(docFreq=5777, maxDocs=44218)
                0.0625 = fieldNorm(doc=6752)
          0.04828973 = weight(_text_:22 in 6752) [ClassicSimilarity], result of:
            0.04828973 = score(doc=6752,freq=2.0), product of:
              0.15601443 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.044552263 = queryNorm
              0.30952093 = fieldWeight in 6752, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0625 = fieldNorm(doc=6752)
      0.5 = coord(1/2)
    
    Date
    6. 3.1997 16:22:15
  4. Newman, D.J.; Block, S.: Probabilistic topic decomposition of an eighteenth-century American newspaper (2006) 0.04
    0.036996923 = product of:
      0.07399385 = sum of:
        0.07399385 = sum of:
          0.03174033 = weight(_text_:6 in 5291) [ClassicSimilarity], result of:
            0.03174033 = score(doc=5291,freq=2.0), product of:
              0.13521942 = queryWeight, product of:
                3.0350742 = idf(docFreq=5777, maxDocs=44218)
                0.044552263 = queryNorm
              0.23473203 = fieldWeight in 5291, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.0350742 = idf(docFreq=5777, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5291)
          0.042253513 = weight(_text_:22 in 5291) [ClassicSimilarity], result of:
            0.042253513 = score(doc=5291,freq=2.0), product of:
              0.15601443 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.044552263 = queryNorm
              0.2708308 = fieldWeight in 5291, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5291)
      0.5 = coord(1/2)
    
    Date
    22. 7.2006 17:32:00
    Source
    Journal of the American Society for Information Science and Technology. 57(2006) no.6, S.753-767
  5. Mesquita, L.A.P.; Souza, R.R.; Baracho Porto, R.M.A.: Noun phrases in automatic indexing: : a structural analysis of the distribution of relevant terms in doctoral theses (2014) 0.02
    0.021141099 = product of:
      0.042282198 = sum of:
        0.042282198 = sum of:
          0.018137332 = weight(_text_:6 in 1442) [ClassicSimilarity], result of:
            0.018137332 = score(doc=1442,freq=2.0), product of:
              0.13521942 = queryWeight, product of:
                3.0350742 = idf(docFreq=5777, maxDocs=44218)
                0.044552263 = queryNorm
              0.1341326 = fieldWeight in 1442, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.0350742 = idf(docFreq=5777, maxDocs=44218)
                0.03125 = fieldNorm(doc=1442)
          0.024144866 = weight(_text_:22 in 1442) [ClassicSimilarity], result of:
            0.024144866 = score(doc=1442,freq=2.0), product of:
              0.15601443 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.044552263 = queryNorm
              0.15476047 = fieldWeight in 1442, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.03125 = fieldNorm(doc=1442)
      0.5 = coord(1/2)
    
    Abstract
    The main objective of this research was to analyze whether there was a characteristic distribution behavior of relevant terms over a scientific text that could contribute as a criterion for their process of automatic indexing. The terms considered in this study were only full noun phrases contained in the texts themselves. The texts were considered a total of 98 doctoral theses of the eight areas of knowledge in a same university. Initially, 20 full noun phrases were automatically extracted from each text as candidates to be the most relevant terms, and each author of each text assigned a relevance value 0-6 (not relevant and highly relevant, respectively) for each of the 20 noun phrases sent. Only, 22.1 % of noun phrases were considered not relevant. A relevance values of the terms assigned by the authors were associated with their positions in the text. Each full noun phrases found in the text was considered as a valid linear position. The results that were obtained showed values resulting from this distribution by considering two types of position: linear, with values consolidated into ten equal consecutive parts; and structural, considering parts of the text (such as introduction, development and conclusion). As a result of considerable importance, all areas of knowledge related to the Natural Sciences showed a characteristic behavior in the distribution of relevant terms, as well as all areas of knowledge related to Social Sciences showed the same characteristic behavior of distribution, but distinct from the Natural Sciences. The difference of the distribution behavior between the Natural and Social Sciences can be clearly visualized through graphs. All behaviors, including the general behavior of all areas of knowledge together, were characterized in polynomial equations and can be applied in future as criteria for automatic indexing. Until the present date this work has become inedited of for two reasons: to present a method for characterizing the distribution of relevant terms in a scientific text, and also, through this method, pointing out a quantitative trait difference between the Natural and Social Sciences.
    Source
    Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik
  6. Biebricher, N.; Fuhr, N.; Lustig, G.; Schwantner, M.; Knorz, G.: ¬The automatic indexing system AIR/PHYS : from research to application (1988) 0.02
    0.015090541 = product of:
      0.030181082 = sum of:
        0.030181082 = product of:
          0.060362164 = sum of:
            0.060362164 = weight(_text_:22 in 1952) [ClassicSimilarity], result of:
              0.060362164 = score(doc=1952,freq=2.0), product of:
                0.15601443 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044552263 = queryNorm
                0.38690117 = fieldWeight in 1952, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=1952)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    16. 8.1998 12:51:22
  7. Kutschekmanesch, S.; Lutes, B.; Moelle, K.; Thiel, U.; Tzeras, K.: Automated multilingual indexing : a synthesis of rule-based and thesaurus-based methods (1998) 0.02
    0.015090541 = product of:
      0.030181082 = sum of:
        0.030181082 = product of:
          0.060362164 = sum of:
            0.060362164 = weight(_text_:22 in 4157) [ClassicSimilarity], result of:
              0.060362164 = score(doc=4157,freq=2.0), product of:
                0.15601443 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044552263 = queryNorm
                0.38690117 = fieldWeight in 4157, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=4157)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information und Märkte: 50. Deutscher Dokumentartag 1998, Kongreß der Deutschen Gesellschaft für Dokumentation e.V. (DGD), Rheinische Friedrich-Wilhelms-Universität Bonn, 22.-24. September 1998. Hrsg. von Marlies Ockenfeld u. Gerhard J. Mantwill
  8. Stankovic, R. et al.: Indexing of textual databases based on lexical resources : a case study for Serbian (2016) 0.02
    0.015090541 = product of:
      0.030181082 = sum of:
        0.030181082 = product of:
          0.060362164 = sum of:
            0.060362164 = weight(_text_:22 in 2759) [ClassicSimilarity], result of:
              0.060362164 = score(doc=2759,freq=2.0), product of:
                0.15601443 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044552263 = queryNorm
                0.38690117 = fieldWeight in 2759, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2759)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    1. 2.2016 18:25:22
  9. Thirion, B.; Leroy, J.P.; Baudic, F.; Douyère, M.; Piot, J.; Darmoni, S.J.: SDI selecting, decribing, and indexing : did you mean automatically? (2001) 0.01
    0.013602999 = product of:
      0.027205998 = sum of:
        0.027205998 = product of:
          0.054411996 = sum of:
            0.054411996 = weight(_text_:6 in 6198) [ClassicSimilarity], result of:
              0.054411996 = score(doc=6198,freq=2.0), product of:
                0.13521942 = queryWeight, product of:
                  3.0350742 = idf(docFreq=5777, maxDocs=44218)
                  0.044552263 = queryNorm
                0.40239778 = fieldWeight in 6198, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.0350742 = idf(docFreq=5777, maxDocs=44218)
                  0.09375 = fieldNorm(doc=6198)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    6. 1.1997 18:30:28
  10. Tsujii, J.-I.: Automatic acquisition of semantic collocation from corpora (1995) 0.01
    0.012072433 = product of:
      0.024144866 = sum of:
        0.024144866 = product of:
          0.04828973 = sum of:
            0.04828973 = weight(_text_:22 in 4709) [ClassicSimilarity], result of:
              0.04828973 = score(doc=4709,freq=2.0), product of:
                0.15601443 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044552263 = queryNorm
                0.30952093 = fieldWeight in 4709, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4709)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    31. 7.1996 9:22:19
  11. Hodges, P.R.: Keyword in title indexes : effectiveness of retrieval in computer searches (1983) 0.01
    0.010563378 = product of:
      0.021126756 = sum of:
        0.021126756 = product of:
          0.042253513 = sum of:
            0.042253513 = weight(_text_:22 in 5001) [ClassicSimilarity], result of:
              0.042253513 = score(doc=5001,freq=2.0), product of:
                0.15601443 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044552263 = queryNorm
                0.2708308 = fieldWeight in 5001, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5001)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    14. 3.1996 13:22:21
  12. Bordoni, L.; Pazienza, M.T.: Documents automatic indexing in an environmental domain (1997) 0.01
    0.010563378 = product of:
      0.021126756 = sum of:
        0.021126756 = product of:
          0.042253513 = sum of:
            0.042253513 = weight(_text_:22 in 530) [ClassicSimilarity], result of:
              0.042253513 = score(doc=530,freq=2.0), product of:
                0.15601443 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044552263 = queryNorm
                0.2708308 = fieldWeight in 530, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=530)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    International forum on information and documentation. 22(1997) no.1, S.17-28
  13. Wolfekuhler, M.R.; Punch, W.F.: Finding salient features for personal Web pages categories (1997) 0.01
    0.010563378 = product of:
      0.021126756 = sum of:
        0.021126756 = product of:
          0.042253513 = sum of:
            0.042253513 = weight(_text_:22 in 2673) [ClassicSimilarity], result of:
              0.042253513 = score(doc=2673,freq=2.0), product of:
                0.15601443 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044552263 = queryNorm
                0.2708308 = fieldWeight in 2673, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2673)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    1. 8.1996 22:08:06
  14. Salton, G.: Fast document classification in automatic information retrieval (1978) 0.01
    0.009068666 = product of:
      0.018137332 = sum of:
        0.018137332 = product of:
          0.036274664 = sum of:
            0.036274664 = weight(_text_:6 in 2331) [ClassicSimilarity], result of:
              0.036274664 = score(doc=2331,freq=2.0), product of:
                0.13521942 = queryWeight, product of:
                  3.0350742 = idf(docFreq=5777, maxDocs=44218)
                  0.044552263 = queryNorm
                0.2682652 = fieldWeight in 2331, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.0350742 = idf(docFreq=5777, maxDocs=44218)
                  0.0625 = fieldNorm(doc=2331)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Kooperation in der Klassifikation I. Proc. der Sekt.1-3 der 2. Fachtagung der Gesellschaft für Klassifikation, Frankfurt-Hoechst, 6.-7.4.1978. Bearb.: W. Dahlberg
  15. Haas, S.; He, S.: Toward the automatic identification of sublanguage vocabulary (1993) 0.01
    0.009068666 = product of:
      0.018137332 = sum of:
        0.018137332 = product of:
          0.036274664 = sum of:
            0.036274664 = weight(_text_:6 in 4891) [ClassicSimilarity], result of:
              0.036274664 = score(doc=4891,freq=2.0), product of:
                0.13521942 = queryWeight, product of:
                  3.0350742 = idf(docFreq=5777, maxDocs=44218)
                  0.044552263 = queryNorm
                0.2682652 = fieldWeight in 4891, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.0350742 = idf(docFreq=5777, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4891)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information processing and management. 29(1993) no.6, S.721-744
  16. Pritchard-Schoch, T.: Natural language comes of age (1993) 0.01
    0.009068666 = product of:
      0.018137332 = sum of:
        0.018137332 = product of:
          0.036274664 = sum of:
            0.036274664 = weight(_text_:6 in 2570) [ClassicSimilarity], result of:
              0.036274664 = score(doc=2570,freq=2.0), product of:
                0.13521942 = queryWeight, product of:
                  3.0350742 = idf(docFreq=5777, maxDocs=44218)
                  0.044552263 = queryNorm
                0.2682652 = fieldWeight in 2570, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.0350742 = idf(docFreq=5777, maxDocs=44218)
                  0.0625 = fieldNorm(doc=2570)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    6. 1.1999 10:02:00
  17. Ward, M.L.: ¬The future of the human indexer (1996) 0.01
    0.009054325 = product of:
      0.01810865 = sum of:
        0.01810865 = product of:
          0.0362173 = sum of:
            0.0362173 = weight(_text_:22 in 7244) [ClassicSimilarity], result of:
              0.0362173 = score(doc=7244,freq=2.0), product of:
                0.15601443 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044552263 = queryNorm
                0.23214069 = fieldWeight in 7244, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=7244)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    9. 2.1997 18:44:22
  18. Pfeifer, U.; Fuhr, N.; Huynh, T.: Searching structured documents with the enhanced retrieval functionality of freeWAIS-sf and SFgate (1995) 0.01
    0.007935083 = product of:
      0.015870165 = sum of:
        0.015870165 = product of:
          0.03174033 = sum of:
            0.03174033 = weight(_text_:6 in 2214) [ClassicSimilarity], result of:
              0.03174033 = score(doc=2214,freq=2.0), product of:
                0.13521942 = queryWeight, product of:
                  3.0350742 = idf(docFreq=5777, maxDocs=44218)
                  0.044552263 = queryNorm
                0.23473203 = fieldWeight in 2214, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.0350742 = idf(docFreq=5777, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2214)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Computer networks and ISDN systems. 27(1995) no.6, S.1027-36
  19. Shafer, K.: Scorpion Project explores using Dewey to organize the Web (1996) 0.01
    0.007935083 = product of:
      0.015870165 = sum of:
        0.015870165 = product of:
          0.03174033 = sum of:
            0.03174033 = weight(_text_:6 in 6750) [ClassicSimilarity], result of:
              0.03174033 = score(doc=6750,freq=2.0), product of:
                0.13521942 = queryWeight, product of:
                  3.0350742 = idf(docFreq=5777, maxDocs=44218)
                  0.044552263 = queryNorm
                0.23473203 = fieldWeight in 6750, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.0350742 = idf(docFreq=5777, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=6750)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    6. 3.1997 16:17:41
  20. Hlava, M.M.K.: Machine aided indexing (MAI) in a multilingual environment (1993) 0.01
    0.007935083 = product of:
      0.015870165 = sum of:
        0.015870165 = product of:
          0.03174033 = sum of:
            0.03174033 = weight(_text_:6 in 7405) [ClassicSimilarity], result of:
              0.03174033 = score(doc=7405,freq=2.0), product of:
                0.13521942 = queryWeight, product of:
                  3.0350742 = idf(docFreq=5777, maxDocs=44218)
                  0.044552263 = queryNorm
                0.23473203 = fieldWeight in 7405, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.0350742 = idf(docFreq=5777, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=7405)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Proceedings of the 14th National Online Meeting 1993, New York, 4-6 May 1993. Ed.: M.E. Williams