Search (22 results, page 1 of 2)

  • × language_ss:"e"
  • × theme_ss:"Automatisches Indexieren"
  1. Voorhees, E.M.: Implementing agglomerative hierarchic clustering algorithms for use in document retrieval (1986) 0.02
    0.024534488 = product of:
      0.049068976 = sum of:
        0.049068976 = product of:
          0.09813795 = sum of:
            0.09813795 = weight(_text_:22 in 402) [ClassicSimilarity], result of:
              0.09813795 = score(doc=402,freq=2.0), product of:
                0.15853201 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045271195 = queryNorm
                0.61904186 = fieldWeight in 402, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.125 = fieldNorm(doc=402)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information processing and management. 22(1986) no.6, S.465-476
  2. Hlava, M.M.K.: Automatic indexing : comparing rule-based and statistics-based indexing systems (2005) 0.02
    0.021467676 = product of:
      0.042935353 = sum of:
        0.042935353 = product of:
          0.085870706 = sum of:
            0.085870706 = weight(_text_:22 in 6265) [ClassicSimilarity], result of:
              0.085870706 = score(doc=6265,freq=2.0), product of:
                0.15853201 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045271195 = queryNorm
                0.5416616 = fieldWeight in 6265, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=6265)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information outlook. 9(2005) no.8, S.22-23
  3. Chung, Y.M.; Lee, J.Y.: ¬A corpus-based approach to comparative evaluation of statistical term association measures (2001) 0.02
    0.015766267 = product of:
      0.031532533 = sum of:
        0.031532533 = product of:
          0.06306507 = sum of:
            0.06306507 = weight(_text_:x in 5769) [ClassicSimilarity], result of:
              0.06306507 = score(doc=5769,freq=4.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.32989708 = fieldWeight in 5769, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5769)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Statistical association measures have been widely applied in information retrieval research, usually employing a clustering of documents or terms on the basis of their relationships. Applications of the association measures for term clustering include automatic thesaurus construction and query expansion. This research evaluates the similarity of six association measures by comparing the relationship and behavior they demonstrate in various analyses of a test corpus. Analysis techniques include comparisons of highly ranked term pairs and term clusters, analyses of the correlation among the association measures using Pearson's correlation coefficient and MDS mapping, and an analysis of the impact of a term frequency on the association values by means of z-score. The major findings of the study are as follows: First, the most similar association measures are mutual information and Yule's coefficient of colligation Y, whereas cosine and Jaccard coefficients, as well as X**2 statistic and likelihood ratio, demonstrate quite similar behavior for terms with high frequency. Second, among all the measures, the X**2 statistic is the least affected by the frequency of terms. Third, although cosine and Jaccard coefficients tend to emphasize high frequency terms, mutual information and Yule's Y seem to overestimate rare terms
  4. Biebricher, N.; Fuhr, N.; Lustig, G.; Schwantner, M.; Knorz, G.: ¬The automatic indexing system AIR/PHYS : from research to application (1988) 0.02
    0.015334055 = product of:
      0.03066811 = sum of:
        0.03066811 = product of:
          0.06133622 = sum of:
            0.06133622 = weight(_text_:22 in 1952) [ClassicSimilarity], result of:
              0.06133622 = score(doc=1952,freq=2.0), product of:
                0.15853201 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045271195 = queryNorm
                0.38690117 = fieldWeight in 1952, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=1952)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    16. 8.1998 12:51:22
  5. Kutschekmanesch, S.; Lutes, B.; Moelle, K.; Thiel, U.; Tzeras, K.: Automated multilingual indexing : a synthesis of rule-based and thesaurus-based methods (1998) 0.02
    0.015334055 = product of:
      0.03066811 = sum of:
        0.03066811 = product of:
          0.06133622 = sum of:
            0.06133622 = weight(_text_:22 in 4157) [ClassicSimilarity], result of:
              0.06133622 = score(doc=4157,freq=2.0), product of:
                0.15853201 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045271195 = queryNorm
                0.38690117 = fieldWeight in 4157, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=4157)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information und Märkte: 50. Deutscher Dokumentartag 1998, Kongreß der Deutschen Gesellschaft für Dokumentation e.V. (DGD), Rheinische Friedrich-Wilhelms-Universität Bonn, 22.-24. September 1998. Hrsg. von Marlies Ockenfeld u. Gerhard J. Mantwill
  6. Stankovic, R. et al.: Indexing of textual databases based on lexical resources : a case study for Serbian (2016) 0.02
    0.015334055 = product of:
      0.03066811 = sum of:
        0.03066811 = product of:
          0.06133622 = sum of:
            0.06133622 = weight(_text_:22 in 2759) [ClassicSimilarity], result of:
              0.06133622 = score(doc=2759,freq=2.0), product of:
                0.15853201 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045271195 = queryNorm
                0.38690117 = fieldWeight in 2759, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2759)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    1. 2.2016 18:25:22
  7. Mongin, L.; Fu, Y.Y.; Mostafa, J.: Open Archives data Service prototype and automated subject indexing using D-Lib archive content as a testbed (2003) 0.01
    0.013378119 = product of:
      0.026756238 = sum of:
        0.026756238 = product of:
          0.053512476 = sum of:
            0.053512476 = weight(_text_:x in 1167) [ClassicSimilarity], result of:
              0.053512476 = score(doc=1167,freq=2.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.27992693 = fieldWeight in 1167, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1167)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    D-Lib magazine. 9(2003) no.12, x S
  8. Tsujii, J.-I.: Automatic acquisition of semantic collocation from corpora (1995) 0.01
    0.012267244 = product of:
      0.024534488 = sum of:
        0.024534488 = product of:
          0.049068976 = sum of:
            0.049068976 = weight(_text_:22 in 4709) [ClassicSimilarity], result of:
              0.049068976 = score(doc=4709,freq=2.0), product of:
                0.15853201 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045271195 = queryNorm
                0.30952093 = fieldWeight in 4709, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4709)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    31. 7.1996 9:22:19
  9. Riloff, E.: ¬An empirical study of automated dictionary construction for information extraction in three domains (1996) 0.01
    0.012267244 = product of:
      0.024534488 = sum of:
        0.024534488 = product of:
          0.049068976 = sum of:
            0.049068976 = weight(_text_:22 in 6752) [ClassicSimilarity], result of:
              0.049068976 = score(doc=6752,freq=2.0), product of:
                0.15853201 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045271195 = queryNorm
                0.30952093 = fieldWeight in 6752, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=6752)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    6. 3.1997 16:22:15
  10. Smiraglia, R.P.; Cai, X.: Tracking the evolution of clustering, machine learning, automatic indexing and automatic classification in knowledge organization (2017) 0.01
    0.011148433 = product of:
      0.022296866 = sum of:
        0.022296866 = product of:
          0.044593733 = sum of:
            0.044593733 = weight(_text_:x in 3627) [ClassicSimilarity], result of:
              0.044593733 = score(doc=3627,freq=2.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.23327245 = fieldWeight in 3627, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3627)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  11. Li, X.; Zhang, A.; Li, C.; Ouyang, J.; Cai, Y.: Exploring coherent topics by topic modeling with term weighting (2018) 0.01
    0.011148433 = product of:
      0.022296866 = sum of:
        0.022296866 = product of:
          0.044593733 = sum of:
            0.044593733 = weight(_text_:x in 5045) [ClassicSimilarity], result of:
              0.044593733 = score(doc=5045,freq=2.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.23327245 = fieldWeight in 5045, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5045)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  12. Hodges, P.R.: Keyword in title indexes : effectiveness of retrieval in computer searches (1983) 0.01
    0.010733838 = product of:
      0.021467676 = sum of:
        0.021467676 = product of:
          0.042935353 = sum of:
            0.042935353 = weight(_text_:22 in 5001) [ClassicSimilarity], result of:
              0.042935353 = score(doc=5001,freq=2.0), product of:
                0.15853201 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045271195 = queryNorm
                0.2708308 = fieldWeight in 5001, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5001)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    14. 3.1996 13:22:21
  13. Bordoni, L.; Pazienza, M.T.: Documents automatic indexing in an environmental domain (1997) 0.01
    0.010733838 = product of:
      0.021467676 = sum of:
        0.021467676 = product of:
          0.042935353 = sum of:
            0.042935353 = weight(_text_:22 in 530) [ClassicSimilarity], result of:
              0.042935353 = score(doc=530,freq=2.0), product of:
                0.15853201 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045271195 = queryNorm
                0.2708308 = fieldWeight in 530, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=530)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    International forum on information and documentation. 22(1997) no.1, S.17-28
  14. Wolfekuhler, M.R.; Punch, W.F.: Finding salient features for personal Web pages categories (1997) 0.01
    0.010733838 = product of:
      0.021467676 = sum of:
        0.021467676 = product of:
          0.042935353 = sum of:
            0.042935353 = weight(_text_:22 in 2673) [ClassicSimilarity], result of:
              0.042935353 = score(doc=2673,freq=2.0), product of:
                0.15853201 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045271195 = queryNorm
                0.2708308 = fieldWeight in 2673, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2673)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    1. 8.1996 22:08:06
  15. Newman, D.J.; Block, S.: Probabilistic topic decomposition of an eighteenth-century American newspaper (2006) 0.01
    0.010733838 = product of:
      0.021467676 = sum of:
        0.021467676 = product of:
          0.042935353 = sum of:
            0.042935353 = weight(_text_:22 in 5291) [ClassicSimilarity], result of:
              0.042935353 = score(doc=5291,freq=2.0), product of:
                0.15853201 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045271195 = queryNorm
                0.2708308 = fieldWeight in 5291, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5291)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 7.2006 17:32:00
  16. Ward, M.L.: ¬The future of the human indexer (1996) 0.01
    0.009200432 = product of:
      0.018400865 = sum of:
        0.018400865 = product of:
          0.03680173 = sum of:
            0.03680173 = weight(_text_:22 in 7244) [ClassicSimilarity], result of:
              0.03680173 = score(doc=7244,freq=2.0), product of:
                0.15853201 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045271195 = queryNorm
                0.23214069 = fieldWeight in 7244, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=7244)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    9. 2.1997 18:44:22
  17. Tavakolizadeh-Ravari, M.: Analysis of the long term dynamics in thesaurus developments and its consequences (2017) 0.01
    0.008918746 = product of:
      0.017837493 = sum of:
        0.017837493 = product of:
          0.035674985 = sum of:
            0.035674985 = weight(_text_:x in 3081) [ClassicSimilarity], result of:
              0.035674985 = score(doc=3081,freq=2.0), product of:
                0.19116588 = queryWeight, product of:
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.045271195 = queryNorm
                0.18661796 = fieldWeight in 3081, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.2226825 = idf(docFreq=1761, maxDocs=44218)
                  0.03125 = fieldNorm(doc=3081)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Type
    x
  18. Plaunt, C.; Norgard, B.A.: ¬An association-based method for automatic indexing with a controlled vocabulary (1998) 0.01
    0.0076670274 = product of:
      0.015334055 = sum of:
        0.015334055 = product of:
          0.03066811 = sum of:
            0.03066811 = weight(_text_:22 in 1794) [ClassicSimilarity], result of:
              0.03066811 = score(doc=1794,freq=2.0), product of:
                0.15853201 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045271195 = queryNorm
                0.19345059 = fieldWeight in 1794, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1794)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    11. 9.2000 19:53:22
  19. Milstead, J.L.: Thesauri in a full-text world (1998) 0.01
    0.0076670274 = product of:
      0.015334055 = sum of:
        0.015334055 = product of:
          0.03066811 = sum of:
            0.03066811 = weight(_text_:22 in 2337) [ClassicSimilarity], result of:
              0.03066811 = score(doc=2337,freq=2.0), product of:
                0.15853201 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045271195 = queryNorm
                0.19345059 = fieldWeight in 2337, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2337)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 9.1997 19:16:05
  20. Martins, A.L.; Souza, R.R.; Ribeiro de Mello, H.: ¬The use of noun phrases in information retrieval : proposing a mechanism for automatic classification (2014) 0.01
    0.006133622 = product of:
      0.012267244 = sum of:
        0.012267244 = product of:
          0.024534488 = sum of:
            0.024534488 = weight(_text_:22 in 1441) [ClassicSimilarity], result of:
              0.024534488 = score(doc=1441,freq=2.0), product of:
                0.15853201 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045271195 = queryNorm
                0.15476047 = fieldWeight in 1441, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=1441)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik