Search (125 results, page 1 of 7)

  • × theme_ss:"Automatisches Indexieren"
  1. Salton, G.; Allen, J.; Buckley, C.; Singhal, A.: Automatic analysis, theme generation, and summarization of machine-readable data (1994) 0.05
    0.051122274 = product of:
      0.10224455 = sum of:
        0.10224455 = product of:
          0.15336682 = sum of:
            0.070401184 = weight(_text_:j in 1168) [ClassicSimilarity], result of:
              0.070401184 = score(doc=1168,freq=2.0), product of:
                0.14323919 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.04507926 = queryNorm
                0.4914939 = fieldWeight in 1168, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.109375 = fieldNorm(doc=1168)
            0.082965635 = weight(_text_:c in 1168) [ClassicSimilarity], result of:
              0.082965635 = score(doc=1168,freq=2.0), product of:
                0.15549664 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.04507926 = queryNorm
                0.5335526 = fieldWeight in 1168, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.109375 = fieldNorm(doc=1168)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  2. Kutschekmanesch, S.; Lutes, B.; Moelle, K.; Thiel, U.; Tzeras, K.: Automated multilingual indexing : a synthesis of rule-based and thesaurus-based methods (1998) 0.04
    0.037120916 = product of:
      0.07424183 = sum of:
        0.07424183 = product of:
          0.11136274 = sum of:
            0.05028656 = weight(_text_:j in 4157) [ClassicSimilarity], result of:
              0.05028656 = score(doc=4157,freq=2.0), product of:
                0.14323919 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.04507926 = queryNorm
                0.35106707 = fieldWeight in 4157, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.078125 = fieldNorm(doc=4157)
            0.061076175 = weight(_text_:22 in 4157) [ClassicSimilarity], result of:
              0.061076175 = score(doc=4157,freq=2.0), product of:
                0.15785989 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04507926 = queryNorm
                0.38690117 = fieldWeight in 4157, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=4157)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Source
    Information und Märkte: 50. Deutscher Dokumentartag 1998, Kongreß der Deutschen Gesellschaft für Dokumentation e.V. (DGD), Rheinische Friedrich-Wilhelms-Universität Bonn, 22.-24. September 1998. Hrsg. von Marlies Ockenfeld u. Gerhard J. Mantwill
  3. Stankovic, R. et al.: Indexing of textual databases based on lexical resources : a case study for Serbian (2016) 0.04
    0.037120916 = product of:
      0.07424183 = sum of:
        0.07424183 = product of:
          0.11136274 = sum of:
            0.05028656 = weight(_text_:j in 2759) [ClassicSimilarity], result of:
              0.05028656 = score(doc=2759,freq=2.0), product of:
                0.14323919 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.04507926 = queryNorm
                0.35106707 = fieldWeight in 2759, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2759)
            0.061076175 = weight(_text_:22 in 2759) [ClassicSimilarity], result of:
              0.061076175 = score(doc=2759,freq=2.0), product of:
                0.15785989 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04507926 = queryNorm
                0.38690117 = fieldWeight in 2759, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2759)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    1. 2.2016 18:25:22
    Source
    Semantic keyword-based search on structured data sources: First COST Action IC1302 International KEYSTONE Conference, IKC 2015, Coimbra, Portugal, September 8-9, 2015. Revised Selected Papers. Eds.: J. Cardoso et al
  4. Salton, G.; Allan, J.; Buckley, C.; Singhal, A.: Automatic analysis, theme generation, and summarization of machine readable texts (1994) 0.04
    0.036515914 = product of:
      0.07303183 = sum of:
        0.07303183 = product of:
          0.109547734 = sum of:
            0.05028656 = weight(_text_:j in 1949) [ClassicSimilarity], result of:
              0.05028656 = score(doc=1949,freq=2.0), product of:
                0.14323919 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.04507926 = queryNorm
                0.35106707 = fieldWeight in 1949, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.078125 = fieldNorm(doc=1949)
            0.05926117 = weight(_text_:c in 1949) [ClassicSimilarity], result of:
              0.05926117 = score(doc=1949,freq=2.0), product of:
                0.15549664 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.04507926 = queryNorm
                0.381109 = fieldWeight in 1949, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.078125 = fieldNorm(doc=1949)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  5. Tsujii, J.-I.: Automatic acquisition of semantic collocation from corpora (1995) 0.03
    0.02969673 = product of:
      0.05939346 = sum of:
        0.05939346 = product of:
          0.08909019 = sum of:
            0.04022925 = weight(_text_:j in 4709) [ClassicSimilarity], result of:
              0.04022925 = score(doc=4709,freq=2.0), product of:
                0.14323919 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.04507926 = queryNorm
                0.28085366 = fieldWeight in 4709, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4709)
            0.04886094 = weight(_text_:22 in 4709) [ClassicSimilarity], result of:
              0.04886094 = score(doc=4709,freq=2.0), product of:
                0.15785989 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04507926 = queryNorm
                0.30952093 = fieldWeight in 4709, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4709)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    31. 7.1996 9:22:19
  6. Lepsky, K.; Vorhauer, J.: Lingo - ein open source System für die Automatische Indexierung deutschsprachiger Dokumente (2006) 0.03
    0.02969673 = product of:
      0.05939346 = sum of:
        0.05939346 = product of:
          0.08909019 = sum of:
            0.04022925 = weight(_text_:j in 3581) [ClassicSimilarity], result of:
              0.04022925 = score(doc=3581,freq=2.0), product of:
                0.14323919 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.04507926 = queryNorm
                0.28085366 = fieldWeight in 3581, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.0625 = fieldNorm(doc=3581)
            0.04886094 = weight(_text_:22 in 3581) [ClassicSimilarity], result of:
              0.04886094 = score(doc=3581,freq=2.0), product of:
                0.15785989 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04507926 = queryNorm
                0.30952093 = fieldWeight in 3581, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=3581)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    24. 3.2006 12:22:02
  7. Probst, M.; Mittelbach, J.: Maschinelle Indexierung in der Sacherschließung wissenschaftlicher Bibliotheken (2006) 0.03
    0.02969673 = product of:
      0.05939346 = sum of:
        0.05939346 = product of:
          0.08909019 = sum of:
            0.04022925 = weight(_text_:j in 1755) [ClassicSimilarity], result of:
              0.04022925 = score(doc=1755,freq=2.0), product of:
                0.14323919 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.04507926 = queryNorm
                0.28085366 = fieldWeight in 1755, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1755)
            0.04886094 = weight(_text_:22 in 1755) [ClassicSimilarity], result of:
              0.04886094 = score(doc=1755,freq=2.0), product of:
                0.15785989 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04507926 = queryNorm
                0.30952093 = fieldWeight in 1755, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1755)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    22. 3.2008 12:35:19
  8. Salton, G.; Buckley, C.; Allan, J.: Automatic structuring of text files (1992) 0.03
    0.029212728 = product of:
      0.058425456 = sum of:
        0.058425456 = product of:
          0.087638184 = sum of:
            0.04022925 = weight(_text_:j in 6507) [ClassicSimilarity], result of:
              0.04022925 = score(doc=6507,freq=2.0), product of:
                0.14323919 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.04507926 = queryNorm
                0.28085366 = fieldWeight in 6507, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.0625 = fieldNorm(doc=6507)
            0.04740894 = weight(_text_:c in 6507) [ClassicSimilarity], result of:
              0.04740894 = score(doc=6507,freq=2.0), product of:
                0.15549664 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.04507926 = queryNorm
                0.3048872 = fieldWeight in 6507, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.0625 = fieldNorm(doc=6507)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  9. Krause, J.; Womser-Hacker, C.: PADOK-II : Retrievaltests zur Bewertung von Volltextindexierungsvarianten für das deutsche Patentinformationssystem (1990) 0.03
    0.029212728 = product of:
      0.058425456 = sum of:
        0.058425456 = product of:
          0.087638184 = sum of:
            0.04022925 = weight(_text_:j in 2653) [ClassicSimilarity], result of:
              0.04022925 = score(doc=2653,freq=2.0), product of:
                0.14323919 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.04507926 = queryNorm
                0.28085366 = fieldWeight in 2653, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.0625 = fieldNorm(doc=2653)
            0.04740894 = weight(_text_:c in 2653) [ClassicSimilarity], result of:
              0.04740894 = score(doc=2653,freq=2.0), product of:
                0.15549664 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.04507926 = queryNorm
                0.3048872 = fieldWeight in 2653, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.0625 = fieldNorm(doc=2653)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  10. Böhm, A.; Seifert, C.; Schlötterer, J.; Granitzer, M.: Identifying tweets from the economic domain (2017) 0.03
    0.025561137 = product of:
      0.051122274 = sum of:
        0.051122274 = product of:
          0.07668341 = sum of:
            0.035200592 = weight(_text_:j in 3495) [ClassicSimilarity], result of:
              0.035200592 = score(doc=3495,freq=2.0), product of:
                0.14323919 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.04507926 = queryNorm
                0.24574696 = fieldWeight in 3495, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3495)
            0.041482817 = weight(_text_:c in 3495) [ClassicSimilarity], result of:
              0.041482817 = score(doc=3495,freq=2.0), product of:
                0.15549664 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.04507926 = queryNorm
                0.2667763 = fieldWeight in 3495, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3495)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  11. Goller, C.; Löning, J.; Will, T.; Wolff, W.: Automatic document classification : a thourough evaluation of various methods (2000) 0.02
    0.021909548 = product of:
      0.043819096 = sum of:
        0.043819096 = product of:
          0.06572864 = sum of:
            0.030171938 = weight(_text_:j in 5480) [ClassicSimilarity], result of:
              0.030171938 = score(doc=5480,freq=2.0), product of:
                0.14323919 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.04507926 = queryNorm
                0.21064025 = fieldWeight in 5480, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5480)
            0.035556704 = weight(_text_:c in 5480) [ClassicSimilarity], result of:
              0.035556704 = score(doc=5480,freq=2.0), product of:
                0.15549664 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.04507926 = queryNorm
                0.22866541 = fieldWeight in 5480, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5480)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  12. Buckley, C.; Allan, J.; Salton, G.: Automatic routing and retrieval using Smart : TREC-2 (1995) 0.02
    0.021909548 = product of:
      0.043819096 = sum of:
        0.043819096 = product of:
          0.06572864 = sum of:
            0.030171938 = weight(_text_:j in 5699) [ClassicSimilarity], result of:
              0.030171938 = score(doc=5699,freq=2.0), product of:
                0.14323919 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.04507926 = queryNorm
                0.21064025 = fieldWeight in 5699, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5699)
            0.035556704 = weight(_text_:c in 5699) [ClassicSimilarity], result of:
              0.035556704 = score(doc=5699,freq=2.0), product of:
                0.15549664 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.04507926 = queryNorm
                0.22866541 = fieldWeight in 5699, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5699)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  13. Schöneberg, U.; Gödert, W.: Erschließung mathematischer Publikationen mittels linguistischer Verfahren (2012) 0.02
    0.021909548 = product of:
      0.043819096 = sum of:
        0.043819096 = product of:
          0.06572864 = sum of:
            0.030171938 = weight(_text_:j in 1055) [ClassicSimilarity], result of:
              0.030171938 = score(doc=1055,freq=2.0), product of:
                0.14323919 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.04507926 = queryNorm
                0.21064025 = fieldWeight in 1055, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1055)
            0.035556704 = weight(_text_:c in 1055) [ClassicSimilarity], result of:
              0.035556704 = score(doc=1055,freq=2.0), product of:
                0.15549664 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.04507926 = queryNorm
                0.22866541 = fieldWeight in 1055, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1055)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Source
    http://at.yorku.ca/c/b/f/j/99.htm
  14. Plaunt, C.; Norgard, B.A.: ¬An association-based method for automatic indexing with a controlled vocabulary (1998) 0.02
    0.020056225 = product of:
      0.04011245 = sum of:
        0.04011245 = product of:
          0.060168672 = sum of:
            0.029630585 = weight(_text_:c in 1794) [ClassicSimilarity], result of:
              0.029630585 = score(doc=1794,freq=2.0), product of:
                0.15549664 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.04507926 = queryNorm
                0.1905545 = fieldWeight in 1794, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1794)
            0.030538088 = weight(_text_:22 in 1794) [ClassicSimilarity], result of:
              0.030538088 = score(doc=1794,freq=2.0), product of:
                0.15785989 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04507926 = queryNorm
                0.19345059 = fieldWeight in 1794, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1794)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    11. 9.2000 19:53:22
  15. Martins, A.L.; Souza, R.R.; Ribeiro de Mello, H.: ¬The use of noun phrases in information retrieval : proposing a mechanism for automatic classification (2014) 0.02
    0.019317886 = product of:
      0.03863577 = sum of:
        0.03863577 = product of:
          0.057953656 = sum of:
            0.033523183 = weight(_text_:c in 1441) [ClassicSimilarity], result of:
              0.033523183 = score(doc=1441,freq=4.0), product of:
                0.15549664 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.04507926 = queryNorm
                0.21558782 = fieldWeight in 1441, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.03125 = fieldNorm(doc=1441)
            0.02443047 = weight(_text_:22 in 1441) [ClassicSimilarity], result of:
              0.02443047 = score(doc=1441,freq=2.0), product of:
                0.15785989 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04507926 = queryNorm
                0.15476047 = fieldWeight in 1441, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=1441)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Abstract
    This paper presents a research on syntactic structures known as noun phrases (NP) being applied to increase the effectiveness and efficiency of the mechanisms for the document's classification. Our hypothesis is the fact that the NP can be used instead of single words as a semantic aggregator to reduce the number of words that will be used for the classification system without losing its semantic coverage, increasing its efficiency. The experiment divided the documents classification process in three phases: a) NP preprocessing b) system training; and c) classification experiments. In the first step, a corpus of digitalized texts was submitted to a natural language processing platform1 in which the part-of-speech tagging was done, and them PERL scripts pertaining to the PALAVRAS package were used to extract the Noun Phrases. The preprocessing also involved the tasks of a) removing NP low meaning pre-modifiers, as quantifiers; b) identification of synonyms and corresponding substitution for common hyperonyms; and c) stemming of the relevant words contained in the NP, for similitude checking with other NPs. The first tests with the resulting documents have demonstrated its effectiveness. We have compared the structural similarity of the documents before and after the whole pre-processing steps of phase one. The texts maintained the consistency with the original and have kept the readability. The second phase involves submitting the modified documents to a SVM algorithm to identify clusters and classify the documents. The classification rules are to be established using a machine learning approach. Finally, tests will be conducted to check the effectiveness of the whole process.
    Source
    Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik
  16. SIGIR'92 : Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (1992) 0.02
    0.018647334 = product of:
      0.037294667 = sum of:
        0.037294667 = product of:
          0.055942 = sum of:
            0.035200592 = weight(_text_:j in 6671) [ClassicSimilarity], result of:
              0.035200592 = score(doc=6671,freq=8.0), product of:
                0.14323919 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.04507926 = queryNorm
                0.24574696 = fieldWeight in 6671, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.02734375 = fieldNorm(doc=6671)
            0.020741409 = weight(_text_:c in 6671) [ClassicSimilarity], result of:
              0.020741409 = score(doc=6671,freq=2.0), product of:
                0.15549664 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.04507926 = queryNorm
                0.13338815 = fieldWeight in 6671, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.02734375 = fieldNorm(doc=6671)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Content
    HARMAN, D.: Relevance feedback revisited; AALBERSBERG, I.J.: Incremental relevance feedback; TAGUE-SUTCLIFFE, J.: Measuring the informativeness of a retrieval process; LEWIS, D.D.: An evaluation of phrasal and clustered representations on a text categorization task; BLOSSEVILLE, M.J., G. HÉBRAIL, M.G. MONTEIL u. N. PÉNOT: Automatic document classification: natural language processing, statistical analysis, and expert system techniques used together; MASAND, B., G. LINOFF u. D. WALTZ: Classifying news stories using memory based reasoning; KEEN, E.M.: Term position ranking: some new test results; CROUCH, C.J. u. B. YANG: Experiments in automatic statistical thesaurus construction; GREFENSTETTE, G.: Use of syntactic context to produce term association lists for text retrieval; ANICK, P.G. u. R.A. FLYNN: Versioning of full-text information retrieval system; BURKOWSKI, F.J.: Retrieval activities in a database consisting of heterogeneous collections; DEERWESTER, S.C., K. WACLENA u. M. LaMAR: A textual object management system; NIE, J.-Y.:Towards a probabilistic modal logic for semantic-based information retrieval; WANG, A.W., S.K.M. WONG u. Y.Y. YAO: An analysis of vector space models based on computational geometry; BARTELL, B.T., G.W. COTTRELL u. R.K. BELEW: Latent semantic indexing is an optimal special case of multidimensional scaling; GLAVITSCH, U. u. P. SCHÄUBLE: A system for retrieving speech documents; MARGULIS, E.L.: N-Poisson document modelling; HESS, M.: An incrementally extensible document retrieval system based on linguistics and logical principles; COOPER, W.S., F.C. GEY u. D.P. DABNEY: Probabilistic retrieval based on staged logistic regression; FUHR, N.: Integration of probabilistic fact and text retrieval; CROFT, B., L.A. SMITH u. H. TURTLE: A loosely-coupled integration of a text retrieval system and an object-oriented database system; DUMAIS, S.T. u. J. NIELSEN: Automating the assignement of submitted manuscripts to reviewers; GOST, M.A. u. M. MASOTTI: Design of an OPAC database to permit different subject searching accesses; ROBERTSON, A.M. u. P. WILLETT: Searching for historical word forms in a database of 17th century English text using spelling correction methods; FAX, E.A., Q.F. CHEN u. L.S. HEATH: A faster algorithm for constructing minimal perfect hash functions; MOFFAT, A. u. J. ZOBEL: Parameterised compression for sparse bitmaps; GRANDI, F., P. TIBERIO u. P. Zezula: Frame-sliced patitioned parallel signature files; ALLEN, B.: Cognitive differences in end user searching of a CD-ROM index; SONNENWALD, D.H.: Developing a theory to guide the process of designing information retrieval systems; CUTTING, D.R., J.O. PEDERSEN, D. KARGER, u. J.W. TUKEY: Scatter/ Gather: a cluster-based approach to browsing large document collections; CHALMERS, M. u. P. CHITSON: Bead: Explorations in information visualization; WILLIAMSON, C. u. B. SHNEIDERMAN: The dynamic HomeFinder: evaluating dynamic queries in a real-estate information exploring system
  17. Tsai, C.-F.; McGarry, K.; Tait, J.: Qualitative evaluation of automatic assignment of keywords to images (2006) 0.02
    0.018257957 = product of:
      0.036515914 = sum of:
        0.036515914 = product of:
          0.054773867 = sum of:
            0.02514328 = weight(_text_:j in 963) [ClassicSimilarity], result of:
              0.02514328 = score(doc=963,freq=2.0), product of:
                0.14323919 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.04507926 = queryNorm
                0.17553353 = fieldWeight in 963, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=963)
            0.029630585 = weight(_text_:c in 963) [ClassicSimilarity], result of:
              0.029630585 = score(doc=963,freq=2.0), product of:
                0.15549664 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.04507926 = queryNorm
                0.1905545 = fieldWeight in 963, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=963)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  18. Li, X.; Zhang, A.; Li, C.; Ouyang, J.; Cai, Y.: Exploring coherent topics by topic modeling with term weighting (2018) 0.02
    0.018257957 = product of:
      0.036515914 = sum of:
        0.036515914 = product of:
          0.054773867 = sum of:
            0.02514328 = weight(_text_:j in 5045) [ClassicSimilarity], result of:
              0.02514328 = score(doc=5045,freq=2.0), product of:
                0.14323919 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.04507926 = queryNorm
                0.17553353 = fieldWeight in 5045, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5045)
            0.029630585 = weight(_text_:c in 5045) [ClassicSimilarity], result of:
              0.029630585 = score(doc=5045,freq=2.0), product of:
                0.15549664 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.04507926 = queryNorm
                0.1905545 = fieldWeight in 5045, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5045)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  19. Zhang, Y.; Zhang, C.; Li, J.: Joint modeling of characters, words, and conversation contexts for microblog keyphrase extraction (2020) 0.02
    0.018257957 = product of:
      0.036515914 = sum of:
        0.036515914 = product of:
          0.054773867 = sum of:
            0.02514328 = weight(_text_:j in 5816) [ClassicSimilarity], result of:
              0.02514328 = score(doc=5816,freq=2.0), product of:
                0.14323919 = queryWeight, product of:
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.04507926 = queryNorm
                0.17553353 = fieldWeight in 5816, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1774964 = idf(docFreq=5010, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5816)
            0.029630585 = weight(_text_:c in 5816) [ClassicSimilarity], result of:
              0.029630585 = score(doc=5816,freq=2.0), product of:
                0.15549664 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.04507926 = queryNorm
                0.1905545 = fieldWeight in 5816, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5816)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  20. Schneider, C.; Womser-Hacker, C.: Inhaltserschließungssysteme für Patenttexte : Test und Systemvergleich im Projekt PADOK (1986) 0.02
    0.016761592 = product of:
      0.033523183 = sum of:
        0.033523183 = product of:
          0.100569546 = sum of:
            0.100569546 = weight(_text_:c in 2648) [ClassicSimilarity], result of:
              0.100569546 = score(doc=2648,freq=4.0), product of:
                0.15549664 = queryWeight, product of:
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.04507926 = queryNorm
                0.64676344 = fieldWeight in 2648, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.4494052 = idf(docFreq=3817, maxDocs=44218)
                  0.09375 = fieldNorm(doc=2648)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    

Years

Languages

  • e 74
  • d 46
  • f 2
  • a 1
  • m 1
  • ru 1
  • More… Less…

Types

  • a 105
  • el 9
  • x 8
  • s 5
  • m 3
  • p 2
  • More… Less…