Document (#36839)

Author
Rocha, R.
Cobo, A.
Title
Automatización de procesos de categorización jerárquica documental en las organizaciones
Source
Rev. Aporte Santiaguino. 3(2010) no.1, S.93-100
Year
2010
Abstract
In a global context characterized by the massive use of information technology and communications any organization needs to optimize the search and document management processes. In this paper an analysis of modern document management techniques and computational strategies with specialized language resources is presented and a model that can be used in automatic text categorization in the context of organizations is proposed.As a particular case we describe a classification system according to the taxonomy JEL (Journal of Economic Literature) and that makes use of multilingual glossaries for hierarchical classifications of scientific and technical documents related to the business functional areas.
Content
Vgl.: http://revistas.concytec.gob.pe/scielo.php?script=sci_arttext&pid=S2070-836X2010000100013&lng=pt&nrm=iso&tlng=es.
Theme
Klassifikationssysteme im Online-Retrieval
Field
Wirtschaftswissenschaften

Similar documents (author)

  1. Serrano, S. Cobo-=> Cobo-Serrano, S.: 1.93
    1.9331049 = sum of:
      1.9331049 = product of:
        3.8662097 = sum of:
          3.8662097 = weight(author_txt:cobo in 4874) [ClassicSimilarity], result of:
            3.8662097 = score(doc=4874,freq=2.0), product of:
              0.73600215 = queryWeight, product of:
                1.042682 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.07126349 = queryNorm
              5.252987 = fieldWeight in 4874, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.375 = fieldNorm(doc=4874)
        0.5 = coord(1/2)
    
  2. Souza, R. Rocha => Rocha Souza, R.: 1.71
    1.7052958 = sum of:
      1.7052958 = product of:
        3.4105916 = sum of:
          3.4105916 = weight(author_txt:rocha in 4708) [ClassicSimilarity], result of:
            3.4105916 = score(doc=4708,freq=2.0), product of:
              0.6769791 = queryWeight, product of:
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.07126349 = queryNorm
              5.0379567 = fieldWeight in 4708, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.375 = fieldNorm(doc=4708)
        0.5 = coord(1/2)
    
  3. Silva, N.; Rocha, J.: Merging ontologies using a bottom-up lexical and structural approach (2003) 1.61
    1.6077683 = sum of:
      1.6077683 = product of:
        3.2155366 = sum of:
          3.2155366 = weight(author_txt:rocha in 2685) [ClassicSimilarity], result of:
            3.2155366 = score(doc=2685,freq=1.0), product of:
              0.6769791 = queryWeight, product of:
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.07126349 = queryNorm
              4.749831 = fieldWeight in 2685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.5 = fieldNorm(doc=2685)
        0.5 = coord(1/2)
    
  4. Rocha Souza, R. = > Souza, R.R.: 1.41
    1.4067972 = sum of:
      1.4067972 = product of:
        2.8135943 = sum of:
          2.8135943 = weight(author_txt:rocha in 2439) [ClassicSimilarity], result of:
            2.8135943 = score(doc=2439,freq=1.0), product of:
              0.6769791 = queryWeight, product of:
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.07126349 = queryNorm
              4.156102 = fieldWeight in 2439, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.4375 = fieldNorm(doc=2439)
        0.5 = coord(1/2)
    
  5. Rocha Souza, R. -> Souza, R.R.: 1.41
    1.4067972 = sum of:
      1.4067972 = product of:
        2.8135943 = sum of:
          2.8135943 = weight(author_txt:rocha in 879) [ClassicSimilarity], result of:
            2.8135943 = score(doc=879,freq=1.0), product of:
              0.6769791 = queryWeight, product of:
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.07126349 = queryNorm
              4.156102 = fieldWeight in 879, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.4375 = fieldNorm(doc=879)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Garcia Marco, F.J.: Contexto y determinantes funcionales de la clasificacion documental (1996) 0.23
    0.23347455 = sum of:
      0.23347455 = product of:
        1.459216 = sum of:
          0.09834396 = weight(abstract_txt:classifications in 380) [ClassicSimilarity], result of:
            0.09834396 = score(doc=380,freq=1.0), product of:
              0.17127314 = queryWeight, product of:
                1.1788312 = boost
                6.124733 = idf(docFreq=262, maxDocs=44218)
                0.023721954 = queryNorm
              0.5741937 = fieldWeight in 380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.124733 = idf(docFreq=262, maxDocs=44218)
                0.09375 = fieldNorm(doc=380)
          0.067714244 = weight(abstract_txt:document in 380) [ClassicSimilarity], result of:
            0.067714244 = score(doc=380,freq=1.0), product of:
              0.16826256 = queryWeight, product of:
                1.6524022 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.023721954 = queryNorm
              0.40243202 = fieldWeight in 380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.09375 = fieldNorm(doc=380)
          0.06998035 = weight(abstract_txt:context in 380) [ClassicSimilarity], result of:
            0.06998035 = score(doc=380,freq=1.0), product of:
              0.17199595 = queryWeight, product of:
                1.6706333 = boost
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.023721954 = queryNorm
              0.4068721 = fieldWeight in 380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.09375 = fieldNorm(doc=380)
          1.2231774 = weight(title_txt:documental in 380) [ClassicSimilarity], result of:
            1.2231774 = score(doc=380,freq=1.0), product of:
              0.4120323 = queryWeight, product of:
                1.8284061 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.023721954 = queryNorm
              2.9686446 = fieldWeight in 380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.3125 = fieldNorm(doc=380)
        0.16 = coord(4/25)
    
  2. Esteban Navarro, M.A.: Fundamentos epistemologicos de la classificacion documental (1995) 0.12
    0.12374503 = sum of:
      0.12374503 = product of:
        1.5468129 = sum of:
          0.07899995 = weight(abstract_txt:document in 5547) [ClassicSimilarity], result of:
            0.07899995 = score(doc=5547,freq=1.0), product of:
              0.16826256 = queryWeight, product of:
                1.6524022 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.023721954 = queryNorm
              0.46950403 = fieldWeight in 5547, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.109375 = fieldNorm(doc=5547)
          1.4678129 = weight(title_txt:documental in 5547) [ClassicSimilarity], result of:
            1.4678129 = score(doc=5547,freq=1.0), product of:
              0.4120323 = queryWeight, product of:
                1.8284061 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.023721954 = queryNorm
              3.5623734 = fieldWeight in 5547, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.375 = fieldNorm(doc=5547)
        0.08 = coord(2/25)
    
  3. Gonzalez, A.C.: Analisis y diseno de sistemas de gestion electronica de documentacion en grandes entidades (1997) 0.12
    0.12049963 = sum of:
      0.12049963 = product of:
        0.6024982 = sum of:
          0.0867068 = weight(abstract_txt:global in 2923) [ClassicSimilarity], result of:
            0.0867068 = score(doc=2923,freq=1.0), product of:
              0.14210032 = queryWeight, product of:
                1.0737534 = boost
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.023721954 = queryNorm
              0.6101802 = fieldWeight in 2923, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.109375 = fieldNorm(doc=2923)
          0.13977036 = weight(abstract_txt:functional in 2923) [ClassicSimilarity], result of:
            0.13977036 = score(doc=2923,freq=2.0), product of:
              0.15505758 = queryWeight, product of:
                1.1216401 = boost
                5.8275905 = idf(docFreq=353, maxDocs=44218)
                0.023721954 = queryNorm
              0.9014094 = fieldWeight in 2923, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8275905 = idf(docFreq=353, maxDocs=44218)
                0.109375 = fieldNorm(doc=2923)
          0.110278815 = weight(abstract_txt:economic in 2923) [ClassicSimilarity], result of:
            0.110278815 = score(doc=2923,freq=1.0), product of:
              0.16680959 = queryWeight, product of:
                1.163369 = boost
                6.044398 = idf(docFreq=284, maxDocs=44218)
                0.023721954 = queryNorm
              0.661106 = fieldWeight in 2923, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.044398 = idf(docFreq=284, maxDocs=44218)
                0.109375 = fieldNorm(doc=2923)
          0.107742265 = weight(abstract_txt:management in 2923) [ClassicSimilarity], result of:
            0.107742265 = score(doc=2923,freq=2.0), product of:
              0.1642418 = queryWeight, product of:
                1.6325401 = boost
                4.2410107 = idf(docFreq=1729, maxDocs=44218)
                0.023721954 = queryNorm
              0.6559978 = fieldWeight in 2923, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2410107 = idf(docFreq=1729, maxDocs=44218)
                0.109375 = fieldNorm(doc=2923)
          0.1579999 = weight(abstract_txt:document in 2923) [ClassicSimilarity], result of:
            0.1579999 = score(doc=2923,freq=4.0), product of:
              0.16826256 = queryWeight, product of:
                1.6524022 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.023721954 = queryNorm
              0.93900806 = fieldWeight in 2923, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.109375 = fieldNorm(doc=2923)
        0.2 = coord(5/25)
    
  4. Azagury, A.; Factor, M.E.; Maarek, Y.S.; Mandler, B.: ¬A novel navigation paradigm for XML repositories (2002) 0.12
    0.119824916 = sum of:
      0.119824916 = product of:
        0.42794612 = sum of:
          0.041208163 = weight(abstract_txt:according in 463) [ClassicSimilarity], result of:
            0.041208163 = score(doc=463,freq=1.0), product of:
              0.12567256 = queryWeight, product of:
                1.0097811 = boost
                5.2464166 = idf(docFreq=632, maxDocs=44218)
                0.023721954 = queryNorm
              0.32790104 = fieldWeight in 463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2464166 = idf(docFreq=632, maxDocs=44218)
                0.0625 = fieldNorm(doc=463)
          0.043011624 = weight(abstract_txt:business in 463) [ClassicSimilarity], result of:
            0.043011624 = score(doc=463,freq=1.0), product of:
              0.129313 = queryWeight, product of:
                1.0243022 = boost
                5.321862 = idf(docFreq=586, maxDocs=44218)
                0.023721954 = queryNorm
              0.3326164 = fieldWeight in 463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.321862 = idf(docFreq=586, maxDocs=44218)
                0.0625 = fieldNorm(doc=463)
          0.053706545 = weight(abstract_txt:hierarchical in 463) [ClassicSimilarity], result of:
            0.053706545 = score(doc=463,freq=1.0), product of:
              0.14994654 = queryWeight, product of:
                1.1029993 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.023721954 = queryNorm
              0.35817128 = fieldWeight in 463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.0625 = fieldNorm(doc=463)
          0.06996324 = weight(abstract_txt:communications in 463) [ClassicSimilarity], result of:
            0.06996324 = score(doc=463,freq=1.0), product of:
              0.17885382 = queryWeight, product of:
                1.2046368 = boost
                6.258808 = idf(docFreq=229, maxDocs=44218)
                0.023721954 = queryNorm
              0.3911755 = fieldWeight in 463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.258808 = idf(docFreq=229, maxDocs=44218)
                0.0625 = fieldNorm(doc=463)
          0.108935624 = weight(abstract_txt:massive in 463) [ClassicSimilarity], result of:
            0.108935624 = score(doc=463,freq=1.0), product of:
              0.24026927 = queryWeight, product of:
                1.3962274 = boost
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.023721954 = queryNorm
              0.45338973 = fieldWeight in 463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.0625 = fieldNorm(doc=463)
          0.045142826 = weight(abstract_txt:document in 463) [ClassicSimilarity], result of:
            0.045142826 = score(doc=463,freq=1.0), product of:
              0.16826256 = queryWeight, product of:
                1.6524022 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.023721954 = queryNorm
              0.26828802 = fieldWeight in 463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=463)
          0.06597811 = weight(abstract_txt:context in 463) [ClassicSimilarity], result of:
            0.06597811 = score(doc=463,freq=2.0), product of:
              0.17199595 = queryWeight, product of:
                1.6706333 = boost
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.023721954 = queryNorm
              0.3836027 = fieldWeight in 463, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.0625 = fieldNorm(doc=463)
        0.28 = coord(7/25)
    
  5. Hepp, M.; Bruijn, J. de: GenTax : a generic methodology for deriving OWL and RDF-S ontologies from hierarchical classifications, thesauri, and inconsistent taxonomies (2007) 0.12
    0.11884218 = sum of:
      0.11884218 = product of:
        0.49517575 = sum of:
          0.043011624 = weight(abstract_txt:business in 4692) [ClassicSimilarity], result of:
            0.043011624 = score(doc=4692,freq=1.0), product of:
              0.129313 = queryWeight, product of:
                1.0243022 = boost
                5.321862 = idf(docFreq=586, maxDocs=44218)
                0.023721954 = queryNorm
              0.3326164 = fieldWeight in 4692, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.321862 = idf(docFreq=586, maxDocs=44218)
                0.0625 = fieldNorm(doc=4692)
          0.07595253 = weight(abstract_txt:hierarchical in 4692) [ClassicSimilarity], result of:
            0.07595253 = score(doc=4692,freq=2.0), product of:
              0.14994654 = queryWeight, product of:
                1.1029993 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.023721954 = queryNorm
              0.5065307 = fieldWeight in 4692, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.0625 = fieldNorm(doc=4692)
          0.06556264 = weight(abstract_txt:classifications in 4692) [ClassicSimilarity], result of:
            0.06556264 = score(doc=4692,freq=1.0), product of:
              0.17127314 = queryWeight, product of:
                1.1788312 = boost
                6.124733 = idf(docFreq=262, maxDocs=44218)
                0.023721954 = queryNorm
              0.3827958 = fieldWeight in 4692, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.124733 = idf(docFreq=262, maxDocs=44218)
                0.0625 = fieldNorm(doc=4692)
          0.07582826 = weight(abstract_txt:taxonomy in 4692) [ClassicSimilarity], result of:
            0.07582826 = score(doc=4692,freq=1.0), product of:
              0.18871468 = queryWeight, product of:
                1.2373993 = boost
                6.429029 = idf(docFreq=193, maxDocs=44218)
                0.023721954 = queryNorm
              0.4018143 = fieldWeight in 4692, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.429029 = idf(docFreq=193, maxDocs=44218)
                0.0625 = fieldNorm(doc=4692)
          0.14151356 = weight(abstract_txt:categorization in 4692) [ClassicSimilarity], result of:
            0.14151356 = score(doc=4692,freq=3.0), product of:
              0.19833982 = queryWeight, product of:
                1.2685628 = boost
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.023721954 = queryNorm
              0.71349037 = fieldWeight in 4692, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.0625 = fieldNorm(doc=4692)
          0.093307145 = weight(abstract_txt:context in 4692) [ClassicSimilarity], result of:
            0.093307145 = score(doc=4692,freq=4.0), product of:
              0.17199595 = queryWeight, product of:
                1.6706333 = boost
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.023721954 = queryNorm
              0.54249614 = fieldWeight in 4692, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.0625 = fieldNorm(doc=4692)
        0.24 = coord(6/25)