Document (#36837)

Author
Rocha, R.
Cobo, A.
Title
Automatización de procesos de categorización jerárquica documental en las organizaciones
Source
Rev. Aporte Santiaguino. 3(2010) no.1, S.93-100
Year
2010
Abstract
In a global context characterized by the massive use of information technology and communications any organization needs to optimize the search and document management processes. In this paper an analysis of modern document management techniques and computational strategies with specialized language resources is presented and a model that can be used in automatic text categorization in the context of organizations is proposed.As a particular case we describe a classification system according to the taxonomy JEL (Journal of Economic Literature) and that makes use of multilingual glossaries for hierarchical classifications of scientific and technical documents related to the business functional areas.
Content
Vgl.: http://revistas.concytec.gob.pe/scielo.php?script=sci_arttext&pid=S2070-836X2010000100013&lng=pt&nrm=iso&tlng=es.
Theme
Klassifikationssysteme im Online-Retrieval
Field
Wirtschaftswissenschaften

Similar documents (author)

  1. Serrano, S. Cobo-=> Cobo-Serrano, S.: 1.93
    1.9302763 = sum of:
      1.9302763 = product of:
        3.8605525 = sum of:
          3.8605525 = weight(author_txt:cobo in 1160) [ClassicSimilarity], result of:
            3.8605525 = score(doc=1160,freq=2.0), product of:
              0.7360461 = queryWeight, product of:
                1.0427499 = boost
                9.890043 = idf(docFreq=5, maxDocs=43556)
                0.0713718 = queryNorm
              5.2449875 = fieldWeight in 1160, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.890043 = idf(docFreq=5, maxDocs=43556)
                0.375 = fieldNorm(doc=1160)
        0.5 = coord(1/2)
    
  2. Souza, R. Rocha => Rocha Souza, R.: 1.70
    1.7024679 = sum of:
      1.7024679 = product of:
        3.4049358 = sum of:
          3.4049358 = weight(author_txt:rocha in 994) [ClassicSimilarity], result of:
            3.4049358 = score(doc=994,freq=2.0), product of:
              0.67693144 = queryWeight, product of:
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.0713718 = queryNorm
              5.029957 = fieldWeight in 994, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.375 = fieldNorm(doc=994)
        0.5 = coord(1/2)
    
  3. Silva, N.; Rocha, J.: Merging ontologies using a bottom-up lexical and structural approach (2003) 1.61
    1.6051023 = sum of:
      1.6051023 = product of:
        3.2102046 = sum of:
          3.2102046 = weight(author_txt:rocha in 3683) [ClassicSimilarity], result of:
            3.2102046 = score(doc=3683,freq=1.0), product of:
              0.67693144 = queryWeight, product of:
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.0713718 = queryNorm
              4.742289 = fieldWeight in 3683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.5 = fieldNorm(doc=3683)
        0.5 = coord(1/2)
    
  4. Rocha Souza, R. = > Souza, R.R.: 1.40
    1.4044645 = sum of:
      1.4044645 = product of:
        2.808929 = sum of:
          2.808929 = weight(author_txt:rocha in 2439) [ClassicSimilarity], result of:
            2.808929 = score(doc=2439,freq=1.0), product of:
              0.67693144 = queryWeight, product of:
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.0713718 = queryNorm
              4.1495028 = fieldWeight in 2439, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.4375 = fieldNorm(doc=2439)
        0.5 = coord(1/2)
    
  5. Rocha Souza, R. -> Souza, R.R.: 1.40
    1.4044645 = sum of:
      1.4044645 = product of:
        2.808929 = sum of:
          2.808929 = weight(author_txt:rocha in 2877) [ClassicSimilarity], result of:
            2.808929 = score(doc=2877,freq=1.0), product of:
              0.67693144 = queryWeight, product of:
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.0713718 = queryNorm
              4.1495028 = fieldWeight in 2877, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.4375 = fieldNorm(doc=2877)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Garcia Marco, F.J.: Contexto y determinantes funcionales de la clasificacion documental (1996) 0.23
    0.23197326 = sum of:
      0.23197326 = product of:
        1.4498329 = sum of:
          0.099487126 = weight(abstract_txt:classifications in 1378) [ClassicSimilarity], result of:
            0.099487126 = score(doc=1378,freq=1.0), product of:
              0.17237422 = queryWeight, product of:
                1.1849065 = boost
                6.1563497 = idf(docFreq=250, maxDocs=43556)
                0.023630068 = queryNorm
              0.5771578 = fieldWeight in 1378, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1563497 = idf(docFreq=250, maxDocs=43556)
                0.09375 = fieldNorm(doc=1378)
          0.067259766 = weight(abstract_txt:document in 1378) [ClassicSimilarity], result of:
            0.067259766 = score(doc=1378,freq=1.0), product of:
              0.16729179 = queryWeight, product of:
                1.6508219 = boost
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.023630068 = queryNorm
              0.4020506 = fieldWeight in 1378, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.09375 = fieldNorm(doc=1378)
          0.07045184 = weight(abstract_txt:context in 1378) [ClassicSimilarity], result of:
            0.07045184 = score(doc=1378,freq=1.0), product of:
              0.17254378 = queryWeight, product of:
                1.6765348 = boost
                4.355337 = idf(docFreq=1519, maxDocs=43556)
                0.023630068 = queryNorm
              0.40831286 = fieldWeight in 1378, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.355337 = idf(docFreq=1519, maxDocs=43556)
                0.09375 = fieldNorm(doc=1378)
          1.2126342 = weight(title_txt:documental in 1378) [ClassicSimilarity], result of:
            1.2126342 = score(doc=1378,freq=1.0), product of:
              0.40913042 = queryWeight, product of:
                1.8254873 = boost
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.023630068 = queryNorm
              2.9639306 = fieldWeight in 1378, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.3125 = fieldNorm(doc=1378)
        0.16 = coord(4/25)
    
  2. Esteban Navarro, M.A.: Fundamentos epistemologicos de la classificacion documental (1995) 0.12
    0.12269046 = sum of:
      0.12269046 = product of:
        1.5336308 = sum of:
          0.07846973 = weight(abstract_txt:document in 5613) [ClassicSimilarity], result of:
            0.07846973 = score(doc=5613,freq=1.0), product of:
              0.16729179 = queryWeight, product of:
                1.6508219 = boost
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.023630068 = queryNorm
              0.46905905 = fieldWeight in 5613, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.109375 = fieldNorm(doc=5613)
          1.4551611 = weight(title_txt:documental in 5613) [ClassicSimilarity], result of:
            1.4551611 = score(doc=5613,freq=1.0), product of:
              0.40913042 = queryWeight, product of:
                1.8254873 = boost
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.023630068 = queryNorm
              3.556717 = fieldWeight in 5613, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.375 = fieldNorm(doc=5613)
        0.08 = coord(2/25)
    
  3. Gonzalez, A.C.: Analisis y diseno de sistemas de gestion electronica de documentacion en grandes entidades (1997) 0.12
    0.12001536 = sum of:
      0.12001536 = product of:
        0.6000768 = sum of:
          0.0872329 = weight(abstract_txt:global in 3921) [ClassicSimilarity], result of:
            0.0872329 = score(doc=3921,freq=1.0), product of:
              0.14248969 = queryWeight, product of:
                1.0773073 = boost
                5.5973034 = idf(docFreq=438, maxDocs=43556)
                0.023630068 = queryNorm
              0.612205 = fieldWeight in 3921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5973034 = idf(docFreq=438, maxDocs=43556)
                0.109375 = fieldNorm(doc=3921)
          0.13855377 = weight(abstract_txt:functional in 3921) [ClassicSimilarity], result of:
            0.13855377 = score(doc=3921,freq=2.0), product of:
              0.15395676 = queryWeight, product of:
                1.1198176 = boost
                5.8181715 = idf(docFreq=351, maxDocs=43556)
                0.023630068 = queryNorm
              0.8999525 = fieldWeight in 3921, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8181715 = idf(docFreq=351, maxDocs=43556)
                0.109375 = fieldNorm(doc=3921)
          0.109798945 = weight(abstract_txt:economic in 3921) [ClassicSimilarity], result of:
            0.109798945 = score(doc=3921,freq=1.0), product of:
              0.16610983 = queryWeight, product of:
                1.1631764 = boost
                6.043448 = idf(docFreq=280, maxDocs=43556)
                0.023630068 = queryNorm
              0.6610021 = fieldWeight in 3921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.043448 = idf(docFreq=280, maxDocs=43556)
                0.109375 = fieldNorm(doc=3921)
          0.107551694 = weight(abstract_txt:management in 3921) [ClassicSimilarity], result of:
            0.107551694 = score(doc=3921,freq=2.0), product of:
              0.16383551 = queryWeight, product of:
                1.6336797 = boost
                4.2440076 = idf(docFreq=1698, maxDocs=43556)
                0.023630068 = queryNorm
              0.6564614 = fieldWeight in 3921, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2440076 = idf(docFreq=1698, maxDocs=43556)
                0.109375 = fieldNorm(doc=3921)
          0.15693946 = weight(abstract_txt:document in 3921) [ClassicSimilarity], result of:
            0.15693946 = score(doc=3921,freq=4.0), product of:
              0.16729179 = queryWeight, product of:
                1.6508219 = boost
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.023630068 = queryNorm
              0.9381181 = fieldWeight in 3921, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.109375 = fieldNorm(doc=3921)
        0.2 = coord(5/25)
    
  4. Azagury, A.; Factor, M.E.; Maarek, Y.S.; Mandler, B.: ¬A novel navigation paradigm for XML repositories (2002) 0.12
    0.11965167 = sum of:
      0.11965167 = product of:
        0.4273274 = sum of:
          0.04129555 = weight(abstract_txt:according in 1461) [ClassicSimilarity], result of:
            0.04129555 = score(doc=1461,freq=1.0), product of:
              0.1256871 = queryWeight, product of:
                1.0117967 = boost
                5.2569337 = idf(docFreq=616, maxDocs=43556)
                0.023630068 = queryNorm
              0.32855836 = fieldWeight in 1461, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2569337 = idf(docFreq=616, maxDocs=43556)
                0.0625 = fieldNorm(doc=1461)
          0.04272858 = weight(abstract_txt:business in 1461) [ClassicSimilarity], result of:
            0.04272858 = score(doc=1461,freq=1.0), product of:
              0.12857826 = queryWeight, product of:
                1.0233676 = boost
                5.317052 = idf(docFreq=580, maxDocs=43556)
                0.023630068 = queryNorm
              0.33231574 = fieldWeight in 1461, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.317052 = idf(docFreq=580, maxDocs=43556)
                0.0625 = fieldNorm(doc=1461)
          0.053656206 = weight(abstract_txt:hierarchical in 1461) [ClassicSimilarity], result of:
            0.053656206 = score(doc=1461,freq=1.0), product of:
              0.14965867 = queryWeight, product of:
                1.1040757 = boost
                5.736382 = idf(docFreq=381, maxDocs=43556)
                0.023630068 = queryNorm
              0.35852388 = fieldWeight in 1461, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.736382 = idf(docFreq=381, maxDocs=43556)
                0.0625 = fieldNorm(doc=1461)
          0.069479704 = weight(abstract_txt:communications in 1461) [ClassicSimilarity], result of:
            0.069479704 = score(doc=1461,freq=1.0), product of:
              0.17779814 = queryWeight, product of:
                1.2034042 = boost
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.023630068 = queryNorm
              0.39077857 = fieldWeight in 1461, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.0625 = fieldNorm(doc=1461)
          0.10890484 = weight(abstract_txt:massive in 1461) [ClassicSimilarity], result of:
            0.10890484 = score(doc=1461,freq=1.0), product of:
              0.23991276 = queryWeight, product of:
                1.3978951 = boost
                7.2629623 = idf(docFreq=82, maxDocs=43556)
                0.023630068 = queryNorm
              0.45393515 = fieldWeight in 1461, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2629623 = idf(docFreq=82, maxDocs=43556)
                0.0625 = fieldNorm(doc=1461)
          0.044839844 = weight(abstract_txt:document in 1461) [ClassicSimilarity], result of:
            0.044839844 = score(doc=1461,freq=1.0), product of:
              0.16729179 = queryWeight, product of:
                1.6508219 = boost
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.023630068 = queryNorm
              0.26803374 = fieldWeight in 1461, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.0625 = fieldNorm(doc=1461)
          0.066422634 = weight(abstract_txt:context in 1461) [ClassicSimilarity], result of:
            0.066422634 = score(doc=1461,freq=2.0), product of:
              0.17254378 = queryWeight, product of:
                1.6765348 = boost
                4.355337 = idf(docFreq=1519, maxDocs=43556)
                0.023630068 = queryNorm
              0.38496104 = fieldWeight in 1461, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.355337 = idf(docFreq=1519, maxDocs=43556)
                0.0625 = fieldNorm(doc=1461)
        0.28 = coord(7/25)
    
  5. Hepp, M.; Bruijn, J. de: GenTax : a generic methodology for deriving OWL and RDF-S ontologies from hierarchical classifications, thesauri, and inconsistent taxonomies (2007) 0.12
    0.119472966 = sum of:
      0.119472966 = product of:
        0.49780405 = sum of:
          0.04272858 = weight(abstract_txt:business in 1690) [ClassicSimilarity], result of:
            0.04272858 = score(doc=1690,freq=1.0), product of:
              0.12857826 = queryWeight, product of:
                1.0233676 = boost
                5.317052 = idf(docFreq=580, maxDocs=43556)
                0.023630068 = queryNorm
              0.33231574 = fieldWeight in 1690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.317052 = idf(docFreq=580, maxDocs=43556)
                0.0625 = fieldNorm(doc=1690)
          0.075881325 = weight(abstract_txt:hierarchical in 1690) [ClassicSimilarity], result of:
            0.075881325 = score(doc=1690,freq=2.0), product of:
              0.14965867 = queryWeight, product of:
                1.1040757 = boost
                5.736382 = idf(docFreq=381, maxDocs=43556)
                0.023630068 = queryNorm
              0.5070293 = fieldWeight in 1690, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.736382 = idf(docFreq=381, maxDocs=43556)
                0.0625 = fieldNorm(doc=1690)
          0.06632475 = weight(abstract_txt:classifications in 1690) [ClassicSimilarity], result of:
            0.06632475 = score(doc=1690,freq=1.0), product of:
              0.17237422 = queryWeight, product of:
                1.1849065 = boost
                6.1563497 = idf(docFreq=250, maxDocs=43556)
                0.023630068 = queryNorm
              0.38477185 = fieldWeight in 1690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1563497 = idf(docFreq=250, maxDocs=43556)
                0.0625 = fieldNorm(doc=1690)
          0.07573674 = weight(abstract_txt:taxonomy in 1690) [ClassicSimilarity], result of:
            0.07573674 = score(doc=1690,freq=1.0), product of:
              0.18831849 = queryWeight, product of:
                1.2384953 = boost
                6.4347787 = idf(docFreq=189, maxDocs=43556)
                0.023630068 = queryNorm
              0.40217367 = fieldWeight in 1690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4347787 = idf(docFreq=189, maxDocs=43556)
                0.0625 = fieldNorm(doc=1690)
          0.14319688 = weight(abstract_txt:categorization in 1690) [ClassicSimilarity], result of:
            0.14319688 = score(doc=1690,freq=3.0), product of:
              0.19965056 = queryWeight, product of:
                1.2752143 = boost
                6.625557 = idf(docFreq=156, maxDocs=43556)
                0.023630068 = queryNorm
              0.7172376 = fieldWeight in 1690, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.625557 = idf(docFreq=156, maxDocs=43556)
                0.0625 = fieldNorm(doc=1690)
          0.09393579 = weight(abstract_txt:context in 1690) [ClassicSimilarity], result of:
            0.09393579 = score(doc=1690,freq=4.0), product of:
              0.17254378 = queryWeight, product of:
                1.6765348 = boost
                4.355337 = idf(docFreq=1519, maxDocs=43556)
                0.023630068 = queryNorm
              0.54441714 = fieldWeight in 1690, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.355337 = idf(docFreq=1519, maxDocs=43556)
                0.0625 = fieldNorm(doc=1690)
        0.24 = coord(6/25)