Search (7 results, page 1 of 1)

  • × author_ss:"Souza, R.R."
  • × year_i:[2010 TO 2020}
  1. Martins, A.L.; Souza, R.R.; Ribeiro de Mello, H.: ¬The use of noun phrases in information retrieval : proposing a mechanism for automatic classification (2014) 0.02
    0.01709765 = product of:
      0.042744122 = sum of:
        0.031770762 = weight(_text_:b in 1441) [ClassicSimilarity], result of:
          0.031770762 = score(doc=1441,freq=4.0), product of:
            0.1434766 = queryWeight, product of:
              3.542962 = idf(docFreq=3476, maxDocs=44218)
              0.04049623 = queryNorm
            0.22143513 = fieldWeight in 1441, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.542962 = idf(docFreq=3476, maxDocs=44218)
              0.03125 = fieldNorm(doc=1441)
        0.01097336 = product of:
          0.02194672 = sum of:
            0.02194672 = weight(_text_:22 in 1441) [ClassicSimilarity], result of:
              0.02194672 = score(doc=1441,freq=2.0), product of:
                0.1418109 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04049623 = queryNorm
                0.15476047 = fieldWeight in 1441, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=1441)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    This paper presents a research on syntactic structures known as noun phrases (NP) being applied to increase the effectiveness and efficiency of the mechanisms for the document's classification. Our hypothesis is the fact that the NP can be used instead of single words as a semantic aggregator to reduce the number of words that will be used for the classification system without losing its semantic coverage, increasing its efficiency. The experiment divided the documents classification process in three phases: a) NP preprocessing b) system training; and c) classification experiments. In the first step, a corpus of digitalized texts was submitted to a natural language processing platform1 in which the part-of-speech tagging was done, and them PERL scripts pertaining to the PALAVRAS package were used to extract the Noun Phrases. The preprocessing also involved the tasks of a) removing NP low meaning pre-modifiers, as quantifiers; b) identification of synonyms and corresponding substitution for common hyperonyms; and c) stemming of the relevant words contained in the NP, for similitude checking with other NPs. The first tests with the resulting documents have demonstrated its effectiveness. We have compared the structural similarity of the documents before and after the whole pre-processing steps of phase one. The texts maintained the consistency with the original and have kept the readability. The second phase involves submitting the modified documents to a SVM algorithm to identify clusters and classify the documents. The classification rules are to be established using a machine learning approach. Finally, tests will be conducted to check the effectiveness of the whole process.
    Source
    Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik
  2. Souza, R.R.; Coelho, F.C.; Higuchi, S.; Silva, D.L da: ¬The CPDOC semantic portal : applying semantic and knowledge organization systems to the Brazilian contemporary history domain (2012) 0.01
    0.007675644 = product of:
      0.03837822 = sum of:
        0.03837822 = weight(_text_:u in 859) [ClassicSimilarity], result of:
          0.03837822 = score(doc=859,freq=2.0), product of:
            0.13260265 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.04049623 = queryNorm
            0.28942272 = fieldWeight in 859, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.0625 = fieldNorm(doc=859)
      0.2 = coord(1/5)
    
    Source
    Categories, contexts and relations in knowledge organization: Proceedings of the Twelfth International ISKO Conference 6-9 August 2012, Mysore, India. Eds.: Neelameghan, A. u. K.S. Raghavan
  3. Coelho, F.C.; Souza, R.R.; Chada, D.M.; Cerdeira, P. de Camargo: Information mining and visualization of data from the Brazilian Supreme Court (STF) : a case study (2012) 0.01
    0.006716188 = product of:
      0.03358094 = sum of:
        0.03358094 = weight(_text_:u in 867) [ClassicSimilarity], result of:
          0.03358094 = score(doc=867,freq=2.0), product of:
            0.13260265 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.04049623 = queryNorm
            0.25324488 = fieldWeight in 867, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.0546875 = fieldNorm(doc=867)
      0.2 = coord(1/5)
    
    Source
    Categories, contexts and relations in knowledge organization: Proceedings of the Twelfth International ISKO Conference 6-9 August 2012, Mysore, India. Eds.: Neelameghan, A. u. K.S. Raghavan
  4. Simões, M. da Graça; Machado, L.M.; Souza, R.R.; Almeida, M.B.; Tavares Lopes, A.: Automatic indexing and ontologies : the consistency of research chronology and authoring in the context of Information Science (2018) 0.01
    0.006716188 = product of:
      0.03358094 = sum of:
        0.03358094 = weight(_text_:u in 5909) [ClassicSimilarity], result of:
          0.03358094 = score(doc=5909,freq=2.0), product of:
            0.13260265 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.04049623 = queryNorm
            0.25324488 = fieldWeight in 5909, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5909)
      0.2 = coord(1/5)
    
    Source
    Challenges and opportunities for knowledge organization in the digital age: proceedings of the Fifteenth International ISKO Conference, 9-11 July 2018, Porto, Portugal / organized by: International Society for Knowledge Organization (ISKO), ISKO Spain and Portugal Chapter, University of Porto - Faculty of Arts and Humanities, Research Centre in Communication, Information and Digital Culture (CIC.digital) - Porto. Eds.: F. Ribeiro u. M.E. Cerveira
  5. Coelho, F.C.; Souza, R.R.; Codeço, C.T.: Towards an ontology for mathematical modeling with application to epidemiology (2012) 0.01
    0.0057567325 = product of:
      0.028783662 = sum of:
        0.028783662 = weight(_text_:u in 838) [ClassicSimilarity], result of:
          0.028783662 = score(doc=838,freq=2.0), product of:
            0.13260265 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.04049623 = queryNorm
            0.21706703 = fieldWeight in 838, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.046875 = fieldNorm(doc=838)
      0.2 = coord(1/5)
    
    Source
    Categories, contexts and relations in knowledge organization: Proceedings of the Twelfth International ISKO Conference 6-9 August 2012, Mysore, India. Eds.: Neelameghan, A. u. K.S. Raghavan
  6. Almeida, M.B.; Souza, R.R.; Porto, R.B.: Looking for the identity of information science in the age of big data, computing clouds and social networks (2015) 0.01
    0.0057567325 = product of:
      0.028783662 = sum of:
        0.028783662 = weight(_text_:u in 3453) [ClassicSimilarity], result of:
          0.028783662 = score(doc=3453,freq=2.0), product of:
            0.13260265 = queryWeight, product of:
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.04049623 = queryNorm
            0.21706703 = fieldWeight in 3453, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2744443 = idf(docFreq=4547, maxDocs=44218)
              0.046875 = fieldNorm(doc=3453)
      0.2 = coord(1/5)
    
    Source
    Re:inventing information science in the networked society: Proceeding of the 14th International Symposium on Information Science (ISI 2015), Zadar, Croatia, 19th-21st May 2015. Hrsg.: Franjo Pehar, Christian Schlögl u. Christian Wolff
  7. Mesquita, L.A.P.; Souza, R.R.; Baracho Porto, R.M.A.: Noun phrases in automatic indexing: : a structural analysis of the distribution of relevant terms in doctoral theses (2014) 0.00
    0.0021946721 = product of:
      0.01097336 = sum of:
        0.01097336 = product of:
          0.02194672 = sum of:
            0.02194672 = weight(_text_:22 in 1442) [ClassicSimilarity], result of:
              0.02194672 = score(doc=1442,freq=2.0), product of:
                0.1418109 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04049623 = queryNorm
                0.15476047 = fieldWeight in 1442, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=1442)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Source
    Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik