Search (5 results, page 1 of 1)

  • × author_ss:"Souza, R.R."
  • × year_i:[2010 TO 2020}
  1. Mesquita, L.A.P.; Souza, R.R.; Baracho Porto, R.M.A.: Noun phrases in automatic indexing: : a structural analysis of the distribution of relevant terms in doctoral theses (2014) 0.03
    0.034468703 = product of:
      0.068937406 = sum of:
        0.056384586 = weight(_text_:social in 1442) [ClassicSimilarity], result of:
          0.056384586 = score(doc=1442,freq=6.0), product of:
            0.1847249 = queryWeight, product of:
              3.9875789 = idf(docFreq=2228, maxDocs=44218)
              0.046325076 = queryNorm
            0.30523545 = fieldWeight in 1442, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.9875789 = idf(docFreq=2228, maxDocs=44218)
              0.03125 = fieldNorm(doc=1442)
        0.012552816 = product of:
          0.025105633 = sum of:
            0.025105633 = weight(_text_:22 in 1442) [ClassicSimilarity], result of:
              0.025105633 = score(doc=1442,freq=2.0), product of:
                0.16222252 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046325076 = queryNorm
                0.15476047 = fieldWeight in 1442, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=1442)
          0.5 = coord(1/2)
      0.5 = coord(2/4)
    
    Abstract
    The main objective of this research was to analyze whether there was a characteristic distribution behavior of relevant terms over a scientific text that could contribute as a criterion for their process of automatic indexing. The terms considered in this study were only full noun phrases contained in the texts themselves. The texts were considered a total of 98 doctoral theses of the eight areas of knowledge in a same university. Initially, 20 full noun phrases were automatically extracted from each text as candidates to be the most relevant terms, and each author of each text assigned a relevance value 0-6 (not relevant and highly relevant, respectively) for each of the 20 noun phrases sent. Only, 22.1 % of noun phrases were considered not relevant. A relevance values of the terms assigned by the authors were associated with their positions in the text. Each full noun phrases found in the text was considered as a valid linear position. The results that were obtained showed values resulting from this distribution by considering two types of position: linear, with values consolidated into ten equal consecutive parts; and structural, considering parts of the text (such as introduction, development and conclusion). As a result of considerable importance, all areas of knowledge related to the Natural Sciences showed a characteristic behavior in the distribution of relevant terms, as well as all areas of knowledge related to Social Sciences showed the same characteristic behavior of distribution, but distinct from the Natural Sciences. The difference of the distribution behavior between the Natural and Social Sciences can be clearly visualized through graphs. All behaviors, including the general behavior of all areas of knowledge together, were characterized in polynomial equations and can be applied in future as criteria for automatic indexing. Until the present date this work has become inedited of for two reasons: to present a method for characterizing the distribution of relevant terms in a scientific text, and also, through this method, pointing out a quantitative trait difference between the Natural and Social Sciences.
    Source
    Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik
  2. Almeida, M.B.; Souza, R.R.; Porto, R.B.: Looking for the identity of information science in the age of big data, computing clouds and social networks (2015) 0.02
    0.017264182 = product of:
      0.06905673 = sum of:
        0.06905673 = weight(_text_:social in 3453) [ClassicSimilarity], result of:
          0.06905673 = score(doc=3453,freq=4.0), product of:
            0.1847249 = queryWeight, product of:
              3.9875789 = idf(docFreq=2228, maxDocs=44218)
              0.046325076 = queryNorm
            0.3738355 = fieldWeight in 3453, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.9875789 = idf(docFreq=2228, maxDocs=44218)
              0.046875 = fieldNorm(doc=3453)
      0.25 = coord(1/4)
    
    Abstract
    In this paper we discuss, under a critical point of view, the current Information Science landscape and some future prospects regarding contemporary information phenomena. We present thoughts about the process of thematic deflation of Information Science, through the analysis of the research objects currently under development in this field. In addition to this, we look at the process of absorption of these and other relevant objects in distinguished knowledge fields. We seek to challenge the emphasis and the volume of interdisciplinary research within the field, and present some comments about what might be the results of such processes for the future of Information Science. Subsequently, we analyze the impact in the Information Science field due to phenomena like information boom, the consolidation of the social networks as interactive spaces, cloud computing, as well as other key elements.
  3. Souza, R.R.; Coelho, F.C.; Higuchi, S.; Silva, D.L da: ¬The CPDOC semantic portal : applying semantic and knowledge organization systems to the Brazilian contemporary history domain (2012) 0.02
    0.016276827 = product of:
      0.06510731 = sum of:
        0.06510731 = weight(_text_:social in 859) [ClassicSimilarity], result of:
          0.06510731 = score(doc=859,freq=2.0), product of:
            0.1847249 = queryWeight, product of:
              3.9875789 = idf(docFreq=2228, maxDocs=44218)
              0.046325076 = queryNorm
            0.3524555 = fieldWeight in 859, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.9875789 = idf(docFreq=2228, maxDocs=44218)
              0.0625 = fieldNorm(doc=859)
      0.25 = coord(1/4)
    
    Abstract
    Presents the semantic portal project of the center for teaching and research in the Social Sciences and Contemporary History (CPDOC) of the Fundação Getúlio Vargas, Rio de Janeiro. This project involves the use of semantic and visualization technologies and natural language processing techniques to allow enhanced ways to access the CPDOC collections.
  4. Café, L.M.A.; Souza, R.R.: Sentiment analysis and knowledge organization : an overview of the international literature (2017) 0.01
    0.0065351077 = product of:
      0.026140431 = sum of:
        0.026140431 = product of:
          0.052280862 = sum of:
            0.052280862 = weight(_text_:aspects in 3625) [ClassicSimilarity], result of:
              0.052280862 = score(doc=3625,freq=2.0), product of:
                0.20938325 = queryWeight, product of:
                  4.5198684 = idf(docFreq=1308, maxDocs=44218)
                  0.046325076 = queryNorm
                0.2496898 = fieldWeight in 3625, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.5198684 = idf(docFreq=1308, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3625)
          0.5 = coord(1/2)
      0.25 = coord(1/4)
    
    Abstract
    Knowledge organization (KO) as an activity is, among other meanings, a process for conceptual modeling of knowledge domains that produces a consensual abstraction model of this domain with a particular purpose. It adopts a myriad of techniques to analyze and build efficient knowledge organization systems, and one of these techniques is called sentiment analysis (SA) or opinion mining, which is emerging as promising and useful in a variety of ways. It is based in NLP and AI algorithms, and aims at identifying opinions and emotions toward any person, organization or subject; evaluating them as positive or negative, in both binary and graded fashions. This study sought to show various aspects of the implementation of SA for knowledge organization tasks as register ed in the scientific literature. We began with exploratory bibliographic research and built a corpus of 91 scientific papers, written in English, selected in the LISA Database, between 2000 to 2016. We analyzed these papers and extracted title, year of publication, author(s) and institution(s), title of the journal where they were published, keywords, the LISA classification code, methods/techniques adopted and its application areas. Our main findings are that theoretical papers still prevail, which may indicate a field in the early stages. We found many institutions and authors from Asia, which points to a new shift in world expertise. We concluded that SA is still a novelty in the KO field, being slowly adopted as an aid to the main tasks, as document classification.
  5. Martins, A.L.; Souza, R.R.; Ribeiro de Mello, H.: ¬The use of noun phrases in information retrieval : proposing a mechanism for automatic classification (2014) 0.00
    0.003138204 = product of:
      0.012552816 = sum of:
        0.012552816 = product of:
          0.025105633 = sum of:
            0.025105633 = weight(_text_:22 in 1441) [ClassicSimilarity], result of:
              0.025105633 = score(doc=1441,freq=2.0), product of:
                0.16222252 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046325076 = queryNorm
                0.15476047 = fieldWeight in 1441, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=1441)
          0.5 = coord(1/2)
      0.25 = coord(1/4)
    
    Source
    Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik