Document (#20639)

Author
Gil-Leiva, I.
Munoz, J.V.R.
Title
Analisis de los descriptores de diferentes areas del conocimiento indizades en bases de datos del CSIC : Aplicacion a la indizacion automatica
Source
Revista Española de Documentaçion Cientifica. 20(1997) no.2, S.150-160
Year
1997
Abstract
Studies the value of scientific articles' titles and abstracts as sources of terms for document indexing in relation to 6 areas of knowledge: library and information science, medicine, chemistry, biology, psychology and physics, indexed in the databases ISOC, IME and ICYT of the CSIC. Also examines the syntagmatic structures of the indexing terms found in the field 'descriptors'. as well as the relationship between length of document and number of descriptors. Concludes that if the abstracts are not well made and the titles are not precise, they are not definitive sources for the extractions of concepts; the most common syntactic structure is the noun phrase, followed by noun+adjective and noun+noun; and no significant relationship was found between length of document and number of descriptors assigned to it
Footnote
Übers. d. Titels: Descriptors analysis on different knowledge ares in CSIC databases: application on automatic indexing
Theme
Automatisches Indexieren
Field
Physik
Chemie
Medizin
Psychologie
Biologie
Informationswissenschaft
Bibliothekswesen

Similar documents (author)

  1. Gil-Leiva, I.; Munoz, V.R.: ¬Los origines del almacenamiento y recuperacion de informacion (1996) 5.70
    5.698863 = sum of:
      5.698863 = sum of:
        2.7277508 = weight(author_txt:leiva in 5586) [ClassicSimilarity], result of:
          2.7277508 = score(doc=5586,freq=1.0), product of:
            0.6866909 = queryWeight, product of:
              9.079571 = idf(docFreq=12, maxDocs=41962)
              0.07563033 = queryNorm
            3.9723122 = fieldWeight in 5586, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.079571 = idf(docFreq=12, maxDocs=41962)
              0.4375 = fieldNorm(doc=5586)
        2.9711125 = weight(author_txt:munoz in 5586) [ClassicSimilarity], result of:
          2.9711125 = score(doc=5586,freq=1.0), product of:
            0.7269495 = queryWeight, product of:
              1.028896 = boost
              9.341934 = idf(docFreq=9, maxDocs=41962)
              0.07563033 = queryNorm
            4.087096 = fieldWeight in 5586, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.341934 = idf(docFreq=9, maxDocs=41962)
              0.4375 = fieldNorm(doc=5586)
    
  2. Munoz, A.M.; Munoz, F.A.: Nuevas areas de conocimiento y la problematica documental : la prospectiva de la paz en la Universidad de Granada (1997) 2.40
    2.4010215 = sum of:
      2.4010215 = product of:
        4.802043 = sum of:
          4.802043 = weight(author_txt:munoz in 1341) [ClassicSimilarity], result of:
            4.802043 = score(doc=1341,freq=2.0), product of:
              0.7269495 = queryWeight, product of:
                1.028896 = boost
                9.341934 = idf(docFreq=9, maxDocs=41962)
                0.07563033 = queryNorm
              6.605745 = fieldWeight in 1341, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.341934 = idf(docFreq=9, maxDocs=41962)
                0.5 = fieldNorm(doc=1341)
        0.5 = coord(1/2)
    
  3. Munoz, J.V.R.: Documentos electronicos y normalizacion : informacion y conocimiento (1997) 2.12
    2.1222234 = sum of:
      2.1222234 = product of:
        4.2444468 = sum of:
          4.2444468 = weight(author_txt:munoz in 4227) [ClassicSimilarity], result of:
            4.2444468 = score(doc=4227,freq=1.0), product of:
              0.7269495 = queryWeight, product of:
                1.028896 = boost
                9.341934 = idf(docFreq=9, maxDocs=41962)
                0.07563033 = queryNorm
              5.838709 = fieldWeight in 4227, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.341934 = idf(docFreq=9, maxDocs=41962)
                0.625 = fieldNorm(doc=4227)
        0.5 = coord(1/2)
    
  4. Leiva, I.G. -> Gil-Leiva, I.: 1.93
    1.9288111 = sum of:
      1.9288111 = product of:
        3.8576221 = sum of:
          3.8576221 = weight(author_txt:leiva in 98) [ClassicSimilarity], result of:
            3.8576221 = score(doc=98,freq=2.0), product of:
              0.6866909 = queryWeight, product of:
                9.079571 = idf(docFreq=12, maxDocs=41962)
                0.07563033 = queryNorm
              5.6176977 = fieldWeight in 98, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.079571 = idf(docFreq=12, maxDocs=41962)
                0.4375 = fieldNorm(doc=98)
        0.5 = coord(1/2)
    
  5. Fernández, F.J. Munoz- -> Munoz-Fernández, F.J.: 1.80
    1.8007661 = sum of:
      1.8007661 = product of:
        3.6015322 = sum of:
          3.6015322 = weight(author_txt:munoz in 3708) [ClassicSimilarity], result of:
            3.6015322 = score(doc=3708,freq=2.0), product of:
              0.7269495 = queryWeight, product of:
                1.028896 = boost
                9.341934 = idf(docFreq=9, maxDocs=41962)
                0.07563033 = queryNorm
              4.9543085 = fieldWeight in 3708, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.341934 = idf(docFreq=9, maxDocs=41962)
                0.375 = fieldNorm(doc=3708)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Mesquita, L.A.P.; Souza, R.R.; Baracho Porto, R.M.A.: Noun phrases in automatic indexing: : a structural analysis of the distribution of relevant terms in doctoral theses (2014) 0.18
    0.18073748 = sum of:
      0.18073748 = product of:
        0.645491 = sum of:
          0.013844697 = weight(abstract_txt:between in 3443) [ClassicSimilarity], result of:
            0.013844697 = score(doc=3443,freq=2.0), product of:
              0.05953929 = queryWeight, product of:
                3.5077088 = idf(docFreq=3417, maxDocs=41962)
                0.01697384 = queryNorm
              0.23253044 = fieldWeight in 3443, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5077088 = idf(docFreq=3417, maxDocs=41962)
                0.046875 = fieldNorm(doc=3443)
          0.013990443 = weight(abstract_txt:well in 3443) [ClassicSimilarity], result of:
            0.013990443 = score(doc=3443,freq=1.0), product of:
              0.07554035 = queryWeight, product of:
                1.1263871 = boost
                3.9510381 = idf(docFreq=2193, maxDocs=41962)
                0.01697384 = queryNorm
              0.18520491 = fieldWeight in 3443, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9510381 = idf(docFreq=2193, maxDocs=41962)
                0.046875 = fieldNorm(doc=3443)
          0.037218582 = weight(abstract_txt:terms in 3443) [ClassicSimilarity], result of:
            0.037218582 = score(doc=3443,freq=6.0), product of:
              0.079814315 = queryWeight, product of:
                1.1578134 = boost
                4.061272 = idf(docFreq=1964, maxDocs=41962)
                0.01697384 = queryNorm
              0.4663146 = fieldWeight in 3443, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.061272 = idf(docFreq=1964, maxDocs=41962)
                0.046875 = fieldNorm(doc=3443)
          0.026113823 = weight(abstract_txt:indexing in 3443) [ClassicSimilarity], result of:
            0.026113823 = score(doc=3443,freq=2.0), product of:
              0.09089255 = queryWeight, product of:
                1.2355556 = boost
                4.3339696 = idf(docFreq=1495, maxDocs=41962)
                0.01697384 = queryNorm
              0.2873043 = fieldWeight in 3443, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3339696 = idf(docFreq=1495, maxDocs=41962)
                0.046875 = fieldNorm(doc=3443)
          0.020969575 = weight(abstract_txt:found in 3443) [ClassicSimilarity], result of:
            0.020969575 = score(doc=3443,freq=1.0), product of:
              0.09893526 = queryWeight, product of:
                1.2890618 = boost
                4.521653 = idf(docFreq=1239, maxDocs=41962)
                0.01697384 = queryNorm
              0.2119525 = fieldWeight in 3443, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.521653 = idf(docFreq=1239, maxDocs=41962)
                0.046875 = fieldNorm(doc=3443)
          0.053005584 = weight(abstract_txt:areas in 3443) [ClassicSimilarity], result of:
            0.053005584 = score(doc=3443,freq=4.0), product of:
              0.11565181 = queryWeight, product of:
                1.3937163 = boost
                4.888751 = idf(docFreq=858, maxDocs=41962)
                0.01697384 = queryNorm
              0.4583204 = fieldWeight in 3443, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.888751 = idf(docFreq=858, maxDocs=41962)
                0.046875 = fieldNorm(doc=3443)
          0.4803483 = weight(abstract_txt:noun in 3443) [ClassicSimilarity], result of:
            0.4803483 = score(doc=3443,freq=5.0), product of:
              0.58796144 = queryWeight, product of:
                4.4441385 = boost
                7.794372 = idf(docFreq=46, maxDocs=41962)
                0.01697384 = queryNorm
              0.81697243 = fieldWeight in 3443, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.794372 = idf(docFreq=46, maxDocs=41962)
                0.046875 = fieldNorm(doc=3443)
        0.28 = coord(7/25)
    
  2. Souza, R.R.; Raghavan, K.S.: ¬A methodology for noun phrase-based automatic indexing (2006) 0.18
    0.17755938 = sum of:
      0.17755938 = product of:
        0.8877969 = sum of:
          0.025324035 = weight(abstract_txt:terms in 1299) [ClassicSimilarity], result of:
            0.025324035 = score(doc=1299,freq=1.0), product of:
              0.079814315 = queryWeight, product of:
                1.1578134 = boost
                4.061272 = idf(docFreq=1964, maxDocs=41962)
                0.01697384 = queryNorm
              0.31728688 = fieldWeight in 1299, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.061272 = idf(docFreq=1964, maxDocs=41962)
                0.078125 = fieldNorm(doc=1299)
          0.030775433 = weight(abstract_txt:indexing in 1299) [ClassicSimilarity], result of:
            0.030775433 = score(doc=1299,freq=1.0), product of:
              0.09089255 = queryWeight, product of:
                1.2355556 = boost
                4.3339696 = idf(docFreq=1495, maxDocs=41962)
                0.01697384 = queryNorm
              0.33859137 = fieldWeight in 1299, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3339696 = idf(docFreq=1495, maxDocs=41962)
                0.078125 = fieldNorm(doc=1299)
          0.044498626 = weight(abstract_txt:document in 1299) [ClassicSimilarity], result of:
            0.044498626 = score(doc=1299,freq=1.0), product of:
              0.13304146 = queryWeight, product of:
                1.8307848 = boost
                4.28124 = idf(docFreq=1576, maxDocs=41962)
                0.01697384 = queryNorm
              0.33447188 = fieldWeight in 1299, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.28124 = idf(docFreq=1576, maxDocs=41962)
                0.078125 = fieldNorm(doc=1299)
          0.16707186 = weight(abstract_txt:descriptors in 1299) [ClassicSimilarity], result of:
            0.16707186 = score(doc=1299,freq=1.0), product of:
              0.3213844 = queryWeight, product of:
                2.8454845 = boost
                6.654087 = idf(docFreq=146, maxDocs=41962)
                0.01697384 = queryNorm
              0.51985055 = fieldWeight in 1299, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.654087 = idf(docFreq=146, maxDocs=41962)
                0.078125 = fieldNorm(doc=1299)
          0.6201269 = weight(abstract_txt:noun in 1299) [ClassicSimilarity], result of:
            0.6201269 = score(doc=1299,freq=3.0), product of:
              0.58796144 = queryWeight, product of:
                4.4441385 = boost
                7.794372 = idf(docFreq=46, maxDocs=41962)
                0.01697384 = queryNorm
              1.0547068 = fieldWeight in 1299, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.794372 = idf(docFreq=46, maxDocs=41962)
                0.078125 = fieldNorm(doc=1299)
        0.2 = coord(5/25)
    
  3. Rodriguez Bravo, B.: ¬The visibility of women in indexing languages (2006) 0.16
    0.15675345 = sum of:
      0.15675345 = product of:
        0.7837672 = sum of:
          0.030775433 = weight(abstract_txt:indexing in 2264) [ClassicSimilarity], result of:
            0.030775433 = score(doc=2264,freq=1.0), product of:
              0.09089255 = queryWeight, product of:
                1.2355556 = boost
                4.3339696 = idf(docFreq=1495, maxDocs=41962)
                0.01697384 = queryNorm
              0.33859137 = fieldWeight in 2264, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3339696 = idf(docFreq=1495, maxDocs=41962)
                0.078125 = fieldNorm(doc=2264)
          0.18079664 = weight(abstract_txt:adjective in 2264) [ClassicSimilarity], result of:
            0.18079664 = score(doc=2264,freq=1.0), product of:
              0.23487803 = queryWeight, product of:
                1.4044439 = boost
                9.85276 = idf(docFreq=5, maxDocs=41962)
                0.01697384 = queryNorm
              0.7697469 = fieldWeight in 2264, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.85276 = idf(docFreq=5, maxDocs=41962)
                0.078125 = fieldNorm(doc=2264)
          0.04709285 = weight(abstract_txt:relationship in 2264) [ClassicSimilarity], result of:
            0.04709285 = score(doc=2264,freq=1.0), product of:
              0.12069673 = queryWeight, product of:
                1.42379 = boost
                4.9942408 = idf(docFreq=772, maxDocs=41962)
                0.01697384 = queryNorm
              0.39017504 = fieldWeight in 2264, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9942408 = idf(docFreq=772, maxDocs=41962)
                0.078125 = fieldNorm(doc=2264)
          0.16707186 = weight(abstract_txt:descriptors in 2264) [ClassicSimilarity], result of:
            0.16707186 = score(doc=2264,freq=1.0), product of:
              0.3213844 = queryWeight, product of:
                2.8454845 = boost
                6.654087 = idf(docFreq=146, maxDocs=41962)
                0.01697384 = queryNorm
              0.51985055 = fieldWeight in 2264, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.654087 = idf(docFreq=146, maxDocs=41962)
                0.078125 = fieldNorm(doc=2264)
          0.35803047 = weight(abstract_txt:noun in 2264) [ClassicSimilarity], result of:
            0.35803047 = score(doc=2264,freq=1.0), product of:
              0.58796144 = queryWeight, product of:
                4.4441385 = boost
                7.794372 = idf(docFreq=46, maxDocs=41962)
                0.01697384 = queryNorm
              0.6089353 = fieldWeight in 2264, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.794372 = idf(docFreq=46, maxDocs=41962)
                0.078125 = fieldNorm(doc=2264)
        0.2 = coord(5/25)
    
  4. Lopez-Ostenero, F.; Gonzalo, J.; Verdejo, F.: Noun phrases as building blocks for cross-language search assistance (2005) 0.14
    0.1422735 = sum of:
      0.1422735 = product of:
        0.7113675 = sum of:
          0.06768874 = weight(abstract_txt:phrase in 3022) [ClassicSimilarity], result of:
            0.06768874 = score(doc=3022,freq=1.0), product of:
              0.12200935 = queryWeight, product of:
                1.0122312 = boost
                7.101225 = idf(docFreq=93, maxDocs=41962)
                0.01697384 = queryNorm
              0.5547832 = fieldWeight in 3022, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.101225 = idf(docFreq=93, maxDocs=41962)
                0.078125 = fieldNorm(doc=3022)
          0.025324035 = weight(abstract_txt:terms in 3022) [ClassicSimilarity], result of:
            0.025324035 = score(doc=3022,freq=1.0), product of:
              0.079814315 = queryWeight, product of:
                1.1578134 = boost
                4.061272 = idf(docFreq=1964, maxDocs=41962)
                0.01697384 = queryNorm
              0.31728688 = fieldWeight in 3022, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.061272 = idf(docFreq=1964, maxDocs=41962)
                0.078125 = fieldNorm(doc=3022)
          0.03494929 = weight(abstract_txt:found in 3022) [ClassicSimilarity], result of:
            0.03494929 = score(doc=3022,freq=1.0), product of:
              0.09893526 = queryWeight, product of:
                1.2890618 = boost
                4.521653 = idf(docFreq=1239, maxDocs=41962)
                0.01697384 = queryNorm
              0.35325414 = fieldWeight in 3022, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.521653 = idf(docFreq=1239, maxDocs=41962)
                0.078125 = fieldNorm(doc=3022)
          0.07707388 = weight(abstract_txt:document in 3022) [ClassicSimilarity], result of:
            0.07707388 = score(doc=3022,freq=3.0), product of:
              0.13304146 = queryWeight, product of:
                1.8307848 = boost
                4.28124 = idf(docFreq=1576, maxDocs=41962)
                0.01697384 = queryNorm
              0.5793223 = fieldWeight in 3022, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.28124 = idf(docFreq=1576, maxDocs=41962)
                0.078125 = fieldNorm(doc=3022)
          0.50633156 = weight(abstract_txt:noun in 3022) [ClassicSimilarity], result of:
            0.50633156 = score(doc=3022,freq=2.0), product of:
              0.58796144 = queryWeight, product of:
                4.4441385 = boost
                7.794372 = idf(docFreq=46, maxDocs=41962)
                0.01697384 = queryNorm
              0.86116457 = fieldWeight in 3022, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.794372 = idf(docFreq=46, maxDocs=41962)
                0.078125 = fieldNorm(doc=3022)
        0.2 = coord(5/25)
    
  5. Larouk, O.: Modelling users need : schemas of interrogation and filtering of answers from the WEB in co-operative mode (1998) 0.14
    0.13561119 = sum of:
      0.13561119 = product of:
        0.56504667 = sum of:
          0.017726826 = weight(abstract_txt:terms in 1061) [ClassicSimilarity], result of:
            0.017726826 = score(doc=1061,freq=1.0), product of:
              0.079814315 = queryWeight, product of:
                1.1578134 = boost
                4.061272 = idf(docFreq=1964, maxDocs=41962)
                0.01697384 = queryNorm
              0.22210082 = fieldWeight in 1061, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.061272 = idf(docFreq=1964, maxDocs=41962)
                0.0546875 = fieldNorm(doc=1061)
          0.030466126 = weight(abstract_txt:indexing in 1061) [ClassicSimilarity], result of:
            0.030466126 = score(doc=1061,freq=2.0), product of:
              0.09089255 = queryWeight, product of:
                1.2355556 = boost
                4.3339696 = idf(docFreq=1495, maxDocs=41962)
                0.01697384 = queryNorm
              0.33518836 = fieldWeight in 1061, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3339696 = idf(docFreq=1495, maxDocs=41962)
                0.0546875 = fieldNorm(doc=1061)
          0.07014836 = weight(abstract_txt:titles in 1061) [ClassicSimilarity], result of:
            0.07014836 = score(doc=1061,freq=2.0), product of:
              0.15848756 = queryWeight, product of:
                1.6315327 = boost
                5.7229414 = idf(docFreq=372, maxDocs=41962)
                0.01697384 = queryNorm
              0.44261116 = fieldWeight in 1061, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7229414 = idf(docFreq=372, maxDocs=41962)
                0.0546875 = fieldNorm(doc=1061)
          0.079133704 = weight(abstract_txt:abstracts in 1061) [ClassicSimilarity], result of:
            0.079133704 = score(doc=1061,freq=2.0), product of:
              0.17174779 = queryWeight, product of:
                1.6984148 = boost
                5.9575443 = idf(docFreq=294, maxDocs=41962)
                0.01697384 = queryNorm
              0.4607553 = fieldWeight in 1061, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9575443 = idf(docFreq=294, maxDocs=41962)
                0.0546875 = fieldNorm(doc=1061)
          0.1169503 = weight(abstract_txt:descriptors in 1061) [ClassicSimilarity], result of:
            0.1169503 = score(doc=1061,freq=1.0), product of:
              0.3213844 = queryWeight, product of:
                2.8454845 = boost
                6.654087 = idf(docFreq=146, maxDocs=41962)
                0.01697384 = queryNorm
              0.3638954 = fieldWeight in 1061, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.654087 = idf(docFreq=146, maxDocs=41962)
                0.0546875 = fieldNorm(doc=1061)
          0.25062135 = weight(abstract_txt:noun in 1061) [ClassicSimilarity], result of:
            0.25062135 = score(doc=1061,freq=1.0), product of:
              0.58796144 = queryWeight, product of:
                4.4441385 = boost
                7.794372 = idf(docFreq=46, maxDocs=41962)
                0.01697384 = queryNorm
              0.42625472 = fieldWeight in 1061, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.794372 = idf(docFreq=46, maxDocs=41962)
                0.0546875 = fieldNorm(doc=1061)
        0.24 = coord(6/25)