Document (#686)

Author
Faraj, N.
Title
Analyse d'une methode d'indexation automatique basée sur une analyse syntaxique de texte
Source
Canadian journal of information and library science. 21(1996) no.1, S.1-21
Year
1996
Abstract
Evaluates an automatic indexing method based on syntactical text analysis combined with statistical analysis. Tests many combinations for the choice of term categories and weighting methods. The experiment, conducted on a software engineering corpus, shows systematic improvement in the use of syntactic term phrases compared to using only individual words as index terms
Footnote
Übers. d. Titels: Analysis of an automatic indexing method based on syntactic analysis of text
Theme
Automatisches Indexieren

Similar documents (content)

  1. Coret, A.; Menon, B.; Schibler, D.; Terrasse, C.: ¬Un système d'indexation structurée à l'INIST : bilan d'une étude préalable (1994) 0.18
    0.18010074 = sum of:
      0.18010074 = product of:
        2.2512593 = sum of:
          1.1047933 = weight(title_txt:d'indexation in 754) [ClassicSimilarity], result of:
            1.1047933 = score(doc=754,freq=1.0), product of:
              0.372746 = queryWeight, product of:
                1.8270918 = boost
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.021509713 = queryNorm
              2.9639306 = fieldWeight in 754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.3125 = fieldNorm(doc=754)
          1.146466 = weight(title_txt:d'une in 754) [ClassicSimilarity], result of:
            1.146466 = score(doc=754,freq=1.0), product of:
              0.38206133 = queryWeight, product of:
                1.8497814 = boost
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.021509713 = queryNorm
              3.0007381 = fieldWeight in 754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.3125 = fieldNorm(doc=754)
        0.08 = coord(2/25)
    
  2. Lavallee, C.: Indexation manuelle et indexation assistee par ordinateur : comparison de la performance de deux index d'une monographie (1996) 0.14
    0.14036514 = sum of:
      0.14036514 = product of:
        1.1697096 = sum of:
          0.0944407 = weight(abstract_txt:experiment in 738) [ClassicSimilarity], result of:
            0.0944407 = score(doc=738,freq=1.0), product of:
              0.13323708 = queryWeight, product of:
                1.0923616 = boost
                5.6705356 = idf(docFreq=407, maxDocs=43556)
                0.021509713 = queryNorm
              0.70881695 = fieldWeight in 738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6705356 = idf(docFreq=407, maxDocs=43556)
                0.125 = fieldNorm(doc=738)
          0.15809608 = weight(abstract_txt:evaluates in 738) [ClassicSimilarity], result of:
            0.15809608 = score(doc=738,freq=1.0), product of:
              0.18784504 = queryWeight, product of:
                1.297041 = boost
                6.7330427 = idf(docFreq=140, maxDocs=43556)
                0.021509713 = queryNorm
              0.84163034 = fieldWeight in 738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7330427 = idf(docFreq=140, maxDocs=43556)
                0.125 = fieldNorm(doc=738)
          0.9171728 = weight(title_txt:d'une in 738) [ClassicSimilarity], result of:
            0.9171728 = score(doc=738,freq=1.0), product of:
              0.38206133 = queryWeight, product of:
                1.8497814 = boost
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.021509713 = queryNorm
              2.4005904 = fieldWeight in 738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.25 = fieldNorm(doc=738)
        0.12 = coord(3/25)
    
  3. Grefenstette, G.: Explorations in automatic thesaurus discovery (1994) 0.14
    0.13859543 = sum of:
      0.13859543 = product of:
        0.577481 = sum of:
          0.054483607 = weight(abstract_txt:automatic in 1168) [ClassicSimilarity], result of:
            0.054483607 = score(doc=1168,freq=1.0), product of:
              0.11185499 = queryWeight, product of:
                1.0008789 = boost
                5.195642 = idf(docFreq=655, maxDocs=43556)
                0.021509713 = queryNorm
              0.48709142 = fieldWeight in 1168, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.195642 = idf(docFreq=655, maxDocs=43556)
                0.09375 = fieldNorm(doc=1168)
          0.059614338 = weight(abstract_txt:words in 1168) [ClassicSimilarity], result of:
            0.059614338 = score(doc=1168,freq=1.0), product of:
              0.11877142 = queryWeight, product of:
                1.031359 = boost
                5.353866 = idf(docFreq=559, maxDocs=43556)
                0.021509713 = queryNorm
              0.50192493 = fieldWeight in 1168, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353866 = idf(docFreq=559, maxDocs=43556)
                0.09375 = fieldNorm(doc=1168)
          0.18672417 = weight(abstract_txt:syntactic in 1168) [ClassicSimilarity], result of:
            0.18672417 = score(doc=1168,freq=3.0), product of:
              0.17629422 = queryWeight, product of:
                1.2565302 = boost
                6.5227475 = idf(docFreq=173, maxDocs=43556)
                0.021509713 = queryNorm
              1.0591621 = fieldWeight in 1168, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5227475 = idf(docFreq=173, maxDocs=43556)
                0.09375 = fieldNorm(doc=1168)
          0.03835566 = weight(abstract_txt:analysis in 1168) [ClassicSimilarity], result of:
            0.03835566 = score(doc=1168,freq=1.0), product of:
              0.11152557 = queryWeight, product of:
                1.4133707 = boost
                3.6684597 = idf(docFreq=3020, maxDocs=43556)
                0.021509713 = queryNorm
              0.34391809 = fieldWeight in 1168, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6684597 = idf(docFreq=3020, maxDocs=43556)
                0.09375 = fieldNorm(doc=1168)
          0.086835235 = weight(abstract_txt:term in 1168) [ClassicSimilarity], result of:
            0.086835235 = score(doc=1168,freq=1.0), product of:
              0.19228798 = queryWeight, product of:
                1.8558588 = boost
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.021509713 = queryNorm
              0.45158952 = fieldWeight in 1168, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.09375 = fieldNorm(doc=1168)
          0.15146796 = weight(abstract_txt:analyse in 1168) [ClassicSimilarity], result of:
            0.15146796 = score(doc=1168,freq=1.0), product of:
              0.2786348 = queryWeight, product of:
                2.2340174 = boost
                5.7984805 = idf(docFreq=358, maxDocs=43556)
                0.021509713 = queryNorm
              0.54360753 = fieldWeight in 1168, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7984805 = idf(docFreq=358, maxDocs=43556)
                0.09375 = fieldNorm(doc=1168)
        0.24 = coord(6/25)
    
  4. Lioma, C.; Ounis, I.: ¬A syntactically-based query reformulation technique for information retrieval (2008) 0.13
    0.1318421 = sum of:
      0.1318421 = product of:
        0.47086465 = sum of:
          0.071066946 = weight(abstract_txt:automatic in 4029) [ClassicSimilarity], result of:
            0.071066946 = score(doc=4029,freq=5.0), product of:
              0.11185499 = queryWeight, product of:
                1.0008789 = boost
                5.195642 = idf(docFreq=655, maxDocs=43556)
                0.021509713 = queryNorm
              0.6353489 = fieldWeight in 4029, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.195642 = idf(docFreq=655, maxDocs=43556)
                0.0546875 = fieldNorm(doc=4029)
          0.03477503 = weight(abstract_txt:words in 4029) [ClassicSimilarity], result of:
            0.03477503 = score(doc=4029,freq=1.0), product of:
              0.11877142 = queryWeight, product of:
                1.031359 = boost
                5.353866 = idf(docFreq=559, maxDocs=43556)
                0.021509713 = queryNorm
              0.29278955 = fieldWeight in 4029, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353866 = idf(docFreq=559, maxDocs=43556)
                0.0546875 = fieldNorm(doc=4029)
          0.038659886 = weight(abstract_txt:statistical in 4029) [ClassicSimilarity], result of:
            0.038659886 = score(doc=4029,freq=1.0), product of:
              0.12746002 = queryWeight, product of:
                1.0684172 = boost
                5.546238 = idf(docFreq=461, maxDocs=43556)
                0.021509713 = queryNorm
              0.3033099 = fieldWeight in 4029, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.546238 = idf(docFreq=461, maxDocs=43556)
                0.0546875 = fieldNorm(doc=4029)
          0.041317806 = weight(abstract_txt:experiment in 4029) [ClassicSimilarity], result of:
            0.041317806 = score(doc=4029,freq=1.0), product of:
              0.13323708 = queryWeight, product of:
                1.0923616 = boost
                5.6705356 = idf(docFreq=407, maxDocs=43556)
                0.021509713 = queryNorm
              0.3101074 = fieldWeight in 4029, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6705356 = idf(docFreq=407, maxDocs=43556)
                0.0546875 = fieldNorm(doc=4029)
          0.1257728 = weight(abstract_txt:syntactic in 4029) [ClassicSimilarity], result of:
            0.1257728 = score(doc=4029,freq=4.0), product of:
              0.17629422 = queryWeight, product of:
                1.2565302 = boost
                6.5227475 = idf(docFreq=173, maxDocs=43556)
                0.021509713 = queryNorm
              0.7134255 = fieldWeight in 4029, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.5227475 = idf(docFreq=173, maxDocs=43556)
                0.0546875 = fieldNorm(doc=4029)
          0.10861831 = weight(abstract_txt:weighting in 4029) [ClassicSimilarity], result of:
            0.10861831 = score(doc=4029,freq=2.0), product of:
              0.2014307 = queryWeight, product of:
                1.3431258 = boost
                6.9722724 = idf(docFreq=110, maxDocs=43556)
                0.021509713 = queryNorm
              0.53923416 = fieldWeight in 4029, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9722724 = idf(docFreq=110, maxDocs=43556)
                0.0546875 = fieldNorm(doc=4029)
          0.05065389 = weight(abstract_txt:term in 4029) [ClassicSimilarity], result of:
            0.05065389 = score(doc=4029,freq=1.0), product of:
              0.19228798 = queryWeight, product of:
                1.8558588 = boost
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.021509713 = queryNorm
              0.26342723 = fieldWeight in 4029, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.0546875 = fieldNorm(doc=4029)
        0.28 = coord(7/25)
    
  5. Menillet, D.: Grilles d'indexation et de préindexation : l'exemple de PASCAL (1992) 0.09
    0.09417248 = sum of:
      0.09417248 = product of:
        1.177156 = sum of:
          1.1047933 = weight(title_txt:d'indexation in 5803) [ClassicSimilarity], result of:
            1.1047933 = score(doc=5803,freq=1.0), product of:
              0.372746 = queryWeight, product of:
                1.8270918 = boost
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.021509713 = queryNorm
              2.9639306 = fieldWeight in 5803, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.3125 = fieldNorm(doc=5803)
          0.0723627 = weight(abstract_txt:term in 5803) [ClassicSimilarity], result of:
            0.0723627 = score(doc=5803,freq=1.0), product of:
              0.19228798 = queryWeight, product of:
                1.8558588 = boost
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.021509713 = queryNorm
              0.37632462 = fieldWeight in 5803, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.078125 = fieldNorm(doc=5803)
        0.08 = coord(2/25)