Document (#686)

Author
Faraj, N.
Title
Analyse d'une methode d'indexation automatique basée sur une analyse syntaxique de texte
Source
Canadian journal of information and library science. 21(1996) no.1, S.1-21
Year
1996
Abstract
Evaluates an automatic indexing method based on syntactical text analysis combined with statistical analysis. Tests many combinations for the choice of term categories and weighting methods. The experiment, conducted on a software engineering corpus, shows systematic improvement in the use of syntactic term phrases compared to using only individual words as index terms
Footnote
Übers. d. Titels: Analysis of an automatic indexing method based on syntactic analysis of text
Theme
Automatisches Indexieren

Similar documents (content)

  1. Coret, A.; Menon, B.; Schibler, D.; Terrasse, C.: ¬Un système d'indexation structurée à l'INIST : bilan d'une étude préalable (1994) 0.18
    0.18110363 = sum of:
      0.18110363 = product of:
        2.2637954 = sum of:
          1.1109784 = weight(title_txt:d'indexation in 8757) [ClassicSimilarity], result of:
            1.1109784 = score(doc=8757,freq=1.0), product of:
              0.37423757 = queryWeight, product of:
                1.8347121 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.02147194 = queryNorm
              2.9686446 = fieldWeight in 8757, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.3125 = fieldNorm(doc=8757)
          1.1528169 = weight(title_txt:d'une in 8757) [ClassicSimilarity], result of:
            1.1528169 = score(doc=8757,freq=1.0), product of:
              0.38357523 = queryWeight, product of:
                1.8574601 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.02147194 = queryNorm
              3.005452 = fieldWeight in 8757, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.3125 = fieldNorm(doc=8757)
        0.08 = coord(2/25)
    
  2. Lavallee, C.: Indexation manuelle et indexation assistee par ordinateur : comparison de la performance de deux index d'une monographie (1996) 0.14
    0.1408944 = sum of:
      0.1408944 = product of:
        1.1741201 = sum of:
          0.09406193 = weight(abstract_txt:experiment in 740) [ClassicSimilarity], result of:
            0.09406193 = score(doc=740,freq=1.0), product of:
              0.13291672 = queryWeight, product of:
                1.0934125 = boost
                5.6614056 = idf(docFreq=417, maxDocs=44218)
                0.02147194 = queryNorm
              0.7076757 = fieldWeight in 740, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6614056 = idf(docFreq=417, maxDocs=44218)
                0.125 = fieldNorm(doc=740)
          0.15780461 = weight(abstract_txt:evaluates in 740) [ClassicSimilarity], result of:
            0.15780461 = score(doc=740,freq=1.0), product of:
              0.18766508 = queryWeight, product of:
                1.2992297 = boost
                6.727074 = idf(docFreq=143, maxDocs=44218)
                0.02147194 = queryNorm
              0.84088427 = fieldWeight in 740, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.727074 = idf(docFreq=143, maxDocs=44218)
                0.125 = fieldNorm(doc=740)
          0.9222535 = weight(title_txt:d'une in 740) [ClassicSimilarity], result of:
            0.9222535 = score(doc=740,freq=1.0), product of:
              0.38357523 = queryWeight, product of:
                1.8574601 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.02147194 = queryNorm
              2.4043615 = fieldWeight in 740, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.25 = fieldNorm(doc=740)
        0.12 = coord(3/25)
    
  3. Grefenstette, G.: Explorations in automatic thesaurus discovery (1994) 0.14
    0.13812692 = sum of:
      0.13812692 = product of:
        0.57552886 = sum of:
          0.054526657 = weight(abstract_txt:automatic in 170) [ClassicSimilarity], result of:
            0.054526657 = score(doc=170,freq=1.0), product of:
              0.111944325 = queryWeight, product of:
                1.003449 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.02147194 = queryNorm
              0.48708728 = fieldWeight in 170, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.09375 = fieldNorm(doc=170)
          0.059634242 = weight(abstract_txt:words in 170) [ClassicSimilarity], result of:
            0.059634242 = score(doc=170,freq=1.0), product of:
              0.118830144 = queryWeight, product of:
                1.0338501 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.02147194 = queryNorm
              0.5018444 = fieldWeight in 170, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.09375 = fieldNorm(doc=170)
          0.18719086 = weight(abstract_txt:syntactic in 170) [ClassicSimilarity], result of:
            0.18719086 = score(doc=170,freq=3.0), product of:
              0.17663585 = queryWeight, product of:
                1.2604734 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.02147194 = queryNorm
              1.0597558 = fieldWeight in 170, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.09375 = fieldNorm(doc=170)
          0.037920646 = weight(abstract_txt:analysis in 170) [ClassicSimilarity], result of:
            0.037920646 = score(doc=170,freq=1.0), product of:
              0.1107108 = queryWeight, product of:
                1.4112508 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.02147194 = queryNorm
              0.34251985 = fieldWeight in 170, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.09375 = fieldNorm(doc=170)
          0.08605636 = weight(abstract_txt:term in 170) [ClassicSimilarity], result of:
            0.08605636 = score(doc=170,freq=1.0), product of:
              0.19118837 = queryWeight, product of:
                1.8545561 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.02147194 = queryNorm
              0.45011294 = fieldWeight in 170, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.09375 = fieldNorm(doc=170)
          0.1502001 = weight(abstract_txt:analyse in 170) [ClassicSimilarity], result of:
            0.1502001 = score(doc=170,freq=1.0), product of:
              0.27715304 = queryWeight, product of:
                2.232899 = boost
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.02147194 = queryNorm
              0.5419392 = fieldWeight in 170, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.09375 = fieldNorm(doc=170)
        0.24 = coord(6/25)
    
  4. Lioma, C.; Ounis, I.: ¬A syntactically-based query reformulation technique for information retrieval (2008) 0.13
    0.13188957 = sum of:
      0.13188957 = product of:
        0.47103414 = sum of:
          0.07112311 = weight(abstract_txt:automatic in 2031) [ClassicSimilarity], result of:
            0.07112311 = score(doc=2031,freq=5.0), product of:
              0.111944325 = queryWeight, product of:
                1.003449 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.02147194 = queryNorm
              0.63534355 = fieldWeight in 2031, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2031)
          0.034786638 = weight(abstract_txt:words in 2031) [ClassicSimilarity], result of:
            0.034786638 = score(doc=2031,freq=1.0), product of:
              0.118830144 = queryWeight, product of:
                1.0338501 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.02147194 = queryNorm
              0.29274255 = fieldWeight in 2031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2031)
          0.038692396 = weight(abstract_txt:statistical in 2031) [ClassicSimilarity], result of:
            0.038692396 = score(doc=2031,freq=1.0), product of:
              0.12756613 = queryWeight, product of:
                1.0711787 = boost
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.02147194 = queryNorm
              0.30331245 = fieldWeight in 2031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2031)
          0.041152094 = weight(abstract_txt:experiment in 2031) [ClassicSimilarity], result of:
            0.041152094 = score(doc=2031,freq=1.0), product of:
              0.13291672 = queryWeight, product of:
                1.0934125 = boost
                5.6614056 = idf(docFreq=417, maxDocs=44218)
                0.02147194 = queryNorm
              0.3096081 = fieldWeight in 2031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6614056 = idf(docFreq=417, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2031)
          0.12608714 = weight(abstract_txt:syntactic in 2031) [ClassicSimilarity], result of:
            0.12608714 = score(doc=2031,freq=4.0), product of:
              0.17663585 = queryWeight, product of:
                1.2604734 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.02147194 = queryNorm
              0.71382535 = fieldWeight in 2031, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2031)
          0.108993225 = weight(abstract_txt:weighting in 2031) [ClassicSimilarity], result of:
            0.108993225 = score(doc=2031,freq=2.0), product of:
              0.2019488 = queryWeight, product of:
                1.3477672 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.02147194 = queryNorm
              0.5397072 = fieldWeight in 2031, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2031)
          0.050199542 = weight(abstract_txt:term in 2031) [ClassicSimilarity], result of:
            0.050199542 = score(doc=2031,freq=1.0), product of:
              0.19118837 = queryWeight, product of:
                1.8545561 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.02147194 = queryNorm
              0.26256588 = fieldWeight in 2031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2031)
        0.28 = coord(7/25)
    
  5. Menillet, D.: Grilles d'indexation et de préindexation : l'exemple de PASCAL (1992) 0.09
    0.09461536 = sum of:
      0.09461536 = product of:
        1.182692 = sum of:
          1.1109784 = weight(title_txt:d'indexation in 5806) [ClassicSimilarity], result of:
            1.1109784 = score(doc=5806,freq=1.0), product of:
              0.37423757 = queryWeight, product of:
                1.8347121 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.02147194 = queryNorm
              2.9686446 = fieldWeight in 5806, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.3125 = fieldNorm(doc=5806)
          0.071713634 = weight(abstract_txt:term in 5806) [ClassicSimilarity], result of:
            0.071713634 = score(doc=5806,freq=1.0), product of:
              0.19118837 = queryWeight, product of:
                1.8545561 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.02147194 = queryNorm
              0.37509412 = fieldWeight in 5806, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.078125 = fieldNorm(doc=5806)
        0.08 = coord(2/25)