Document (#26852)

Author
Ferret, O.
Grau, B.
Hurault-Plantet, M.
Illouz, G.
Jacquemin, C.
Monceaux, L.
Robba, I.
Vilnat, A.
Title
How NLP can improve question answering
Source
Knowledge organization. 29(2002) nos.3/4, S.135-155
Year
2002
Abstract
Answering open-domain factual questions requires Natural Language processing for refining document selection and answer identification. With our system QALC, we have participated in the Question Answering track of the TREC8, TREC9 and TREC10 evaluations. QALC performs an analysis of documents relying an multiword term searches and their linguistic variation both to minimize the number of documents selected and to provide additional clues when comparing question and sentence representations. This comparison process also makes use of the results of a syntactic parsing of the questions and Named Entity recognition functionalities. Answer extraction relies an the application of syntactic patterns chosen according to the kind of information that is sought, and categorized depending an the syntactic form of the question. These patterns allow QALC to handle nicely linguistic variations at the answer level.
Theme
Computerlinguistik
Retrievalstudien
Sprachretrieval
Object
TREC

Similar documents (author)

  1. Grau, O.: Infos lokal gewoben : die WWW-Sprache HTML und die passende Software (1994) 6.00
    5.9971275 = sum of:
      5.9971275 = weight(author_txt:grau in 566) [ClassicSimilarity], result of:
        5.9971275 = fieldWeight in 566, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.595404 = idf(docFreq=7, maxDocs=43254)
          0.625 = fieldNorm(doc=566)
    
  2. Grau, O.: Alles integriert : Informationssurfen im World Wide Web (1994) 6.00
    5.9971275 = sum of:
      5.9971275 = weight(author_txt:grau in 613) [ClassicSimilarity], result of:
        5.9971275 = fieldWeight in 613, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.595404 = idf(docFreq=7, maxDocs=43254)
          0.625 = fieldNorm(doc=613)
    
  3. Grau, B.: Finding answers to questions, in text collections or Web, in open domain or specialty domains (2012) 6.00
    5.9971275 = sum of:
      5.9971275 = weight(author_txt:grau in 1572) [ClassicSimilarity], result of:
        5.9971275 = fieldWeight in 1572, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.595404 = idf(docFreq=7, maxDocs=43254)
          0.625 = fieldNorm(doc=1572)
    
  4. Grau, J.E.; Mehrotra, R.: Similar shape retrieval using a structural feature index (1993) 4.80
    4.797702 = sum of:
      4.797702 = weight(author_txt:grau in 332) [ClassicSimilarity], result of:
        4.797702 = fieldWeight in 332, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.595404 = idf(docFreq=7, maxDocs=43254)
          0.5 = fieldNorm(doc=332)
    
  5. Ferret, O.; Grau, B.; Masson, N.: Utilisation d'un réseau de cooccurences lexikales pour a méliorer une analyse thématique fondée sur la distribution des mots (1999) 3.60
    3.5982764 = sum of:
      3.5982764 = weight(author_txt:grau in 1296) [ClassicSimilarity], result of:
        3.5982764 = fieldWeight in 1296, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.595404 = idf(docFreq=7, maxDocs=43254)
          0.375 = fieldNorm(doc=1296)
    

Similar documents (content)

  1. Grau, B.: Finding answers to questions, in text collections or Web, in open domain or specialty domains (2012) 0.29
    0.2875945 = sum of:
      0.2875945 = product of:
        0.8987328 = sum of:
          0.10555706 = weight(abstract_txt:factual in 1572) [ClassicSimilarity], result of:
            0.10555706 = score(doc=1572,freq=2.0), product of:
              0.15788557 = queryWeight, product of:
                1.1047888 = boost
                7.563971 = idf(docFreq=60, maxDocs=43254)
                0.01889354 = queryNorm
              0.6685669 = fieldWeight in 1572, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.563971 = idf(docFreq=60, maxDocs=43254)
                0.0625 = fieldNorm(doc=1572)
          0.093315385 = weight(abstract_txt:clues in 1572) [ClassicSimilarity], result of:
            0.093315385 = score(doc=1572,freq=1.0), product of:
              0.18322992 = queryWeight, product of:
                1.1901624 = boost
                8.148484 = idf(docFreq=33, maxDocs=43254)
                0.01889354 = queryNorm
              0.50928026 = fieldWeight in 1572, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.148484 = idf(docFreq=33, maxDocs=43254)
                0.0625 = fieldNorm(doc=1572)
          0.024059286 = weight(abstract_txt:documents in 1572) [ClassicSimilarity], result of:
            0.024059286 = score(doc=1572,freq=1.0), product of:
              0.09351747 = queryWeight, product of:
                1.2024566 = boost
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.01889354 = queryNorm
              0.25727051 = fieldWeight in 1572, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.0625 = fieldNorm(doc=1572)
          0.0577786 = weight(abstract_txt:questions in 1572) [ClassicSimilarity], result of:
            0.0577786 = score(doc=1572,freq=2.0), product of:
              0.13310844 = queryWeight, product of:
                1.4345834 = boost
                4.91096 = idf(docFreq=865, maxDocs=43254)
                0.01889354 = queryNorm
              0.43407166 = fieldWeight in 1572, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.91096 = idf(docFreq=865, maxDocs=43254)
                0.0625 = fieldNorm(doc=1572)
          0.068706684 = weight(abstract_txt:linguistic in 1572) [ClassicSimilarity], result of:
            0.068706684 = score(doc=1572,freq=1.0), product of:
              0.18823639 = queryWeight, product of:
                1.7059834 = boost
                5.8400345 = idf(docFreq=341, maxDocs=43254)
                0.01889354 = queryNorm
              0.36500216 = fieldWeight in 1572, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8400345 = idf(docFreq=341, maxDocs=43254)
                0.0625 = fieldNorm(doc=1572)
          0.107475415 = weight(abstract_txt:answer in 1572) [ClassicSimilarity], result of:
            0.107475415 = score(doc=1572,freq=1.0), product of:
              0.29036266 = queryWeight, product of:
                2.59501 = boost
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.01889354 = queryNorm
              0.370142 = fieldWeight in 1572, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.0625 = fieldNorm(doc=1572)
          0.22126612 = weight(abstract_txt:answering in 1572) [ClassicSimilarity], result of:
            0.22126612 = score(doc=1572,freq=2.0), product of:
              0.3729649 = queryWeight, product of:
                2.9410515 = boost
                6.7120004 = idf(docFreq=142, maxDocs=43254)
                0.01889354 = queryNorm
              0.5932626 = fieldWeight in 1572, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7120004 = idf(docFreq=142, maxDocs=43254)
                0.0625 = fieldNorm(doc=1572)
          0.22057426 = weight(abstract_txt:question in 1572) [ClassicSimilarity], result of:
            0.22057426 = score(doc=1572,freq=5.0), product of:
              0.30182886 = queryWeight, product of:
                3.0550506 = boost
                5.229125 = idf(docFreq=629, maxDocs=43254)
                0.01889354 = queryNorm
              0.73079246 = fieldWeight in 1572, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.229125 = idf(docFreq=629, maxDocs=43254)
                0.0625 = fieldNorm(doc=1572)
        0.32 = coord(8/25)
    
  2. Lin, J.; Katz, B.: Building a reusable test collection for question answering (2006) 0.19
    0.18618149 = sum of:
      0.18618149 = product of:
        0.9309074 = sum of:
          0.052089885 = weight(abstract_txt:documents in 46) [ClassicSimilarity], result of:
            0.052089885 = score(doc=46,freq=3.0), product of:
              0.09351747 = queryWeight, product of:
                1.2024566 = boost
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.01889354 = queryNorm
              0.557007 = fieldWeight in 46, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.078125 = fieldNorm(doc=46)
          0.05106955 = weight(abstract_txt:questions in 46) [ClassicSimilarity], result of:
            0.05106955 = score(doc=46,freq=1.0), product of:
              0.13310844 = queryWeight, product of:
                1.4345834 = boost
                4.91096 = idf(docFreq=865, maxDocs=43254)
                0.01889354 = queryNorm
              0.38366878 = fieldWeight in 46, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.91096 = idf(docFreq=865, maxDocs=43254)
                0.078125 = fieldNorm(doc=46)
          0.18999149 = weight(abstract_txt:answer in 46) [ClassicSimilarity], result of:
            0.18999149 = score(doc=46,freq=2.0), product of:
              0.29036266 = queryWeight, product of:
                2.59501 = boost
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.01889354 = queryNorm
              0.6543248 = fieldWeight in 46, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.078125 = fieldNorm(doc=46)
          0.39114696 = weight(abstract_txt:answering in 46) [ClassicSimilarity], result of:
            0.39114696 = score(doc=46,freq=4.0), product of:
              0.3729649 = queryWeight, product of:
                2.9410515 = boost
                6.7120004 = idf(docFreq=142, maxDocs=43254)
                0.01889354 = queryNorm
              1.04875 = fieldWeight in 46, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.7120004 = idf(docFreq=142, maxDocs=43254)
                0.078125 = fieldNorm(doc=46)
          0.24660952 = weight(abstract_txt:question in 46) [ClassicSimilarity], result of:
            0.24660952 = score(doc=46,freq=4.0), product of:
              0.30182886 = queryWeight, product of:
                3.0550506 = boost
                5.229125 = idf(docFreq=629, maxDocs=43254)
                0.01889354 = queryNorm
              0.8170508 = fieldWeight in 46, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.229125 = idf(docFreq=629, maxDocs=43254)
                0.078125 = fieldNorm(doc=46)
        0.2 = coord(5/25)
    
  3. Saint-Dizier, P.; Moens, M.-F.: Knowledge and reasoning for question answering : research perspectives (2011) 0.18
    0.18243003 = sum of:
      0.18243003 = product of:
        1.1401877 = sum of:
          0.11196017 = weight(abstract_txt:factual in 4211) [ClassicSimilarity], result of:
            0.11196017 = score(doc=4211,freq=1.0), product of:
              0.15788557 = queryWeight, product of:
                1.1047888 = boost
                7.563971 = idf(docFreq=60, maxDocs=43254)
                0.01889354 = queryNorm
              0.7091223 = fieldWeight in 4211, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.563971 = idf(docFreq=60, maxDocs=43254)
                0.09375 = fieldNorm(doc=4211)
          0.22798978 = weight(abstract_txt:answer in 4211) [ClassicSimilarity], result of:
            0.22798978 = score(doc=4211,freq=2.0), product of:
              0.29036266 = queryWeight, product of:
                2.59501 = boost
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.01889354 = queryNorm
              0.78518975 = fieldWeight in 4211, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.09375 = fieldNorm(doc=4211)
          0.46937636 = weight(abstract_txt:answering in 4211) [ClassicSimilarity], result of:
            0.46937636 = score(doc=4211,freq=4.0), product of:
              0.3729649 = queryWeight, product of:
                2.9410515 = boost
                6.7120004 = idf(docFreq=142, maxDocs=43254)
                0.01889354 = queryNorm
              1.2585001 = fieldWeight in 4211, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.7120004 = idf(docFreq=142, maxDocs=43254)
                0.09375 = fieldNorm(doc=4211)
          0.3308614 = weight(abstract_txt:question in 4211) [ClassicSimilarity], result of:
            0.3308614 = score(doc=4211,freq=5.0), product of:
              0.30182886 = queryWeight, product of:
                3.0550506 = boost
                5.229125 = idf(docFreq=629, maxDocs=43254)
                0.01889354 = queryNorm
              1.0961887 = fieldWeight in 4211, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.229125 = idf(docFreq=629, maxDocs=43254)
                0.09375 = fieldNorm(doc=4211)
        0.16 = coord(4/25)
    
  4. Liu, Z.; Jansen, B.J.: ASK: A taxonomy of accuracy, social, and knowledge information seeking posts in social question and answering (2017) 0.18
    0.18057887 = sum of:
      0.18057887 = product of:
        0.9028944 = sum of:
          0.08171128 = weight(abstract_txt:questions in 4810) [ClassicSimilarity], result of:
            0.08171128 = score(doc=4810,freq=4.0), product of:
              0.13310844 = queryWeight, product of:
                1.4345834 = boost
                4.91096 = idf(docFreq=865, maxDocs=43254)
                0.01889354 = queryNorm
              0.61387 = fieldWeight in 4810, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.91096 = idf(docFreq=865, maxDocs=43254)
                0.0625 = fieldNorm(doc=4810)
          0.107475415 = weight(abstract_txt:answer in 4810) [ClassicSimilarity], result of:
            0.107475415 = score(doc=4810,freq=1.0), product of:
              0.29036266 = queryWeight, product of:
                2.59501 = boost
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.01889354 = queryNorm
              0.370142 = fieldWeight in 4810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.0625 = fieldNorm(doc=4810)
          0.2035025 = weight(abstract_txt:syntactic in 4810) [ClassicSimilarity], result of:
            0.2035025 = score(doc=4810,freq=2.0), product of:
              0.35272628 = queryWeight, product of:
                2.8601418 = boost
                6.5273504 = idf(docFreq=171, maxDocs=43254)
                0.01889354 = queryNorm
              0.5769417 = fieldWeight in 4810, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5273504 = idf(docFreq=171, maxDocs=43254)
                0.0625 = fieldNorm(doc=4810)
          0.31291756 = weight(abstract_txt:answering in 4810) [ClassicSimilarity], result of:
            0.31291756 = score(doc=4810,freq=4.0), product of:
              0.3729649 = queryWeight, product of:
                2.9410515 = boost
                6.7120004 = idf(docFreq=142, maxDocs=43254)
                0.01889354 = queryNorm
              0.83900005 = fieldWeight in 4810, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.7120004 = idf(docFreq=142, maxDocs=43254)
                0.0625 = fieldNorm(doc=4810)
          0.1972876 = weight(abstract_txt:question in 4810) [ClassicSimilarity], result of:
            0.1972876 = score(doc=4810,freq=4.0), product of:
              0.30182886 = queryWeight, product of:
                3.0550506 = boost
                5.229125 = idf(docFreq=629, maxDocs=43254)
                0.01889354 = queryNorm
              0.6536406 = fieldWeight in 4810, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.229125 = idf(docFreq=629, maxDocs=43254)
                0.0625 = fieldNorm(doc=4810)
        0.2 = coord(5/25)
    
  5. Moreda, P.; Llorens, H.; Saquete, E.; Palomar, M.: Combining semantic information in question answering systems (2011) 0.16
    0.16382818 = sum of:
      0.16382818 = product of:
        0.6826174 = sum of:
          0.055352155 = weight(abstract_txt:named in 4214) [ClassicSimilarity], result of:
            0.055352155 = score(doc=4214,freq=1.0), product of:
              0.1293552 = queryWeight, product of:
                6.8465314 = idf(docFreq=124, maxDocs=43254)
                0.01889354 = queryNorm
              0.4279082 = fieldWeight in 4214, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8465314 = idf(docFreq=124, maxDocs=43254)
                0.0625 = fieldNorm(doc=4214)
          0.06400622 = weight(abstract_txt:performs in 4214) [ClassicSimilarity], result of:
            0.06400622 = score(doc=4214,freq=1.0), product of:
              0.14250901 = queryWeight, product of:
                1.049613 = boost
                7.1862087 = idf(docFreq=88, maxDocs=43254)
                0.01889354 = queryNorm
              0.44913805 = fieldWeight in 4214, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1862087 = idf(docFreq=88, maxDocs=43254)
                0.0625 = fieldNorm(doc=4214)
          0.091355994 = weight(abstract_txt:questions in 4214) [ClassicSimilarity], result of:
            0.091355994 = score(doc=4214,freq=5.0), product of:
              0.13310844 = queryWeight, product of:
                1.4345834 = boost
                4.91096 = idf(docFreq=865, maxDocs=43254)
                0.01889354 = queryNorm
              0.6863276 = fieldWeight in 4214, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.91096 = idf(docFreq=865, maxDocs=43254)
                0.0625 = fieldNorm(doc=4214)
          0.1519932 = weight(abstract_txt:answer in 4214) [ClassicSimilarity], result of:
            0.1519932 = score(doc=4214,freq=2.0), product of:
              0.29036266 = queryWeight, product of:
                2.59501 = boost
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.01889354 = queryNorm
              0.52345985 = fieldWeight in 4214, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.0625 = fieldNorm(doc=4214)
          0.22126612 = weight(abstract_txt:answering in 4214) [ClassicSimilarity], result of:
            0.22126612 = score(doc=4214,freq=2.0), product of:
              0.3729649 = queryWeight, product of:
                2.9410515 = boost
                6.7120004 = idf(docFreq=142, maxDocs=43254)
                0.01889354 = queryNorm
              0.5932626 = fieldWeight in 4214, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7120004 = idf(docFreq=142, maxDocs=43254)
                0.0625 = fieldNorm(doc=4214)
          0.0986438 = weight(abstract_txt:question in 4214) [ClassicSimilarity], result of:
            0.0986438 = score(doc=4214,freq=1.0), product of:
              0.30182886 = queryWeight, product of:
                3.0550506 = boost
                5.229125 = idf(docFreq=629, maxDocs=43254)
                0.01889354 = queryNorm
              0.3268203 = fieldWeight in 4214, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.229125 = idf(docFreq=629, maxDocs=43254)
                0.0625 = fieldNorm(doc=4214)
        0.24 = coord(6/25)