Document (#36243)

Author
Girju, R.
Beamer, B.
Rozovskaya, A.
Fister, A.
Bhat, S.
Title
¬A knowledge-rich approach to identifying semantic relations between nominals
Source
Information processing and management. 46(2010) no.5, S.589-610
Year
2010
Abstract
This paper describes a state-of-the-art supervised, knowledge-intensive approach to the automatic identification of semantic relations between nominals in English sentences. The system employs a combination of rich and varied sets of new and previously used lexical, syntactic, and semantic features extracted from various knowledge sources such as WordNet and additional annotated corpora. The system ranked first at the third most popular SemEval 2007 Task - Classification of Semantic Relations between Nominals and achieved an F-measure of 72.4% and an accuracy of 76.3%. We also show that some semantic relations are better suited for WordNet-based models than other relations. Additionally, we make a distinction between out-of-context (regular) examples and those that require sentence context for relation identification and show that contextual data are important for the performance of a noun-noun semantic parser. Finally, learning curves show that the task difficulty varies across relations and that our learned WordNet-based representation is highly accurate so the performance results suggest the upper bound on what this representation can do.
Theme
Wissensrepräsentation

Similar documents (content)

  1. Leroy, G.; Chen, H.: Genescene: an ontology-enhanced integration of linguistic and co-occurrence based relations in biomedical texts (2005) 0.36
    0.3587996 = sum of:
      0.3587996 = product of:
        0.99666554 = sum of:
          0.1565846 = weight(abstract_txt:parser in 5259) [ClassicSimilarity], result of:
            0.1565846 = score(doc=5259,freq=3.0), product of:
              0.17436364 = queryWeight, product of:
                1.1794627 = boost
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.017820474 = queryNorm
              0.89803475 = fieldWeight in 5259, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.0625 = fieldNorm(doc=5259)
          0.021303963 = weight(abstract_txt:knowledge in 5259) [ClassicSimilarity], result of:
            0.021303963 = score(doc=5259,freq=1.0), product of:
              0.09594249 = queryWeight, product of:
                1.5153828 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.017820474 = queryNorm
              0.2220493 = fieldWeight in 5259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.0625 = fieldNorm(doc=5259)
          0.018243914 = weight(abstract_txt:that in 5259) [ClassicSimilarity], result of:
            0.018243914 = score(doc=5259,freq=3.0), product of:
              0.071125485 = queryWeight, product of:
                1.6844335 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017820474 = queryNorm
              0.2565032 = fieldWeight in 5259, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=5259)
          0.071314365 = weight(abstract_txt:rich in 5259) [ClassicSimilarity], result of:
            0.071314365 = score(doc=5259,freq=1.0), product of:
              0.18755342 = queryWeight, product of:
                1.7299507 = boost
                6.0837593 = idf(docFreq=273, maxDocs=44218)
                0.017820474 = queryNorm
              0.38023496 = fieldWeight in 5259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0837593 = idf(docFreq=273, maxDocs=44218)
                0.0625 = fieldNorm(doc=5259)
          0.040631212 = weight(abstract_txt:show in 5259) [ClassicSimilarity], result of:
            0.040631212 = score(doc=5259,freq=1.0), product of:
              0.1475516 = queryWeight, product of:
                1.8792684 = boost
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.017820474 = queryNorm
              0.27536952 = fieldWeight in 5259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.0625 = fieldNorm(doc=5259)
          0.026314367 = weight(abstract_txt:between in 5259) [ClassicSimilarity], result of:
            0.026314367 = score(doc=5259,freq=1.0), product of:
              0.121566035 = queryWeight, product of:
                1.9696649 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.017820474 = queryNorm
              0.21646151 = fieldWeight in 5259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.0625 = fieldNorm(doc=5259)
          0.14828447 = weight(abstract_txt:noun in 5259) [ClassicSimilarity], result of:
            0.14828447 = score(doc=5259,freq=1.0), product of:
              0.30554187 = queryWeight, product of:
                2.2080383 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.017820474 = queryNorm
              0.48531634 = fieldWeight in 5259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.0625 = fieldNorm(doc=5259)
          0.0851067 = weight(abstract_txt:semantic in 5259) [ClassicSimilarity], result of:
            0.0851067 = score(doc=5259,freq=1.0), product of:
              0.30433828 = queryWeight, product of:
                3.8168943 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.017820474 = queryNorm
              0.2796451 = fieldWeight in 5259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=5259)
          0.42888194 = weight(abstract_txt:relations in 5259) [ClassicSimilarity], result of:
            0.42888194 = score(doc=5259,freq=7.0), product of:
              0.46763453 = queryWeight, product of:
                4.731351 = boost
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.017820474 = queryNorm
              0.9171306 = fieldWeight in 5259, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.0625 = fieldNorm(doc=5259)
        0.36 = coord(9/25)
    
  2. Jouis, C.: System of types + inter-concept relations properties : towards validation of constructed terminologies (1998) 0.29
    0.2860933 = sum of:
      0.2860933 = product of:
        0.89404154 = sum of:
          0.046898644 = weight(abstract_txt:task in 56) [ClassicSimilarity], result of:
            0.046898644 = score(doc=56,freq=1.0), product of:
              0.122228876 = queryWeight, product of:
                1.3965553 = boost
                4.9112997 = idf(docFreq=884, maxDocs=44218)
                0.017820474 = queryNorm
              0.3836953 = fieldWeight in 56, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9112997 = idf(docFreq=884, maxDocs=44218)
                0.078125 = fieldNorm(doc=56)
          0.06692603 = weight(abstract_txt:representation in 56) [ClassicSimilarity], result of:
            0.06692603 = score(doc=56,freq=2.0), product of:
              0.122966565 = queryWeight, product of:
                1.4007633 = boost
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.017820474 = queryNorm
              0.54426205 = fieldWeight in 56, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.078125 = fieldNorm(doc=56)
          0.046124432 = weight(abstract_txt:knowledge in 56) [ClassicSimilarity], result of:
            0.046124432 = score(doc=56,freq=3.0), product of:
              0.09594249 = queryWeight, product of:
                1.5153828 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.017820474 = queryNorm
              0.48075083 = fieldWeight in 56, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.078125 = fieldNorm(doc=56)
          0.079041325 = weight(abstract_txt:identification in 56) [ClassicSimilarity], result of:
            0.079041325 = score(doc=56,freq=1.0), product of:
              0.17310241 = queryWeight, product of:
                1.6619685 = boost
                5.8446846 = idf(docFreq=347, maxDocs=44218)
                0.017820474 = queryNorm
              0.45661598 = fieldWeight in 56, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8446846 = idf(docFreq=347, maxDocs=44218)
                0.078125 = fieldNorm(doc=56)
          0.018620117 = weight(abstract_txt:that in 56) [ClassicSimilarity], result of:
            0.018620117 = score(doc=56,freq=2.0), product of:
              0.071125485 = queryWeight, product of:
                1.6844335 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017820474 = queryNorm
              0.26179248 = fieldWeight in 56, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=56)
          0.03289296 = weight(abstract_txt:between in 56) [ClassicSimilarity], result of:
            0.03289296 = score(doc=56,freq=1.0), product of:
              0.121566035 = queryWeight, product of:
                1.9696649 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.017820474 = queryNorm
              0.2705769 = fieldWeight in 56, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.078125 = fieldNorm(doc=56)
          0.15044881 = weight(abstract_txt:semantic in 56) [ClassicSimilarity], result of:
            0.15044881 = score(doc=56,freq=2.0), product of:
              0.30433828 = queryWeight, product of:
                3.8168943 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.017820474 = queryNorm
              0.49434733 = fieldWeight in 56, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.078125 = fieldNorm(doc=56)
          0.45308927 = weight(abstract_txt:relations in 56) [ClassicSimilarity], result of:
            0.45308927 = score(doc=56,freq=5.0), product of:
              0.46763453 = queryWeight, product of:
                4.731351 = boost
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.017820474 = queryNorm
              0.9688961 = fieldWeight in 56, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.078125 = fieldNorm(doc=56)
        0.32 = coord(8/25)
    
  3. Blanco, E.; Cankaya, H.C.; Moldovan, D.: Composition of semantic relations : model and applications (2010) 0.25
    0.2522923 = sum of:
      0.2522923 = product of:
        1.2614615 = sum of:
          0.1582073 = weight(abstract_txt:parser in 4761) [ClassicSimilarity], result of:
            0.1582073 = score(doc=4761,freq=1.0), product of:
              0.17436364 = queryWeight, product of:
                1.1794627 = boost
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.017820474 = queryNorm
              0.90734106 = fieldWeight in 4761, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.109375 = fieldNorm(doc=4761)
          0.018432977 = weight(abstract_txt:that in 4761) [ClassicSimilarity], result of:
            0.018432977 = score(doc=4761,freq=1.0), product of:
              0.071125485 = queryWeight, product of:
                1.6844335 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017820474 = queryNorm
              0.25916135 = fieldWeight in 4761, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.109375 = fieldNorm(doc=4761)
          0.25949782 = weight(abstract_txt:noun in 4761) [ClassicSimilarity], result of:
            0.25949782 = score(doc=4761,freq=1.0), product of:
              0.30554187 = queryWeight, product of:
                2.2080383 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.017820474 = queryNorm
              0.8493036 = fieldWeight in 4761, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.109375 = fieldNorm(doc=4761)
          0.25796598 = weight(abstract_txt:semantic in 4761) [ClassicSimilarity], result of:
            0.25796598 = score(doc=4761,freq=3.0), product of:
              0.30433828 = queryWeight, product of:
                3.8168943 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.017820474 = queryNorm
              0.8476291 = fieldWeight in 4761, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.109375 = fieldNorm(doc=4761)
          0.5673575 = weight(abstract_txt:relations in 4761) [ClassicSimilarity], result of:
            0.5673575 = score(doc=4761,freq=4.0), product of:
              0.46763453 = queryWeight, product of:
                4.731351 = boost
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.017820474 = queryNorm
              1.2132498 = fieldWeight in 4761, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.109375 = fieldNorm(doc=4761)
        0.2 = coord(5/25)
    
  4. Khoo, S.G.; Na, J.-C.: Semantic relations in information science (2006) 0.23
    0.22980222 = sum of:
      0.22980222 = product of:
        0.71813196 = sum of:
          0.029935898 = weight(abstract_txt:regular in 1978) [ClassicSimilarity], result of:
            0.029935898 = score(doc=1978,freq=1.0), product of:
              0.13247843 = queryWeight, product of:
                1.0280845 = boost
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.017820474 = queryNorm
              0.2259681 = fieldWeight in 1978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.03125 = fieldNorm(doc=1978)
          0.018929541 = weight(abstract_txt:representation in 1978) [ClassicSimilarity], result of:
            0.018929541 = score(doc=1978,freq=1.0), product of:
              0.122966565 = queryWeight, product of:
                1.4007633 = boost
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.017820474 = queryNorm
              0.15394056 = fieldWeight in 1978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.03125 = fieldNorm(doc=1978)
          0.028182494 = weight(abstract_txt:knowledge in 1978) [ClassicSimilarity], result of:
            0.028182494 = score(doc=1978,freq=7.0), product of:
              0.09594249 = queryWeight, product of:
                1.5153828 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.017820474 = queryNorm
              0.2937436 = fieldWeight in 1978, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.03125 = fieldNorm(doc=1978)
          0.03161653 = weight(abstract_txt:identification in 1978) [ClassicSimilarity], result of:
            0.03161653 = score(doc=1978,freq=1.0), product of:
              0.17310241 = queryWeight, product of:
                1.6619685 = boost
                5.8446846 = idf(docFreq=347, maxDocs=44218)
                0.017820474 = queryNorm
              0.1826464 = fieldWeight in 1978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8446846 = idf(docFreq=347, maxDocs=44218)
                0.03125 = fieldNorm(doc=1978)
          0.019705681 = weight(abstract_txt:that in 1978) [ClassicSimilarity], result of:
            0.019705681 = score(doc=1978,freq=14.0), product of:
              0.071125485 = queryWeight, product of:
                1.6844335 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017820474 = queryNorm
              0.27705514 = fieldWeight in 1978, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.03125 = fieldNorm(doc=1978)
          0.026314367 = weight(abstract_txt:between in 1978) [ClassicSimilarity], result of:
            0.026314367 = score(doc=1978,freq=4.0), product of:
              0.121566035 = queryWeight, product of:
                1.9696649 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.017820474 = queryNorm
              0.21646151 = fieldWeight in 1978, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.03125 = fieldNorm(doc=1978)
          0.13456552 = weight(abstract_txt:semantic in 1978) [ClassicSimilarity], result of:
            0.13456552 = score(doc=1978,freq=10.0), product of:
              0.30433828 = queryWeight, product of:
                3.8168943 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.017820474 = queryNorm
              0.44215772 = fieldWeight in 1978, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.03125 = fieldNorm(doc=1978)
          0.42888194 = weight(abstract_txt:relations in 1978) [ClassicSimilarity], result of:
            0.42888194 = score(doc=1978,freq=28.0), product of:
              0.46763453 = queryWeight, product of:
                4.731351 = boost
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.017820474 = queryNorm
              0.9171306 = fieldWeight in 1978, product of:
                5.2915025 = tf(freq=28.0), with freq of:
                  28.0 = termFreq=28.0
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.03125 = fieldNorm(doc=1978)
        0.32 = coord(8/25)
    
  5. Murphy, M.L.: Semantic relations and the lexicon : antonymy, synonymy and other paradigms (2008) 0.23
    0.22872627 = sum of:
      0.22872627 = product of:
        0.8168795 = sum of:
          0.029414097 = weight(abstract_txt:approach in 997) [ClassicSimilarity], result of:
            0.029414097 = score(doc=997,freq=2.0), product of:
              0.07108217 = queryWeight, product of:
                1.0650048 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.017820474 = queryNorm
              0.41380417 = fieldWeight in 997, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.078125 = fieldNorm(doc=997)
          0.047323853 = weight(abstract_txt:representation in 997) [ClassicSimilarity], result of:
            0.047323853 = score(doc=997,freq=1.0), product of:
              0.122966565 = queryWeight, product of:
                1.4007633 = boost
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.017820474 = queryNorm
              0.3848514 = fieldWeight in 997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.078125 = fieldNorm(doc=997)
          0.037660442 = weight(abstract_txt:knowledge in 997) [ClassicSimilarity], result of:
            0.037660442 = score(doc=997,freq=2.0), product of:
              0.09594249 = queryWeight, product of:
                1.5153828 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.017820474 = queryNorm
              0.39253142 = fieldWeight in 997, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.078125 = fieldNorm(doc=997)
          0.022804894 = weight(abstract_txt:that in 997) [ClassicSimilarity], result of:
            0.022804894 = score(doc=997,freq=3.0), product of:
              0.071125485 = queryWeight, product of:
                1.6844335 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017820474 = queryNorm
              0.320629 = fieldWeight in 997, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=997)
          0.03289296 = weight(abstract_txt:between in 997) [ClassicSimilarity], result of:
            0.03289296 = score(doc=997,freq=1.0), product of:
              0.121566035 = queryWeight, product of:
                1.9696649 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.017820474 = queryNorm
              0.2705769 = fieldWeight in 997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.078125 = fieldNorm(doc=997)
          0.15044881 = weight(abstract_txt:semantic in 997) [ClassicSimilarity], result of:
            0.15044881 = score(doc=997,freq=2.0), product of:
              0.30433828 = queryWeight, product of:
                3.8168943 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.017820474 = queryNorm
              0.49434733 = fieldWeight in 997, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.078125 = fieldNorm(doc=997)
          0.49633443 = weight(abstract_txt:relations in 997) [ClassicSimilarity], result of:
            0.49633443 = score(doc=997,freq=6.0), product of:
              0.46763453 = queryWeight, product of:
                4.731351 = boost
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.017820474 = queryNorm
              1.0613725 = fieldWeight in 997, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.078125 = fieldNorm(doc=997)
        0.28 = coord(7/25)