Document (#35029)

Author
Liu, W.
Weichselbraun, A.
Scharl, A.
Chang, E.
Title
Semi-automatic ontology extension using spreading activation
Source
Journal of universal knowledge management. 0(2005) no.1, S.50-58
Year
2005
Abstract
This paper describes a system to semi-automatically extend and refine ontologies by mining textual data from the Web sites of international online media. Expanding a seed ontology creates a semantic network through co-occurrence analysis, trigger phrase analysis, and disambiguation based on the WordNet lexical dictionary. Spreading activation then processes this semantic network to find the most probable candidates for inclusion in an extended ontology. Approaches to identifying hierarchical relationships such as subsumption, head noun analysis and WordNet consultation are used to confirm and classify the found relationships. Using a seed ontology on "climate change" as an example, this paper demonstrates how spreading activation improves the result by naturally integrating the mentioned methods.
Theme
Data Mining

Similar documents (author)

  1. Chang, R.: DBase, relational data models, and MARC records (1992) 4.78
    4.7836475 = sum of:
      4.7836475 = weight(author_txt:chang in 5057) [ClassicSimilarity], result of:
        4.7836475 = fieldWeight in 5057, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.653836 = idf(docFreq=56, maxDocs=44218)
          0.625 = fieldNorm(doc=5057)
    
  2. Chang, R.: ¬The development of indexing technology (1993) 4.78
    4.7836475 = sum of:
      4.7836475 = weight(author_txt:chang in 7024) [ClassicSimilarity], result of:
        4.7836475 = fieldWeight in 7024, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.653836 = idf(docFreq=56, maxDocs=44218)
          0.625 = fieldNorm(doc=7024)
    
  3. Chang, R.: Keyword searching and indexing (1993) 4.78
    4.7836475 = sum of:
      4.7836475 = weight(author_txt:chang in 7223) [ClassicSimilarity], result of:
        4.7836475 = fieldWeight in 7223, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.653836 = idf(docFreq=56, maxDocs=44218)
          0.625 = fieldNorm(doc=7223)
    
  4. Chang, R.H.: To classify or not to classify? : a new look at an old problem (1989) 4.78
    4.7836475 = sum of:
      4.7836475 = weight(author_txt:chang in 2510) [ClassicSimilarity], result of:
        4.7836475 = fieldWeight in 2510, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.653836 = idf(docFreq=56, maxDocs=44218)
          0.625 = fieldNorm(doc=2510)
    
  5. Chang, S.H.: ¬The current state of Web search engines (1999) 4.78
    4.7836475 = sum of:
      4.7836475 = weight(author_txt:chang in 509) [ClassicSimilarity], result of:
        4.7836475 = fieldWeight in 509, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.653836 = idf(docFreq=56, maxDocs=44218)
          0.625 = fieldNorm(doc=509)
    

Similar documents (content)

  1. Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: FACET: thesaurus retrieval with semantic term expansion (2002) 0.14
    0.13837199 = sum of:
      0.13837199 = product of:
        0.57654995 = sum of:
          0.0048072794 = weight(abstract_txt:this in 175) [ClassicSimilarity], result of:
            0.0048072794 = score(doc=175,freq=1.0), product of:
              0.0364293 = queryWeight, product of:
                1.0207757 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.014789723 = queryNorm
              0.1319619 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.045687065 = weight(abstract_txt:semantic in 175) [ClassicSimilarity], result of:
            0.045687065 = score(doc=175,freq=5.0), product of:
              0.08350125 = queryWeight, product of:
                1.2618443 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.014789723 = queryNorm
              0.54714227 = fieldWeight in 175, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.02254615 = weight(abstract_txt:network in 175) [ClassicSimilarity], result of:
            0.02254615 = score(doc=175,freq=1.0), product of:
              0.08916664 = queryWeight, product of:
                1.3039486 = boost
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.014789723 = queryNorm
              0.25285408 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.03583804 = weight(abstract_txt:relationships in 175) [ClassicSimilarity], result of:
            0.03583804 = score(doc=175,freq=2.0), product of:
              0.09639186 = queryWeight, product of:
                1.3557495 = boost
                4.807296 = idf(docFreq=981, maxDocs=44218)
                0.014789723 = queryNorm
              0.37179533 = fieldWeight in 175, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.807296 = idf(docFreq=981, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.22940692 = weight(abstract_txt:activation in 175) [ClassicSimilarity], result of:
            0.22940692 = score(doc=175,freq=1.0), product of:
              0.47927958 = queryWeight, product of:
                3.7025366 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.014789723 = queryNorm
              0.4786495 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
          0.2382645 = weight(abstract_txt:spreading in 175) [ClassicSimilarity], result of:
            0.2382645 = score(doc=175,freq=1.0), product of:
              0.49153844 = queryWeight, product of:
                3.7495887 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.014789723 = queryNorm
              0.48473218 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.0546875 = fieldNorm(doc=175)
        0.24 = coord(6/25)
    
  2. Na, J.-C.; Neoh, H.L.: Effectiveness of UMLS semantic network as a seed ontology for building a medical domain ontology (2008) 0.13
    0.12843452 = sum of:
      0.12843452 = product of:
        0.6421726 = sum of:
          0.009515945 = weight(abstract_txt:this in 1910) [ClassicSimilarity], result of:
            0.009515945 = score(doc=1910,freq=3.0), product of:
              0.0364293 = queryWeight, product of:
                1.0207757 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.014789723 = queryNorm
              0.2612168 = fieldWeight in 1910, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=1910)
          0.07005214 = weight(abstract_txt:semantic in 1910) [ClassicSimilarity], result of:
            0.07005214 = score(doc=1910,freq=9.0), product of:
              0.08350125 = queryWeight, product of:
                1.2618443 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.014789723 = queryNorm
              0.83893526 = fieldWeight in 1910, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=1910)
          0.06311607 = weight(abstract_txt:network in 1910) [ClassicSimilarity], result of:
            0.06311607 = score(doc=1910,freq=6.0), product of:
              0.08916664 = queryWeight, product of:
                1.3039486 = boost
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.014789723 = queryNorm
              0.707844 = fieldWeight in 1910, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.0625 = fieldNorm(doc=1910)
          0.28334305 = weight(abstract_txt:seed in 1910) [ClassicSimilarity], result of:
            0.28334305 = score(doc=1910,freq=3.0), product of:
              0.30572256 = queryWeight, product of:
                2.4144766 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.014789723 = queryNorm
              0.9267979 = fieldWeight in 1910, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.0625 = fieldNorm(doc=1910)
          0.21614534 = weight(abstract_txt:ontology in 1910) [ClassicSimilarity], result of:
            0.21614534 = score(doc=1910,freq=6.0), product of:
              0.25524056 = queryWeight, product of:
                3.1199605 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.014789723 = queryNorm
              0.8468299 = fieldWeight in 1910, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.0625 = fieldNorm(doc=1910)
        0.2 = coord(5/25)
    
  3. Kulyukin, V.A.; Settle, A.: Ranked retrieval with semantic networks and vector spaces (2001) 0.12
    0.11917595 = sum of:
      0.11917595 = product of:
        0.99313295 = sum of:
          0.05779007 = weight(abstract_txt:semantic in 6934) [ClassicSimilarity], result of:
            0.05779007 = score(doc=6934,freq=2.0), product of:
              0.08350125 = queryWeight, product of:
                1.2618443 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.014789723 = queryNorm
              0.6920863 = fieldWeight in 6934, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.109375 = fieldNorm(doc=6934)
          0.45881385 = weight(abstract_txt:activation in 6934) [ClassicSimilarity], result of:
            0.45881385 = score(doc=6934,freq=1.0), product of:
              0.47927958 = queryWeight, product of:
                3.7025366 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.014789723 = queryNorm
              0.957299 = fieldWeight in 6934, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.109375 = fieldNorm(doc=6934)
          0.476529 = weight(abstract_txt:spreading in 6934) [ClassicSimilarity], result of:
            0.476529 = score(doc=6934,freq=1.0), product of:
              0.49153844 = queryWeight, product of:
                3.7495887 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.014789723 = queryNorm
              0.96946436 = fieldWeight in 6934, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.109375 = fieldNorm(doc=6934)
        0.12 = coord(3/25)
    
  4. Chen, H.; Ng, T.: ¬An algorithmic approach to concept exploration in a large knowledge network (automatic thesaurus consultation) : symbolic branch-and-bound search versus connectionist Hopfield Net Activation (1995) 0.12
    0.118854485 = sum of:
      0.118854485 = product of:
        0.7428405 = sum of:
          0.029188393 = weight(abstract_txt:semantic in 2203) [ClassicSimilarity], result of:
            0.029188393 = score(doc=2203,freq=1.0), product of:
              0.08350125 = queryWeight, product of:
                1.2618443 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.014789723 = queryNorm
              0.34955636 = fieldWeight in 2203, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.078125 = fieldNorm(doc=2203)
          0.0455501 = weight(abstract_txt:network in 2203) [ClassicSimilarity], result of:
            0.0455501 = score(doc=2203,freq=2.0), product of:
              0.08916664 = queryWeight, product of:
                1.3039486 = boost
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.014789723 = queryNorm
              0.5108424 = fieldWeight in 2203, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.078125 = fieldNorm(doc=2203)
          0.3277242 = weight(abstract_txt:activation in 2203) [ClassicSimilarity], result of:
            0.3277242 = score(doc=2203,freq=1.0), product of:
              0.47927958 = queryWeight, product of:
                3.7025366 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.014789723 = queryNorm
              0.683785 = fieldWeight in 2203, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.078125 = fieldNorm(doc=2203)
          0.34037787 = weight(abstract_txt:spreading in 2203) [ClassicSimilarity], result of:
            0.34037787 = score(doc=2203,freq=1.0), product of:
              0.49153844 = queryWeight, product of:
                3.7495887 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.014789723 = queryNorm
              0.69247454 = fieldWeight in 2203, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.078125 = fieldNorm(doc=2203)
        0.16 = coord(4/25)
    
  5. Vlachidis, A.; Tudhope, D.: ¬A knowledge-based approach to information extraction for semantic interoperability in the archaeology domain (2016) 0.10
    0.104743116 = sum of:
      0.104743116 = product of:
        0.37408257 = sum of:
          0.046686143 = weight(abstract_txt:phrase in 2895) [ClassicSimilarity], result of:
            0.046686143 = score(doc=2895,freq=1.0), product of:
              0.10518202 = queryWeight, product of:
                1.0014172 = boost
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.014789723 = queryNorm
              0.44386047 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
          0.07289932 = weight(abstract_txt:disambiguation in 2895) [ClassicSimilarity], result of:
            0.07289932 = score(doc=2895,freq=2.0), product of:
              0.112362616 = queryWeight, product of:
                1.0350354 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.014789723 = queryNorm
              0.64878625 = fieldWeight in 2895, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
          0.06102718 = weight(abstract_txt:noun in 2895) [ClassicSimilarity], result of:
            0.06102718 = score(doc=2895,freq=1.0), product of:
              0.12574722 = queryWeight, product of:
                1.0949479 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.014789723 = queryNorm
              0.48531634 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
          0.057197336 = weight(abstract_txt:semantic in 2895) [ClassicSimilarity], result of:
            0.057197336 = score(doc=2895,freq=6.0), product of:
              0.08350125 = queryWeight, product of:
                1.2618443 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.014789723 = queryNorm
              0.6849878 = fieldWeight in 2895, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
          0.028961511 = weight(abstract_txt:relationships in 2895) [ClassicSimilarity], result of:
            0.028961511 = score(doc=2895,freq=1.0), product of:
              0.09639186 = queryWeight, product of:
                1.3557495 = boost
                4.807296 = idf(docFreq=981, maxDocs=44218)
                0.014789723 = queryNorm
              0.300456 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.807296 = idf(docFreq=981, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
          0.019070115 = weight(abstract_txt:analysis in 2895) [ClassicSimilarity], result of:
            0.019070115 = score(doc=2895,freq=1.0), product of:
              0.08351391 = queryWeight, product of:
                1.5455544 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.014789723 = queryNorm
              0.22834657 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
          0.08824096 = weight(abstract_txt:ontology in 2895) [ClassicSimilarity], result of:
            0.08824096 = score(doc=2895,freq=1.0), product of:
              0.25524056 = queryWeight, product of:
                3.1199605 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.014789723 = queryNorm
              0.34571683 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
        0.28 = coord(7/25)