Document (#35030)

Author
Liu, W.
Weichselbraun, A.
Scharl, A.
Chang, E.
Title
Semi-automatic ontology extension using spreading activation
Source
Journal of universal knowledge management. 0(2005) no.1, S.50-58
Year
2005
Abstract
This paper describes a system to semi-automatically extend and refine ontologies by mining textual data from the Web sites of international online media. Expanding a seed ontology creates a semantic network through co-occurrence analysis, trigger phrase analysis, and disambiguation based on the WordNet lexical dictionary. Spreading activation then processes this semantic network to find the most probable candidates for inclusion in an extended ontology. Approaches to identifying hierarchical relationships such as subsumption, head noun analysis and WordNet consultation are used to confirm and classify the found relationships. Using a seed ontology on "climate change" as an example, this paper demonstrates how spreading activation improves the result by naturally integrating the mentioned methods.
Theme
Data Mining

Similar documents (author)

  1. Chang, R.: DBase, relational data models, and MARC records (1992) 4.81
    4.8134003 = sum of:
      4.8134003 = weight(author_txt:chang in 5057) [ClassicSimilarity], result of:
        4.8134003 = fieldWeight in 5057, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.7014403 = idf(docFreq=51, maxDocs=42306)
          0.625 = fieldNorm(doc=5057)
    
  2. Chang, R.: ¬The development of indexing technology (1993) 4.81
    4.8134003 = sum of:
      4.8134003 = weight(author_txt:chang in 7024) [ClassicSimilarity], result of:
        4.8134003 = fieldWeight in 7024, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.7014403 = idf(docFreq=51, maxDocs=42306)
          0.625 = fieldNorm(doc=7024)
    
  3. Chang, R.: Keyword searching and indexing (1993) 4.81
    4.8134003 = sum of:
      4.8134003 = weight(author_txt:chang in 7223) [ClassicSimilarity], result of:
        4.8134003 = fieldWeight in 7223, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.7014403 = idf(docFreq=51, maxDocs=42306)
          0.625 = fieldNorm(doc=7223)
    
  4. Chang, R.H.: To classify or not to classify? : a new look at an old problem (1989) 4.81
    4.8134003 = sum of:
      4.8134003 = weight(author_txt:chang in 2579) [ClassicSimilarity], result of:
        4.8134003 = fieldWeight in 2579, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.7014403 = idf(docFreq=51, maxDocs=42306)
          0.625 = fieldNorm(doc=2579)
    
  5. Chang, S.H.: ¬The current state of Web search engines (1999) 4.81
    4.8134003 = sum of:
      4.8134003 = weight(author_txt:chang in 1510) [ClassicSimilarity], result of:
        4.8134003 = fieldWeight in 1510, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.7014403 = idf(docFreq=51, maxDocs=42306)
          0.625 = fieldNorm(doc=1510)
    

Similar documents (content)

  1. Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: FACET: thesaurus retrieval with semantic term expansion (2002) 0.14
    0.13834617 = sum of:
      0.13834617 = product of:
        0.57644236 = sum of:
          0.005026466 = weight(abstract_txt:this in 2176) [ClassicSimilarity], result of:
            0.005026466 = score(doc=2176,freq=1.0), product of:
              0.03744781 = queryWeight, product of:
                1.0387839 = boost
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.01468767 = queryNorm
              0.1342259 = fieldWeight in 2176, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2176)
          0.046144113 = weight(abstract_txt:semantic in 2176) [ClassicSimilarity], result of:
            0.046144113 = score(doc=2176,freq=5.0), product of:
              0.083876766 = queryWeight, product of:
                1.2693671 = boost
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.01468767 = queryNorm
              0.55014175 = fieldWeight in 2176, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2176)
          0.022727998 = weight(abstract_txt:network in 2176) [ClassicSimilarity], result of:
            0.022727998 = score(doc=2176,freq=1.0), product of:
              0.089453004 = queryWeight, product of:
                1.3108828 = boost
                4.645989 = idf(docFreq=1103, maxDocs=42306)
                0.01468767 = queryNorm
              0.25407752 = fieldWeight in 2176, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.645989 = idf(docFreq=1103, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2176)
          0.03612554 = weight(abstract_txt:relationships in 2176) [ClassicSimilarity], result of:
            0.03612554 = score(doc=2176,freq=2.0), product of:
              0.09669865 = queryWeight, product of:
                1.3629396 = boost
                4.830487 = idf(docFreq=917, maxDocs=42306)
                0.01468767 = queryNorm
              0.3735889 = fieldWeight in 2176, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.830487 = idf(docFreq=917, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2176)
          0.2332091 = weight(abstract_txt:spreading in 2176) [ClassicSimilarity], result of:
            0.2332091 = score(doc=2176,freq=1.0), product of:
              0.48352054 = queryWeight, product of:
                3.732669 = boost
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.01468767 = queryNorm
              0.4823148 = fieldWeight in 2176, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2176)
          0.2332091 = weight(abstract_txt:activation in 2176) [ClassicSimilarity], result of:
            0.2332091 = score(doc=2176,freq=1.0), product of:
              0.48352054 = queryWeight, product of:
                3.732669 = boost
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.01468767 = queryNorm
              0.4823148 = fieldWeight in 2176, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.0546875 = fieldNorm(doc=2176)
        0.24 = coord(6/25)
    
  2. Na, J.-C.; Neoh, H.L.: Effectiveness of UMLS semantic network as a seed ontology for building a medical domain ontology (2008) 0.13
    0.12866077 = sum of:
      0.12866077 = product of:
        0.6433038 = sum of:
          0.009949822 = weight(abstract_txt:this in 3911) [ClassicSimilarity], result of:
            0.009949822 = score(doc=3911,freq=3.0), product of:
              0.03744781 = queryWeight, product of:
                1.0387839 = boost
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.01468767 = queryNorm
              0.26569837 = fieldWeight in 3911, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4544165 = idf(docFreq=9879, maxDocs=42306)
                0.0625 = fieldNorm(doc=3911)
          0.07075294 = weight(abstract_txt:semantic in 3911) [ClassicSimilarity], result of:
            0.07075294 = score(doc=3911,freq=9.0), product of:
              0.083876766 = queryWeight, product of:
                1.2693671 = boost
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.01468767 = queryNorm
              0.84353447 = fieldWeight in 3911, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.0625 = fieldNorm(doc=3911)
          0.06362514 = weight(abstract_txt:network in 3911) [ClassicSimilarity], result of:
            0.06362514 = score(doc=3911,freq=6.0), product of:
              0.089453004 = queryWeight, product of:
                1.3108828 = boost
                4.645989 = idf(docFreq=1103, maxDocs=42306)
                0.01468767 = queryNorm
              0.7112689 = fieldWeight in 3911, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.645989 = idf(docFreq=1103, maxDocs=42306)
                0.0625 = fieldNorm(doc=3911)
          0.27718404 = weight(abstract_txt:seed in 3911) [ClassicSimilarity], result of:
            0.27718404 = score(doc=3911,freq=3.0), product of:
              0.30062926 = queryWeight, product of:
                2.4031563 = boost
                8.51719 = idf(docFreq=22, maxDocs=42306)
                0.01468767 = queryNorm
              0.92201287 = fieldWeight in 3911, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.51719 = idf(docFreq=22, maxDocs=42306)
                0.0625 = fieldNorm(doc=3911)
          0.22179186 = weight(abstract_txt:ontology in 3911) [ClassicSimilarity], result of:
            0.22179186 = score(doc=3911,freq=6.0), product of:
              0.2591092 = queryWeight, product of:
                3.1551704 = boost
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.01468767 = queryNorm
              0.85597837 = fieldWeight in 3911, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.0625 = fieldNorm(doc=3911)
        0.2 = coord(5/25)
    
  3. Kulyukin, V.A.; Settle, A.: Ranked retrieval with semantic networks and vector spaces (2001) 0.12
    0.11894455 = sum of:
      0.11894455 = product of:
        0.9912046 = sum of:
          0.0583682 = weight(abstract_txt:semantic in 935) [ClassicSimilarity], result of:
            0.0583682 = score(doc=935,freq=2.0), product of:
              0.083876766 = queryWeight, product of:
                1.2693671 = boost
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.01468767 = queryNorm
              0.6958804 = fieldWeight in 935, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.109375 = fieldNorm(doc=935)
          0.4664182 = weight(abstract_txt:spreading in 935) [ClassicSimilarity], result of:
            0.4664182 = score(doc=935,freq=1.0), product of:
              0.48352054 = queryWeight, product of:
                3.732669 = boost
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.01468767 = queryNorm
              0.9646296 = fieldWeight in 935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.109375 = fieldNorm(doc=935)
          0.4664182 = weight(abstract_txt:activation in 935) [ClassicSimilarity], result of:
            0.4664182 = score(doc=935,freq=1.0), product of:
              0.48352054 = queryWeight, product of:
                3.732669 = boost
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.01468767 = queryNorm
              0.9646296 = fieldWeight in 935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.109375 = fieldNorm(doc=935)
        0.12 = coord(3/25)
    
  4. Chen, H.; Ng, T.: ¬An algorithmic approach to concept exploration in a large knowledge network (automatic thesaurus consultation) : symbolic branch-and-bound search versus connectionist Hopfield Net Activation (1995) 0.12
    0.11867353 = sum of:
      0.11867353 = product of:
        0.7417096 = sum of:
          0.029480392 = weight(abstract_txt:semantic in 2272) [ClassicSimilarity], result of:
            0.029480392 = score(doc=2272,freq=1.0), product of:
              0.083876766 = queryWeight, product of:
                1.2693671 = boost
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.01468767 = queryNorm
              0.35147268 = fieldWeight in 2272, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.078125 = fieldNorm(doc=2272)
          0.045917485 = weight(abstract_txt:network in 2272) [ClassicSimilarity], result of:
            0.045917485 = score(doc=2272,freq=2.0), product of:
              0.089453004 = queryWeight, product of:
                1.3108828 = boost
                4.645989 = idf(docFreq=1103, maxDocs=42306)
                0.01468767 = queryNorm
              0.51331407 = fieldWeight in 2272, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.645989 = idf(docFreq=1103, maxDocs=42306)
                0.078125 = fieldNorm(doc=2272)
          0.33315587 = weight(abstract_txt:spreading in 2272) [ClassicSimilarity], result of:
            0.33315587 = score(doc=2272,freq=1.0), product of:
              0.48352054 = queryWeight, product of:
                3.732669 = boost
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.01468767 = queryNorm
              0.6890211 = fieldWeight in 2272, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.078125 = fieldNorm(doc=2272)
          0.33315587 = weight(abstract_txt:activation in 2272) [ClassicSimilarity], result of:
            0.33315587 = score(doc=2272,freq=1.0), product of:
              0.48352054 = queryWeight, product of:
                3.732669 = boost
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.01468767 = queryNorm
              0.6890211 = fieldWeight in 2272, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.078125 = fieldNorm(doc=2272)
        0.16 = coord(4/25)
    
  5. Vlachidis, A.; Tudhope, D.: ¬A knowledge-based approach to information extraction for semantic interoperability in the archaeology domain (2016) 0.11
    0.10616845 = sum of:
      0.10616845 = product of:
        0.37917304 = sum of:
          0.046123423 = weight(abstract_txt:phrase in 4896) [ClassicSimilarity], result of:
            0.046123423 = score(doc=4896,freq=1.0), product of:
              0.10411114 = queryWeight, product of:
                7.088336 = idf(docFreq=95, maxDocs=42306)
                0.01468767 = queryNorm
              0.443021 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.088336 = idf(docFreq=95, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.07434233 = weight(abstract_txt:disambiguation in 4896) [ClassicSimilarity], result of:
            0.07434233 = score(doc=4896,freq=2.0), product of:
              0.11359616 = queryWeight, product of:
                1.0445596 = boost
                7.404189 = idf(docFreq=69, maxDocs=42306)
                0.01468767 = queryNorm
              0.65444404 = fieldWeight in 4896, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.404189 = idf(docFreq=69, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.061517127 = weight(abstract_txt:noun in 4896) [ClassicSimilarity], result of:
            0.061517127 = score(doc=4896,freq=1.0), product of:
              0.12614796 = queryWeight, product of:
                1.1007571 = boost
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.01468767 = queryNorm
              0.48765853 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.057769537 = weight(abstract_txt:semantic in 4896) [ClassicSimilarity], result of:
            0.057769537 = score(doc=4896,freq=6.0), product of:
              0.083876766 = queryWeight, product of:
                1.2693671 = boost
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.01468767 = queryNorm
              0.688743 = fieldWeight in 4896, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.029193847 = weight(abstract_txt:relationships in 4896) [ClassicSimilarity], result of:
            0.029193847 = score(doc=4896,freq=1.0), product of:
              0.09669865 = queryWeight, product of:
                1.3629396 = boost
                4.830487 = idf(docFreq=917, maxDocs=42306)
                0.01468767 = queryNorm
              0.30190542 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.830487 = idf(docFreq=917, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.019680649 = weight(abstract_txt:analysis in 4896) [ClassicSimilarity], result of:
            0.019680649 = score(doc=4896,freq=1.0), product of:
              0.085103914 = queryWeight, product of:
                1.5659821 = boost
                3.7000692 = idf(docFreq=2842, maxDocs=42306)
                0.01468767 = queryNorm
              0.23125432 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7000692 = idf(docFreq=2842, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
          0.090546146 = weight(abstract_txt:ontology in 4896) [ClassicSimilarity], result of:
            0.090546146 = score(doc=4896,freq=1.0), product of:
              0.2591092 = queryWeight, product of:
                3.1551704 = boost
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.01468767 = queryNorm
              0.3494517 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.0625 = fieldNorm(doc=4896)
        0.28 = coord(7/25)