Document (#38036)

Author
Solskinnsbakk, G.
Gulla, J.A.
Haderlein, V.
Myrseth, P.
Cerrato, O.
Title
Quality of hierarchies in ontologies and folksonomies
Source
Data and knowledge engineering. 74(2012) April, S.13-25
Year
2012
Abstract
Ontologies have been a hot research topic for the recent decade and have been used for many applications such as information integration, semantic search, knowledge management, etc. Manual engineering of ontologies is a costly process and automatic ontology engineering lacks in precision. Folksonomies have recently emerged as another hot research topic and several research efforts have been made to extract lightweight ontologies automatically from folksonomy data. Due to the high cost of manual ontology engineering and the lack of precision in automatic ontology engineering it is important that we are able to evaluate the structure of the ontology. Detection of problems with the suggested ontology at an early stage can, especially for manually engineered ontologies, be cost saving. In this paper we present an approach to evaluate the quality of hierarchical relations in ontologies and folksonomy based structures. The approach is based on constructing shallow semantic representations of the ontology concepts and folksonomy tags. We specify four hypotheses regarding the semantic representations and different quality aspects of the hierarchical relations and perform an evaluation on two different data sets. The results of the evaluation confirm our hypotheses.
Theme
Folksonomies
Wissensrepräsentation

Similar documents (content)

  1. Jiang, X.; Tan, A.-H.: CRCTOL: a semantic-based domain ontology learning system (2009) 0.35
    0.3460814 = sum of:
      0.3460814 = product of:
        0.9613372 = sum of:
          0.066576675 = weight(abstract_txt:shallow in 321) [ClassicSimilarity], result of:
            0.066576675 = score(doc=321,freq=1.0), product of:
              0.1456772 = queryWeight, product of:
                1.0710424 = boost
                8.356848 = idf(docFreq=26, maxDocs=42306)
                0.016275803 = queryNorm
              0.4570151 = fieldWeight in 321, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.356848 = idf(docFreq=26, maxDocs=42306)
                0.0546875 = fieldNorm(doc=321)
          0.020698922 = weight(abstract_txt:evaluation in 321) [ClassicSimilarity], result of:
            0.020698922 = score(doc=321,freq=1.0), product of:
              0.08423359 = queryWeight, product of:
                1.1517774 = boost
                4.4933925 = idf(docFreq=1285, maxDocs=42306)
                0.016275803 = queryNorm
              0.2457324 = fieldWeight in 321, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4933925 = idf(docFreq=1285, maxDocs=42306)
                0.0546875 = fieldNorm(doc=321)
          0.016563991 = weight(abstract_txt:been in 321) [ClassicSimilarity], result of:
            0.016563991 = score(doc=321,freq=1.0), product of:
              0.08311139 = queryWeight, product of:
                1.4012053 = boost
                3.6443186 = idf(docFreq=3005, maxDocs=42306)
                0.016275803 = queryNorm
              0.19929868 = fieldWeight in 321, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6443186 = idf(docFreq=3005, maxDocs=42306)
                0.0546875 = fieldNorm(doc=321)
          0.07955945 = weight(abstract_txt:relations in 321) [ClassicSimilarity], result of:
            0.07955945 = score(doc=321,freq=4.0), product of:
              0.1302051 = queryWeight, product of:
                1.4319897 = boost
                5.586576 = idf(docFreq=430, maxDocs=42306)
                0.016275803 = queryNorm
              0.6110318 = fieldWeight in 321, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.586576 = idf(docFreq=430, maxDocs=42306)
                0.0546875 = fieldNorm(doc=321)
          0.01555456 = weight(abstract_txt:have in 321) [ClassicSimilarity], result of:
            0.01555456 = score(doc=321,freq=1.0), product of:
              0.08772068 = queryWeight, product of:
                1.6622329 = boost
                3.2424083 = idf(docFreq=4492, maxDocs=42306)
                0.016275803 = queryNorm
              0.1773192 = fieldWeight in 321, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2424083 = idf(docFreq=4492, maxDocs=42306)
                0.0546875 = fieldNorm(doc=321)
          0.053973578 = weight(abstract_txt:semantic in 321) [ClassicSimilarity], result of:
            0.053973578 = score(doc=321,freq=3.0), product of:
              0.12665752 = queryWeight, product of:
                1.7297646 = boost
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.016275803 = queryNorm
              0.42613798 = fieldWeight in 321, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.0546875 = fieldNorm(doc=321)
          0.04973193 = weight(abstract_txt:quality in 321) [ClassicSimilarity], result of:
            0.04973193 = score(doc=321,freq=2.0), product of:
              0.13728742 = queryWeight, product of:
                1.8008888 = boost
                4.6838336 = idf(docFreq=1062, maxDocs=42306)
                0.016275803 = queryNorm
              0.36224678 = fieldWeight in 321, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6838336 = idf(docFreq=1062, maxDocs=42306)
                0.0546875 = fieldNorm(doc=321)
          0.31653103 = weight(abstract_txt:ontology in 321) [ClassicSimilarity], result of:
            0.31653103 = score(doc=321,freq=7.0), product of:
              0.391266 = queryWeight, product of:
                4.299546 = boost
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.016275803 = queryNorm
              0.8089919 = fieldWeight in 321, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.0546875 = fieldNorm(doc=321)
          0.34214705 = weight(abstract_txt:ontologies in 321) [ClassicSimilarity], result of:
            0.34214705 = score(doc=321,freq=6.0), product of:
              0.4338291 = queryWeight, product of:
                4.5273685 = boost
                5.887493 = idf(docFreq=318, maxDocs=42306)
                0.016275803 = queryNorm
              0.7886678 = fieldWeight in 321, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.887493 = idf(docFreq=318, maxDocs=42306)
                0.0546875 = fieldNorm(doc=321)
        0.36 = coord(9/25)
    
  2. Mustafa El Hadi, W.: Terminologies, ontologies and information access (2006) 0.31
    0.31430525 = sum of:
      0.31430525 = product of:
        1.1225188 = sum of:
          0.023023145 = weight(abstract_txt:research in 3489) [ClassicSimilarity], result of:
            0.023023145 = score(doc=3489,freq=1.0), product of:
              0.06520892 = queryWeight, product of:
                1.2411522 = boost
                3.228045 = idf(docFreq=4557, maxDocs=42306)
                0.016275803 = queryNorm
              0.35306743 = fieldWeight in 3489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.228045 = idf(docFreq=4557, maxDocs=42306)
                0.109375 = fieldNorm(doc=3489)
          0.06400895 = weight(abstract_txt:automatic in 3489) [ClassicSimilarity], result of:
            0.06400895 = score(doc=3489,freq=1.0), product of:
              0.11263169 = queryWeight, product of:
                1.3318527 = boost
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.016275803 = queryNorm
              0.56830317 = fieldWeight in 3489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.109375 = fieldNorm(doc=3489)
          0.08613737 = weight(abstract_txt:cost in 3489) [ClassicSimilarity], result of:
            0.08613737 = score(doc=3489,freq=1.0), product of:
              0.13728651 = queryWeight, product of:
                1.4704146 = boost
                5.736482 = idf(docFreq=370, maxDocs=42306)
                0.016275803 = queryNorm
              0.62742776 = fieldWeight in 3489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.736482 = idf(docFreq=370, maxDocs=42306)
                0.109375 = fieldNorm(doc=3489)
          0.03110912 = weight(abstract_txt:have in 3489) [ClassicSimilarity], result of:
            0.03110912 = score(doc=3489,freq=1.0), product of:
              0.08772068 = queryWeight, product of:
                1.6622329 = boost
                3.2424083 = idf(docFreq=4492, maxDocs=42306)
                0.016275803 = queryNorm
              0.3546384 = fieldWeight in 3489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2424083 = idf(docFreq=4492, maxDocs=42306)
                0.109375 = fieldNorm(doc=3489)
          0.19509625 = weight(abstract_txt:engineering in 3489) [ClassicSimilarity], result of:
            0.19509625 = score(doc=3489,freq=1.0), product of:
              0.2983157 = queryWeight, product of:
                3.0653422 = boost
                5.979361 = idf(docFreq=290, maxDocs=42306)
                0.016275803 = queryNorm
              0.6539926 = fieldWeight in 3489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.979361 = idf(docFreq=290, maxDocs=42306)
                0.109375 = fieldNorm(doc=3489)
          0.23927498 = weight(abstract_txt:ontology in 3489) [ClassicSimilarity], result of:
            0.23927498 = score(doc=3489,freq=1.0), product of:
              0.391266 = queryWeight, product of:
                4.299546 = boost
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.016275803 = queryNorm
              0.61154044 = fieldWeight in 3489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.109375 = fieldNorm(doc=3489)
          0.48386902 = weight(abstract_txt:ontologies in 3489) [ClassicSimilarity], result of:
            0.48386902 = score(doc=3489,freq=3.0), product of:
              0.4338291 = queryWeight, product of:
                4.5273685 = boost
                5.887493 = idf(docFreq=318, maxDocs=42306)
                0.016275803 = queryNorm
              1.1153448 = fieldWeight in 3489, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.887493 = idf(docFreq=318, maxDocs=42306)
                0.109375 = fieldNorm(doc=3489)
        0.28 = coord(7/25)
    
  3. Mao, M.: Ontology mapping : towards semantic interoperability in distributed and heterogeneous environments (2008) 0.26
    0.25781617 = sum of:
      0.25781617 = product of:
        0.920772 = sum of:
          0.016279822 = weight(abstract_txt:research in 1660) [ClassicSimilarity], result of:
            0.016279822 = score(doc=1660,freq=2.0), product of:
              0.06520892 = queryWeight, product of:
                1.2411522 = boost
                3.228045 = idf(docFreq=4557, maxDocs=42306)
                0.016275803 = queryNorm
              0.24965636 = fieldWeight in 1660, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.228045 = idf(docFreq=4557, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1660)
          0.023425018 = weight(abstract_txt:been in 1660) [ClassicSimilarity], result of:
            0.023425018 = score(doc=1660,freq=2.0), product of:
              0.08311139 = queryWeight, product of:
                1.4012053 = boost
                3.6443186 = idf(docFreq=3005, maxDocs=42306)
                0.016275803 = queryNorm
              0.28185087 = fieldWeight in 1660, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6443186 = idf(docFreq=3005, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1660)
          0.048858356 = weight(abstract_txt:manual in 1660) [ClassicSimilarity], result of:
            0.048858356 = score(doc=1660,freq=1.0), product of:
              0.14932965 = queryWeight, product of:
                1.5335535 = boost
                5.9828033 = idf(docFreq=289, maxDocs=42306)
                0.016275803 = queryNorm
              0.32718456 = fieldWeight in 1660, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9828033 = idf(docFreq=289, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1660)
          0.02199747 = weight(abstract_txt:have in 1660) [ClassicSimilarity], result of:
            0.02199747 = score(doc=1660,freq=2.0), product of:
              0.08772068 = queryWeight, product of:
                1.6622329 = boost
                3.2424083 = idf(docFreq=4492, maxDocs=42306)
                0.016275803 = queryNorm
              0.25076723 = fieldWeight in 1660, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2424083 = idf(docFreq=4492, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1660)
          0.06232332 = weight(abstract_txt:semantic in 1660) [ClassicSimilarity], result of:
            0.06232332 = score(doc=1660,freq=4.0), product of:
              0.12665752 = queryWeight, product of:
                1.7297646 = boost
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.016275803 = queryNorm
              0.49206176 = fieldWeight in 1660, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1660)
          0.37832698 = weight(abstract_txt:ontology in 1660) [ClassicSimilarity], result of:
            0.37832698 = score(doc=1660,freq=10.0), product of:
              0.391266 = queryWeight, product of:
                4.299546 = boost
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.016275803 = queryNorm
              0.9669304 = fieldWeight in 1660, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1660)
          0.36956102 = weight(abstract_txt:ontologies in 1660) [ClassicSimilarity], result of:
            0.36956102 = score(doc=1660,freq=7.0), product of:
              0.4338291 = queryWeight, product of:
                4.5273685 = boost
                5.887493 = idf(docFreq=318, maxDocs=42306)
                0.016275803 = queryNorm
              0.85185856 = fieldWeight in 1660, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.887493 = idf(docFreq=318, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1660)
        0.28 = coord(7/25)
    
  4. Urs, S.R.; Angrosh, M.A.: Ontology-based knowledge organization systems in digital libraries : a comparison of experiments in OWL and KAON ontologies (2006 (?)) 0.25
    0.25005403 = sum of:
      0.25005403 = product of:
        0.89305013 = sum of:
          0.01993863 = weight(abstract_txt:research in 619) [ClassicSimilarity], result of:
            0.01993863 = score(doc=619,freq=3.0), product of:
              0.06520892 = queryWeight, product of:
                1.2411522 = boost
                3.228045 = idf(docFreq=4557, maxDocs=42306)
                0.016275803 = queryNorm
              0.30576536 = fieldWeight in 619, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.228045 = idf(docFreq=4557, maxDocs=42306)
                0.0546875 = fieldNorm(doc=619)
          0.016563991 = weight(abstract_txt:been in 619) [ClassicSimilarity], result of:
            0.016563991 = score(doc=619,freq=1.0), product of:
              0.08311139 = queryWeight, product of:
                1.4012053 = boost
                3.6443186 = idf(docFreq=3005, maxDocs=42306)
                0.016275803 = queryNorm
              0.19929868 = fieldWeight in 619, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6443186 = idf(docFreq=3005, maxDocs=42306)
                0.0546875 = fieldNorm(doc=619)
          0.050353333 = weight(abstract_txt:representations in 619) [ClassicSimilarity], result of:
            0.050353333 = score(doc=619,freq=1.0), product of:
              0.15236047 = queryWeight, product of:
                1.5490379 = boost
                6.0432124 = idf(docFreq=272, maxDocs=42306)
                0.016275803 = queryNorm
              0.33048818 = fieldWeight in 619, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0432124 = idf(docFreq=272, maxDocs=42306)
                0.0546875 = fieldNorm(doc=619)
          0.03116166 = weight(abstract_txt:semantic in 619) [ClassicSimilarity], result of:
            0.03116166 = score(doc=619,freq=1.0), product of:
              0.12665752 = queryWeight, product of:
                1.7297646 = boost
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.016275803 = queryNorm
              0.24603088 = fieldWeight in 619, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.0546875 = fieldNorm(doc=619)
          0.1379539 = weight(abstract_txt:engineering in 619) [ClassicSimilarity], result of:
            0.1379539 = score(doc=619,freq=2.0), product of:
              0.2983157 = queryWeight, product of:
                3.0653422 = boost
                5.979361 = idf(docFreq=290, maxDocs=42306)
                0.016275803 = queryNorm
              0.46244264 = fieldWeight in 619, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.979361 = idf(docFreq=290, maxDocs=42306)
                0.0546875 = fieldNorm(doc=619)
          0.26751757 = weight(abstract_txt:ontology in 619) [ClassicSimilarity], result of:
            0.26751757 = score(doc=619,freq=5.0), product of:
              0.391266 = queryWeight, product of:
                4.299546 = boost
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.016275803 = queryNorm
              0.68372303 = fieldWeight in 619, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.0546875 = fieldNorm(doc=619)
          0.36956102 = weight(abstract_txt:ontologies in 619) [ClassicSimilarity], result of:
            0.36956102 = score(doc=619,freq=7.0), product of:
              0.4338291 = queryWeight, product of:
                4.5273685 = boost
                5.887493 = idf(docFreq=318, maxDocs=42306)
                0.016275803 = queryNorm
              0.85185856 = fieldWeight in 619, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.887493 = idf(docFreq=318, maxDocs=42306)
                0.0546875 = fieldNorm(doc=619)
        0.28 = coord(7/25)
    
  5. Ma, N.; Zheng, H.T.; Xiao, X.: ¬An ontology-based latent semantic indexing approach using long short-term memory networks (2017) 0.25
    0.24520667 = sum of:
      0.24520667 = product of:
        0.8757381 = sum of:
          0.043348826 = weight(abstract_txt:topic in 729) [ClassicSimilarity], result of:
            0.043348826 = score(doc=729,freq=1.0), product of:
              0.108701885 = queryWeight, product of:
                1.3084117 = boost
                5.104465 = idf(docFreq=697, maxDocs=42306)
                0.016275803 = queryNorm
              0.39878634 = fieldWeight in 729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.104465 = idf(docFreq=697, maxDocs=42306)
                0.078125 = fieldNorm(doc=729)
          0.023662841 = weight(abstract_txt:been in 729) [ClassicSimilarity], result of:
            0.023662841 = score(doc=729,freq=1.0), product of:
              0.08311139 = queryWeight, product of:
                1.4012053 = boost
                3.6443186 = idf(docFreq=3005, maxDocs=42306)
                0.016275803 = queryNorm
              0.28471237 = fieldWeight in 729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6443186 = idf(docFreq=3005, maxDocs=42306)
                0.078125 = fieldNorm(doc=729)
          0.08036717 = weight(abstract_txt:relations in 729) [ClassicSimilarity], result of:
            0.08036717 = score(doc=729,freq=2.0), product of:
              0.1302051 = queryWeight, product of:
                1.4319897 = boost
                5.586576 = idf(docFreq=430, maxDocs=42306)
                0.016275803 = queryNorm
              0.61723524 = fieldWeight in 729, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.586576 = idf(docFreq=430, maxDocs=42306)
                0.078125 = fieldNorm(doc=729)
          0.022220802 = weight(abstract_txt:have in 729) [ClassicSimilarity], result of:
            0.022220802 = score(doc=729,freq=1.0), product of:
              0.08772068 = queryWeight, product of:
                1.6622329 = boost
                3.2424083 = idf(docFreq=4492, maxDocs=42306)
                0.016275803 = queryNorm
              0.25331315 = fieldWeight in 729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2424083 = idf(docFreq=4492, maxDocs=42306)
                0.078125 = fieldNorm(doc=729)
          0.08903331 = weight(abstract_txt:semantic in 729) [ClassicSimilarity], result of:
            0.08903331 = score(doc=729,freq=4.0), product of:
              0.12665752 = queryWeight, product of:
                1.7297646 = boost
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.016275803 = queryNorm
              0.70294535 = fieldWeight in 729, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4988503 = idf(docFreq=1278, maxDocs=42306)
                0.078125 = fieldNorm(doc=729)
          0.1709107 = weight(abstract_txt:ontology in 729) [ClassicSimilarity], result of:
            0.1709107 = score(doc=729,freq=1.0), product of:
              0.391266 = queryWeight, product of:
                4.299546 = boost
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.016275803 = queryNorm
              0.4368146 = fieldWeight in 729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.078125 = fieldNorm(doc=729)
          0.4461944 = weight(abstract_txt:ontologies in 729) [ClassicSimilarity], result of:
            0.4461944 = score(doc=729,freq=5.0), product of:
              0.4338291 = queryWeight, product of:
                4.5273685 = boost
                5.887493 = idf(docFreq=318, maxDocs=42306)
                0.016275803 = queryNorm
              1.0285027 = fieldWeight in 729, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.887493 = idf(docFreq=318, maxDocs=42306)
                0.078125 = fieldNorm(doc=729)
        0.28 = coord(7/25)