Document (#36174)

Author
Fripp, D.
Title
Using linked data to classify web documents
Source
Aslib proceedings. 62(2010) no.6, S.585 - 595
Year
2010
Abstract
Purpose - The purpose of this paper is to find a relationship between traditional faceted classification schemes and semantic web document annotators, particularly in the linked data environment. Design/methodology/approach - A consideration of the conceptual ideas behind faceted classification and linked data architecture is made. Analysis of selected web documents is performed using Calais' Semantic Proxy to support the considerations. Findings - Technical language aside, the principles of both approaches are very similar. Modern classification techniques have the potential to automatically generate metadata to drive more precise information recall by including a semantic layer. Originality/value - Linked data have not been explicitly considered in this context before in the published literature.
Theme
Semantic Web
Klassifikationstheorie: Elemente / Struktur

Similar documents (content)

  1. Smith, D.A.; Shadbolt, N.R.: FacetOntology : expressive descriptions of facets in the Semantic Web (2012) 0.17
    0.1702751 = sum of:
      0.1702751 = product of:
        0.6081254 = sum of:
          0.042497065 = weight(abstract_txt:before in 4209) [ClassicSimilarity], result of:
            0.042497065 = score(doc=4209,freq=1.0), product of:
              0.11865209 = queryWeight, product of:
                5.730645 = idf(docFreq=376, maxDocs=42740)
                0.02070484 = queryNorm
              0.35816532 = fieldWeight in 4209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.730645 = idf(docFreq=376, maxDocs=42740)
                0.0625 = fieldNorm(doc=4209)
          0.015255255 = weight(abstract_txt:have in 4209) [ClassicSimilarity], result of:
            0.015255255 = score(doc=4209,freq=1.0), product of:
              0.0755079 = queryWeight, product of:
                1.1281673 = boost
                3.2325633 = idf(docFreq=4583, maxDocs=42740)
                0.02070484 = queryNorm
              0.2020352 = fieldWeight in 4209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2325633 = idf(docFreq=4583, maxDocs=42740)
                0.0625 = fieldNorm(doc=4209)
          0.019025074 = weight(abstract_txt:using in 4209) [ClassicSimilarity], result of:
            0.019025074 = score(doc=4209,freq=1.0), product of:
              0.087484345 = queryWeight, product of:
                1.214346 = boost
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.02070484 = queryNorm
              0.21746832 = fieldWeight in 4209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.0625 = fieldNorm(doc=4209)
          0.19963582 = weight(abstract_txt:faceted in 4209) [ClassicSimilarity], result of:
            0.19963582 = score(doc=4209,freq=4.0), product of:
              0.26415068 = queryWeight, product of:
                2.1101007 = boost
                6.046119 = idf(docFreq=274, maxDocs=42740)
                0.02070484 = queryNorm
              0.7557649 = fieldWeight in 4209, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.046119 = idf(docFreq=274, maxDocs=42740)
                0.0625 = fieldNorm(doc=4209)
          0.109499305 = weight(abstract_txt:data in 4209) [ClassicSimilarity], result of:
            0.109499305 = score(doc=4209,freq=10.0), product of:
              0.16430987 = queryWeight, product of:
                2.3535538 = boost
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.02070484 = queryNorm
              0.6664195 = fieldWeight in 4209, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.0625 = fieldNorm(doc=4209)
          0.06165628 = weight(abstract_txt:semantic in 4209) [ClassicSimilarity], result of:
            0.06165628 = score(doc=4209,freq=1.0), product of:
              0.2193115 = queryWeight, product of:
                2.3547978 = boost
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.02070484 = queryNorm
              0.28113565 = fieldWeight in 4209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.0625 = fieldNorm(doc=4209)
          0.16055661 = weight(abstract_txt:linked in 4209) [ClassicSimilarity], result of:
            0.16055661 = score(doc=4209,freq=1.0), product of:
              0.4568864 = queryWeight, product of:
                3.9246092 = boost
                5.622636 = idf(docFreq=419, maxDocs=42740)
                0.02070484 = queryNorm
              0.35141474 = fieldWeight in 4209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.622636 = idf(docFreq=419, maxDocs=42740)
                0.0625 = fieldNorm(doc=4209)
        0.28 = coord(7/25)
    
  2. Bianchini, C.; Willer, M.: ISBD resource and Its description in the context of the Semantic Web (2014) 0.14
    0.13558874 = sum of:
      0.13558874 = product of:
        0.67794365 = sum of:
          0.02853761 = weight(abstract_txt:using in 3999) [ClassicSimilarity], result of:
            0.02853761 = score(doc=3999,freq=1.0), product of:
              0.087484345 = queryWeight, product of:
                1.214346 = boost
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.02070484 = queryNorm
              0.32620248 = fieldWeight in 3999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.09375 = fieldNorm(doc=3999)
          0.061879765 = weight(abstract_txt:purpose in 3999) [ClassicSimilarity], result of:
            0.061879765 = score(doc=3999,freq=1.0), product of:
              0.14656076 = queryWeight, product of:
                1.5717597 = boost
                4.5035987 = idf(docFreq=1285, maxDocs=42740)
                0.02070484 = queryNorm
              0.42221236 = fieldWeight in 3999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5035987 = idf(docFreq=1285, maxDocs=42740)
                0.09375 = fieldNorm(doc=3999)
          0.11614154 = weight(abstract_txt:data in 3999) [ClassicSimilarity], result of:
            0.11614154 = score(doc=3999,freq=5.0), product of:
              0.16430987 = queryWeight, product of:
                2.3535538 = boost
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.02070484 = queryNorm
              0.70684457 = fieldWeight in 3999, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.09375 = fieldNorm(doc=3999)
          0.13079272 = weight(abstract_txt:semantic in 3999) [ClassicSimilarity], result of:
            0.13079272 = score(doc=3999,freq=2.0), product of:
              0.2193115 = queryWeight, product of:
                2.3547978 = boost
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.02070484 = queryNorm
              0.59637874 = fieldWeight in 3999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.09375 = fieldNorm(doc=3999)
          0.34059203 = weight(abstract_txt:linked in 3999) [ClassicSimilarity], result of:
            0.34059203 = score(doc=3999,freq=2.0), product of:
              0.4568864 = queryWeight, product of:
                3.9246092 = boost
                5.622636 = idf(docFreq=419, maxDocs=42740)
                0.02070484 = queryNorm
              0.74546325 = fieldWeight in 3999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.622636 = idf(docFreq=419, maxDocs=42740)
                0.09375 = fieldNorm(doc=3999)
        0.2 = coord(5/25)
    
  3. Mitchell, J.S.; Panzer, M.: Dewey linked data : Making connections with old friends and new acquaintances (2012) 0.13
    0.12864363 = sum of:
      0.12864363 = product of:
        0.64321816 = sum of:
          0.021574186 = weight(abstract_txt:have in 2306) [ClassicSimilarity], result of:
            0.021574186 = score(doc=2306,freq=2.0), product of:
              0.0755079 = queryWeight, product of:
                1.1281673 = boost
                3.2325633 = idf(docFreq=4583, maxDocs=42740)
                0.02070484 = queryNorm
              0.2857209 = fieldWeight in 2306, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2325633 = idf(docFreq=4583, maxDocs=42740)
                0.0625 = fieldNorm(doc=2306)
          0.075092204 = weight(abstract_txt:classification in 2306) [ClassicSimilarity], result of:
            0.075092204 = score(doc=2306,freq=3.0), product of:
              0.17342006 = queryWeight, product of:
                2.0939803 = boost
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.02070484 = queryNorm
              0.4330076 = fieldWeight in 2306, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.0625 = fieldNorm(doc=2306)
          0.09161369 = weight(abstract_txt:data in 2306) [ClassicSimilarity], result of:
            0.09161369 = score(doc=2306,freq=7.0), product of:
              0.16430987 = queryWeight, product of:
                2.3535538 = boost
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.02070484 = queryNorm
              0.5575665 = fieldWeight in 2306, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.0625 = fieldNorm(doc=2306)
          0.06165628 = weight(abstract_txt:semantic in 2306) [ClassicSimilarity], result of:
            0.06165628 = score(doc=2306,freq=1.0), product of:
              0.2193115 = queryWeight, product of:
                2.3547978 = boost
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.02070484 = queryNorm
              0.28113565 = fieldWeight in 2306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.0625 = fieldNorm(doc=2306)
          0.39328182 = weight(abstract_txt:linked in 2306) [ClassicSimilarity], result of:
            0.39328182 = score(doc=2306,freq=6.0), product of:
              0.4568864 = queryWeight, product of:
                3.9246092 = boost
                5.622636 = idf(docFreq=419, maxDocs=42740)
                0.02070484 = queryNorm
              0.86078686 = fieldWeight in 2306, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.622636 = idf(docFreq=419, maxDocs=42740)
                0.0625 = fieldNorm(doc=2306)
        0.2 = coord(5/25)
    
  4. Quick Guide to Publishing a Classification Scheme on the Semantic Web (2008) 0.13
    0.12643228 = sum of:
      0.12643228 = product of:
        0.63216144 = sum of:
          0.02853761 = weight(abstract_txt:using in 62) [ClassicSimilarity], result of:
            0.02853761 = score(doc=62,freq=1.0), product of:
              0.087484345 = queryWeight, product of:
                1.214346 = boost
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.02070484 = queryNorm
              0.32620248 = fieldWeight in 62, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.09375 = fieldNorm(doc=62)
          0.1126383 = weight(abstract_txt:classification in 62) [ClassicSimilarity], result of:
            0.1126383 = score(doc=62,freq=3.0), product of:
              0.17342006 = queryWeight, product of:
                2.0939803 = boost
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.02070484 = queryNorm
              0.6495114 = fieldWeight in 62, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.09375 = fieldNorm(doc=62)
          0.08996285 = weight(abstract_txt:data in 62) [ClassicSimilarity], result of:
            0.08996285 = score(doc=62,freq=3.0), product of:
              0.16430987 = queryWeight, product of:
                2.3535538 = boost
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.02070484 = queryNorm
              0.54751945 = fieldWeight in 62, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.09375 = fieldNorm(doc=62)
          0.16018772 = weight(abstract_txt:semantic in 62) [ClassicSimilarity], result of:
            0.16018772 = score(doc=62,freq=3.0), product of:
              0.2193115 = queryWeight, product of:
                2.3547978 = boost
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.02070484 = queryNorm
              0.7304118 = fieldWeight in 62, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.09375 = fieldNorm(doc=62)
          0.24083494 = weight(abstract_txt:linked in 62) [ClassicSimilarity], result of:
            0.24083494 = score(doc=62,freq=1.0), product of:
              0.4568864 = queryWeight, product of:
                3.9246092 = boost
                5.622636 = idf(docFreq=419, maxDocs=42740)
                0.02070484 = queryNorm
              0.52712214 = fieldWeight in 62, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.622636 = idf(docFreq=419, maxDocs=42740)
                0.09375 = fieldNorm(doc=62)
        0.2 = coord(5/25)
    
  5. Vocht, L. De: Exploring semantic relationships in the Web of Data : Semantische relaties verkennen in data op het web (2017) 0.12
    0.11930598 = sum of:
      0.11930598 = product of:
        0.4260928 = sum of:
          0.03853932 = weight(abstract_txt:behind in 233) [ClassicSimilarity], result of:
            0.03853932 = score(doc=233,freq=2.0), product of:
              0.15310057 = queryWeight, product of:
                1.1359277 = boost
                6.5095987 = idf(docFreq=172, maxDocs=42740)
                0.02070484 = queryNorm
              0.25172552 = fieldWeight in 233, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5095987 = idf(docFreq=172, maxDocs=42740)
                0.02734375 = fieldNorm(doc=233)
          0.030386675 = weight(abstract_txt:precise in 233) [ClassicSimilarity], result of:
            0.030386675 = score(doc=233,freq=1.0), product of:
              0.16462894 = queryWeight, product of:
                1.1779189 = boost
                6.7502356 = idf(docFreq=135, maxDocs=42740)
                0.02070484 = queryNorm
              0.18457675 = fieldWeight in 233, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7502356 = idf(docFreq=135, maxDocs=42740)
                0.02734375 = fieldNorm(doc=233)
          0.00832347 = weight(abstract_txt:using in 233) [ClassicSimilarity], result of:
            0.00832347 = score(doc=233,freq=1.0), product of:
              0.087484345 = queryWeight, product of:
                1.214346 = boost
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.02070484 = queryNorm
              0.095142394 = fieldWeight in 233, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.02734375 = fieldNorm(doc=233)
          0.030794565 = weight(abstract_txt:documents in 233) [ClassicSimilarity], result of:
            0.030794565 = score(doc=233,freq=5.0), product of:
              0.122382715 = queryWeight, product of:
                1.4362742 = boost
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.02070484 = queryNorm
              0.25162512 = fieldWeight in 233, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.02734375 = fieldNorm(doc=233)
          0.06059676 = weight(abstract_txt:data in 233) [ClassicSimilarity], result of:
            0.06059676 = score(doc=233,freq=16.0), product of:
              0.16430987 = queryWeight, product of:
                2.3535538 = boost
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.02070484 = queryNorm
              0.3687956 = fieldWeight in 233, product of:
                4.0 = tf(freq=16.0), with freq of:
                  16.0 = termFreq=16.0
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.02734375 = fieldNorm(doc=233)
          0.046721417 = weight(abstract_txt:semantic in 233) [ClassicSimilarity], result of:
            0.046721417 = score(doc=233,freq=3.0), product of:
              0.2193115 = queryWeight, product of:
                2.3547978 = boost
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.02070484 = queryNorm
              0.21303678 = fieldWeight in 233, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.02734375 = fieldNorm(doc=233)
          0.21073058 = weight(abstract_txt:linked in 233) [ClassicSimilarity], result of:
            0.21073058 = score(doc=233,freq=9.0), product of:
              0.4568864 = queryWeight, product of:
                3.9246092 = boost
                5.622636 = idf(docFreq=419, maxDocs=42740)
                0.02070484 = queryNorm
              0.4612319 = fieldWeight in 233, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                5.622636 = idf(docFreq=419, maxDocs=42740)
                0.02734375 = fieldNorm(doc=233)
        0.28 = coord(7/25)