Document (#36173)

Author
Fripp, D.
Title
Using linked data to classify web documents
Source
Aslib proceedings. 62(2010) no.6, S.585 - 595
Year
2010
Abstract
Purpose - The purpose of this paper is to find a relationship between traditional faceted classification schemes and semantic web document annotators, particularly in the linked data environment. Design/methodology/approach - A consideration of the conceptual ideas behind faceted classification and linked data architecture is made. Analysis of selected web documents is performed using Calais' Semantic Proxy to support the considerations. Findings - Technical language aside, the principles of both approaches are very similar. Modern classification techniques have the potential to automatically generate metadata to drive more precise information recall by including a semantic layer. Originality/value - Linked data have not been explicitly considered in this context before in the published literature.
Theme
Semantic Web
Klassifikationstheorie: Elemente / Struktur

Similar documents (content)

  1. Bianchini, C.; Bargioni, S.: Automated classification using linked open data : a case study on faceted classification and Wikidata (2021) 0.18
    0.18102787 = sum of:
      0.18102787 = product of:
        0.75428283 = sum of:
          0.09588422 = weight(abstract_txt:classify in 724) [ClassicSimilarity], result of:
            0.09588422 = score(doc=724,freq=1.0), product of:
              0.15643795 = queryWeight, product of:
                1.1483983 = boost
                6.537832 = idf(docFreq=173, maxDocs=44218)
                0.020836072 = queryNorm
              0.6129217 = fieldWeight in 724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.537832 = idf(docFreq=173, maxDocs=44218)
                0.09375 = fieldNorm(doc=724)
          0.028502172 = weight(abstract_txt:using in 724) [ClassicSimilarity], result of:
            0.028502172 = score(doc=724,freq=1.0), product of:
              0.08778884 = queryWeight, product of:
                1.2166233 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.020836072 = queryNorm
              0.32466736 = fieldWeight in 724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.09375 = fieldNorm(doc=724)
          0.14649847 = weight(abstract_txt:faceted in 724) [ClassicSimilarity], result of:
            0.14649847 = score(doc=724,freq=1.0), product of:
              0.2614625 = queryWeight, product of:
                2.0996222 = boost
                5.9765754 = idf(docFreq=304, maxDocs=44218)
                0.020836072 = queryNorm
              0.5603039 = fieldWeight in 724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9765754 = idf(docFreq=304, maxDocs=44218)
                0.09375 = fieldNorm(doc=724)
          0.16041277 = weight(abstract_txt:classification in 724) [ClassicSimilarity], result of:
            0.16041277 = score(doc=724,freq=6.0), product of:
              0.17498198 = queryWeight, product of:
                2.1036756 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.020836072 = queryNorm
              0.9167388 = fieldWeight in 724, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.09375 = fieldNorm(doc=724)
          0.08828368 = weight(abstract_txt:data in 724) [ClassicSimilarity], result of:
            0.08828368 = score(doc=724,freq=3.0), product of:
              0.16295859 = queryWeight, product of:
                2.3441753 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.020836072 = queryNorm
              0.5417553 = fieldWeight in 724, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.09375 = fieldNorm(doc=724)
          0.2347015 = weight(abstract_txt:linked in 724) [ClassicSimilarity], result of:
            0.2347015 = score(doc=724,freq=1.0), product of:
              0.4510326 = queryWeight, product of:
                3.8999176 = boost
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.020836072 = queryNorm
              0.5203648 = fieldWeight in 724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.09375 = fieldNorm(doc=724)
        0.24 = coord(6/25)
    
  2. Smith, D.A.; Shadbolt, N.R.: FacetOntology : expressive descriptions of facets in the Semantic Web (2012) 0.17
    0.16715647 = sum of:
      0.16715647 = product of:
        0.5969874 = sum of:
          0.042206395 = weight(abstract_txt:before in 2208) [ClassicSimilarity], result of:
            0.042206395 = score(doc=2208,freq=1.0), product of:
              0.11861976 = queryWeight, product of:
                5.6930003 = idf(docFreq=404, maxDocs=44218)
                0.020836072 = queryNorm
              0.35581252 = fieldWeight in 2208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6930003 = idf(docFreq=404, maxDocs=44218)
                0.0625 = fieldNorm(doc=2208)
          0.015055904 = weight(abstract_txt:have in 2208) [ClassicSimilarity], result of:
            0.015055904 = score(doc=2208,freq=1.0), product of:
              0.07517142 = queryWeight, product of:
                1.1258042 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.020836072 = queryNorm
              0.20028761 = fieldWeight in 2208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.0625 = fieldNorm(doc=2208)
          0.019001449 = weight(abstract_txt:using in 2208) [ClassicSimilarity], result of:
            0.019001449 = score(doc=2208,freq=1.0), product of:
              0.08778884 = queryWeight, product of:
                1.2166233 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.020836072 = queryNorm
              0.21644491 = fieldWeight in 2208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=2208)
          0.1953313 = weight(abstract_txt:faceted in 2208) [ClassicSimilarity], result of:
            0.1953313 = score(doc=2208,freq=4.0), product of:
              0.2614625 = queryWeight, product of:
                2.0996222 = boost
                5.9765754 = idf(docFreq=304, maxDocs=44218)
                0.020836072 = queryNorm
              0.7470719 = fieldWeight in 2208, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.9765754 = idf(docFreq=304, maxDocs=44218)
                0.0625 = fieldNorm(doc=2208)
          0.107455485 = weight(abstract_txt:data in 2208) [ClassicSimilarity], result of:
            0.107455485 = score(doc=2208,freq=10.0), product of:
              0.16295859 = queryWeight, product of:
                2.3441753 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.020836072 = queryNorm
              0.6594036 = fieldWeight in 2208, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=2208)
          0.061469186 = weight(abstract_txt:semantic in 2208) [ClassicSimilarity], result of:
            0.061469186 = score(doc=2208,freq=1.0), product of:
              0.21981142 = queryWeight, product of:
                2.3578014 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.020836072 = queryNorm
              0.2796451 = fieldWeight in 2208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=2208)
          0.15646766 = weight(abstract_txt:linked in 2208) [ClassicSimilarity], result of:
            0.15646766 = score(doc=2208,freq=1.0), product of:
              0.4510326 = queryWeight, product of:
                3.8999176 = boost
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.020836072 = queryNorm
              0.34690988 = fieldWeight in 2208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.0625 = fieldNorm(doc=2208)
        0.28 = coord(7/25)
    
  3. Jia, J.: From data to knowledge : the relationships between vocabularies, linked data and knowledge graphs (2021) 0.15
    0.1475458 = sum of:
      0.1475458 = product of:
        0.73772895 = sum of:
          0.16763121 = weight(abstract_txt:layer in 106) [ClassicSimilarity], result of:
            0.16763121 = score(doc=106,freq=3.0), product of:
              0.20626919 = queryWeight, product of:
                1.3186777 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.020836072 = queryNorm
              0.81268173 = fieldWeight in 106, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.0625 = fieldNorm(doc=106)
          0.05753159 = weight(abstract_txt:purpose in 106) [ClassicSimilarity], result of:
            0.05753159 = score(doc=106,freq=2.0), product of:
              0.14582852 = queryWeight, product of:
                1.568042 = boost
                4.463432 = idf(docFreq=1384, maxDocs=44218)
                0.020836072 = queryNorm
              0.39451537 = fieldWeight in 106, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.463432 = idf(docFreq=1384, maxDocs=44218)
                0.0625 = fieldNorm(doc=106)
          0.11270027 = weight(abstract_txt:data in 106) [ClassicSimilarity], result of:
            0.11270027 = score(doc=106,freq=11.0), product of:
              0.16295859 = queryWeight, product of:
                2.3441753 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.020836072 = queryNorm
              0.6915884 = fieldWeight in 106, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=106)
          0.08693055 = weight(abstract_txt:semantic in 106) [ClassicSimilarity], result of:
            0.08693055 = score(doc=106,freq=2.0), product of:
              0.21981142 = queryWeight, product of:
                2.3578014 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.020836072 = queryNorm
              0.39547786 = fieldWeight in 106, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=106)
          0.31293532 = weight(abstract_txt:linked in 106) [ClassicSimilarity], result of:
            0.31293532 = score(doc=106,freq=4.0), product of:
              0.4510326 = queryWeight, product of:
                3.8999176 = boost
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.020836072 = queryNorm
              0.69381976 = fieldWeight in 106, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.0625 = fieldNorm(doc=106)
        0.2 = coord(5/25)
    
  4. Bianchini, C.; Willer, M.: ISBD resource and Its description in the context of the Semantic Web (2014) 0.13
    0.13316226 = sum of:
      0.13316226 = product of:
        0.6658113 = sum of:
          0.028502172 = weight(abstract_txt:using in 1998) [ClassicSimilarity], result of:
            0.028502172 = score(doc=1998,freq=1.0), product of:
              0.08778884 = queryWeight, product of:
                1.2166233 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.020836072 = queryNorm
              0.32466736 = fieldWeight in 1998, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.09375 = fieldNorm(doc=1998)
          0.061021462 = weight(abstract_txt:purpose in 1998) [ClassicSimilarity], result of:
            0.061021462 = score(doc=1998,freq=1.0), product of:
              0.14582852 = queryWeight, product of:
                1.568042 = boost
                4.463432 = idf(docFreq=1384, maxDocs=44218)
                0.020836072 = queryNorm
              0.41844672 = fieldWeight in 1998, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.463432 = idf(docFreq=1384, maxDocs=44218)
                0.09375 = fieldNorm(doc=1998)
          0.11397376 = weight(abstract_txt:data in 1998) [ClassicSimilarity], result of:
            0.11397376 = score(doc=1998,freq=5.0), product of:
              0.16295859 = queryWeight, product of:
                2.3441753 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.020836072 = queryNorm
              0.69940317 = fieldWeight in 1998, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.09375 = fieldNorm(doc=1998)
          0.13039583 = weight(abstract_txt:semantic in 1998) [ClassicSimilarity], result of:
            0.13039583 = score(doc=1998,freq=2.0), product of:
              0.21981142 = queryWeight, product of:
                2.3578014 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.020836072 = queryNorm
              0.5932168 = fieldWeight in 1998, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.09375 = fieldNorm(doc=1998)
          0.33191803 = weight(abstract_txt:linked in 1998) [ClassicSimilarity], result of:
            0.33191803 = score(doc=1998,freq=2.0), product of:
              0.4510326 = queryWeight, product of:
                3.8999176 = boost
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.020836072 = queryNorm
              0.73590696 = fieldWeight in 1998, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.09375 = fieldNorm(doc=1998)
        0.2 = coord(5/25)
    
  5. Mitchell, J.S.; Panzer, M.: Dewey linked data : Making connections with old friends and new acquaintances (2012) 0.13
    0.1263101 = sum of:
      0.1263101 = product of:
        0.63155043 = sum of:
          0.021292262 = weight(abstract_txt:have in 305) [ClassicSimilarity], result of:
            0.021292262 = score(doc=305,freq=2.0), product of:
              0.07517142 = queryWeight, product of:
                1.1258042 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.020836072 = queryNorm
              0.28324944 = fieldWeight in 305, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.0625 = fieldNorm(doc=305)
          0.0756193 = weight(abstract_txt:classification in 305) [ClassicSimilarity], result of:
            0.0756193 = score(doc=305,freq=3.0), product of:
              0.17498198 = queryWeight, product of:
                2.1036756 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.020836072 = queryNorm
              0.4321548 = fieldWeight in 305, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=305)
          0.08990371 = weight(abstract_txt:data in 305) [ClassicSimilarity], result of:
            0.08990371 = score(doc=305,freq=7.0), product of:
              0.16295859 = queryWeight, product of:
                2.3441753 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.020836072 = queryNorm
              0.55169666 = fieldWeight in 305, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=305)
          0.061469186 = weight(abstract_txt:semantic in 305) [ClassicSimilarity], result of:
            0.061469186 = score(doc=305,freq=1.0), product of:
              0.21981142 = queryWeight, product of:
                2.3578014 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.020836072 = queryNorm
              0.2796451 = fieldWeight in 305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=305)
          0.38326597 = weight(abstract_txt:linked in 305) [ClassicSimilarity], result of:
            0.38326597 = score(doc=305,freq=6.0), product of:
              0.4510326 = queryWeight, product of:
                3.8999176 = boost
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.020836072 = queryNorm
              0.84975225 = fieldWeight in 305, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.0625 = fieldNorm(doc=305)
        0.2 = coord(5/25)