Document (#37415)

Author
Luo, Y.
Picalausa, F.
Fletcher, G.H.L.
Hidders, J.
Vansummeren, S.
Title
Storing and indexing massive RDF datasets
Source
Semantic search over the Web. Eds.: R. De Virgilio, et al
Imprint
Berlin : Springer
Year
2012
Pages
S.31-60
Series
Data-centric systems and applications
Abstract
The resource description framework (RDF for short) provides a flexible method for modeling information on the Web [34,40]. All data items in RDF are uniformly represented as triples of the form (subject, predicate, object), sometimes also referred to as (subject, property, value) triples. As a running example for this chapter, a small fragment of an RDF dataset concerning music and music fans is given in Fig. 2.1. Spurred by efforts like the Linking Open Data project, increasingly large volumes of data are being published in RDF. Notable contributors in this respect include areas as diverse as the government, the life sciences, Web 2.0 communities, and so on. To give an idea of the volumes of RDF data concerned, as of September 2012, there are 31,634,213,770 triples in total published by data sources participating in the Linking Open Data project. Many individual data sources (like, e.g., PubMed, DBpedia, MusicBrainz) contain hundreds of millions of triples (797, 672, and 179 millions, respectively). These large volumes of RDF data motivate the need for scalable native RDF data management solutions capabable of efficiently storing, indexing, and querying RDF data. In this chapter, we present a general and up-to-date survey of the current state of the art in RDF storage and indexing.
Theme
Semantic Web
Object
RDF

Similar documents (author)

  1. Fletcher, L.: In-house publishing with CD-ROM tools (1992) 5.76
    5.7574883 = sum of:
      5.7574883 = weight(author_txt:fletcher in 4272) [ClassicSimilarity], result of:
        5.7574883 = fieldWeight in 4272, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.625 = fieldNorm(doc=4272)
    
  2. Fletcher, L.: Is there a chance for a standardised user interface? (1993) 5.76
    5.7574883 = sum of:
      5.7574883 = weight(author_txt:fletcher in 4287) [ClassicSimilarity], result of:
        5.7574883 = fieldWeight in 4287, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.625 = fieldNorm(doc=4287)
    
  3. Fletcher, M.: ¬The CATRIONA project : feasibility study and outcomes (1996) 5.76
    5.7574883 = sum of:
      5.7574883 = weight(author_txt:fletcher in 3817) [ClassicSimilarity], result of:
        5.7574883 = fieldWeight in 3817, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.625 = fieldNorm(doc=3817)
    
  4. Fletcher, P.D.: Creating the front door to government : a case study of the Firstgov portal (2004) 5.76
    5.7574883 = sum of:
      5.7574883 = weight(author_txt:fletcher in 872) [ClassicSimilarity], result of:
        5.7574883 = fieldWeight in 872, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.625 = fieldNorm(doc=872)
    
  5. Katzer, J.; Fletcher, P.T.: ¬The information environment of managers (1992) 4.61
    4.6059904 = sum of:
      4.6059904 = weight(author_txt:fletcher in 6714) [ClassicSimilarity], result of:
        4.6059904 = fieldWeight in 6714, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.5 = fieldNorm(doc=6714)
    

Similar documents (content)

  1. Hassanzadeh, O.; Kementsietsidis, A.; Lim, L.; Miller, R.J.; Wang, M.: Semantic link discovery over relational data (2012) 0.38
    0.37534773 = sum of:
      0.37534773 = product of:
        1.0426326 = sum of:
          0.026972834 = weight(abstract_txt:project in 412) [ClassicSimilarity], result of:
            0.026972834 = score(doc=412,freq=1.0), product of:
              0.078854464 = queryWeight, product of:
                1.124836 = boost
                4.378348 = idf(docFreq=1507, maxDocs=44218)
                0.016011309 = queryNorm
              0.34205842 = fieldWeight in 412, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.378348 = idf(docFreq=1507, maxDocs=44218)
                0.078125 = fieldNorm(doc=412)
          0.04015943 = weight(abstract_txt:large in 412) [ClassicSimilarity], result of:
            0.04015943 = score(doc=412,freq=2.0), product of:
              0.081606284 = queryWeight, product of:
                1.1442946 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.016011309 = queryNorm
              0.49211198 = fieldWeight in 412, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.078125 = fieldNorm(doc=412)
          0.034355707 = weight(abstract_txt:sources in 412) [ClassicSimilarity], result of:
            0.034355707 = score(doc=412,freq=1.0), product of:
              0.0926562 = queryWeight, product of:
                1.2193077 = boost
                4.7460723 = idf(docFreq=1043, maxDocs=44218)
                0.016011309 = queryNorm
              0.3707869 = fieldWeight in 412, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7460723 = idf(docFreq=1043, maxDocs=44218)
                0.078125 = fieldNorm(doc=412)
          0.03606974 = weight(abstract_txt:open in 412) [ClassicSimilarity], result of:
            0.03606974 = score(doc=412,freq=1.0), product of:
              0.09571292 = queryWeight, product of:
                1.2392569 = boost
                4.8237233 = idf(docFreq=965, maxDocs=44218)
                0.016011309 = queryNorm
              0.37685338 = fieldWeight in 412, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8237233 = idf(docFreq=965, maxDocs=44218)
                0.078125 = fieldNorm(doc=412)
          0.038149215 = weight(abstract_txt:published in 412) [ClassicSimilarity], result of:
            0.038149215 = score(doc=412,freq=1.0), product of:
              0.09935711 = queryWeight, product of:
                1.2626283 = boost
                4.9146953 = idf(docFreq=881, maxDocs=44218)
                0.016011309 = queryNorm
              0.38396057 = fieldWeight in 412, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9146953 = idf(docFreq=881, maxDocs=44218)
                0.078125 = fieldNorm(doc=412)
          0.07262394 = weight(abstract_txt:linking in 412) [ClassicSimilarity], result of:
            0.07262394 = score(doc=412,freq=1.0), product of:
              0.15261425 = queryWeight, product of:
                1.5648532 = boost
                6.091085 = idf(docFreq=271, maxDocs=44218)
                0.016011309 = queryNorm
              0.47586602 = fieldWeight in 412, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.091085 = idf(docFreq=271, maxDocs=44218)
                0.078125 = fieldNorm(doc=412)
          0.08154447 = weight(abstract_txt:chapter in 412) [ClassicSimilarity], result of:
            0.08154447 = score(doc=412,freq=1.0), product of:
              0.16486871 = queryWeight, product of:
                1.6264666 = boost
                6.330911 = idf(docFreq=213, maxDocs=44218)
                0.016011309 = queryNorm
              0.49460244 = fieldWeight in 412, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.330911 = idf(docFreq=213, maxDocs=44218)
                0.078125 = fieldNorm(doc=412)
          0.13343348 = weight(abstract_txt:data in 412) [ClassicSimilarity], result of:
            0.13343348 = score(doc=412,freq=5.0), product of:
              0.2289383 = queryWeight, product of:
                4.2856855 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016011309 = queryNorm
              0.582836 = fieldWeight in 412, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=412)
          0.5793237 = weight(abstract_txt:triples in 412) [ClassicSimilarity], result of:
            0.5793237 = score(doc=412,freq=2.0), product of:
              0.6092882 = queryWeight, product of:
                4.421834 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.016011309 = queryNorm
              0.95082045 = fieldWeight in 412, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.078125 = fieldNorm(doc=412)
        0.36 = coord(9/25)
    
  2. Baker, T.: ¬The concepts of knowledge organization systems as hubs in the Web of data (2011) 0.16
    0.1613449 = sum of:
      0.1613449 = product of:
        0.8067245 = sum of:
          0.019166036 = weight(abstract_txt:subject in 4810) [ClassicSimilarity], result of:
            0.019166036 = score(doc=4810,freq=1.0), product of:
              0.06279091 = queryWeight, product of:
                1.0037473 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.016011309 = queryNorm
              0.30523583 = fieldWeight in 4810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.078125 = fieldNorm(doc=4810)
          0.11420577 = weight(abstract_txt:predicate in 4810) [ClassicSimilarity], result of:
            0.11420577 = score(doc=4810,freq=1.0), product of:
              0.16380379 = queryWeight, product of:
                1.1463653 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.016011309 = queryNorm
              0.6972108 = fieldWeight in 4810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.078125 = fieldNorm(doc=4810)
          0.034355707 = weight(abstract_txt:sources in 4810) [ClassicSimilarity], result of:
            0.034355707 = score(doc=4810,freq=1.0), product of:
              0.0926562 = queryWeight, product of:
                1.2193077 = boost
                4.7460723 = idf(docFreq=1043, maxDocs=44218)
                0.016011309 = queryNorm
              0.3707869 = fieldWeight in 4810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7460723 = idf(docFreq=1043, maxDocs=44218)
                0.078125 = fieldNorm(doc=4810)
          0.05967327 = weight(abstract_txt:data in 4810) [ClassicSimilarity], result of:
            0.05967327 = score(doc=4810,freq=1.0), product of:
              0.2289383 = queryWeight, product of:
                4.2856855 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016011309 = queryNorm
              0.26065218 = fieldWeight in 4810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=4810)
          0.5793237 = weight(abstract_txt:triples in 4810) [ClassicSimilarity], result of:
            0.5793237 = score(doc=4810,freq=2.0), product of:
              0.6092882 = queryWeight, product of:
                4.421834 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.016011309 = queryNorm
              0.95082045 = fieldWeight in 4810, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.078125 = fieldNorm(doc=4810)
        0.2 = coord(5/25)
    
  3. Bianchini, D.; Antonellis, V. De: Linked data services and semantics-enabled mashup (2012) 0.13
    0.13076185 = sum of:
      0.13076185 = product of:
        0.65380925 = sum of:
          0.1532607 = weight(abstract_txt:dbpedia in 435) [ClassicSimilarity], result of:
            0.1532607 = score(doc=435,freq=5.0), product of:
              0.1478304 = queryWeight, product of:
                1.0890378 = boost
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.016011309 = queryNorm
              1.0367333 = fieldWeight in 435, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.0546875 = fieldNorm(doc=435)
          0.024048995 = weight(abstract_txt:sources in 435) [ClassicSimilarity], result of:
            0.024048995 = score(doc=435,freq=1.0), product of:
              0.0926562 = queryWeight, product of:
                1.2193077 = boost
                4.7460723 = idf(docFreq=1043, maxDocs=44218)
                0.016011309 = queryNorm
              0.25955084 = fieldWeight in 435, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7460723 = idf(docFreq=1043, maxDocs=44218)
                0.0546875 = fieldNorm(doc=435)
          0.045048993 = weight(abstract_txt:like in 435) [ClassicSimilarity], result of:
            0.045048993 = score(doc=435,freq=2.0), product of:
              0.11175233 = queryWeight, product of:
                1.3390733 = boost
                5.212252 = idf(docFreq=654, maxDocs=44218)
                0.016011309 = queryNorm
              0.40311456 = fieldWeight in 435, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.212252 = idf(docFreq=654, maxDocs=44218)
                0.0546875 = fieldNorm(doc=435)
          0.14469996 = weight(abstract_txt:data in 435) [ClassicSimilarity], result of:
            0.14469996 = score(doc=435,freq=12.0), product of:
              0.2289383 = queryWeight, product of:
                4.2856855 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016011309 = queryNorm
              0.6320479 = fieldWeight in 435, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0546875 = fieldNorm(doc=435)
          0.2867506 = weight(abstract_txt:triples in 435) [ClassicSimilarity], result of:
            0.2867506 = score(doc=435,freq=1.0), product of:
              0.6092882 = queryWeight, product of:
                4.421834 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.016011309 = queryNorm
              0.47063214 = fieldWeight in 435, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.0546875 = fieldNorm(doc=435)
        0.2 = coord(5/25)
    
  4. Bizer, C.; Lehmann, J.; Kobilarov, G.; Auer, S.; Becker, C.; Cyganiak, R.; Hellmann, S.: DBpedia: a crystallization point for the Web of Data. (2009) 0.10
    0.100830354 = sum of:
      0.100830354 = product of:
        0.50415176 = sum of:
          0.2349952 = weight(abstract_txt:dbpedia in 1643) [ClassicSimilarity], result of:
            0.2349952 = score(doc=1643,freq=9.0), product of:
              0.1478304 = queryWeight, product of:
                1.0890378 = boost
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.016011309 = queryNorm
              1.589627 = fieldWeight in 1643, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.0625 = fieldNorm(doc=1643)
          0.021578267 = weight(abstract_txt:project in 1643) [ClassicSimilarity], result of:
            0.021578267 = score(doc=1643,freq=1.0), product of:
              0.078854464 = queryWeight, product of:
                1.124836 = boost
                4.378348 = idf(docFreq=1507, maxDocs=44218)
                0.016011309 = queryNorm
              0.27364674 = fieldWeight in 1643, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.378348 = idf(docFreq=1507, maxDocs=44218)
                0.0625 = fieldNorm(doc=1643)
          0.04760466 = weight(abstract_txt:sources in 1643) [ClassicSimilarity], result of:
            0.04760466 = score(doc=1643,freq=3.0), product of:
              0.0926562 = queryWeight, product of:
                1.2193077 = boost
                4.7460723 = idf(docFreq=1043, maxDocs=44218)
                0.016011309 = queryNorm
              0.5137774 = fieldWeight in 1643, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7460723 = idf(docFreq=1043, maxDocs=44218)
                0.0625 = fieldNorm(doc=1643)
          0.06494844 = weight(abstract_txt:music in 1643) [ClassicSimilarity], result of:
            0.06494844 = score(doc=1643,freq=1.0), product of:
              0.16438457 = queryWeight, product of:
                1.6240768 = boost
                6.321609 = idf(docFreq=215, maxDocs=44218)
                0.016011309 = queryNorm
              0.39510056 = fieldWeight in 1643, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.321609 = idf(docFreq=215, maxDocs=44218)
                0.0625 = fieldNorm(doc=1643)
          0.13502519 = weight(abstract_txt:data in 1643) [ClassicSimilarity], result of:
            0.13502519 = score(doc=1643,freq=8.0), product of:
              0.2289383 = queryWeight, product of:
                4.2856855 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016011309 = queryNorm
              0.58978856 = fieldWeight in 1643, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=1643)
        0.2 = coord(5/25)
    
  5. Sakr, S.; Wylot, M.; Mutharaju, R.; Le-Phuoc, D.; Fundulaki, I.: Linked data : storing, querying, and reasoning (2018) 0.10
    0.09540059 = sum of:
      0.09540059 = product of:
        0.47700292 = sum of:
          0.019877903 = weight(abstract_txt:large in 5329) [ClassicSimilarity], result of:
            0.019877903 = score(doc=5329,freq=1.0), product of:
              0.081606284 = queryWeight, product of:
                1.1442946 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.016011309 = queryNorm
              0.243583 = fieldWeight in 5329, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5329)
          0.025248818 = weight(abstract_txt:open in 5329) [ClassicSimilarity], result of:
            0.025248818 = score(doc=5329,freq=1.0), product of:
              0.09571292 = queryWeight, product of:
                1.2392569 = boost
                4.8237233 = idf(docFreq=965, maxDocs=44218)
                0.016011309 = queryNorm
              0.26379737 = fieldWeight in 5329, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8237233 = idf(docFreq=965, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5329)
          0.17124337 = weight(abstract_txt:chapter in 5329) [ClassicSimilarity], result of:
            0.17124337 = score(doc=5329,freq=9.0), product of:
              0.16486871 = queryWeight, product of:
                1.6264666 = boost
                6.330911 = idf(docFreq=213, maxDocs=44218)
                0.016011309 = queryNorm
              1.038665 = fieldWeight in 5329, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.330911 = idf(docFreq=213, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5329)
          0.1285404 = weight(abstract_txt:storing in 5329) [ClassicSimilarity], result of:
            0.1285404 = score(doc=5329,freq=2.0), product of:
              0.22481553 = queryWeight, product of:
                1.8992809 = boost
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.016011309 = queryNorm
              0.5717594 = fieldWeight in 5329, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5329)
          0.1320924 = weight(abstract_txt:data in 5329) [ClassicSimilarity], result of:
            0.1320924 = score(doc=5329,freq=10.0), product of:
              0.2289383 = queryWeight, product of:
                4.2856855 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016011309 = queryNorm
              0.57697815 = fieldWeight in 5329, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5329)
        0.2 = coord(5/25)