Document (#36706)

Author
Assem, M. van
Rijgersberg, H.
Wigham, M.
Top, J.
Title
Converting and annotating quantitative data tables
Source
The Semantic Web - ISWC 2010. 9th International Semantic Web Conference, ISWC 2010, Shanghai, China, November 7-11, 2010, Revised Selected Papers, Part I. Eds.: Peter F. Patel-Schneider et al
Imprint
Berlin : Springer
Year
2010
Pages
S.16-31
Series
Lecture notes in computer science; 6496
Abstract
Companies, governmental agencies and scientists produce a large amount of quantitative (research) data, consisting of measurements ranging from e.g. the surface temperatures of an ocean to the viscosity of a sample of mayonnaise. Such measurements are stored in tables in e.g. spreadsheet files and research reports. To integrate and reuse such data, it is necessary to have a semantic description of the data. However, the notation used is often ambiguous, making automatic interpretation and conversion to RDF or other suitable format diffiult. For example, the table header cell "f(Hz)" refers to frequency measured in Hertz, but the symbol "f" can also refer to the unit farad or the quantities force or luminous flux. Current annotation tools for this task either work on less ambiguous data or perform a more limited task. We introduce new disambiguation strategies based on an ontology, which allows to improve performance on "sloppy" datasets not yet targeted by existing systems.
Content
Vgl. unter: http://www.cs.vu.nl/~mark/papers/Assem10a.pdf.
Theme
Wissensrepräsentation
Object
OWL
RDF

Similar documents (author)

  1. Assem, M. van: Converting and integrating vocabularies for the Semantic Web (2010) 4.61
    4.6059904 = sum of:
      4.6059904 = weight(author_txt:assem in 4639) [ClassicSimilarity], result of:
        4.6059904 = fieldWeight in 4639, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.5 = fieldNorm(doc=4639)
    
  2. Hollink, L.; Assem, M. van: Estimating the relevance of search results in the Culture-Web : a study of semantic distance measures (2010) 4.03
    4.0302415 = sum of:
      4.0302415 = weight(author_txt:assem in 4649) [ClassicSimilarity], result of:
        4.0302415 = fieldWeight in 4649, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.4375 = fieldNorm(doc=4649)
    
  3. Assem, M. van; Gangemi, A.; Schreiber, G.: Conversion of WordNet to a standard RDF/OWL representation (2006) 3.45
    3.4544928 = sum of:
      3.4544928 = weight(author_txt:assem in 4641) [ClassicSimilarity], result of:
        3.4544928 = fieldWeight in 4641, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.375 = fieldNorm(doc=4641)
    
  4. Wielinga, B.; Wielemaker, J.; Schreiber, G.; Assem, M. van: Methods for porting resources to the Semantic Web (2004) 2.88
    2.8787441 = sum of:
      2.8787441 = weight(author_txt:assem in 4640) [ClassicSimilarity], result of:
        2.8787441 = fieldWeight in 4640, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.3125 = fieldNorm(doc=4640)
    
  5. Assem, M. van; Malaisé, V.; Miles, A.; Schreiber, G.: ¬A method to convert thesauri to SKOS (2006) 2.88
    2.8787441 = sum of:
      2.8787441 = weight(author_txt:assem in 4642) [ClassicSimilarity], result of:
        2.8787441 = fieldWeight in 4642, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.3125 = fieldNorm(doc=4642)
    

Similar documents (content)

  1. Whitlatch, J.B.: Reference services : research methodologies for assessment and accountability (1992) 0.07
    0.06931932 = sum of:
      0.06931932 = product of:
        0.57766104 = sum of:
          0.14697273 = weight(abstract_txt:quantitative in 4540) [ClassicSimilarity], result of:
            0.14697273 = score(doc=4540,freq=1.0), product of:
              0.19705442 = queryWeight, product of:
                1.7414412 = boost
                5.9667873 = idf(docFreq=307, maxDocs=44218)
                0.018964298 = queryNorm
              0.7458484 = fieldWeight in 4540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9667873 = idf(docFreq=307, maxDocs=44218)
                0.125 = fieldNorm(doc=4540)
          0.33984682 = weight(abstract_txt:measurements in 4540) [ClassicSimilarity], result of:
            0.33984682 = score(doc=4540,freq=1.0), product of:
              0.34457505 = queryWeight, product of:
                2.3028076 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.018964298 = queryNorm
              0.9862781 = fieldWeight in 4540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.125 = fieldNorm(doc=4540)
          0.09084144 = weight(abstract_txt:data in 4540) [ClassicSimilarity], result of:
            0.09084144 = score(doc=4540,freq=2.0), product of:
              0.15402375 = queryWeight, product of:
                2.434331 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.018964298 = queryNorm
              0.58978856 = fieldWeight in 4540, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.125 = fieldNorm(doc=4540)
        0.12 = coord(3/25)
    
  2. Grassi, M.; Morbidoni, C.; Nucci, M.; Fonda, S.; Ledda, G.: Pundit: semantically structured annotations for Web contents and digital libraries (2012) 0.06
    0.059593305 = sum of:
      0.059593305 = product of:
        0.37245816 = sum of:
          0.06957453 = weight(abstract_txt:ranging in 473) [ClassicSimilarity], result of:
            0.06957453 = score(doc=473,freq=1.0), product of:
              0.12995665 = queryWeight, product of:
                6.8527 = idf(docFreq=126, maxDocs=44218)
                0.018964298 = queryNorm
              0.5353672 = fieldWeight in 473, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8527 = idf(docFreq=126, maxDocs=44218)
                0.078125 = fieldNorm(doc=473)
          0.07492546 = weight(abstract_txt:annotation in 473) [ClassicSimilarity], result of:
            0.07492546 = score(doc=473,freq=1.0), product of:
              0.13653728 = queryWeight, product of:
                1.0250059 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.018964298 = queryNorm
              0.5487546 = fieldWeight in 473, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.078125 = fieldNorm(doc=473)
          0.14766495 = weight(abstract_txt:annotating in 473) [ClassicSimilarity], result of:
            0.14766495 = score(doc=473,freq=1.0), product of:
              0.21462648 = queryWeight, product of:
                1.2851162 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.018964298 = queryNorm
              0.688009 = fieldWeight in 473, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.078125 = fieldNorm(doc=473)
          0.08029325 = weight(abstract_txt:data in 473) [ClassicSimilarity], result of:
            0.08029325 = score(doc=473,freq=4.0), product of:
              0.15402375 = queryWeight, product of:
                2.434331 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.018964298 = queryNorm
              0.52130437 = fieldWeight in 473, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=473)
        0.16 = coord(4/25)
    
  3. Billal, B.; Fonseca, A.; Sadat, F.; Lounis, H.: Semi-supervised learning and social media text analysis towards multi-labeling categorization (2017) 0.06
    0.058022864 = sum of:
      0.058022864 = product of:
        0.2901143 = sum of:
          0.05244782 = weight(abstract_txt:annotation in 4095) [ClassicSimilarity], result of:
            0.05244782 = score(doc=4095,freq=1.0), product of:
              0.13653728 = queryWeight, product of:
                1.0250059 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.018964298 = queryNorm
              0.38412818 = fieldWeight in 4095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
          0.059852593 = weight(abstract_txt:disambiguation in 4095) [ClassicSimilarity], result of:
            0.059852593 = score(doc=4095,freq=1.0), product of:
              0.14910366 = queryWeight, product of:
                1.0711367 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.018964298 = queryNorm
              0.401416 = fieldWeight in 4095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
          0.0731191 = weight(abstract_txt:quantities in 4095) [ClassicSimilarity], result of:
            0.0731191 = score(doc=4095,freq=1.0), product of:
              0.17039369 = queryWeight, product of:
                1.145058 = boost
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.018964298 = queryNorm
              0.42911857 = fieldWeight in 4095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
          0.035857696 = weight(abstract_txt:task in 4095) [ClassicSimilarity], result of:
            0.035857696 = score(doc=4095,freq=1.0), product of:
              0.1335051 = queryWeight, product of:
                1.4333911 = boost
                4.9112997 = idf(docFreq=884, maxDocs=44218)
                0.018964298 = queryNorm
              0.2685867 = fieldWeight in 4095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9112997 = idf(docFreq=884, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
          0.06883713 = weight(abstract_txt:data in 4095) [ClassicSimilarity], result of:
            0.06883713 = score(doc=4095,freq=6.0), product of:
              0.15402375 = queryWeight, product of:
                2.434331 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.018964298 = queryNorm
              0.4469254 = fieldWeight in 4095, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
        0.2 = coord(5/25)
    
  4. Stathopoulos, Y.; Baker, S.; Rei, M.; Teufel, S.: Variable typing : assigning meaning to variables in mathematical text (2018) 0.06
    0.055688847 = sum of:
      0.055688847 = product of:
        0.3480553 = sum of:
          0.08550371 = weight(abstract_txt:disambiguation in 4432) [ClassicSimilarity], result of:
            0.08550371 = score(doc=4432,freq=1.0), product of:
              0.14910366 = queryWeight, product of:
                1.0711367 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.018964298 = queryNorm
              0.57345146 = fieldWeight in 4432, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.078125 = fieldNorm(doc=4432)
          0.12057211 = weight(abstract_txt:symbol in 4432) [ClassicSimilarity], result of:
            0.12057211 = score(doc=4432,freq=1.0), product of:
              0.18749782 = queryWeight, product of:
                1.2011545 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.018964298 = queryNorm
              0.6430587 = fieldWeight in 4432, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.078125 = fieldNorm(doc=4432)
          0.072443485 = weight(abstract_txt:task in 4432) [ClassicSimilarity], result of:
            0.072443485 = score(doc=4432,freq=2.0), product of:
              0.1335051 = queryWeight, product of:
                1.4333911 = boost
                4.9112997 = idf(docFreq=884, maxDocs=44218)
                0.018964298 = queryNorm
              0.5426271 = fieldWeight in 4432, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9112997 = idf(docFreq=884, maxDocs=44218)
                0.078125 = fieldNorm(doc=4432)
          0.06953599 = weight(abstract_txt:data in 4432) [ClassicSimilarity], result of:
            0.06953599 = score(doc=4432,freq=3.0), product of:
              0.15402375 = queryWeight, product of:
                2.434331 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.018964298 = queryNorm
              0.4514628 = fieldWeight in 4432, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=4432)
        0.16 = coord(4/25)
    
  5. Kozak, M.; Hartley, J.: Presenting numerical values within sentences and text tables (2012) 0.06
    0.055649184 = sum of:
      0.055649184 = product of:
        0.3478074 = sum of:
          0.09640529 = weight(abstract_txt:table in 4968) [ClassicSimilarity], result of:
            0.09640529 = score(doc=4968,freq=3.0), product of:
              0.12995665 = queryWeight, product of:
                6.8527 = idf(docFreq=126, maxDocs=44218)
                0.018964298 = queryNorm
              0.74182653 = fieldWeight in 4968, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.8527 = idf(docFreq=126, maxDocs=44218)
                0.0625 = fieldNorm(doc=4968)
          0.073486365 = weight(abstract_txt:quantitative in 4968) [ClassicSimilarity], result of:
            0.073486365 = score(doc=4968,freq=1.0), product of:
              0.19705442 = queryWeight, product of:
                1.7414412 = boost
                5.9667873 = idf(docFreq=307, maxDocs=44218)
                0.018964298 = queryNorm
              0.3729242 = fieldWeight in 4968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9667873 = idf(docFreq=307, maxDocs=44218)
                0.0625 = fieldNorm(doc=4968)
          0.14579843 = weight(abstract_txt:tables in 4968) [ClassicSimilarity], result of:
            0.14579843 = score(doc=4968,freq=2.0), product of:
              0.2469488 = queryWeight, product of:
                1.9494818 = boost
                6.6796074 = idf(docFreq=150, maxDocs=44218)
                0.018964298 = queryNorm
              0.59039944 = fieldWeight in 4968, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6796074 = idf(docFreq=150, maxDocs=44218)
                0.0625 = fieldNorm(doc=4968)
          0.0321173 = weight(abstract_txt:data in 4968) [ClassicSimilarity], result of:
            0.0321173 = score(doc=4968,freq=1.0), product of:
              0.15402375 = queryWeight, product of:
                2.434331 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.018964298 = queryNorm
              0.20852174 = fieldWeight in 4968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=4968)
        0.16 = coord(4/25)