Document (#38123)

Author
Wunner, T.
Buitelaar, P.
O'Riain, S.
Title
Semantic, terminological and linguistic interpretation of XBRL
Source
Reuse and Adaptation of Ontologies and Terminologies Workshop at 17th International Conference on Knowledge Engineering and Knowledge Management (EKAW), October 11-15, 2010, Lisbon: Proceedings
Imprint
? : Digital Enterprise Research Institute (Galway)
Year
2010
Pages
S.xx-xx
Abstract
Standardization efforts in fnancial reporting have led to large numbers of machine-interpretable vocabularies that attempt to model complex accounting practices in XBRL (eXtended Business Reporting Language). Because reporting agencies do not require fine-grained semantic and terminological representations, these vocabularies cannot be easily reused. Ontology-based Information Extraction, in particular, requires much greater semantic and terminological structure, and the introduction of a linguistic structure currently absent from XBRL. In order to facilitate such reuse, we propose a three-faceted methodology that analyzes and enriches the XBRL vocabulary: (1) transform semantic structure by analyzing the semantic relationships between terms (e.g. taxonomic, meronymic); (2) enhance terminological structure by using several domain-specific (XBRL), domain-related (SAPTerm, etc.) and domain-independent (GoogleDefine, Wikipedia, etc.) terminologies; and (3) add linguistic structure at term level (e.g. part-of-speech, morphology, syntactic arguments). This paper outlines a first experiment towards implementing this methodology on the International Financial Reporting Standard XBRL vocabulary.
Content
Vgl.: ceur-ws.org/Vol-925/paper_3.pdf.
Theme
Wissensrepräsentation
Object
XBRL

Similar documents (content)

  1. Rindflesch, T.C.; Fizsman, M.: The interaction of domain knowledge and linguistic structure in natural language processing : interpreting hypernymic propositions in biomedical text (2003) 0.13
    0.12660237 = sum of:
      0.12660237 = product of:
        0.52750987 = sum of:
          0.07368234 = weight(abstract_txt:taxonomic in 2097) [ClassicSimilarity], result of:
            0.07368234 = score(doc=2097,freq=1.0), product of:
              0.14721732 = queryWeight, product of:
                1.1829307 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0155408615 = queryNorm
              0.5005005 = fieldWeight in 2097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
          0.027504515 = weight(abstract_txt:methodology in 2097) [ClassicSimilarity], result of:
            0.027504515 = score(doc=2097,freq=1.0), product of:
              0.09616033 = queryWeight, product of:
                1.3520503 = boost
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.0155408615 = queryNorm
              0.28602767 = fieldWeight in 2097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
          0.06464707 = weight(abstract_txt:domain in 2097) [ClassicSimilarity], result of:
            0.06464707 = score(doc=2097,freq=2.0), product of:
              0.15444705 = queryWeight, product of:
                2.098603 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0155408615 = queryNorm
              0.41857108 = fieldWeight in 2097, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
          0.1202991 = weight(abstract_txt:linguistic in 2097) [ClassicSimilarity], result of:
            0.1202991 = score(doc=2097,freq=2.0), product of:
              0.23366229 = queryWeight, product of:
                2.581278 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0155408615 = queryNorm
              0.51484174 = fieldWeight in 2097, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
          0.08397237 = weight(abstract_txt:structure in 2097) [ClassicSimilarity], result of:
            0.08397237 = score(doc=2097,freq=2.0), product of:
              0.21799888 = queryWeight, product of:
                3.2187853 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.0155408615 = queryNorm
              0.38519636 = fieldWeight in 2097, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
          0.15740448 = weight(abstract_txt:semantic in 2097) [ClassicSimilarity], result of:
            0.15740448 = score(doc=2097,freq=6.0), product of:
              0.22979166 = queryWeight, product of:
                3.3046997 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0155408615 = queryNorm
              0.6849878 = fieldWeight in 2097, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
        0.24 = coord(6/25)
    
  2. Gil-Berrozpe, J.C.: Description, categorization, and representation of hyponymy in environmental terminology (2022) 0.12
    0.1181389 = sum of:
      0.1181389 = product of:
        0.73836815 = sum of:
          0.063798234 = weight(abstract_txt:linguistic in 1004) [ClassicSimilarity], result of:
            0.063798234 = score(doc=1004,freq=1.0), product of:
              0.23366229 = queryWeight, product of:
                2.581278 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0155408615 = queryNorm
              0.27303606 = fieldWeight in 1004, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.046875 = fieldNorm(doc=1004)
          0.06297928 = weight(abstract_txt:structure in 1004) [ClassicSimilarity], result of:
            0.06297928 = score(doc=1004,freq=2.0), product of:
              0.21799888 = queryWeight, product of:
                3.2187853 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.0155408615 = queryNorm
              0.28889728 = fieldWeight in 1004, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.046875 = fieldNorm(doc=1004)
          0.068158135 = weight(abstract_txt:semantic in 1004) [ClassicSimilarity], result of:
            0.068158135 = score(doc=1004,freq=2.0), product of:
              0.22979166 = queryWeight, product of:
                3.3046997 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0155408615 = queryNorm
              0.2966084 = fieldWeight in 1004, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.046875 = fieldNorm(doc=1004)
          0.5434325 = weight(abstract_txt:terminological in 1004) [ClassicSimilarity], result of:
            0.5434325 = score(doc=1004,freq=12.0), product of:
              0.46852463 = queryWeight, product of:
                4.2206182 = boost
                7.14301 = idf(docFreq=94, maxDocs=44218)
                0.0155408615 = queryNorm
              1.1598803 = fieldWeight in 1004, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                7.14301 = idf(docFreq=94, maxDocs=44218)
                0.046875 = fieldNorm(doc=1004)
        0.16 = coord(4/25)
    
  3. Zarrad, R.; Doggaz, N.; Zagrouba, E.: Wikipedia HTML structure analysis for ontology construction (2018) 0.12
    0.117486715 = sum of:
      0.117486715 = product of:
        0.489528 = sum of:
          0.0450956 = weight(abstract_txt:arguments in 4302) [ClassicSimilarity], result of:
            0.0450956 = score(doc=4302,freq=1.0), product of:
              0.10612216 = queryWeight, product of:
                1.0043449 = boost
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.0155408615 = queryNorm
              0.42494047 = fieldWeight in 4302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.0625 = fieldNorm(doc=4302)
          0.16475873 = weight(abstract_txt:taxonomic in 4302) [ClassicSimilarity], result of:
            0.16475873 = score(doc=4302,freq=5.0), product of:
              0.14721732 = queryWeight, product of:
                1.1829307 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0155408615 = queryNorm
              1.1191531 = fieldWeight in 4302, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0625 = fieldNorm(doc=4302)
          0.027504515 = weight(abstract_txt:methodology in 4302) [ClassicSimilarity], result of:
            0.027504515 = score(doc=4302,freq=1.0), product of:
              0.09616033 = queryWeight, product of:
                1.3520503 = boost
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.0155408615 = queryNorm
              0.28602767 = fieldWeight in 4302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.0625 = fieldNorm(doc=4302)
          0.085064314 = weight(abstract_txt:linguistic in 4302) [ClassicSimilarity], result of:
            0.085064314 = score(doc=4302,freq=1.0), product of:
              0.23366229 = queryWeight, product of:
                2.581278 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0155408615 = queryNorm
              0.3640481 = fieldWeight in 4302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0625 = fieldNorm(doc=4302)
          0.10284473 = weight(abstract_txt:structure in 4302) [ClassicSimilarity], result of:
            0.10284473 = score(doc=4302,freq=3.0), product of:
              0.21799888 = queryWeight, product of:
                3.2187853 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.0155408615 = queryNorm
              0.47176725 = fieldWeight in 4302, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.0625 = fieldNorm(doc=4302)
          0.06426011 = weight(abstract_txt:semantic in 4302) [ClassicSimilarity], result of:
            0.06426011 = score(doc=4302,freq=1.0), product of:
              0.22979166 = queryWeight, product of:
                3.3046997 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0155408615 = queryNorm
              0.2796451 = fieldWeight in 4302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=4302)
        0.24 = coord(6/25)
    
  4. Shen, M.; Liu, D.-R.; Huang, Y.-S.: Extracting semantic relations to enrich domain ontologies (2012) 0.11
    0.11321497 = sum of:
      0.11321497 = product of:
        0.56607485 = sum of:
          0.055641066 = weight(abstract_txt:reuse in 267) [ClassicSimilarity], result of:
            0.055641066 = score(doc=267,freq=1.0), product of:
              0.10520594 = queryWeight, product of:
                6.769634 = idf(docFreq=137, maxDocs=44218)
                0.0155408615 = queryNorm
              0.5288776 = fieldWeight in 267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.769634 = idf(docFreq=137, maxDocs=44218)
                0.078125 = fieldNorm(doc=267)
          0.13025321 = weight(abstract_txt:taxonomic in 267) [ClassicSimilarity], result of:
            0.13025321 = score(doc=267,freq=2.0), product of:
              0.14721732 = queryWeight, product of:
                1.1829307 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0155408615 = queryNorm
              0.88476825 = fieldWeight in 267, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.078125 = fieldNorm(doc=267)
          0.10496595 = weight(abstract_txt:enriches in 267) [ClassicSimilarity], result of:
            0.10496595 = score(doc=267,freq=1.0), product of:
              0.16062343 = queryWeight, product of:
                1.2356182 = boost
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.0155408615 = queryNorm
              0.6534909 = fieldWeight in 267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.078125 = fieldNorm(doc=267)
          0.16161768 = weight(abstract_txt:domain in 267) [ClassicSimilarity], result of:
            0.16161768 = score(doc=267,freq=8.0), product of:
              0.15444705 = queryWeight, product of:
                2.098603 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0155408615 = queryNorm
              1.0464277 = fieldWeight in 267, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.078125 = fieldNorm(doc=267)
          0.113596894 = weight(abstract_txt:semantic in 267) [ClassicSimilarity], result of:
            0.113596894 = score(doc=267,freq=2.0), product of:
              0.22979166 = queryWeight, product of:
                3.3046997 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0155408615 = queryNorm
              0.49434733 = fieldWeight in 267, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.078125 = fieldNorm(doc=267)
        0.2 = coord(5/25)
    
  5. Citkina, F.: Terminology organization and change (1996) 0.11
    0.10813145 = sum of:
      0.10813145 = product of:
        0.6758216 = sum of:
          0.10851565 = weight(abstract_txt:terminologies in 5192) [ClassicSimilarity], result of:
            0.10851565 = score(doc=5192,freq=1.0), product of:
              0.1454289 = queryWeight, product of:
                1.1757236 = boost
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.0155408615 = queryNorm
              0.74617666 = fieldWeight in 5192, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.09375 = fieldNorm(doc=5192)
          0.12759647 = weight(abstract_txt:linguistic in 5192) [ClassicSimilarity], result of:
            0.12759647 = score(doc=5192,freq=1.0), product of:
              0.23366229 = queryWeight, product of:
                2.581278 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0155408615 = queryNorm
              0.5460721 = fieldWeight in 5192, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.09375 = fieldNorm(doc=5192)
          0.12595856 = weight(abstract_txt:structure in 5192) [ClassicSimilarity], result of:
            0.12595856 = score(doc=5192,freq=2.0), product of:
              0.21799888 = queryWeight, product of:
                3.2187853 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.0155408615 = queryNorm
              0.57779455 = fieldWeight in 5192, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.09375 = fieldNorm(doc=5192)
          0.31375092 = weight(abstract_txt:terminological in 5192) [ClassicSimilarity], result of:
            0.31375092 = score(doc=5192,freq=1.0), product of:
              0.46852463 = queryWeight, product of:
                4.2206182 = boost
                7.14301 = idf(docFreq=94, maxDocs=44218)
                0.0155408615 = queryNorm
              0.66965723 = fieldWeight in 5192, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.14301 = idf(docFreq=94, maxDocs=44218)
                0.09375 = fieldNorm(doc=5192)
        0.16 = coord(4/25)