Document (#40309)

Author
Busch, D.
Title
Organisation eines Thesaurus für die Unterstützung der mehrsprachigen Suche in einer bibliographischen Datenbank im Bereich Planen und Bauen
Source
o-bib: Das offene Bibliotheksjournal. 3(2016) Nr.4, S.202-216
Year
2016
Abstract
Das Problem der mehrsprachigen Suche gewinnt in der letzten Zeit immer mehr an Bedeutung, da viele nützliche Fachinformationen in der Welt in verschiedenen Sprachen publiziert werden. RSWBPlus ist eine bibliographische Datenbank zum Nachweis der Fachliteratur im Bereich Planen und Bauen, welche deutsch- und englischsprachige Metadaten-Einträge enthält. Bis vor Kurzem war es problematisch Einträge zu finden, deren Sprache sich von der Anfragesprache unterschied. Zum Beispiel fand man auf deutschsprachige Anfragen nur deutschsprachige Einträge, obwohl die Datenbank auch potenziell nützliche englischsprachige Einträge enthielt. Um das Problem zu lösen, wurde nach einer Untersuchung bestehender Ansätze, die RSWBPlus weiterentwickelt, um eine mehrsprachige (sprachübergreifende) Suche zu unterstützen, welche unter Einbeziehung eines zweisprachigen begriffbasierten Thesaurus erfolgt. Der Thesaurus wurde aus bereits bestehenden Thesauri automatisch gebildet. Die Einträge der Quell-Thesauri wurden in SKOS-Format (Simple Knowledge Organisation System) umgewandelt, automatisch miteinander vereinigt und schließlich in einen Ziel-Thesaurus eingespielt, der ebenfalls in SKOS geführt wird. Für den Zugriff zum Ziel-Thesaurus werden Apache Jena und MS SQL Server verwendet. Bei der mehrsprachigen Suche werden Terme der Anfrage durch entsprechende Übersetzungen und Synonyme in Deutsch und Englisch erweitert. Die Erweiterung der Suchterme kann sowohl in der Laufzeit, als auch halbautomatisch erfolgen. Das verbesserte Recherchesystem kann insbesondere deutschsprachigen Benutzern helfen, relevante englischsprachige Einträge zu finden. Die Verwendung vom SKOS erhöht die Interoperabilität der Thesauri, vereinfacht das Bilden des Ziel-Thesaurus und den Zugriff zu seinen Einträgen.
Content
https://www.o-bib.de/article/view/2016H4S202-216. DOI: http://dx.doi.org/10.5282/o-bib/2016H4S202-216. Vortrag, Leipziger Bibliothekskongresses 2016.
Theme
Konzeption und Anwendung des Prinzips Thesaurus
Multilinguale Probleme
Semantische Interoperabilität
Field
Raumplanung
Architektur
Object
SKOS

Similar documents (author)

  1. Busch, R.: Neue Wege der Buchaufstellung in den USA (1956) 5.54
    5.5397964 = sum of:
      5.5397964 = weight(author_txt:busch in 557) [ClassicSimilarity], result of:
        5.5397964 = fieldWeight in 557, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.625 = fieldNorm(doc=557)
    
  2. Busch, J.: Bibliographie zum Bibliotheks- und Büchereiwesen : aus dem Nachlaß bearbeitet von U. von Dietze (1966) 5.54
    5.5397964 = sum of:
      5.5397964 = weight(author_txt:busch in 1462) [ClassicSimilarity], result of:
        5.5397964 = fieldWeight in 1462, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.625 = fieldNorm(doc=1462)
    
  3. Busch, C.: Bitte ein Bit? : Zur (Be-) Deutung der Informationstheorie (1992) 5.54
    5.5397964 = sum of:
      5.5397964 = weight(author_txt:busch in 2444) [ClassicSimilarity], result of:
        5.5397964 = fieldWeight in 2444, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.625 = fieldNorm(doc=2444)
    
  4. Busch, J.: ¬A method for evaluating the multiple relations between subject descriptors : related terms in the Thesaurus for Engineering and Scientific Terms, a pilot study (1978) 5.54
    5.5397964 = sum of:
      5.5397964 = weight(author_txt:busch in 2948) [ClassicSimilarity], result of:
        5.5397964 = fieldWeight in 2948, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.625 = fieldNorm(doc=2948)
    
  5. Busch, J.A.: Thinking ambiguously : organizing source materials for historical research (1994) 5.54
    5.5397964 = sum of:
      5.5397964 = weight(author_txt:busch in 2978) [ClassicSimilarity], result of:
        5.5397964 = fieldWeight in 2978, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.625 = fieldNorm(doc=2978)
    

Similar documents (content)

  1. Busch, D.: Domänenspezifische hybride automatische Indexierung von bibliographischen Metadaten (2019) 0.14
    0.1356192 = sum of:
      0.1356192 = product of:
        0.67809594 = sum of:
          0.024709124 = weight(abstract_txt:werden in 5628) [ClassicSimilarity], result of:
            0.024709124 = score(doc=5628,freq=3.0), product of:
              0.052079055 = queryWeight, product of:
                1.0937172 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.013580459 = queryNorm
              0.47445413 = fieldWeight in 5628, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.078125 = fieldNorm(doc=5628)
          0.051078163 = weight(abstract_txt:bereich in 5628) [ClassicSimilarity], result of:
            0.051078163 = score(doc=5628,freq=2.0), product of:
              0.084511355 = queryWeight, product of:
                1.1375891 = boost
                5.4703507 = idf(docFreq=505, maxDocs=44218)
                0.013580459 = queryNorm
              0.6043941 = fieldWeight in 5628, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4703507 = idf(docFreq=505, maxDocs=44218)
                0.078125 = fieldNorm(doc=5628)
          0.12912883 = weight(abstract_txt:bauen in 5628) [ClassicSimilarity], result of:
            0.12912883 = score(doc=5628,freq=1.0), product of:
              0.19759852 = queryWeight, product of:
                1.7394812 = boost
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.013580459 = queryNorm
              0.6534909 = fieldWeight in 5628, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.078125 = fieldNorm(doc=5628)
          0.13640022 = weight(abstract_txt:planen in 5628) [ClassicSimilarity], result of:
            0.13640022 = score(doc=5628,freq=1.0), product of:
              0.20494859 = queryWeight, product of:
                1.7715375 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.013580459 = queryNorm
              0.66553384 = fieldWeight in 5628, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.078125 = fieldNorm(doc=5628)
          0.33677962 = weight(abstract_txt:einträge in 5628) [ClassicSimilarity], result of:
            0.33677962 = score(doc=5628,freq=1.0), product of:
              0.53997356 = queryWeight, product of:
                4.9805207 = boost
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.013580459 = queryNorm
              0.6236965 = fieldWeight in 5628, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.078125 = fieldNorm(doc=5628)
        0.2 = coord(5/25)
    
  2. WebGND 0.13
    0.13256639 = sum of:
      0.13256639 = product of:
        1.6570799 = sum of:
          0.3099615 = weight(abstract_txt:datenbank in 3877) [ClassicSimilarity], result of:
            0.3099615 = score(doc=3877,freq=1.0), product of:
              0.16092758 = queryWeight, product of:
                1.9225992 = boost
                6.163498 = idf(docFreq=252, maxDocs=44218)
                0.013580459 = queryNorm
              1.9260931 = fieldWeight in 3877, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.163498 = idf(docFreq=252, maxDocs=44218)
                0.3125 = fieldNorm(doc=3877)
          1.3471185 = weight(abstract_txt:einträge in 3877) [ClassicSimilarity], result of:
            1.3471185 = score(doc=3877,freq=1.0), product of:
              0.53997356 = queryWeight, product of:
                4.9805207 = boost
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.013580459 = queryNorm
              2.494786 = fieldWeight in 3877, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.3125 = fieldNorm(doc=3877)
        0.08 = coord(2/25)
    
  3. Mayr, P.; Zapilko, B.; Sure, Y.: ¬Ein Mehr-Thesauri-Szenario auf Basis von SKOS und Crosskonkordanzen (2010) 0.12
    0.11946537 = sum of:
      0.11946537 = product of:
        0.5973269 = sum of:
          0.020174915 = weight(abstract_txt:werden in 3392) [ClassicSimilarity], result of:
            0.020174915 = score(doc=3392,freq=2.0), product of:
              0.052079055 = queryWeight, product of:
                1.0937172 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.013580459 = queryNorm
              0.3873902 = fieldWeight in 3392, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.078125 = fieldNorm(doc=3392)
          0.036117714 = weight(abstract_txt:bereich in 3392) [ClassicSimilarity], result of:
            0.036117714 = score(doc=3392,freq=1.0), product of:
              0.084511355 = queryWeight, product of:
                1.1375891 = boost
                5.4703507 = idf(docFreq=505, maxDocs=44218)
                0.013580459 = queryNorm
              0.42737114 = fieldWeight in 3392, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4703507 = idf(docFreq=505, maxDocs=44218)
                0.078125 = fieldNorm(doc=3392)
          0.07499994 = weight(abstract_txt:thesauri in 3392) [ClassicSimilarity], result of:
            0.07499994 = score(doc=3392,freq=2.0), product of:
              0.124976754 = queryWeight, product of:
                1.6942916 = boost
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.013580459 = queryNorm
              0.6001111 = fieldWeight in 3392, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.078125 = fieldNorm(doc=3392)
          0.3079742 = weight(abstract_txt:skos in 3392) [ClassicSimilarity], result of:
            0.3079742 = score(doc=3392,freq=6.0), product of:
              0.22220701 = queryWeight, product of:
                2.2591882 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.013580459 = queryNorm
              1.3859787 = fieldWeight in 3392, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.078125 = fieldNorm(doc=3392)
          0.15806009 = weight(abstract_txt:thesaurus in 3392) [ClassicSimilarity], result of:
            0.15806009 = score(doc=3392,freq=3.0), product of:
              0.22610822 = queryWeight, product of:
                3.222899 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.013580459 = queryNorm
              0.6990462 = fieldWeight in 3392, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.078125 = fieldNorm(doc=3392)
        0.2 = coord(5/25)
    
  4. Otto, A.: Ordnungssysteme als Wissensbasis für die Suche in textbasierten Datenbeständen : dargestellt am Beispiel einer soziologischen Bibliographie (1998) 0.11
    0.110218436 = sum of:
      0.110218436 = product of:
        0.4592435 = sum of:
          0.022329539 = weight(abstract_txt:werden in 6625) [ClassicSimilarity], result of:
            0.022329539 = score(doc=6625,freq=5.0), product of:
              0.052079055 = queryWeight, product of:
                1.0937172 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.013580459 = queryNorm
              0.42876238 = fieldWeight in 6625, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6625)
          0.039187625 = weight(abstract_txt:finden in 6625) [ClassicSimilarity], result of:
            0.039187625 = score(doc=6625,freq=2.0), product of:
              0.08983774 = queryWeight, product of:
                1.1728901 = boost
                5.6401033 = idf(docFreq=426, maxDocs=44218)
                0.013580459 = queryNorm
              0.4362045 = fieldWeight in 6625, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6401033 = idf(docFreq=426, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6625)
          0.052499957 = weight(abstract_txt:thesauri in 6625) [ClassicSimilarity], result of:
            0.052499957 = score(doc=6625,freq=2.0), product of:
              0.124976754 = queryWeight, product of:
                1.6942916 = boost
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.013580459 = queryNorm
              0.42007777 = fieldWeight in 6625, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.431586 = idf(docFreq=525, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6625)
          0.12129163 = weight(abstract_txt:datenbank in 6625) [ClassicSimilarity], result of:
            0.12129163 = score(doc=6625,freq=5.0), product of:
              0.16092758 = queryWeight, product of:
                1.9225992 = boost
                6.163498 = idf(docFreq=252, maxDocs=44218)
                0.013580459 = queryNorm
              0.7537032 = fieldWeight in 6625, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.163498 = idf(docFreq=252, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6625)
          0.1335959 = weight(abstract_txt:suche in 6625) [ClassicSimilarity], result of:
            0.1335959 = score(doc=6625,freq=6.0), product of:
              0.17776974 = queryWeight, product of:
                2.3333066 = boost
                5.6101127 = idf(docFreq=439, maxDocs=44218)
                0.013580459 = queryNorm
              0.7515109 = fieldWeight in 6625, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.6101127 = idf(docFreq=439, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6625)
          0.09033886 = weight(abstract_txt:thesaurus in 6625) [ClassicSimilarity], result of:
            0.09033886 = score(doc=6625,freq=2.0), product of:
              0.22610822 = queryWeight, product of:
                3.222899 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.013580459 = queryNorm
              0.39953816 = fieldWeight in 6625, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.0546875 = fieldNorm(doc=6625)
        0.24 = coord(6/25)
    
  5. Nowak, L.: ¬Die INIS Collection Search : Einblicke und Fallbeispiele zu neuen Entwicklungen (2015) 0.11
    0.108597554 = sum of:
      0.108597554 = product of:
        0.3878484 = sum of:
          0.01412244 = weight(abstract_txt:werden in 1837) [ClassicSimilarity], result of:
            0.01412244 = score(doc=1837,freq=2.0), product of:
              0.052079055 = queryWeight, product of:
                1.0937172 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.013580459 = queryNorm
              0.27117312 = fieldWeight in 1837, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1837)
          0.032889854 = weight(abstract_txt:welche in 1837) [ClassicSimilarity], result of:
            0.032889854 = score(doc=1837,freq=2.0), product of:
              0.07993448 = queryWeight, product of:
                1.1063561 = boost
                5.3201604 = idf(docFreq=587, maxDocs=44218)
                0.013580459 = queryNorm
              0.41146016 = fieldWeight in 1837, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3201604 = idf(docFreq=587, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1837)
          0.043790404 = weight(abstract_txt:bereich in 1837) [ClassicSimilarity], result of:
            0.043790404 = score(doc=1837,freq=3.0), product of:
              0.084511355 = queryWeight, product of:
                1.1375891 = boost
                5.4703507 = idf(docFreq=505, maxDocs=44218)
                0.013580459 = queryNorm
              0.51816 = fieldWeight in 1837, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4703507 = idf(docFreq=505, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1837)
          0.07160183 = weight(abstract_txt:zugriff in 1837) [ClassicSimilarity], result of:
            0.07160183 = score(doc=1837,freq=3.0), product of:
              0.117294736 = queryWeight, product of:
                1.3401924 = boost
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.013580459 = queryNorm
              0.61044365 = fieldWeight in 1837, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1837)
          0.05839344 = weight(abstract_txt:deutsch in 1837) [ClassicSimilarity], result of:
            0.05839344 = score(doc=1837,freq=1.0), product of:
              0.14766546 = queryWeight, product of:
                1.5037212 = boost
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.013580459 = queryNorm
              0.39544415 = fieldWeight in 1837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1837)
          0.07671156 = weight(abstract_txt:datenbank in 1837) [ClassicSimilarity], result of:
            0.07671156 = score(doc=1837,freq=2.0), product of:
              0.16092758 = queryWeight, product of:
                1.9225992 = boost
                6.163498 = idf(docFreq=252, maxDocs=44218)
                0.013580459 = queryNorm
              0.4766837 = fieldWeight in 1837, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.163498 = idf(docFreq=252, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1837)
          0.09033886 = weight(abstract_txt:thesaurus in 1837) [ClassicSimilarity], result of:
            0.09033886 = score(doc=1837,freq=2.0), product of:
              0.22610822 = queryWeight, product of:
                3.222899 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.013580459 = queryNorm
              0.39953816 = fieldWeight in 1837, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1837)
        0.28 = coord(7/25)