Document (#40310)

Author
Busch, D.
Title
Organisation eines Thesaurus für die Unterstützung der mehrsprachigen Suche in einer bibliographischen Datenbank im Bereich Planen und Bauen
Source
o-bib: Das offene Bibliotheksjournal. 3(2016) Nr.4, S.202-216
Year
2016
Abstract
Das Problem der mehrsprachigen Suche gewinnt in der letzten Zeit immer mehr an Bedeutung, da viele nützliche Fachinformationen in der Welt in verschiedenen Sprachen publiziert werden. RSWBPlus ist eine bibliographische Datenbank zum Nachweis der Fachliteratur im Bereich Planen und Bauen, welche deutsch- und englischsprachige Metadaten-Einträge enthält. Bis vor Kurzem war es problematisch Einträge zu finden, deren Sprache sich von der Anfragesprache unterschied. Zum Beispiel fand man auf deutschsprachige Anfragen nur deutschsprachige Einträge, obwohl die Datenbank auch potenziell nützliche englischsprachige Einträge enthielt. Um das Problem zu lösen, wurde nach einer Untersuchung bestehender Ansätze, die RSWBPlus weiterentwickelt, um eine mehrsprachige (sprachübergreifende) Suche zu unterstützen, welche unter Einbeziehung eines zweisprachigen begriffbasierten Thesaurus erfolgt. Der Thesaurus wurde aus bereits bestehenden Thesauri automatisch gebildet. Die Einträge der Quell-Thesauri wurden in SKOS-Format (Simple Knowledge Organisation System) umgewandelt, automatisch miteinander vereinigt und schließlich in einen Ziel-Thesaurus eingespielt, der ebenfalls in SKOS geführt wird. Für den Zugriff zum Ziel-Thesaurus werden Apache Jena und MS SQL Server verwendet. Bei der mehrsprachigen Suche werden Terme der Anfrage durch entsprechende Übersetzungen und Synonyme in Deutsch und Englisch erweitert. Die Erweiterung der Suchterme kann sowohl in der Laufzeit, als auch halbautomatisch erfolgen. Das verbesserte Recherchesystem kann insbesondere deutschsprachigen Benutzern helfen, relevante englischsprachige Einträge zu finden. Die Verwendung vom SKOS erhöht die Interoperabilität der Thesauri, vereinfacht das Bilden des Ziel-Thesaurus und den Zugriff zu seinen Einträgen.
Content
https://www.o-bib.de/article/view/2016H4S202-216. DOI: http://dx.doi.org/10.5282/o-bib/2016H4S202-216. Vortrag, Leipziger Bibliothekskongresses 2016.
Theme
Konzeption und Anwendung des Prinzips Thesaurus
Multilinguale Probleme
Semantische Interoperabilität
Field
Raumplanung
Architektur
Object
SKOS

Similar documents (author)

  1. Busch, R.: Neue Wege der Buchaufstellung in den USA (1956) 5.59
    5.5903964 = sum of:
      5.5903964 = weight(author_txt:busch in 557) [ClassicSimilarity], result of:
        5.5903964 = fieldWeight in 557, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.944634 = idf(docFreq=14, maxDocs=42306)
          0.625 = fieldNorm(doc=557)
    
  2. Busch, J.: Bibliographie zum Bibliotheks- und Büchereiwesen : aus dem Nachlaß bearbeitet von U. von Dietze (1966) 5.59
    5.5903964 = sum of:
      5.5903964 = weight(author_txt:busch in 1462) [ClassicSimilarity], result of:
        5.5903964 = fieldWeight in 1462, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.944634 = idf(docFreq=14, maxDocs=42306)
          0.625 = fieldNorm(doc=1462)
    
  3. Busch, C.: Bitte ein Bit? : Zur (Be-) Deutung der Informationstheorie (1992) 5.59
    5.5903964 = sum of:
      5.5903964 = weight(author_txt:busch in 2444) [ClassicSimilarity], result of:
        5.5903964 = fieldWeight in 2444, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.944634 = idf(docFreq=14, maxDocs=42306)
          0.625 = fieldNorm(doc=2444)
    
  4. Busch, J.: ¬A method for evaluating the multiple relations between subject descriptors : related terms in the Thesaurus for Engineering and Scientific Terms, a pilot study (1978) 5.59
    5.5903964 = sum of:
      5.5903964 = weight(author_txt:busch in 2948) [ClassicSimilarity], result of:
        5.5903964 = fieldWeight in 2948, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.944634 = idf(docFreq=14, maxDocs=42306)
          0.625 = fieldNorm(doc=2948)
    
  5. Busch, J.A.: Thinking ambiguously : organizing source materials for historical research (1994) 5.59
    5.5903964 = sum of:
      5.5903964 = weight(author_txt:busch in 3047) [ClassicSimilarity], result of:
        5.5903964 = fieldWeight in 3047, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.944634 = idf(docFreq=14, maxDocs=42306)
          0.625 = fieldNorm(doc=3047)
    

Similar documents (content)

  1. Mayr, P.; Zapilko, B.; Sure, Y.: ¬Ein Mehr-Thesauri-Szenario auf Basis von SKOS und Crosskonkordanzen (2010) 0.15
    0.15200599 = sum of:
      0.15200599 = product of:
        0.6333583 = sum of:
          0.024593275 = weight(abstract_txt:wurde in 393) [ClassicSimilarity], result of:
            0.024593275 = score(doc=393,freq=1.0), product of:
              0.06557603 = queryWeight, product of:
                4.8004417 = idf(docFreq=945, maxDocs=42306)
                0.013660416 = queryNorm
              0.3750345 = fieldWeight in 393, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8004417 = idf(docFreq=945, maxDocs=42306)
                0.078125 = fieldNorm(doc=393)
          0.020750027 = weight(abstract_txt:werden in 393) [ClassicSimilarity], result of:
            0.020750027 = score(doc=393,freq=2.0), product of:
              0.053198624 = queryWeight, product of:
                1.1031213 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.013660416 = queryNorm
              0.39004818 = fieldWeight in 393, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.078125 = fieldNorm(doc=393)
          0.036987014 = weight(abstract_txt:bereich in 393) [ClassicSimilarity], result of:
            0.036987014 = score(doc=393,freq=1.0), product of:
              0.086079635 = queryWeight, product of:
                1.1457177 = boost
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.013660416 = queryNorm
              0.4296837 = fieldWeight in 393, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.078125 = fieldNorm(doc=393)
          0.07559288 = weight(abstract_txt:thesauri in 393) [ClassicSimilarity], result of:
            0.07559288 = score(doc=393,freq=2.0), product of:
              0.12595302 = queryWeight, product of:
                1.6973732 = boost
                5.432094 = idf(docFreq=502, maxDocs=42306)
                0.013660416 = queryNorm
              0.6001673 = fieldWeight in 393, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.432094 = idf(docFreq=502, maxDocs=42306)
                0.078125 = fieldNorm(doc=393)
          0.31724128 = weight(abstract_txt:skos in 393) [ClassicSimilarity], result of:
            0.31724128 = score(doc=393,freq=6.0), product of:
              0.22721693 = queryWeight, product of:
                2.2797825 = boost
                7.295975 = idf(docFreq=77, maxDocs=42306)
                0.013660416 = queryNorm
              1.3962045 = fieldWeight in 393, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.295975 = idf(docFreq=77, maxDocs=42306)
                0.078125 = fieldNorm(doc=393)
          0.15819384 = weight(abstract_txt:thesaurus in 393) [ClassicSimilarity], result of:
            0.15819384 = score(doc=393,freq=3.0), product of:
              0.22680916 = queryWeight, product of:
                3.221205 = boost
                5.1544023 = idf(docFreq=663, maxDocs=42306)
                0.013660416 = queryNorm
              0.69747555 = fieldWeight in 393, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1544023 = idf(docFreq=663, maxDocs=42306)
                0.078125 = fieldNorm(doc=393)
        0.24 = coord(6/25)
    
  2. Otto, A.: Ordnungssysteme als Wissensbasis für die Suche in textbasierten Datenbeständen : dargestellt am Beispiel einer soziologischen Bibliographie (1998) 0.14
    0.13633727 = sum of:
      0.13633727 = product of:
        0.48691878 = sum of:
          0.0243461 = weight(abstract_txt:wurde in 626) [ClassicSimilarity], result of:
            0.0243461 = score(doc=626,freq=2.0), product of:
              0.06557603 = queryWeight, product of:
                4.8004417 = idf(docFreq=945, maxDocs=42306)
                0.013660416 = queryNorm
              0.37126523 = fieldWeight in 626, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8004417 = idf(docFreq=945, maxDocs=42306)
                0.0546875 = fieldNorm(doc=626)
          0.02296607 = weight(abstract_txt:werden in 626) [ClassicSimilarity], result of:
            0.02296607 = score(doc=626,freq=5.0), product of:
              0.053198624 = queryWeight, product of:
                1.1031213 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.013660416 = queryNorm
              0.43170422 = fieldWeight in 626, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.0546875 = fieldNorm(doc=626)
          0.040198285 = weight(abstract_txt:finden in 626) [ClassicSimilarity], result of:
            0.040198285 = score(doc=626,freq=2.0), product of:
              0.0916074 = queryWeight, product of:
                1.1819326 = boost
                5.6737986 = idf(docFreq=394, maxDocs=42306)
                0.013660416 = queryNorm
              0.43881047 = fieldWeight in 626, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6737986 = idf(docFreq=394, maxDocs=42306)
                0.0546875 = fieldNorm(doc=626)
          0.052915014 = weight(abstract_txt:thesauri in 626) [ClassicSimilarity], result of:
            0.052915014 = score(doc=626,freq=2.0), product of:
              0.12595302 = queryWeight, product of:
                1.6973732 = boost
                5.432094 = idf(docFreq=502, maxDocs=42306)
                0.013660416 = queryNorm
              0.42011708 = fieldWeight in 626, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.432094 = idf(docFreq=502, maxDocs=42306)
                0.0546875 = fieldNorm(doc=626)
          0.12077977 = weight(abstract_txt:datenbank in 626) [ClassicSimilarity], result of:
            0.12077977 = score(doc=626,freq=5.0), product of:
              0.16088124 = queryWeight, product of:
                1.9183408 = boost
                6.1392555 = idf(docFreq=247, maxDocs=42306)
                0.013660416 = queryNorm
              0.7507387 = fieldWeight in 626, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.1392555 = idf(docFreq=247, maxDocs=42306)
                0.0546875 = fieldNorm(doc=626)
          0.13529822 = weight(abstract_txt:suche in 626) [ClassicSimilarity], result of:
            0.13529822 = score(doc=626,freq=6.0), product of:
              0.17973107 = queryWeight, product of:
                2.3412836 = boost
                5.619598 = idf(docFreq=416, maxDocs=42306)
                0.013660416 = queryNorm
              0.7527815 = fieldWeight in 626, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.619598 = idf(docFreq=416, maxDocs=42306)
                0.0546875 = fieldNorm(doc=626)
          0.09041531 = weight(abstract_txt:thesaurus in 626) [ClassicSimilarity], result of:
            0.09041531 = score(doc=626,freq=2.0), product of:
              0.22680916 = queryWeight, product of:
                3.221205 = boost
                5.1544023 = idf(docFreq=663, maxDocs=42306)
                0.013660416 = queryNorm
              0.39864045 = fieldWeight in 626, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1544023 = idf(docFreq=663, maxDocs=42306)
                0.0546875 = fieldNorm(doc=626)
        0.28 = coord(7/25)
    
  3. WebGND 0.13
    0.13248906 = sum of:
      0.13248906 = product of:
        1.6561133 = sum of:
          0.30865344 = weight(abstract_txt:datenbank in 796) [ClassicSimilarity], result of:
            0.30865344 = score(doc=796,freq=1.0), product of:
              0.16088124 = queryWeight, product of:
                1.9183408 = boost
                6.1392555 = idf(docFreq=247, maxDocs=42306)
                0.013660416 = queryNorm
              1.9185174 = fieldWeight in 796, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1392555 = idf(docFreq=247, maxDocs=42306)
                0.3125 = fieldNorm(doc=796)
          1.3474598 = weight(abstract_txt:einträge in 796) [ClassicSimilarity], result of:
            1.3474598 = score(doc=796,freq=1.0), product of:
              0.5414336 = queryWeight, product of:
                4.97692 = boost
                7.9638047 = idf(docFreq=39, maxDocs=42306)
                0.013660416 = queryNorm
              2.488689 = fieldWeight in 796, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9638047 = idf(docFreq=39, maxDocs=42306)
                0.3125 = fieldNorm(doc=796)
        0.08 = coord(2/25)
    
  4. Nowak, L.: ¬Die INIS Collection Search : Einblicke und Fallbeispiele zu neuen Entwicklungen (2015) 0.13
    0.13048846 = sum of:
      0.13048846 = product of:
        0.40777642 = sum of:
          0.017215293 = weight(abstract_txt:wurde in 3838) [ClassicSimilarity], result of:
            0.017215293 = score(doc=3838,freq=1.0), product of:
              0.06557603 = queryWeight, product of:
                4.8004417 = idf(docFreq=945, maxDocs=42306)
                0.013660416 = queryNorm
              0.26252416 = fieldWeight in 3838, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8004417 = idf(docFreq=945, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3838)
          0.01452502 = weight(abstract_txt:werden in 3838) [ClassicSimilarity], result of:
            0.01452502 = score(doc=3838,freq=2.0), product of:
              0.053198624 = queryWeight, product of:
                1.1031213 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.013660416 = queryNorm
              0.27303374 = fieldWeight in 3838, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3838)
          0.033806805 = weight(abstract_txt:welche in 3838) [ClassicSimilarity], result of:
            0.033806805 = score(doc=3838,freq=2.0), product of:
              0.081619695 = queryWeight, product of:
                1.1156422 = boost
                5.355575 = idf(docFreq=542, maxDocs=42306)
                0.013660416 = queryNorm
              0.4141991 = fieldWeight in 3838, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.355575 = idf(docFreq=542, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3838)
          0.044844374 = weight(abstract_txt:bereich in 3838) [ClassicSimilarity], result of:
            0.044844374 = score(doc=3838,freq=3.0), product of:
              0.086079635 = queryWeight, product of:
                1.1457177 = boost
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.013660416 = queryNorm
              0.5209638 = fieldWeight in 3838, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3838)
          0.07137666 = weight(abstract_txt:zugriff in 3838) [ClassicSimilarity], result of:
            0.07137666 = score(doc=3838,freq=3.0), product of:
              0.11734536 = queryWeight, product of:
                1.3377051 = boost
                6.4215755 = idf(docFreq=186, maxDocs=42306)
                0.013660416 = queryNorm
              0.6082614 = fieldWeight in 3838, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.4215755 = idf(docFreq=186, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3838)
          0.059205152 = weight(abstract_txt:deutsch in 3838) [ClassicSimilarity], result of:
            0.059205152 = score(doc=3838,freq=1.0), product of:
              0.14940846 = queryWeight, product of:
                1.5094371 = boost
                7.245965 = idf(docFreq=81, maxDocs=42306)
                0.013660416 = queryNorm
              0.39626372 = fieldWeight in 3838, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.245965 = idf(docFreq=81, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3838)
          0.07638783 = weight(abstract_txt:datenbank in 3838) [ClassicSimilarity], result of:
            0.07638783 = score(doc=3838,freq=2.0), product of:
              0.16088124 = queryWeight, product of:
                1.9183408 = boost
                6.1392555 = idf(docFreq=247, maxDocs=42306)
                0.013660416 = queryNorm
              0.4748088 = fieldWeight in 3838, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1392555 = idf(docFreq=247, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3838)
          0.09041531 = weight(abstract_txt:thesaurus in 3838) [ClassicSimilarity], result of:
            0.09041531 = score(doc=3838,freq=2.0), product of:
              0.22680916 = queryWeight, product of:
                3.221205 = boost
                5.1544023 = idf(docFreq=663, maxDocs=42306)
                0.013660416 = queryNorm
              0.39864045 = fieldWeight in 3838, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1544023 = idf(docFreq=663, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3838)
        0.32 = coord(8/25)
    
  5. Landkarten-Datenbank : Datenbank historisch wertvoller Landkartenbestände (1996) 0.13
    0.12583548 = sum of:
      0.12583548 = product of:
        0.7864717 = sum of:
          0.02951193 = weight(abstract_txt:wurde in 2991) [ClassicSimilarity], result of:
            0.02951193 = score(doc=2991,freq=1.0), product of:
              0.06557603 = queryWeight, product of:
                4.8004417 = idf(docFreq=945, maxDocs=42306)
                0.013660416 = queryNorm
              0.4500414 = fieldWeight in 2991, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8004417 = idf(docFreq=945, maxDocs=42306)
                0.09375 = fieldNorm(doc=2991)
          0.024900032 = weight(abstract_txt:werden in 2991) [ClassicSimilarity], result of:
            0.024900032 = score(doc=2991,freq=2.0), product of:
              0.053198624 = queryWeight, product of:
                1.1031213 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.013660416 = queryNorm
              0.4680578 = fieldWeight in 2991, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.09375 = fieldNorm(doc=2991)
          0.16038102 = weight(abstract_txt:datenbank in 2991) [ClassicSimilarity], result of:
            0.16038102 = score(doc=2991,freq=3.0), product of:
              0.16088124 = queryWeight, product of:
                1.9183408 = boost
                6.1392555 = idf(docFreq=247, maxDocs=42306)
                0.013660416 = queryNorm
              0.9968908 = fieldWeight in 2991, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1392555 = idf(docFreq=247, maxDocs=42306)
                0.09375 = fieldNorm(doc=2991)
          0.57167876 = weight(abstract_txt:einträge in 2991) [ClassicSimilarity], result of:
            0.57167876 = score(doc=2991,freq=2.0), product of:
              0.5414336 = queryWeight, product of:
                4.97692 = boost
                7.9638047 = idf(docFreq=39, maxDocs=42306)
                0.013660416 = queryNorm
              1.0558614 = fieldWeight in 2991, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.9638047 = idf(docFreq=39, maxDocs=42306)
                0.09375 = fieldNorm(doc=2991)
        0.16 = coord(4/25)