Document (#34875)

Author
Strobel, S.
Title
Englischsprachige Erweiterung des TIB / AV-Portals : Ein GND/DBpedia-Mapping zur Gewinnung eines englischen Begriffssystems
Source
o-bib: Das offene Bibliotheksjournal. 1(2014) Nr.1, S.197-204
Year
2014
Abstract
Die Videos des TIB / AV-Portals werden mit insgesamt 63.356 GND-Sachbegriffen aus Naturwissenschaft und Technik automatisch verschlagwortet. Neben den deutschsprachigen Videos verfügt das TIB / AV-Portal auch über zahlreiche englischsprachige Videos. Die GND enthält zu den in der TIB / AV-Portal-Wissensbasis verwendeten Sachbegriffen nur sehr wenige englische Bezeichner. Es fehlt demnach ein englisches Indexierungsvokabular, mit dem die englischsprachigen Videos automatisch verschlagwortet werden können. Die Lösung dieses Problems sieht wie folgt aus: Die englischen Bezeichner sollen über ein Mapping der GND-Sachbegriffe auf andere Datensätze gewonnen werden, die eine englische Übersetzung der Begriffe enthalten. Die verwendeten Mappingstrategien nutzen die DBpedia, LCSH, MACS-Ergebnisse sowie den WTI-Thesaurus. Am Ende haben 35.025 GND-Sachbegriffe (mindestens) einen englischen Bezeichner ermittelt bekommen. Diese englischen Bezeichner können für die automatische Verschlagwortung der englischsprachigen Videos unmittelbar herangezogen werden. 11.694 GND-Sachbegriffe konnten zwar nicht ins Englische "übersetzt", aber immerhin mit einem Oberbegriff assoziiert werden, der eine englische Übersetzung hat. Diese Assoziation dient der Erweiterung der Suchergebnisse.
Content
Beitrag als ausgearbeitete Form eines Vortrages während des 103. Deutschen Bibliothekartages in Bremen. Vgl.: https://www.o-bib.de/article/view/2014H1S197-204.
Theme
Metadaten
Automatisches Indexieren
Multilinguale Probleme
Form
AV-Materialien
Object
GND
DBpedia
Location
D
Hannover

Similar documents (author)

  1. Strobel, S.: ¬The complete Linux kit : fully configured LINUX system kernel (1997) 6.08
    6.084933 = sum of:
      6.084933 = weight(author_txt:strobel in 956) [ClassicSimilarity], result of:
        6.084933 = fieldWeight in 956, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.735892 = idf(docFreq=6, maxDocs=43556)
          0.625 = fieldNorm(doc=956)
    
  2. Strobel, G.: Konzeption und Realisierung eines WWW-Servers für den Studiengang Dokumentation der HBI Stuttgart (1995) 6.08
    6.084933 = sum of:
      6.084933 = weight(author_txt:strobel in 6030) [ClassicSimilarity], result of:
        6.084933 = fieldWeight in 6030, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.735892 = idf(docFreq=6, maxDocs=43556)
          0.625 = fieldNorm(doc=6030)
    
  3. Strobel, S.: Firewalls : Einführung - Praxis - Produkte (1999) 6.08
    6.084933 = sum of:
      6.084933 = weight(author_txt:strobel in 2529) [ClassicSimilarity], result of:
        6.084933 = fieldWeight in 2529, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.735892 = idf(docFreq=6, maxDocs=43556)
          0.625 = fieldNorm(doc=2529)
    
  4. Strobel, S.; Uhl, T.: LINUX - vom PC zur Workstation : Grundlagen, Installation und praktischer Einsatz (1994) 4.87
    4.867946 = sum of:
      4.867946 = weight(author_txt:strobel in 3559) [ClassicSimilarity], result of:
        4.867946 = fieldWeight in 3559, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.735892 = idf(docFreq=6, maxDocs=43556)
          0.5 = fieldNorm(doc=3559)
    
  5. Strobel, S.; Marín-Arraiza, P.: Metadata for scientific audiovisual media : current practices and perspectives of the TIB / AV-portal (2015) 4.26
    4.259453 = sum of:
      4.259453 = weight(author_txt:strobel in 665) [ClassicSimilarity], result of:
        4.259453 = fieldWeight in 665, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.735892 = idf(docFreq=6, maxDocs=43556)
          0.4375 = fieldNorm(doc=665)
    

Similar documents (content)

  1. Carevic, Z.: Semi-automatische Verschlagwortung zur Integration externer semantischer Inhalte innerhalb einer medizinischen Kooperationsplattform (2012) 0.13
    0.12941773 = sum of:
      0.12941773 = product of:
        0.46220618 = sum of:
          0.063609615 = weight(abstract_txt:wissensbasis in 2895) [ClassicSimilarity], result of:
            0.063609615 = score(doc=2895,freq=2.0), product of:
              0.11338246 = queryWeight, product of:
                1.022018 = boost
                8.462927 = idf(docFreq=24, maxDocs=43556)
                0.013108915 = queryNorm
              0.5610181 = fieldWeight in 2895, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.462927 = idf(docFreq=24, maxDocs=43556)
                0.046875 = fieldNorm(doc=2895)
          0.12073306 = weight(abstract_txt:verschlagwortung in 2895) [ClassicSimilarity], result of:
            0.12073306 = score(doc=2895,freq=7.0), product of:
              0.11447891 = queryWeight, product of:
                1.0269477 = boost
                8.503749 = idf(docFreq=23, maxDocs=43556)
                0.013108915 = queryNorm
              1.0546315 = fieldWeight in 2895, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                8.503749 = idf(docFreq=23, maxDocs=43556)
                0.046875 = fieldNorm(doc=2895)
          0.012184165 = weight(abstract_txt:diese in 2895) [ClassicSimilarity], result of:
            0.012184165 = score(doc=2895,freq=1.0), product of:
              0.05980643 = queryWeight, product of:
                1.0497226 = boost
                4.346169 = idf(docFreq=1533, maxDocs=43556)
                0.013108915 = queryNorm
              0.20372668 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.346169 = idf(docFreq=1533, maxDocs=43556)
                0.046875 = fieldNorm(doc=2895)
          0.018611122 = weight(abstract_txt:können in 2895) [ClassicSimilarity], result of:
            0.018611122 = score(doc=2895,freq=2.0), product of:
              0.062958695 = queryWeight, product of:
                1.0770316 = boost
                4.4592366 = idf(docFreq=1369, maxDocs=43556)
                0.013108915 = queryNorm
              0.29560843 = fieldWeight in 2895, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4592366 = idf(docFreq=1369, maxDocs=43556)
                0.046875 = fieldNorm(doc=2895)
          0.14357226 = weight(abstract_txt:begriffssystems in 2895) [ClassicSimilarity], result of:
            0.14357226 = score(doc=2895,freq=4.0), product of:
              0.15484639 = queryWeight, product of:
                1.1943624 = boost
                9.890043 = idf(docFreq=5, maxDocs=43556)
                0.013108915 = queryNorm
              0.92719156 = fieldWeight in 2895, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.890043 = idf(docFreq=5, maxDocs=43556)
                0.046875 = fieldNorm(doc=2895)
          0.058029354 = weight(abstract_txt:verwendeten in 2895) [ClassicSimilarity], result of:
            0.058029354 = score(doc=2895,freq=1.0), product of:
              0.16929697 = queryWeight, product of:
                1.7661402 = boost
                7.312355 = idf(docFreq=78, maxDocs=43556)
                0.013108915 = queryNorm
              0.34276664 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.312355 = idf(docFreq=78, maxDocs=43556)
                0.046875 = fieldNorm(doc=2895)
          0.045466594 = weight(abstract_txt:werden in 2895) [ClassicSimilarity], result of:
            0.045466594 = score(doc=2895,freq=8.0), product of:
              0.097640276 = queryWeight, product of:
                2.1207287 = boost
                3.5121832 = idf(docFreq=3531, maxDocs=43556)
                0.013108915 = queryNorm
              0.4656541 = fieldWeight in 2895, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.5121832 = idf(docFreq=3531, maxDocs=43556)
                0.046875 = fieldNorm(doc=2895)
        0.28 = coord(7/25)
    
  2. Beall, J.: Approaches to expansions : case studies from the German and Vietnamese translations (2003) 0.12
    0.11737249 = sum of:
      0.11737249 = product of:
        0.58686244 = sum of:
          0.017546732 = weight(abstract_txt:können in 2746) [ClassicSimilarity], result of:
            0.017546732 = score(doc=2746,freq=1.0), product of:
              0.062958695 = queryWeight, product of:
                1.0770316 = boost
                4.4592366 = idf(docFreq=1369, maxDocs=43556)
                0.013108915 = queryNorm
              0.2787023 = fieldWeight in 2746, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4592366 = idf(docFreq=1369, maxDocs=43556)
                0.0625 = fieldNorm(doc=2746)
          0.10942119 = weight(abstract_txt:übersetzung in 2746) [ClassicSimilarity], result of:
            0.10942119 = score(doc=2746,freq=2.0), product of:
              0.16929697 = queryWeight, product of:
                1.7661402 = boost
                7.312355 = idf(docFreq=78, maxDocs=43556)
                0.013108915 = queryNorm
              0.64632696 = fieldWeight in 2746, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.312355 = idf(docFreq=78, maxDocs=43556)
                0.0625 = fieldNorm(doc=2746)
          0.030311063 = weight(abstract_txt:werden in 2746) [ClassicSimilarity], result of:
            0.030311063 = score(doc=2746,freq=2.0), product of:
              0.097640276 = queryWeight, product of:
                2.1207287 = boost
                3.5121832 = idf(docFreq=3531, maxDocs=43556)
                0.013108915 = queryNorm
              0.31043607 = fieldWeight in 2746, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5121832 = idf(docFreq=3531, maxDocs=43556)
                0.0625 = fieldNorm(doc=2746)
          0.2020978 = weight(abstract_txt:englischen in 2746) [ClassicSimilarity], result of:
            0.2020978 = score(doc=2746,freq=1.0), product of:
              0.40455347 = queryWeight, product of:
                3.8610332 = boost
                7.9929233 = idf(docFreq=39, maxDocs=43556)
                0.013108915 = queryNorm
              0.4995577 = fieldWeight in 2746, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9929233 = idf(docFreq=39, maxDocs=43556)
                0.0625 = fieldNorm(doc=2746)
          0.22748569 = weight(abstract_txt:englische in 2746) [ClassicSimilarity], result of:
            0.22748569 = score(doc=2746,freq=1.0), product of:
              0.43776152 = queryWeight, product of:
                4.016376 = boost
                8.314507 = idf(docFreq=28, maxDocs=43556)
                0.013108915 = queryNorm
              0.51965666 = fieldWeight in 2746, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.314507 = idf(docFreq=28, maxDocs=43556)
                0.0625 = fieldNorm(doc=2746)
        0.2 = coord(5/25)
    
  3. Online-Enzyklopädie Wikipedia (2003) 0.10
    0.10499801 = sum of:
      0.10499801 = product of:
        0.43749171 = sum of:
          0.017586326 = weight(abstract_txt:diese in 2408) [ClassicSimilarity], result of:
            0.017586326 = score(doc=2408,freq=3.0), product of:
              0.05980643 = queryWeight, product of:
                1.0497226 = boost
                4.346169 = idf(docFreq=1533, maxDocs=43556)
                0.013108915 = queryNorm
              0.29405412 = fieldWeight in 2408, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.346169 = idf(docFreq=1533, maxDocs=43556)
                0.0390625 = fieldNorm(doc=2408)
          0.015509267 = weight(abstract_txt:können in 2408) [ClassicSimilarity], result of:
            0.015509267 = score(doc=2408,freq=2.0), product of:
              0.062958695 = queryWeight, product of:
                1.0770316 = boost
                4.4592366 = idf(docFreq=1369, maxDocs=43556)
                0.013108915 = queryNorm
              0.24634035 = fieldWeight in 2408, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4592366 = idf(docFreq=1369, maxDocs=43556)
                0.0390625 = fieldNorm(doc=2408)
          0.11270436 = weight(abstract_txt:englischsprachigen in 2408) [ClassicSimilarity], result of:
            0.11270436 = score(doc=2408,freq=2.0), product of:
              0.23620477 = queryWeight, product of:
                2.086147 = boost
                8.63728 = idf(docFreq=20, maxDocs=43556)
                0.013108915 = queryNorm
              0.47714683 = fieldWeight in 2408, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.63728 = idf(docFreq=20, maxDocs=43556)
                0.0390625 = fieldNorm(doc=2408)
          0.023202073 = weight(abstract_txt:werden in 2408) [ClassicSimilarity], result of:
            0.023202073 = score(doc=2408,freq=3.0), product of:
              0.097640276 = queryWeight, product of:
                2.1207287 = boost
                3.5121832 = idf(docFreq=3531, maxDocs=43556)
                0.013108915 = queryNorm
              0.2376281 = fieldWeight in 2408, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5121832 = idf(docFreq=3531, maxDocs=43556)
                0.0390625 = fieldNorm(doc=2408)
          0.12631112 = weight(abstract_txt:englischen in 2408) [ClassicSimilarity], result of:
            0.12631112 = score(doc=2408,freq=1.0), product of:
              0.40455347 = queryWeight, product of:
                3.8610332 = boost
                7.9929233 = idf(docFreq=39, maxDocs=43556)
                0.013108915 = queryNorm
              0.31222355 = fieldWeight in 2408, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9929233 = idf(docFreq=39, maxDocs=43556)
                0.0390625 = fieldNorm(doc=2408)
          0.14217855 = weight(abstract_txt:englische in 2408) [ClassicSimilarity], result of:
            0.14217855 = score(doc=2408,freq=1.0), product of:
              0.43776152 = queryWeight, product of:
                4.016376 = boost
                8.314507 = idf(docFreq=28, maxDocs=43556)
                0.013108915 = queryNorm
              0.3247854 = fieldWeight in 2408, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.314507 = idf(docFreq=28, maxDocs=43556)
                0.0390625 = fieldNorm(doc=2408)
        0.24 = coord(6/25)
    
  4. Weisweiler, H.: Zusätzliche verbale Sacherschließung in englischer Sprache : Zeitschrifteninhaltsdienst Theologie (2001) 0.10
    0.09637024 = sum of:
      0.09637024 = product of:
        0.48185122 = sum of:
          0.014214859 = weight(abstract_txt:diese in 955) [ClassicSimilarity], result of:
            0.014214859 = score(doc=955,freq=1.0), product of:
              0.05980643 = queryWeight, product of:
                1.0497226 = boost
                4.346169 = idf(docFreq=1533, maxDocs=43556)
                0.013108915 = queryNorm
              0.23768112 = fieldWeight in 955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.346169 = idf(docFreq=1533, maxDocs=43556)
                0.0546875 = fieldNorm(doc=955)
          0.111571625 = weight(abstract_txt:englischsprachigen in 955) [ClassicSimilarity], result of:
            0.111571625 = score(doc=955,freq=1.0), product of:
              0.23620477 = queryWeight, product of:
                2.086147 = boost
                8.63728 = idf(docFreq=20, maxDocs=43556)
                0.013108915 = queryNorm
              0.47235128 = fieldWeight in 955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.63728 = idf(docFreq=20, maxDocs=43556)
                0.0546875 = fieldNorm(doc=955)
          0.16047515 = weight(abstract_txt:englischsprachige in 955) [ClassicSimilarity], result of:
            0.16047515 = score(doc=955,freq=2.0), product of:
              0.23888087 = queryWeight, product of:
                2.0979314 = boost
                8.68607 = idf(docFreq=19, maxDocs=43556)
                0.013108915 = queryNorm
              0.671779 = fieldWeight in 955, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.68607 = idf(docFreq=19, maxDocs=43556)
                0.0546875 = fieldNorm(doc=955)
          0.018754013 = weight(abstract_txt:werden in 955) [ClassicSimilarity], result of:
            0.018754013 = score(doc=955,freq=1.0), product of:
              0.097640276 = queryWeight, product of:
                2.1207287 = boost
                3.5121832 = idf(docFreq=3531, maxDocs=43556)
                0.013108915 = queryNorm
              0.19207251 = fieldWeight in 955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5121832 = idf(docFreq=3531, maxDocs=43556)
                0.0546875 = fieldNorm(doc=955)
          0.17683558 = weight(abstract_txt:englischen in 955) [ClassicSimilarity], result of:
            0.17683558 = score(doc=955,freq=1.0), product of:
              0.40455347 = queryWeight, product of:
                3.8610332 = boost
                7.9929233 = idf(docFreq=39, maxDocs=43556)
                0.013108915 = queryNorm
              0.437113 = fieldWeight in 955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9929233 = idf(docFreq=39, maxDocs=43556)
                0.0546875 = fieldNorm(doc=955)
        0.2 = coord(5/25)
    
  5. Anglo-Amerikanische Katalogisierungsregeln : Deutsche Übersetzung der Anglo-American Cataloguing Rules, Second edition, 1998 Revision, einschließlich der Änderungen und Ergänzungen bis März 2001 (2002) 0.09
    0.094762884 = sum of:
      0.094762884 = product of:
        0.39484537 = sum of:
          0.042133953 = weight(abstract_txt:übersetzt in 1599) [ClassicSimilarity], result of:
            0.042133953 = score(doc=1599,freq=1.0), product of:
              0.10854975 = queryWeight, product of:
                8.280605 = idf(docFreq=29, maxDocs=43556)
                0.013108915 = queryNorm
              0.38815337 = fieldWeight in 1599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.280605 = idf(docFreq=29, maxDocs=43556)
                0.046875 = fieldNorm(doc=1599)
          0.012184165 = weight(abstract_txt:diese in 1599) [ClassicSimilarity], result of:
            0.012184165 = score(doc=1599,freq=1.0), product of:
              0.05980643 = queryWeight, product of:
                1.0497226 = boost
                4.346169 = idf(docFreq=1533, maxDocs=43556)
                0.013108915 = queryNorm
              0.20372668 = fieldWeight in 1599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.346169 = idf(docFreq=1533, maxDocs=43556)
                0.046875 = fieldNorm(doc=1599)
          0.058029354 = weight(abstract_txt:verwendeten in 1599) [ClassicSimilarity], result of:
            0.058029354 = score(doc=1599,freq=1.0), product of:
              0.16929697 = queryWeight, product of:
                1.7661402 = boost
                7.312355 = idf(docFreq=78, maxDocs=43556)
                0.013108915 = queryNorm
              0.34276664 = fieldWeight in 1599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.312355 = idf(docFreq=78, maxDocs=43556)
                0.046875 = fieldNorm(doc=1599)
          0.16413179 = weight(abstract_txt:übersetzung in 1599) [ClassicSimilarity], result of:
            0.16413179 = score(doc=1599,freq=8.0), product of:
              0.16929697 = queryWeight, product of:
                1.7661402 = boost
                7.312355 = idf(docFreq=78, maxDocs=43556)
                0.013108915 = queryNorm
              0.9694904 = fieldWeight in 1599, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                7.312355 = idf(docFreq=78, maxDocs=43556)
                0.046875 = fieldNorm(doc=1599)
          0.09563283 = weight(abstract_txt:englischsprachigen in 1599) [ClassicSimilarity], result of:
            0.09563283 = score(doc=1599,freq=1.0), product of:
              0.23620477 = queryWeight, product of:
                2.086147 = boost
                8.63728 = idf(docFreq=20, maxDocs=43556)
                0.013108915 = queryNorm
              0.40487254 = fieldWeight in 1599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.63728 = idf(docFreq=20, maxDocs=43556)
                0.046875 = fieldNorm(doc=1599)
          0.022733297 = weight(abstract_txt:werden in 1599) [ClassicSimilarity], result of:
            0.022733297 = score(doc=1599,freq=2.0), product of:
              0.097640276 = queryWeight, product of:
                2.1207287 = boost
                3.5121832 = idf(docFreq=3531, maxDocs=43556)
                0.013108915 = queryNorm
              0.23282705 = fieldWeight in 1599, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5121832 = idf(docFreq=3531, maxDocs=43556)
                0.046875 = fieldNorm(doc=1599)
        0.24 = coord(6/25)