Document (#34877)

Author
Strobel, S.
Title
Englischsprachige Erweiterung des TIB / AV-Portals : Ein GND/DBpedia-Mapping zur Gewinnung eines englischen Begriffssystems
Source
o-bib: Das offene Bibliotheksjournal. 1(2014) Nr.1, S.197-204
Year
2014
Abstract
Die Videos des TIB / AV-Portals werden mit insgesamt 63.356 GND-Sachbegriffen aus Naturwissenschaft und Technik automatisch verschlagwortet. Neben den deutschsprachigen Videos verfügt das TIB / AV-Portal auch über zahlreiche englischsprachige Videos. Die GND enthält zu den in der TIB / AV-Portal-Wissensbasis verwendeten Sachbegriffen nur sehr wenige englische Bezeichner. Es fehlt demnach ein englisches Indexierungsvokabular, mit dem die englischsprachigen Videos automatisch verschlagwortet werden können. Die Lösung dieses Problems sieht wie folgt aus: Die englischen Bezeichner sollen über ein Mapping der GND-Sachbegriffe auf andere Datensätze gewonnen werden, die eine englische Übersetzung der Begriffe enthalten. Die verwendeten Mappingstrategien nutzen die DBpedia, LCSH, MACS-Ergebnisse sowie den WTI-Thesaurus. Am Ende haben 35.025 GND-Sachbegriffe (mindestens) einen englischen Bezeichner ermittelt bekommen. Diese englischen Bezeichner können für die automatische Verschlagwortung der englischsprachigen Videos unmittelbar herangezogen werden. 11.694 GND-Sachbegriffe konnten zwar nicht ins Englische "übersetzt", aber immerhin mit einem Oberbegriff assoziiert werden, der eine englische Übersetzung hat. Diese Assoziation dient der Erweiterung der Suchergebnisse.
Content
Beitrag als ausgearbeitete Form eines Vortrages während des 103. Deutschen Bibliothekartages in Bremen. Vgl.: https://www.o-bib.de/article/view/2014H1S197-204.
Theme
Metadaten
Automatisches Indexieren
Multilinguale Probleme
Form
AV-Materialien
Object
GND
DBpedia
Location
D
Hannover

Similar documents (author)

  1. Strobel, S.: ¬The complete Linux kit : fully configured LINUX system kernel (1997) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:strobel in 8959) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 8959, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=8959)
    
  2. Strobel, G.: Konzeption und Realisierung eines WWW-Servers für den Studiengang Dokumentation der HBI Stuttgart (1995) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:strobel in 5964) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 5964, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=5964)
    
  3. Strobel, S.: Firewalls : Einführung - Praxis - Produkte (1999) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:strobel in 1531) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 1531, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=1531)
    
  4. Strobel, S.; Uhl, T.: LINUX - vom PC zur Workstation : Grundlagen, Installation und praktischer Einsatz (1994) 4.88
    4.8754888 = sum of:
      4.8754888 = weight(author_txt:strobel in 1561) [ClassicSimilarity], result of:
        4.8754888 = fieldWeight in 1561, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.5 = fieldNorm(doc=1561)
    
  5. Strobel, S.; Marín-Arraiza, P.: Metadata for scientific audiovisual media : current practices and perspectives of the TIB / AV-portal (2015) 4.27
    4.2660527 = sum of:
      4.2660527 = weight(author_txt:strobel in 3667) [ClassicSimilarity], result of:
        4.2660527 = fieldWeight in 3667, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.4375 = fieldNorm(doc=3667)
    

Similar documents (content)

  1. Carevic, Z.: Semi-automatische Verschlagwortung zur Integration externer semantischer Inhalte innerhalb einer medizinischen Kooperationsplattform (2012) 0.13
    0.13001324 = sum of:
      0.13001324 = product of:
        0.46433303 = sum of:
          0.06407173 = weight(abstract_txt:wissensbasis in 897) [ClassicSimilarity], result of:
            0.06407173 = score(doc=897,freq=2.0), product of:
              0.11400297 = queryWeight, product of:
                1.0219779 = boost
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.013157722 = queryNorm
              0.56201804 = fieldWeight in 897, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.046875 = fieldNorm(doc=897)
          0.12160708 = weight(abstract_txt:verschlagwortung in 897) [ClassicSimilarity], result of:
            0.12160708 = score(doc=897,freq=7.0), product of:
              0.115103476 = queryWeight, product of:
                1.0268987 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.013157722 = queryNorm
              1.0565022 = fieldWeight in 897, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.046875 = fieldNorm(doc=897)
          0.012117711 = weight(abstract_txt:diese in 897) [ClassicSimilarity], result of:
            0.012117711 = score(doc=897,freq=1.0), product of:
              0.05962645 = queryWeight, product of:
                1.0452445 = boost
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.013157722 = queryNorm
              0.2032271 = fieldWeight in 897, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.046875 = fieldNorm(doc=897)
          0.018510625 = weight(abstract_txt:können in 897) [ClassicSimilarity], result of:
            0.018510625 = score(doc=897,freq=2.0), product of:
              0.062771514 = queryWeight, product of:
                1.0724566 = boost
                4.4483833 = idf(docFreq=1405, maxDocs=44218)
                0.013157722 = queryNorm
              0.29488894 = fieldWeight in 897, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4483833 = idf(docFreq=1405, maxDocs=44218)
                0.046875 = fieldNorm(doc=897)
          0.14450394 = weight(abstract_txt:begriffssystems in 897) [ClassicSimilarity], result of:
            0.14450394 = score(doc=897,freq=4.0), product of:
              0.15561388 = queryWeight, product of:
                1.194009 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.013157722 = queryNorm
              0.9286057 = fieldWeight in 897, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.046875 = fieldNorm(doc=897)
          0.05819929 = weight(abstract_txt:verwendeten in 897) [ClassicSimilarity], result of:
            0.05819929 = score(doc=897,freq=1.0), product of:
              0.16973458 = queryWeight, product of:
                1.7635329 = boost
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.013157722 = queryNorm
              0.3428841 = fieldWeight in 897, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.046875 = fieldNorm(doc=897)
          0.045322645 = weight(abstract_txt:werden in 897) [ClassicSimilarity], result of:
            0.045322645 = score(doc=897,freq=8.0), product of:
              0.09749568 = queryWeight, product of:
                2.1132998 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.013157722 = queryNorm
              0.46486822 = fieldWeight in 897, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.046875 = fieldNorm(doc=897)
        0.28 = coord(7/25)
    
  2. Beall, J.: Approaches to expansions : case studies from the German and Vietnamese translations (2003) 0.12
    0.11672618 = sum of:
      0.11672618 = product of:
        0.5836309 = sum of:
          0.017451985 = weight(abstract_txt:können in 1748) [ClassicSimilarity], result of:
            0.017451985 = score(doc=1748,freq=1.0), product of:
              0.062771514 = queryWeight, product of:
                1.0724566 = boost
                4.4483833 = idf(docFreq=1405, maxDocs=44218)
                0.013157722 = queryNorm
              0.27802396 = fieldWeight in 1748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4483833 = idf(docFreq=1405, maxDocs=44218)
                0.0625 = fieldNorm(doc=1748)
          0.108634 = weight(abstract_txt:übersetzung in 1748) [ClassicSimilarity], result of:
            0.108634 = score(doc=1748,freq=2.0), product of:
              0.16859056 = queryWeight, product of:
                1.7575797 = boost
                7.290168 = idf(docFreq=81, maxDocs=44218)
                0.013157722 = queryNorm
              0.64436585 = fieldWeight in 1748, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.290168 = idf(docFreq=81, maxDocs=44218)
                0.0625 = fieldNorm(doc=1748)
          0.030215096 = weight(abstract_txt:werden in 1748) [ClassicSimilarity], result of:
            0.030215096 = score(doc=1748,freq=2.0), product of:
              0.09749568 = queryWeight, product of:
                2.1132998 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.013157722 = queryNorm
              0.30991215 = fieldWeight in 1748, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0625 = fieldNorm(doc=1748)
          0.20362997 = weight(abstract_txt:englischen in 1748) [ClassicSimilarity], result of:
            0.20362997 = score(doc=1748,freq=1.0), product of:
              0.4068527 = queryWeight, product of:
                3.8612862 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.013157722 = queryNorm
              0.5005005 = fieldWeight in 1748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0625 = fieldNorm(doc=1748)
          0.22369988 = weight(abstract_txt:englische in 1748) [ClassicSimilarity], result of:
            0.22369988 = score(doc=1748,freq=1.0), product of:
              0.43316486 = queryWeight, product of:
                3.9841897 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.013157722 = queryNorm
              0.5164313 = fieldWeight in 1748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.0625 = fieldNorm(doc=1748)
        0.2 = coord(5/25)
    
  3. Online-Enzyklopädie Wikipedia (2003) 0.10
    0.10479279 = sum of:
      0.10479279 = product of:
        0.43663663 = sum of:
          0.017490407 = weight(abstract_txt:diese in 1410) [ClassicSimilarity], result of:
            0.017490407 = score(doc=1410,freq=3.0), product of:
              0.05962645 = queryWeight, product of:
                1.0452445 = boost
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.013157722 = queryNorm
              0.29333305 = fieldWeight in 1410, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1410)
          0.015425521 = weight(abstract_txt:können in 1410) [ClassicSimilarity], result of:
            0.015425521 = score(doc=1410,freq=2.0), product of:
              0.062771514 = queryWeight, product of:
                1.0724566 = boost
                4.4483833 = idf(docFreq=1405, maxDocs=44218)
                0.013157722 = queryNorm
              0.24574079 = fieldWeight in 1410, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4483833 = idf(docFreq=1405, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1410)
          0.11351092 = weight(abstract_txt:englischsprachigen in 1410) [ClassicSimilarity], result of:
            0.11351092 = score(doc=1410,freq=2.0), product of:
              0.23748043 = queryWeight, product of:
                2.0859904 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.013157722 = queryNorm
              0.4779801 = fieldWeight in 1410, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1410)
          0.023128616 = weight(abstract_txt:werden in 1410) [ClassicSimilarity], result of:
            0.023128616 = score(doc=1410,freq=3.0), product of:
              0.09749568 = queryWeight, product of:
                2.1132998 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.013157722 = queryNorm
              0.23722707 = fieldWeight in 1410, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1410)
          0.12726873 = weight(abstract_txt:englischen in 1410) [ClassicSimilarity], result of:
            0.12726873 = score(doc=1410,freq=1.0), product of:
              0.4068527 = queryWeight, product of:
                3.8612862 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.013157722 = queryNorm
              0.3128128 = fieldWeight in 1410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1410)
          0.13981242 = weight(abstract_txt:englische in 1410) [ClassicSimilarity], result of:
            0.13981242 = score(doc=1410,freq=1.0), product of:
              0.43316486 = queryWeight, product of:
                3.9841897 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.013157722 = queryNorm
              0.32276955 = fieldWeight in 1410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1410)
        0.24 = coord(6/25)
    
  4. Weisweiler, H.: Zusätzliche verbale Sacherschließung in englischer Sprache : Zeitschrifteninhaltsdienst Theologie (2001) 0.10
    0.09699942 = sum of:
      0.09699942 = product of:
        0.4849971 = sum of:
          0.0141373295 = weight(abstract_txt:diese in 5957) [ClassicSimilarity], result of:
            0.0141373295 = score(doc=5957,freq=1.0), product of:
              0.05962645 = queryWeight, product of:
                1.0452445 = boost
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.013157722 = queryNorm
              0.23709829 = fieldWeight in 5957, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5957)
          0.11237008 = weight(abstract_txt:englischsprachigen in 5957) [ClassicSimilarity], result of:
            0.11237008 = score(doc=5957,freq=1.0), product of:
              0.23748043 = queryWeight, product of:
                2.0859904 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.013157722 = queryNorm
              0.47317618 = fieldWeight in 5957, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5957)
          0.16161883 = weight(abstract_txt:englischsprachige in 5957) [ClassicSimilarity], result of:
            0.16161883 = score(doc=5957,freq=2.0), product of:
              0.24016626 = queryWeight, product of:
                2.0977533 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.013157722 = queryNorm
              0.6729456 = fieldWeight in 5957, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5957)
          0.018694635 = weight(abstract_txt:werden in 5957) [ClassicSimilarity], result of:
            0.018694635 = score(doc=5957,freq=1.0), product of:
              0.09749568 = queryWeight, product of:
                2.1132998 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.013157722 = queryNorm
              0.19174835 = fieldWeight in 5957, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5957)
          0.17817624 = weight(abstract_txt:englischen in 5957) [ClassicSimilarity], result of:
            0.17817624 = score(doc=5957,freq=1.0), product of:
              0.4068527 = queryWeight, product of:
                3.8612862 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.013157722 = queryNorm
              0.43793795 = fieldWeight in 5957, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5957)
        0.2 = coord(5/25)
    
  5. Anglo-Amerikanische Katalogisierungsregeln : Deutsche Übersetzung der Anglo-American Cataloguing Rules, Second edition, 1998 Revision, einschließlich der Änderungen und Ergänzungen bis März 2001 (2002) 0.09
    0.09472598 = sum of:
      0.09472598 = product of:
        0.3946916 = sum of:
          0.042445045 = weight(abstract_txt:übersetzt in 4601) [ClassicSimilarity], result of:
            0.042445045 = score(doc=4601,freq=1.0), product of:
              0.10915238 = queryWeight, product of:
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.013157722 = queryNorm
              0.38886046 = fieldWeight in 4601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.046875 = fieldNorm(doc=4601)
          0.012117711 = weight(abstract_txt:diese in 4601) [ClassicSimilarity], result of:
            0.012117711 = score(doc=4601,freq=1.0), product of:
              0.05962645 = queryWeight, product of:
                1.0452445 = boost
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.013157722 = queryNorm
              0.2032271 = fieldWeight in 4601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.046875 = fieldNorm(doc=4601)
          0.16295101 = weight(abstract_txt:übersetzung in 4601) [ClassicSimilarity], result of:
            0.16295101 = score(doc=4601,freq=8.0), product of:
              0.16859056 = queryWeight, product of:
                1.7575797 = boost
                7.290168 = idf(docFreq=81, maxDocs=44218)
                0.013157722 = queryNorm
              0.9665488 = fieldWeight in 4601, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                7.290168 = idf(docFreq=81, maxDocs=44218)
                0.046875 = fieldNorm(doc=4601)
          0.05819929 = weight(abstract_txt:verwendeten in 4601) [ClassicSimilarity], result of:
            0.05819929 = score(doc=4601,freq=1.0), product of:
              0.16973458 = queryWeight, product of:
                1.7635329 = boost
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.013157722 = queryNorm
              0.3428841 = fieldWeight in 4601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.046875 = fieldNorm(doc=4601)
          0.09631722 = weight(abstract_txt:englischsprachigen in 4601) [ClassicSimilarity], result of:
            0.09631722 = score(doc=4601,freq=1.0), product of:
              0.23748043 = queryWeight, product of:
                2.0859904 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.013157722 = queryNorm
              0.4055796 = fieldWeight in 4601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.046875 = fieldNorm(doc=4601)
          0.022661323 = weight(abstract_txt:werden in 4601) [ClassicSimilarity], result of:
            0.022661323 = score(doc=4601,freq=2.0), product of:
              0.09749568 = queryWeight, product of:
                2.1132998 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.013157722 = queryNorm
              0.23243411 = fieldWeight in 4601, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.046875 = fieldNorm(doc=4601)
        0.24 = coord(6/25)