Document (#34878)

Author
Strobel, S.
Title
Englischsprachige Erweiterung des TIB / AV-Portals : Ein GND/DBpedia-Mapping zur Gewinnung eines englischen Begriffssystems
Source
o-bib: Das offene Bibliotheksjournal. 1(2014) Nr.1, S.197-204
Year
2014
Abstract
Die Videos des TIB / AV-Portals werden mit insgesamt 63.356 GND-Sachbegriffen aus Naturwissenschaft und Technik automatisch verschlagwortet. Neben den deutschsprachigen Videos verfügt das TIB / AV-Portal auch über zahlreiche englischsprachige Videos. Die GND enthält zu den in der TIB / AV-Portal-Wissensbasis verwendeten Sachbegriffen nur sehr wenige englische Bezeichner. Es fehlt demnach ein englisches Indexierungsvokabular, mit dem die englischsprachigen Videos automatisch verschlagwortet werden können. Die Lösung dieses Problems sieht wie folgt aus: Die englischen Bezeichner sollen über ein Mapping der GND-Sachbegriffe auf andere Datensätze gewonnen werden, die eine englische Übersetzung der Begriffe enthalten. Die verwendeten Mappingstrategien nutzen die DBpedia, LCSH, MACS-Ergebnisse sowie den WTI-Thesaurus. Am Ende haben 35.025 GND-Sachbegriffe (mindestens) einen englischen Bezeichner ermittelt bekommen. Diese englischen Bezeichner können für die automatische Verschlagwortung der englischsprachigen Videos unmittelbar herangezogen werden. 11.694 GND-Sachbegriffe konnten zwar nicht ins Englische "übersetzt", aber immerhin mit einem Oberbegriff assoziiert werden, der eine englische Übersetzung hat. Diese Assoziation dient der Erweiterung der Suchergebnisse.
Content
Beitrag als ausgearbeitete Form eines Vortrages während des 103. Deutschen Bibliothekartages in Bremen. Vgl.: https://www.o-bib.de/article/view/2014H1S197-204.
Theme
Metadaten
Automatisches Indexieren
Multilinguale Probleme
Form
AV-Materialien
Object
GND
DBpedia
Location
D
Hannover

Similar documents (author)

  1. Strobel, S.: ¬The complete Linux kit : fully configured LINUX system kernel (1997) 6.07
    6.0667334 = sum of:
      6.0667334 = weight(author_txt:strobel in 959) [ClassicSimilarity], result of:
        6.0667334 = fieldWeight in 959, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.706774 = idf(docFreq=6, maxDocs=42306)
          0.625 = fieldNorm(doc=959)
    
  2. Strobel, G.: Konzeption und Realisierung eines WWW-Servers für den Studiengang Dokumentation der HBI Stuttgart (1995) 6.07
    6.0667334 = sum of:
      6.0667334 = weight(author_txt:strobel in 6033) [ClassicSimilarity], result of:
        6.0667334 = fieldWeight in 6033, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.706774 = idf(docFreq=6, maxDocs=42306)
          0.625 = fieldNorm(doc=6033)
    
  3. Strobel, S.: Firewalls : Einführung - Praxis - Produkte (1999) 6.07
    6.0667334 = sum of:
      6.0667334 = weight(author_txt:strobel in 2532) [ClassicSimilarity], result of:
        6.0667334 = fieldWeight in 2532, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.706774 = idf(docFreq=6, maxDocs=42306)
          0.625 = fieldNorm(doc=2532)
    
  4. Strobel, S.; Uhl, T.: LINUX - vom PC zur Workstation : Grundlagen, Installation und praktischer Einsatz (1994) 4.85
    4.853387 = sum of:
      4.853387 = weight(author_txt:strobel in 3562) [ClassicSimilarity], result of:
        4.853387 = fieldWeight in 3562, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.706774 = idf(docFreq=6, maxDocs=42306)
          0.5 = fieldNorm(doc=3562)
    
  5. Strobel, S.; Marín-Arraiza, P.: Metadata for scientific audiovisual media : current practices and perspectives of the TIB / AV-portal (2015) 4.25
    4.2467136 = sum of:
      4.2467136 = weight(author_txt:strobel in 586) [ClassicSimilarity], result of:
        4.2467136 = fieldWeight in 586, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.706774 = idf(docFreq=6, maxDocs=42306)
          0.4375 = fieldNorm(doc=586)
    

Similar documents (content)

  1. Carevic, Z.: Semi-automatische Verschlagwortung zur Integration externer semantischer Inhalte innerhalb einer medizinischen Kooperationsplattform (2012) 0.13
    0.13285992 = sum of:
      0.13285992 = product of:
        0.4744997 = sum of:
          0.066348486 = weight(abstract_txt:wissensbasis in 2898) [ClassicSimilarity], result of:
            0.066348486 = score(doc=2898,freq=2.0), product of:
              0.11690087 = queryWeight, product of:
                1.0457672 = boost
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.013056467 = queryNorm
              0.567562 = fieldWeight in 2898, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.046875 = fieldNorm(doc=2898)
          0.1283184 = weight(abstract_txt:verschlagwortung in 2898) [ClassicSimilarity], result of:
            0.1283184 = score(doc=2898,freq=7.0), product of:
              0.119518094 = queryWeight, product of:
                1.0574089 = boost
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.013056467 = queryNorm
              1.0736315 = fieldWeight in 2898, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.046875 = fieldNorm(doc=2898)
          0.012517955 = weight(abstract_txt:diese in 2898) [ClassicSimilarity], result of:
            0.012517955 = score(doc=2898,freq=1.0), product of:
              0.061043482 = queryWeight, product of:
                1.0687122 = boost
                4.374746 = idf(docFreq=1447, maxDocs=42306)
                0.013056467 = queryNorm
              0.2050662 = fieldWeight in 2898, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.374746 = idf(docFreq=1447, maxDocs=42306)
                0.046875 = fieldNorm(doc=2898)
          0.01906413 = weight(abstract_txt:können in 2898) [ClassicSimilarity], result of:
            0.01906413 = score(doc=2898,freq=2.0), product of:
              0.06413351 = queryWeight, product of:
                1.0954275 = boost
                4.484104 = idf(docFreq=1297, maxDocs=42306)
                0.013056467 = queryNorm
              0.29725692 = fieldWeight in 2898, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.484104 = idf(docFreq=1297, maxDocs=42306)
                0.046875 = fieldNorm(doc=2898)
          0.14335996 = weight(abstract_txt:begriffssystems in 2898) [ClassicSimilarity], result of:
            0.14335996 = score(doc=2898,freq=4.0), product of:
              0.15507399 = queryWeight, product of:
                1.204469 = boost
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.013056467 = queryNorm
              0.9244617 = fieldWeight in 2898, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.046875 = fieldNorm(doc=2898)
          0.058375046 = weight(abstract_txt:verwendeten in 2898) [ClassicSimilarity], result of:
            0.058375046 = score(doc=2898,freq=1.0), product of:
              0.17038651 = queryWeight, product of:
                1.7854954 = boost
                7.308879 = idf(docFreq=76, maxDocs=42306)
                0.013056467 = queryNorm
              0.34260368 = fieldWeight in 2898, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.308879 = idf(docFreq=76, maxDocs=42306)
                0.046875 = fieldNorm(doc=2898)
          0.046515703 = weight(abstract_txt:werden in 2898) [ClassicSimilarity], result of:
            0.046515703 = score(doc=2898,freq=8.0), product of:
              0.099380255 = queryWeight, product of:
                2.156062 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.013056467 = queryNorm
              0.4680578 = fieldWeight in 2898, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.046875 = fieldNorm(doc=2898)
        0.28 = coord(7/25)
    
  2. Beall, J.: Approaches to expansions : case studies from the German and Vietnamese translations (2003) 0.12
    0.117389336 = sum of:
      0.117389336 = product of:
        0.58694667 = sum of:
          0.017973833 = weight(abstract_txt:können in 2749) [ClassicSimilarity], result of:
            0.017973833 = score(doc=2749,freq=1.0), product of:
              0.06413351 = queryWeight, product of:
                1.0954275 = boost
                4.484104 = idf(docFreq=1297, maxDocs=42306)
                0.013056467 = queryNorm
              0.2802565 = fieldWeight in 2749, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.484104 = idf(docFreq=1297, maxDocs=42306)
                0.0625 = fieldNorm(doc=2749)
          0.110664696 = weight(abstract_txt:übersetzung in 2749) [ClassicSimilarity], result of:
            0.110664696 = score(doc=2749,freq=2.0), product of:
              0.17099652 = queryWeight, product of:
                1.7886888 = boost
                7.321951 = idf(docFreq=75, maxDocs=42306)
                0.013056467 = queryNorm
              0.64717513 = fieldWeight in 2749, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.321951 = idf(docFreq=75, maxDocs=42306)
                0.0625 = fieldNorm(doc=2749)
          0.03101047 = weight(abstract_txt:werden in 2749) [ClassicSimilarity], result of:
            0.03101047 = score(doc=2749,freq=2.0), product of:
              0.099380255 = queryWeight, product of:
                2.156062 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.013056467 = queryNorm
              0.31203854 = fieldWeight in 2749, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.0625 = fieldNorm(doc=2749)
          0.20330164 = weight(abstract_txt:englischen in 2749) [ClassicSimilarity], result of:
            0.20330164 = score(doc=2749,freq=1.0), product of:
              0.40715688 = queryWeight, product of:
                3.9033458 = boost
                7.9891224 = idf(docFreq=38, maxDocs=42306)
                0.013056467 = queryNorm
              0.49932015 = fieldWeight in 2749, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9891224 = idf(docFreq=38, maxDocs=42306)
                0.0625 = fieldNorm(doc=2749)
          0.22399603 = weight(abstract_txt:englische in 2749) [ClassicSimilarity], result of:
            0.22399603 = score(doc=2749,freq=1.0), product of:
              0.43433827 = queryWeight, product of:
                4.031533 = boost
                8.251487 = idf(docFreq=29, maxDocs=42306)
                0.013056467 = queryNorm
              0.5157179 = fieldWeight in 2749, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.251487 = idf(docFreq=29, maxDocs=42306)
                0.0625 = fieldNorm(doc=2749)
        0.2 = coord(5/25)
    
  3. Online-Enzyklopädie Wikipedia (2003) 0.11
    0.10537644 = sum of:
      0.10537644 = product of:
        0.4390685 = sum of:
          0.018068112 = weight(abstract_txt:diese in 2411) [ClassicSimilarity], result of:
            0.018068112 = score(doc=2411,freq=3.0), product of:
              0.061043482 = queryWeight, product of:
                1.0687122 = boost
                4.374746 = idf(docFreq=1447, maxDocs=42306)
                0.013056467 = queryNorm
              0.29598758 = fieldWeight in 2411, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.374746 = idf(docFreq=1447, maxDocs=42306)
                0.0390625 = fieldNorm(doc=2411)
          0.015886774 = weight(abstract_txt:können in 2411) [ClassicSimilarity], result of:
            0.015886774 = score(doc=2411,freq=2.0), product of:
              0.06413351 = queryWeight, product of:
                1.0954275 = boost
                4.484104 = idf(docFreq=1297, maxDocs=42306)
                0.013056467 = queryNorm
              0.24771409 = fieldWeight in 2411, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.484104 = idf(docFreq=1297, maxDocs=42306)
                0.0390625 = fieldNorm(doc=2411)
          0.11431512 = weight(abstract_txt:englischsprachigen in 2411) [ClassicSimilarity], result of:
            0.11431512 = score(doc=2411,freq=2.0), product of:
              0.23903619 = queryWeight, product of:
                2.1148179 = boost
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.013056467 = queryNorm
              0.47823355 = fieldWeight in 2411, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.0390625 = fieldNorm(doc=2411)
          0.023737444 = weight(abstract_txt:werden in 2411) [ClassicSimilarity], result of:
            0.023737444 = score(doc=2411,freq=3.0), product of:
              0.099380255 = queryWeight, product of:
                2.156062 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.013056467 = queryNorm
              0.23885474 = fieldWeight in 2411, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.0390625 = fieldNorm(doc=2411)
          0.12706351 = weight(abstract_txt:englischen in 2411) [ClassicSimilarity], result of:
            0.12706351 = score(doc=2411,freq=1.0), product of:
              0.40715688 = queryWeight, product of:
                3.9033458 = boost
                7.9891224 = idf(docFreq=38, maxDocs=42306)
                0.013056467 = queryNorm
              0.31207508 = fieldWeight in 2411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9891224 = idf(docFreq=38, maxDocs=42306)
                0.0390625 = fieldNorm(doc=2411)
          0.13999753 = weight(abstract_txt:englische in 2411) [ClassicSimilarity], result of:
            0.13999753 = score(doc=2411,freq=1.0), product of:
              0.43433827 = queryWeight, product of:
                4.031533 = boost
                8.251487 = idf(docFreq=29, maxDocs=42306)
                0.013056467 = queryNorm
              0.3223237 = fieldWeight in 2411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.251487 = idf(docFreq=29, maxDocs=42306)
                0.0390625 = fieldNorm(doc=2411)
        0.24 = coord(6/25)
    
  4. Weisweiler, H.: Zusätzliche verbale Sacherschließung in englischer Sprache : Zeitschrifteninhaltsdienst Theologie (2001) 0.10
    0.096977465 = sum of:
      0.096977465 = product of:
        0.4848873 = sum of:
          0.014604282 = weight(abstract_txt:diese in 958) [ClassicSimilarity], result of:
            0.014604282 = score(doc=958,freq=1.0), product of:
              0.061043482 = queryWeight, product of:
                1.0687122 = boost
                4.374746 = idf(docFreq=1447, maxDocs=42306)
                0.013056467 = queryNorm
              0.23924391 = fieldWeight in 958, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.374746 = idf(docFreq=1447, maxDocs=42306)
                0.0546875 = fieldNorm(doc=958)
          0.1131662 = weight(abstract_txt:englischsprachigen in 958) [ClassicSimilarity], result of:
            0.1131662 = score(doc=958,freq=1.0), product of:
              0.23903619 = queryWeight, product of:
                2.1148179 = boost
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.013056467 = queryNorm
              0.47342706 = fieldWeight in 958, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.0546875 = fieldNorm(doc=958)
          0.16004117 = weight(abstract_txt:englischsprachige in 958) [ClassicSimilarity], result of:
            0.16004117 = score(doc=958,freq=2.0), product of:
              0.23903619 = queryWeight, product of:
                2.1148179 = boost
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.013056467 = queryNorm
              0.66952693 = fieldWeight in 958, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.0546875 = fieldNorm(doc=958)
          0.019186748 = weight(abstract_txt:werden in 958) [ClassicSimilarity], result of:
            0.019186748 = score(doc=958,freq=1.0), product of:
              0.099380255 = queryWeight, product of:
                2.156062 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.013056467 = queryNorm
              0.19306399 = fieldWeight in 958, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.0546875 = fieldNorm(doc=958)
          0.17788894 = weight(abstract_txt:englischen in 958) [ClassicSimilarity], result of:
            0.17788894 = score(doc=958,freq=1.0), product of:
              0.40715688 = queryWeight, product of:
                3.9033458 = boost
                7.9891224 = idf(docFreq=38, maxDocs=42306)
                0.013056467 = queryNorm
              0.43690515 = fieldWeight in 958, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9891224 = idf(docFreq=38, maxDocs=42306)
                0.0546875 = fieldNorm(doc=958)
        0.2 = coord(5/25)
    
  5. Anglo-Amerikanische Katalogisierungsregeln : Deutsche Übersetzung der Anglo-American Cataloguing Rules, Second edition, 1998 Revision, einschließlich der Änderungen und Ergänzungen bis März 2001 (2002) 0.10
    0.09591997 = sum of:
      0.09591997 = product of:
        0.39966655 = sum of:
          0.04251904 = weight(abstract_txt:übersetzt in 1602) [ClassicSimilarity], result of:
            0.04251904 = score(doc=1602,freq=1.0), product of:
              0.10947863 = queryWeight, product of:
                1.012024 = boost
                8.285388 = idf(docFreq=28, maxDocs=42306)
                0.013056467 = queryNorm
              0.38837755 = fieldWeight in 1602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.285388 = idf(docFreq=28, maxDocs=42306)
                0.046875 = fieldNorm(doc=1602)
          0.012517955 = weight(abstract_txt:diese in 1602) [ClassicSimilarity], result of:
            0.012517955 = score(doc=1602,freq=1.0), product of:
              0.061043482 = queryWeight, product of:
                1.0687122 = boost
                4.374746 = idf(docFreq=1447, maxDocs=42306)
                0.013056467 = queryNorm
              0.2050662 = fieldWeight in 1602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.374746 = idf(docFreq=1447, maxDocs=42306)
                0.046875 = fieldNorm(doc=1602)
          0.058375046 = weight(abstract_txt:verwendeten in 1602) [ClassicSimilarity], result of:
            0.058375046 = score(doc=1602,freq=1.0), product of:
              0.17038651 = queryWeight, product of:
                1.7854954 = boost
                7.308879 = idf(docFreq=76, maxDocs=42306)
                0.013056467 = queryNorm
              0.34260368 = fieldWeight in 1602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.308879 = idf(docFreq=76, maxDocs=42306)
                0.046875 = fieldNorm(doc=1602)
          0.16599704 = weight(abstract_txt:übersetzung in 1602) [ClassicSimilarity], result of:
            0.16599704 = score(doc=1602,freq=8.0), product of:
              0.17099652 = queryWeight, product of:
                1.7886888 = boost
                7.321951 = idf(docFreq=75, maxDocs=42306)
                0.013056467 = queryNorm
              0.9707627 = fieldWeight in 1602, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                7.321951 = idf(docFreq=75, maxDocs=42306)
                0.046875 = fieldNorm(doc=1602)
          0.0969996 = weight(abstract_txt:englischsprachigen in 1602) [ClassicSimilarity], result of:
            0.0969996 = score(doc=1602,freq=1.0), product of:
              0.23903619 = queryWeight, product of:
                2.1148179 = boost
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.013056467 = queryNorm
              0.40579462 = fieldWeight in 1602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.046875 = fieldNorm(doc=1602)
          0.023257852 = weight(abstract_txt:werden in 1602) [ClassicSimilarity], result of:
            0.023257852 = score(doc=1602,freq=2.0), product of:
              0.099380255 = queryWeight, product of:
                2.156062 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.013056467 = queryNorm
              0.2340289 = fieldWeight in 1602, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.046875 = fieldNorm(doc=1602)
        0.24 = coord(6/25)