Document (#42630)

Author
Busch, D.
Title
Domänenspezifische hybride automatische Indexierung von bibliographischen Metadaten
Source
B.I.T.online. 22(2019) H.6, S.465-469
Year
2019
Abstract
Im Fraunhofer-Informationszentrum Raum und Bau (IRB) wird Fachliteratur im Bereich Planen und Bauen bibliographisch erschlossen. Die daraus resultierenden Dokumente (Metadaten-Einträge) werden u.a. bei der Produktion der bibliographischen Datenbanken des IRB verwendet. In Abb. 1 ist ein Dokument dargestellt, das einen Zeitschriftenartikel beschreibt. Die Dokumente werden mit Deskriptoren von einer Nomenklatur (Schlagwortliste IRB) indexiert. Ein Deskriptor ist "eine Benennung., die für sich allein verwendbar, eindeutig zur Inhaltskennzeichnung geeignet und im betreffenden Dokumentationssystem zugelassen ist". Momentan wird die Indexierung intellektuell von menschlichen Experten durchgeführt. Die intellektuelle Indexierung ist zeitaufwendig und teuer. Eine Lösung des Problems besteht in der automatischen Indexierung, bei der die Zuordnung von Deskriptoren durch ein Computerprogramm erfolgt. Solche Computerprogramme werden im Folgenden auch als Klassifikatoren bezeichnet. In diesem Beitrag geht es um ein System zur automatischen Indexierung von deutschsprachigen Dokumenten im Bereich Bauwesen mit Deskriptoren aus der Schlagwortliste IRB.
Content
Vgl.: https://www.b-i-t-online.de/heft/2019-06-index.php.
Theme
Automatisches Indexieren
Object
IRB
Location
D

Similar documents (author)

  1. Busch, R.: Neue Wege der Buchaufstellung in den USA (1956) 5.56
    5.56391 = sum of:
      5.56391 = weight(author_txt:busch in 557) [ClassicSimilarity], result of:
        5.56391 = fieldWeight in 557, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.902256 = idf(docFreq=15, maxDocs=43254)
          0.625 = fieldNorm(doc=557)
    
  2. Busch, J.: Bibliographie zum Bibliotheks- und Büchereiwesen : aus dem Nachlaß bearbeitet von U. von Dietze (1966) 5.56
    5.56391 = sum of:
      5.56391 = weight(author_txt:busch in 1462) [ClassicSimilarity], result of:
        5.56391 = fieldWeight in 1462, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.902256 = idf(docFreq=15, maxDocs=43254)
          0.625 = fieldNorm(doc=1462)
    
  3. Busch, C.: Bitte ein Bit? : Zur (Be-) Deutung der Informationstheorie (1992) 5.56
    5.56391 = sum of:
      5.56391 = weight(author_txt:busch in 2444) [ClassicSimilarity], result of:
        5.56391 = fieldWeight in 2444, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.902256 = idf(docFreq=15, maxDocs=43254)
          0.625 = fieldNorm(doc=2444)
    
  4. Busch, J.: ¬A method for evaluating the multiple relations between subject descriptors : related terms in the Thesaurus for Engineering and Scientific Terms, a pilot study (1978) 5.56
    5.56391 = sum of:
      5.56391 = weight(author_txt:busch in 2948) [ClassicSimilarity], result of:
        5.56391 = fieldWeight in 2948, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.902256 = idf(docFreq=15, maxDocs=43254)
          0.625 = fieldNorm(doc=2948)
    
  5. Busch, J.A.: Thinking ambiguously : organizing source materials for historical research (1994) 5.56
    5.56391 = sum of:
      5.56391 = weight(author_txt:busch in 4047) [ClassicSimilarity], result of:
        5.56391 = fieldWeight in 4047, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.902256 = idf(docFreq=15, maxDocs=43254)
          0.625 = fieldNorm(doc=4047)
    

Similar documents (content)

  1. Kempf, A.O.: Automatische Indexierung in der sozialwissenschaftlichen Fachinformation : eine Evaluationsstudie zur maschinellen Erschließung für die Datenbank SOLIS (2012) 0.21
    0.21417803 = sum of:
      0.21417803 = product of:
        0.8924085 = sum of:
          0.029586107 = weight(abstract_txt:werden in 2368) [ClassicSimilarity], result of:
            0.029586107 = score(doc=2368,freq=2.0), product of:
              0.07606723 = queryWeight, product of:
                1.2712574 = boost
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.016997261 = queryNorm
              0.3889468 = fieldWeight in 2368, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.078125 = fieldNorm(doc=2368)
          0.0853219 = weight(abstract_txt:metadaten in 2368) [ClassicSimilarity], result of:
            0.0853219 = score(doc=2368,freq=1.0), product of:
              0.16962597 = queryWeight, product of:
                1.5500126 = boost
                6.438403 = idf(docFreq=187, maxDocs=43254)
                0.016997261 = queryNorm
              0.50300026 = fieldWeight in 2368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.438403 = idf(docFreq=187, maxDocs=43254)
                0.078125 = fieldNorm(doc=2368)
          0.18090217 = weight(abstract_txt:automatischen in 2368) [ClassicSimilarity], result of:
            0.18090217 = score(doc=2368,freq=3.0), product of:
              0.1941068 = queryWeight, product of:
                1.6580951 = boost
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.016997261 = queryNorm
              0.9319723 = fieldWeight in 2368, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.078125 = fieldNorm(doc=2368)
          0.11863846 = weight(abstract_txt:bibliographischen in 2368) [ClassicSimilarity], result of:
            0.11863846 = score(doc=2368,freq=1.0), product of:
              0.21131758 = queryWeight, product of:
                1.7300429 = boost
                7.1862087 = idf(docFreq=88, maxDocs=43254)
                0.016997261 = queryNorm
              0.5614226 = fieldWeight in 2368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1862087 = idf(docFreq=88, maxDocs=43254)
                0.078125 = fieldNorm(doc=2368)
          0.23163067 = weight(abstract_txt:deskriptoren in 2368) [ClassicSimilarity], result of:
            0.23163067 = score(doc=2368,freq=1.0), product of:
              0.3778735 = queryWeight, product of:
                2.8334002 = boost
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.016997261 = queryNorm
              0.61298466 = fieldWeight in 2368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.078125 = fieldNorm(doc=2368)
          0.24632922 = weight(abstract_txt:indexierung in 2368) [ClassicSimilarity], result of:
            0.24632922 = score(doc=2368,freq=1.0), product of:
              0.4667768 = queryWeight, product of:
                4.0654974 = boost
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.016997261 = queryNorm
              0.5277238 = fieldWeight in 2368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.078125 = fieldNorm(doc=2368)
        0.24 = coord(6/25)
    
  2. Bunk, T.: Deskriptoren Stoppwortlisten und kryptische Zeichen (2008) 0.16
    0.1642648 = sum of:
      0.1642648 = product of:
        1.3688734 = sum of:
          0.20888785 = weight(abstract_txt:automatischen in 4472) [ClassicSimilarity], result of:
            0.20888785 = score(doc=4472,freq=1.0), product of:
              0.1941068 = queryWeight, product of:
                1.6580951 = boost
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.016997261 = queryNorm
              1.076149 = fieldWeight in 4472, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.15625 = fieldNorm(doc=4472)
          0.46326134 = weight(abstract_txt:deskriptoren in 4472) [ClassicSimilarity], result of:
            0.46326134 = score(doc=4472,freq=1.0), product of:
              0.3778735 = queryWeight, product of:
                2.8334002 = boost
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.016997261 = queryNorm
              1.2259693 = fieldWeight in 4472, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.15625 = fieldNorm(doc=4472)
          0.6967242 = weight(abstract_txt:indexierung in 4472) [ClassicSimilarity], result of:
            0.6967242 = score(doc=4472,freq=2.0), product of:
              0.4667768 = queryWeight, product of:
                4.0654974 = boost
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.016997261 = queryNorm
              1.4926281 = fieldWeight in 4472, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.15625 = fieldNorm(doc=4472)
        0.12 = coord(3/25)
    
  3. Schirmer, K.; Haller, J.: Zugang zu mehrsprachigen Nachrichten im Internet (2000) 0.13
    0.12753002 = sum of:
      0.12753002 = product of:
        0.63765013 = sum of:
          0.091646284 = weight(abstract_txt:indexiert in 563) [ClassicSimilarity], result of:
            0.091646284 = score(doc=563,freq=1.0), product of:
              0.1412056 = queryWeight, product of:
                8.307549 = idf(docFreq=28, maxDocs=43254)
                0.016997261 = queryNorm
              0.6490273 = fieldWeight in 563, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.307549 = idf(docFreq=28, maxDocs=43254)
                0.078125 = fieldNorm(doc=563)
          0.05535054 = weight(abstract_txt:werden in 563) [ClassicSimilarity], result of:
            0.05535054 = score(doc=563,freq=7.0), product of:
              0.07606723 = queryWeight, product of:
                1.2712574 = boost
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.016997261 = queryNorm
              0.72765285 = fieldWeight in 563, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.078125 = fieldNorm(doc=563)
          0.111316614 = weight(abstract_txt:dokumente in 563) [ClassicSimilarity], result of:
            0.111316614 = score(doc=563,freq=2.0), product of:
              0.16074914 = queryWeight, product of:
                1.5089102 = boost
                6.267673 = idf(docFreq=222, maxDocs=43254)
                0.016997261 = queryNorm
              0.6924865 = fieldWeight in 563, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.267673 = idf(docFreq=222, maxDocs=43254)
                0.078125 = fieldNorm(doc=563)
          0.147706 = weight(abstract_txt:automatischen in 563) [ClassicSimilarity], result of:
            0.147706 = score(doc=563,freq=2.0), product of:
              0.1941068 = queryWeight, product of:
                1.6580951 = boost
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.016997261 = queryNorm
              0.76095223 = fieldWeight in 563, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.078125 = fieldNorm(doc=563)
          0.23163067 = weight(abstract_txt:deskriptoren in 563) [ClassicSimilarity], result of:
            0.23163067 = score(doc=563,freq=1.0), product of:
              0.3778735 = queryWeight, product of:
                2.8334002 = boost
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.016997261 = queryNorm
              0.61298466 = fieldWeight in 563, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.078125 = fieldNorm(doc=563)
        0.2 = coord(5/25)
    
  4. Scherer, B.: Automatische Indexierung und ihre Anwendung im DFG-Projekt "Gemeinsames Portal für Bibliotheken, Archive und Museen (BAM)" (2003) 0.12
    0.11673178 = sum of:
      0.11673178 = product of:
        0.7295736 = sum of:
          0.037423797 = weight(abstract_txt:werden in 748) [ClassicSimilarity], result of:
            0.037423797 = score(doc=748,freq=5.0), product of:
              0.07606723 = queryWeight, product of:
                1.2712574 = boost
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.016997261 = queryNorm
              0.49198315 = fieldWeight in 748, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.0625 = fieldNorm(doc=748)
          0.042334866 = weight(abstract_txt:bereich in 748) [ClassicSimilarity], result of:
            0.042334866 = score(doc=748,freq=1.0), product of:
              0.123364665 = queryWeight, product of:
                1.321857 = boost
                5.490696 = idf(docFreq=484, maxDocs=43254)
                0.016997261 = queryNorm
              0.3431685 = fieldWeight in 748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.490696 = idf(docFreq=484, maxDocs=43254)
                0.0625 = fieldNorm(doc=748)
          0.16711026 = weight(abstract_txt:automatischen in 748) [ClassicSimilarity], result of:
            0.16711026 = score(doc=748,freq=4.0), product of:
              0.1941068 = queryWeight, product of:
                1.6580951 = boost
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.016997261 = queryNorm
              0.8609192 = fieldWeight in 748, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.0625 = fieldNorm(doc=748)
          0.4827047 = weight(abstract_txt:indexierung in 748) [ClassicSimilarity], result of:
            0.4827047 = score(doc=748,freq=6.0), product of:
              0.4667768 = queryWeight, product of:
                4.0654974 = boost
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.016997261 = queryNorm
              1.0341232 = fieldWeight in 748, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.0625 = fieldNorm(doc=748)
        0.16 = coord(4/25)
    
  5. Panyr, J.: Automatische Indexierung und Klassifikation (1983) 0.11
    0.114294 = sum of:
      0.114294 = product of:
        0.95245004 = sum of:
          0.03347286 = weight(abstract_txt:werden in 762) [ClassicSimilarity], result of:
            0.03347286 = score(doc=762,freq=1.0), product of:
              0.07606723 = queryWeight, product of:
                1.2712574 = boost
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.016997261 = queryNorm
              0.4400431 = fieldWeight in 762, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.125 = fieldNorm(doc=762)
          0.23632962 = weight(abstract_txt:automatischen in 762) [ClassicSimilarity], result of:
            0.23632962 = score(doc=762,freq=2.0), product of:
              0.1941068 = queryWeight, product of:
                1.6580951 = boost
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.016997261 = queryNorm
              1.2175236 = fieldWeight in 762, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8873534 = idf(docFreq=119, maxDocs=43254)
                0.125 = fieldNorm(doc=762)
          0.6826475 = weight(abstract_txt:indexierung in 762) [ClassicSimilarity], result of:
            0.6826475 = score(doc=762,freq=3.0), product of:
              0.4667768 = queryWeight, product of:
                4.0654974 = boost
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.016997261 = queryNorm
              1.462471 = fieldWeight in 762, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.125 = fieldNorm(doc=762)
        0.12 = coord(3/25)