Document (#37907)

Author
Kempf, A.O.
Title
Automatische Inhaltserschließung in der Fachinformation
Source
Information - Wissenschaft und Praxis. 64(2013) H.2/3, S.96-106
Year
2013
Abstract
Der Artikel basiert auf einer Masterarbeit mit dem Titel "Automatische Indexierung in der sozialwissenschaftlichen Fachinformation. Eine Evaluationsstudie zur maschinellen Erschließung für die Datenbank SOLIS" (Kempf 2012), die im Rahmen des Aufbaustudiengangs Bibliotheks- und Informationswissenschaft an der Humboldt- Universität zu Berlin am Lehrstuhl Information Retrieval verfasst wurde. Auf der Grundlage des Schalenmodells zur Inhaltserschließung in der Fachinformation stellt der Artikel Evaluationsergebnisse eines automatischen Erschließungsverfahrens für den Einsatz in der sozialwissenschaftlichen Fachinformation vor. Ausgehend von dem von Krause beschriebenen Anwendungsszenario, wonach SOLIS-Datenbestände (Sozialwissenschaftliches Literaturinformationssystem) von geringerer Relevanz automatisch erschlossen werden sollten, wurden auf dieser Dokumentgrundlage zwei Testreihen mit der Indexierungssoftware MindServer der Firma Recommind durchgeführt. Neben den Auswirkungen allgemeiner Systemeinstellungen in der ersten Testreihe wurde in der zweiten Testreihe die Indexierungsleistung der Software für die Rand- und die Kernbereiche der Literaturdatenbank miteinander verglichen. Für letztere Testreihe wurden für beide Bereiche der Datenbank spezifische Versionen der Indexierungssoftware aufgebaut, die anhand von Dokumentkorpora aus den entsprechenden Bereichen trainiert wurden. Die Ergebnisse der Evaluation, die auf der Grundlage intellektuell generierter Vergleichsdaten erfolgt, weisen auf Unterschiede in der Indexierungsleistung zwischen Rand- und Kernbereichen hin, die einerseits gegen den Einsatz automatischer Indexierungsverfahren in den Randbereichen sprechen. Andererseits deutet sich an, dass sich die Indexierungsresultate durch den Aufbau fachteilgebietsspezifischer Trainingsmengen verbessern lassen.
Content
Vgl.: http://www.degruyter.com/view/j/iwp.2013.64.issue-2-3/iwp-2013-0011/iwp-2013-0011.xml?format=INT.
Theme
Automatisches Indexieren
Field
Sozialwissenschaften
Object
SOLIS

Similar documents (author)

  1. Kempf, G.: Klassifikationsprobleme der Rechtswissenschaft (1972) 5.30
    5.296644 = sum of:
      5.296644 = weight(author_txt:kempf in 4743) [ClassicSimilarity], result of:
        5.296644 = fieldWeight in 4743, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.47463 = idf(docFreq=23, maxDocs=42306)
          0.625 = fieldNorm(doc=4743)
    
  2. Kempf, A.: Thematischer Zugang zu Fachinformationen im Internet (1994) 5.30
    5.296644 = sum of:
      5.296644 = weight(author_txt:kempf in 975) [ClassicSimilarity], result of:
        5.296644 = fieldWeight in 975, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.47463 = idf(docFreq=23, maxDocs=42306)
          0.625 = fieldNorm(doc=975)
    
  3. Kempf, A.: Forstliche Klassifikation und Meta-Information zum Wald im Internet (1995) 5.30
    5.296644 = sum of:
      5.296644 = weight(author_txt:kempf in 3273) [ClassicSimilarity], result of:
        5.296644 = fieldWeight in 3273, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.47463 = idf(docFreq=23, maxDocs=42306)
          0.625 = fieldNorm(doc=3273)
    
  4. Kempf, A.: Advocating global forest issues on the Internet (1996) 5.30
    5.296644 = sum of:
      5.296644 = weight(author_txt:kempf in 94) [ClassicSimilarity], result of:
        5.296644 = fieldWeight in 94, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.47463 = idf(docFreq=23, maxDocs=42306)
          0.625 = fieldNorm(doc=94)
    
  5. Kempf, K.: Dalla Germania un esempio avanzato di sistema integrato (1997) 5.30
    5.296644 = sum of:
      5.296644 = weight(author_txt:kempf in 847) [ClassicSimilarity], result of:
        5.296644 = fieldWeight in 847, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.47463 = idf(docFreq=23, maxDocs=42306)
          0.625 = fieldNorm(doc=847)
    

Similar documents (content)

  1. Kempf, A.O.: Automatische Indexierung in der sozialwissenschaftlichen Fachinformation : eine Evaluationsstudie zur maschinellen Erschließung für die Datenbank SOLIS (2012) 0.49
    0.4897093 = sum of:
      0.4897093 = product of:
        1.7489617 = sum of:
          0.1091196 = weight(abstract_txt:literaturdatenbank in 2904) [ClassicSimilarity], result of:
            0.1091196 = score(doc=2904,freq=1.0), product of:
              0.15615293 = queryWeight, product of:
                1.0750144 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.016239524 = queryNorm
              0.69879955 = fieldWeight in 2904, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.078125 = fieldNorm(doc=2904)
          0.117491715 = weight(abstract_txt:indexierungsverfahren in 2904) [ClassicSimilarity], result of:
            0.117491715 = score(doc=2904,freq=1.0), product of:
              0.16404127 = queryWeight, product of:
                1.1018329 = boost
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.016239524 = queryNorm
              0.71623266 = fieldWeight in 2904, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.078125 = fieldNorm(doc=2904)
          0.07056544 = weight(abstract_txt:datenbank in 2904) [ClassicSimilarity], result of:
            0.07056544 = score(doc=2904,freq=1.0), product of:
              0.14712495 = queryWeight, product of:
                1.4756975 = boost
                6.1392555 = idf(docFreq=247, maxDocs=42306)
                0.016239524 = queryNorm
              0.47962934 = fieldWeight in 2904, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1392555 = idf(docFreq=247, maxDocs=42306)
                0.078125 = fieldNorm(doc=2904)
          0.1443545 = weight(abstract_txt:automatische in 2904) [ClassicSimilarity], result of:
            0.1443545 = score(doc=2904,freq=2.0), product of:
              0.18817787 = queryWeight, product of:
                1.6689312 = boost
                6.943154 = idf(docFreq=110, maxDocs=42306)
                0.016239524 = queryNorm
              0.7671174 = fieldWeight in 2904, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.943154 = idf(docFreq=110, maxDocs=42306)
                0.078125 = fieldNorm(doc=2904)
          0.2092052 = weight(abstract_txt:sozialwissenschaftlichen in 2904) [ClassicSimilarity], result of:
            0.2092052 = score(doc=2904,freq=1.0), product of:
              0.3036267 = queryWeight, product of:
                2.119943 = boost
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.016239524 = queryNorm
              0.6890211 = fieldWeight in 2904, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.81947 = idf(docFreq=16, maxDocs=42306)
                0.078125 = fieldNorm(doc=2904)
          0.3236883 = weight(abstract_txt:solis in 2904) [ClassicSimilarity], result of:
            0.3236883 = score(doc=2904,freq=2.0), product of:
              0.32237867 = queryWeight, product of:
                2.1844258 = boost
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.016239524 = queryNorm
              1.0040624 = fieldWeight in 2904, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.078125 = fieldNorm(doc=2904)
          0.77453697 = weight(title_txt:fachinformation in 2904) [ClassicSimilarity], result of:
            0.77453697 = score(doc=2904,freq=1.0), product of:
              0.42159593 = queryWeight, product of:
                3.5327864 = boost
                7.348619 = idf(docFreq=73, maxDocs=42306)
                0.016239524 = queryNorm
              1.8371547 = fieldWeight in 2904, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.348619 = idf(docFreq=73, maxDocs=42306)
                0.25 = fieldNorm(doc=2904)
        0.28 = coord(7/25)
    
  2. Seeger, T.: Entwicklung der Fachinformation und -kommunikation (2004) 0.17
    0.1743234 = sum of:
      0.1743234 = product of:
        1.452695 = sum of:
          0.070266776 = weight(abstract_txt:beschriebenen in 3908) [ClassicSimilarity], result of:
            0.070266776 = score(doc=3908,freq=1.0), product of:
              0.13512063 = queryWeight, product of:
                8.320479 = idf(docFreq=27, maxDocs=42306)
                0.016239524 = queryNorm
              0.52002996 = fieldWeight in 3908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.320479 = idf(docFreq=27, maxDocs=42306)
                0.0625 = fieldNorm(doc=3908)
          0.026988488 = weight(abstract_txt:wurde in 3908) [ClassicSimilarity], result of:
            0.026988488 = score(doc=3908,freq=1.0), product of:
              0.08995335 = queryWeight, product of:
                1.1538858 = boost
                4.8004417 = idf(docFreq=945, maxDocs=42306)
                0.016239524 = queryNorm
              0.3000276 = fieldWeight in 3908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8004417 = idf(docFreq=945, maxDocs=42306)
                0.0625 = fieldNorm(doc=3908)
          1.3554398 = weight(title_txt:fachinformation in 3908) [ClassicSimilarity], result of:
            1.3554398 = score(doc=3908,freq=1.0), product of:
              0.42159593 = queryWeight, product of:
                3.5327864 = boost
                7.348619 = idf(docFreq=73, maxDocs=42306)
                0.016239524 = queryNorm
              3.215021 = fieldWeight in 3908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.348619 = idf(docFreq=73, maxDocs=42306)
                0.4375 = fieldNorm(doc=3908)
        0.12 = coord(3/25)
    
  3. Capurro, R.: Hermeneutik der Fachinformation (1986) 0.13
    0.12500545 = sum of:
      0.12500545 = product of:
        1.5625682 = sum of:
          0.013494244 = weight(abstract_txt:wurde in 532) [ClassicSimilarity], result of:
            0.013494244 = score(doc=532,freq=1.0), product of:
              0.08995335 = queryWeight, product of:
                1.1538858 = boost
                4.8004417 = idf(docFreq=945, maxDocs=42306)
                0.016239524 = queryNorm
              0.1500138 = fieldWeight in 532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8004417 = idf(docFreq=945, maxDocs=42306)
                0.03125 = fieldNorm(doc=532)
          1.5490739 = weight(title_txt:fachinformation in 532) [ClassicSimilarity], result of:
            1.5490739 = score(doc=532,freq=1.0), product of:
              0.42159593 = queryWeight, product of:
                3.5327864 = boost
                7.348619 = idf(docFreq=73, maxDocs=42306)
                0.016239524 = queryNorm
              3.6743095 = fieldWeight in 532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.348619 = idf(docFreq=73, maxDocs=42306)
                0.5 = fieldNorm(doc=532)
        0.08 = coord(2/25)
    
  4. Herb, U.: Wege zur psychologischen Fachinformation : Eine Bilanz aus der Virtuellen Fachbibliothek Psychologie (2002) 0.10
    0.10495056 = sum of:
      0.10495056 = product of:
        0.874588 = sum of:
          0.026988488 = weight(abstract_txt:wurde in 2178) [ClassicSimilarity], result of:
            0.026988488 = score(doc=2178,freq=1.0), product of:
              0.08995335 = queryWeight, product of:
                1.1538858 = boost
                4.8004417 = idf(docFreq=945, maxDocs=42306)
                0.016239524 = queryNorm
              0.3000276 = fieldWeight in 2178, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8004417 = idf(docFreq=945, maxDocs=42306)
                0.0625 = fieldNorm(doc=2178)
          0.07306259 = weight(abstract_txt:wurden in 2178) [ClassicSimilarity], result of:
            0.07306259 = score(doc=2178,freq=2.0), product of:
              0.15875062 = queryWeight, product of:
                1.877403 = boost
                5.2069645 = idf(docFreq=629, maxDocs=42306)
                0.016239524 = queryNorm
              0.46023497 = fieldWeight in 2178, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2069645 = idf(docFreq=629, maxDocs=42306)
                0.0625 = fieldNorm(doc=2178)
          0.77453697 = weight(title_txt:fachinformation in 2178) [ClassicSimilarity], result of:
            0.77453697 = score(doc=2178,freq=1.0), product of:
              0.42159593 = queryWeight, product of:
                3.5327864 = boost
                7.348619 = idf(docFreq=73, maxDocs=42306)
                0.016239524 = queryNorm
              1.8371547 = fieldWeight in 2178, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.348619 = idf(docFreq=73, maxDocs=42306)
                0.25 = fieldNorm(doc=2178)
        0.12 = coord(3/25)
    
  5. Habermann, K.: vifamath - mathematische Fachinformation aus einer Hand (2010) 0.10
    0.10121051 = sum of:
      0.10121051 = product of:
        1.2651315 = sum of:
          0.103326105 = weight(abstract_txt:wurden in 2038) [ClassicSimilarity], result of:
            0.103326105 = score(doc=2038,freq=1.0), product of:
              0.15875062 = queryWeight, product of:
                1.877403 = boost
                5.2069645 = idf(docFreq=629, maxDocs=42306)
                0.016239524 = queryNorm
              0.65087056 = fieldWeight in 2038, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2069645 = idf(docFreq=629, maxDocs=42306)
                0.125 = fieldNorm(doc=2038)
          1.1618054 = weight(title_txt:fachinformation in 2038) [ClassicSimilarity], result of:
            1.1618054 = score(doc=2038,freq=1.0), product of:
              0.42159593 = queryWeight, product of:
                3.5327864 = boost
                7.348619 = idf(docFreq=73, maxDocs=42306)
                0.016239524 = queryNorm
              2.755732 = fieldWeight in 2038, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.348619 = idf(docFreq=73, maxDocs=42306)
                0.375 = fieldNorm(doc=2038)
        0.08 = coord(2/25)